pdf.js

Author	SHA1	Message	Date
Tim van der Meij	c5663f2f6b	Merge pull request #12141 from phillipj/populate-findbar-on-search-hash Populate the find field with the search query when URL has #search hash	2020-07-31 00:07:18 +02:00
Tim van der Meij	f676a00762	Merge pull request #12143 from escapewindow/fix-12107 fix reftests after #12107	2020-07-30 23:56:54 +02:00
Aki Sasaki	7bb65bab7f	fix reftests after #12107 The f1040-annotations reftest started hanging after #12107. We traced this to `TypeError: can't access property "getOrCreateValue", storage is undefined`. We essentially need to add `annotationStorage` to the parameters in test/driver.js.	2020-07-30 12:25:27 -07:00
Linus Gasser	f1bbfdc16d	Add typescript definitions This PR adds typescript definitions from the JSDoc already present. It adds a new gulp-target 'types' that calls 'tsc', the typescript compiler, to create the definitions. To use the definitions, users can simply do the following: ``` import {getDocument, GlobalWorkerOptions} from "pdfjs-dist"; import pdfjsWorker from "pdfjs-dist/build/pdf.worker.entry"; GlobalWorkerOptions.workerSrc = pdfjsWorker; const pdf = await getDocument("file:///some.pdf").promise; ``` Co-authored-by: @oBusk Co-authored-by: @tamuratak	2020-07-30 11:10:37 +02:00
Phillip Johnsen	50f73092e1	Populate the find field with the search query when URL has #search hash These changes improves the existing search functionality triggered when the URL contains a `#search` hash. In addition to performing the actual search immediately like before, this will ensure the search text field inside the find bar gets populated with the text currently being searched for.	2020-07-30 10:11:30 +02:00
Tim van der Meij	eb4d6a0652	Merge pull request #12107 from calixteman/checkbox Add support for checkboxes printing	2020-07-30 00:11:41 +02:00
Calixte Denizet	cb60523a15	Add support for checkboxes printing	2020-07-29 16:42:57 +02:00
Tim van der Meij	6537e64cb8	Merge pull request #12136 from Snuffleupagus/PDFFetchStreamRangeReader-AbortError Ignore `fetch()` errors, in `PDFFetchStreamRangeReader`, once the request has been aborted	2020-07-29 00:09:51 +02:00
Tim van der Meij	bcbbd03f7b	Merge pull request #12135 from Snuffleupagus/pdfManagerReady-loadDocument-promise [src/core/worker.js] Remove a useless Promise handler from the `pdfManagerReady` function	2020-07-29 00:07:25 +02:00
Jonas Jenwald	1b720a4b23	Ignore `fetch()` errors, in `PDFFetchStreamRangeReader`, once the request has been aborted Besides making general sense, as far as I can tell, this patch should also prevent one source of `Uncaught (in promise) ...` exceptions. Unfortunately `reason instanceof AbortError` doesn't work here, since `AbortError` isn't actually defined in browsers; note how even the DOM specification contains an example using the `name` property: https://dom.spec.whatwg.org/#aborting-ongoing-activities This patch prevents the following errors from being logged in the console, when the unit-tests are running: - Firefox: `Uncaught (in promise) DOMException: The operation was aborted.` - Chrome: `Uncaught (in promise) DOMException: The user aborted a request.`	2020-07-28 17:18:49 +02:00
Jonas Jenwald	fbe90b63ec	[src/core/worker.js] Remove a useless Promise handler from the `pdfManagerReady` function Looking carefully at this code, you'll notice that the `loadDocument` function has no less than three Promise handling functions. This obviously makes no sense, since a Promise can only have one resolve and one reject handler. Hence the final `onFailure`-case is unreachable, which only serves to add confusion when reading the code. Note that this code has been re-factored more than once over the years, but it seems as if this may even have been incorrect already in PR 3310 (and no-one have noticed for seven years :-).	2020-07-28 14:51:50 +02:00
Tim van der Meij	403816040e	Merge pull request #12127 from Snuffleupagus/type3Dependencies Improve how Type3-fonts with dependencies are handled	2020-07-27 23:44:51 +02:00
Jonas Jenwald	835b5ffddd	Only check `isType3Font` the first time that `TranslatedFont.loadType3Data` is called If the `TranslatedFont.type3Loaded` property exists, then you already know that the font must be a Type3 one.	2020-07-27 13:20:15 +02:00
Jonas Jenwald	f3ff526019	Send/receive Type3 images the same way as other globally-cached images There's quite frankly no particular reason to special-case Type3-fonts with image resources, which are very rare anyway, now that we have a general mechanism for sending/receiving images globally.	2020-07-27 13:20:15 +02:00
Jonas Jenwald	7c9d0d5939	Improve how Type3-fonts with dependencies are handled While the `CharProcs` streams of Type3-fonts usually don't rely on dependencies, such as e.g. images, it does happen in some cases. Currently any dependencies are simply appended to the parent operatorList, which in practice means only the operatorList of the first page where the Type3-font is being used. However, there's one thing that's slightly unfortunate with that approach: Since fonts are global to the PDF document, we really ought to ensure that any Type3 dependencies are appended to the operatorList of all pages where the Type3-font is being used. Otherwise there's a theoretical risk that, if one page has its rendering paused, another page may try to use a Type3-font whose dependencies are not yet fully resolved. In that case there would be errors, since Type3 operatorLists are executed synchronously. Hence this patch, which ensures that all relevant pages will have Type3 dependencies appended to the main operatorList. (Note here that the `OperatorList.addDependencies` method, via `OperatorList.addDependency`, ensures that a dependency is only added once to any operatorList.) Finally, these changes also remove the need for the "waiting for the main-thread"-hack that was added to `PartialEvaluator.buildPaintImageXObject` as part of fixing issue 10717.	2020-07-27 13:20:13 +02:00
Tim van der Meij	c7eb79ca66	Merge pull request #12125 from timvandermeij/puppeteer Improve test bot stability	2020-07-26 21:55:33 +02:00
Tim van der Meij	65e76a3c6b	Fix a bug in the temporary folder check in the test runner The `noPrompt` option doesn't exist and should be `noPrompts`.	2020-07-26 20:41:19 +02:00
Tim van der Meij	c7c6c90062	Limit the allowed versions for Jasmine and Puppeteer Jasmine >= 3.6.0 causes intermittent test failures because of random task abortions. Puppeteer >= 4.0.0 causes ENOTEMPTY/EBUSY errors during shutdown on the Windows bot. Moreover, `jasmine-core` is a dependency of `jasmine` so it doesn't have to be required separately.	2020-07-26 20:40:55 +02:00
Tim van der Meij	01e2610cf4	Merge pull request #12126 from Snuffleupagus/unittest-shall_fail_cleanup Attempt to reduce intermittent failures in the "cleans up document resources during rendering of page" unit-test	2020-07-26 14:33:12 +02:00
Jonas Jenwald	86a8fd9810	Attempt to reduce intermittent failures in the "cleans up document resources during rendering of page" unit-test This patch should hopefully remove the `Unhandled promise rejection: ...` errors, by returning the "final" promise. Also, by pausing/delaying of rendering slightly the likelihood of the test failing in the first place should thus be reduced.	2020-07-26 14:05:46 +02:00
Tim van der Meij	7ffbda3396	Merge pull request #12124 from Snuffleupagus/unittest-browser-name Include the browser name when printing unit-test results	2020-07-26 14:02:25 +02:00
Jonas Jenwald	e4ad91be05	Include the browser name when printing unit-test results This uses a similar format to the reference-test logging, and will help determine in exactly which browser the failure occurred (since the tests run concurrently).	2020-07-26 12:54:16 +02:00
Tim van der Meij	311ca6e796	Merge pull request #12122 from Snuffleupagus/update-packages Update packages and translations	2020-07-26 12:34:29 +02:00
Jonas Jenwald	ca61910a2c	Update l10n files	2020-07-26 11:18:56 +02:00
Jonas Jenwald	47ab676225	Update `npm` packages	2020-07-26 11:18:41 +02:00
Tim van der Meij	bf539deada	Merge pull request #12106 from calixteman/storage Add an annotation storage in order to save annotation data in acroforms	2020-07-24 23:49:37 +02:00
Calixte Denizet	584902dbf8	Add an annotation storage in order to save annotation data in acroforms	2020-07-24 10:50:11 +02:00
Tim van der Meij	7b0c52fdfe	Merge pull request #12114 from Snuffleupagus/viewer-PDFJSDev-cleanup Remove a couple of unnecessary `PDFJSDev` checks from the viewer	2020-07-23 23:47:26 +02:00
Jonas Jenwald	1c809c87af	Remove a couple of unnecessary `PDFJSDev` checks from the viewer - Given the `DefaultExternalServices` implementation, the `PDFViewerApplication.supportsDocumentFonts` getter is guaranteed to be defined and we can thus remove some (now) unnecessary `PDFJSDev` checks from the `webViewerInitialized` function. - By slightly tweaking the "pdfBugEnabled" definition in `web/app_options`, similar to the existing ones for "workerSrc" and "cMapUrl", we can remove some `PDFJSDev` checks from the `PDFViewerApplication._parseHashParameters` method.	2020-07-23 18:24:11 +02:00
Tim van der Meij	d69fb446bf	Merge pull request #12023 from escapewindow/issue9297 Fix auto-rotate for printing landscape PDFs where the first page is also landscape	2020-07-17 19:48:14 +02:00
Tim van der Meij	90689cf08e	Merge pull request #12101 from Snuffleupagus/Dict-size-getRawValues Add a new `size` getter and `getRawValues` method, to `Dict` instances, to simplify some code	2020-07-17 19:22:07 +02:00
Aki Sasaki	04db9d902f	ignore `isFirstPagePortrait` in `getPagesOverview` The current behavior for `getPagesOverview` assumes we want to only auto-rotate if: - `enablePrintAutoRotate` is `true` - `isFirstPagePortrait !== isPortraitOrientation(size)` This second check is what is breaking #9297. The two PDFs linked have a landscape orientation first page, as well as subsequent pages. Since `false === false`, we print portrait. Let's drop the comparison with `isFirstPagePortrait`, and print landscape if `!isPortraitOrientation(size)`. Fixes #9297.	2020-07-17 08:22:04 -07:00
Jonas Jenwald	684a7b89ac	Remove unnecessary duplication in the `addChildren` helper function (used by the `ObjectLoader`) Besides being fewer lines of code overall, this also avoids one `node instanceof Dict` check for both of the `Dict`/`Stream`-cases.	2020-07-17 16:32:24 +02:00
Jonas Jenwald	ea8e432c45	Add a `getRawValues` method, to `Dict` instances, to provide an easier way of getting all raw values When the old `Dict.getAll()` method was removed, it was replaced with a `Dict.getKeys()` call and `Dict.get(...)` calls (in a loop). While this pattern obviously makes a lot of sense in many cases, there's some instances where we actually want the raw `Dict` values (i.e. `Ref`s where applicable). In those cases, `Dict.getRaw(...)` calls are instead used within the loop. However, by introducing a new `Dict.getRawValues()` method we can reduce the number of (strictly unnecessary) function calls by simply getting the raw `Dict` values directly.	2020-07-17 16:32:00 +02:00
Jonas Jenwald	6381b5b08f	Add a `size` getter, to `Dict` instances, to provide an easier way of checking the number of entries This removes the need to manually call `Dict.getKeys()` and check its length.	2020-07-17 16:06:11 +02:00
Tim van der Meij	e63d1ebff5	Merge pull request #12087 from Snuffleupagus/LocalGStateCache Add local caching of "simple" Graphics State (ExtGState) data in `PartialEvaluator.{getOperatorList, getTextContent}` (issue 2813)	2020-07-17 16:02:45 +02:00
Tim van der Meij	c55122c828	Merge pull request #12089 from timvandermeij/refsetcache Convert `RefSetCache` to a proper class and to use a `Map` internally	2020-07-17 15:38:09 +02:00
Tim van der Meij	b19a1796ac	Convert `RefSetCache` to a proper class and to use a `Map` internally Using a `Map` instead of an `Object` provides some advantages such as cheaper ways to get the size of the cache, to find out if an entry is contained in the cache and to iterate over the cache. Moreover, we can clear and re-use the same `Map` object now instead of creating a new one.	2020-07-17 13:35:29 +02:00
Tim van der Meij	29adbb7cd7	Implement unit tests for the `RefSetCache` primitive This primitive did not have unit test coverage yet, which is important for upcoming refactoring of the primitive.	2020-07-17 13:35:29 +02:00
Tim van der Meij	a604973cc7	Merge pull request #12085 from tamuratak/fix_isnodejs Make the detection of Node.js environments on Electron strict.	2020-07-17 13:29:59 +02:00
Tim van der Meij	7d26f4147f	Merge pull request #12099 from Snuffleupagus/hasBlendModes-RefSet Use a `RefSet`, rather than a plain Object, for tracking already processed nodes in `PartialEvaluator.hasBlendModes`	2020-07-17 12:37:37 +02:00
Jonas Jenwald	b3480842b3	Use a `RefSet`, rather than a plain Object, for tracking already processed nodes in `PartialEvaluator.hasBlendModes`	2020-07-17 09:52:36 +02:00
Jonas Jenwald	03547b5633	Change `PartialEvaluator.setGState` to an `async` method Since this method calls `Dict.get` to fetch data, there could thus be `Error`s thrown in corrupt PDF documents when attempting to resolve an indirect object. To ensure that this won't ever become a problem, we change the method to be `async` such that a rejected Promise would be returned and general OperatorList parsing won't break.	2020-07-15 14:27:18 +02:00
Jonas Jenwald	f20aeb9343	Slightly simplify the code in `PartialEvaluator.hasBlendModes`, e.g. by using `for...of` loops - Replace the existing loops with `for...of` variants instead. - Make use of `continue`, to reduce indentation and to make the code (slightly) easier to follow, when checking `/Resources` entries.	2020-07-15 12:47:11 +02:00
Jonas Jenwald	15fa3f8518	Remove a redundant `/XObject` stream dictionary `objId` check in `PartialEvaluator.hasBlendModes` (PR 6971 follow-up) This case should no longer happen, given the `instanceof Ref` branch just above (added in PR 6971). Also, I've run the entire test-suite locally with `continue` replaced by `throw new Error(...)` and didn't find any problems.	2020-07-15 12:47:11 +02:00
Jonas Jenwald	84476da26e	Handle lookup errors "silently" in `PartialEvaluator.hasBlendModes` (PR 11680 follow-up) Given that this method is used during what's essentially a pre-parsing stage, before the actual OperatorList parsing occurs, on second thought it doesn't seem at all necessary to warn and trigger fallback in cases where there's lookup errors. Please note: Any any errors will still be either suppressed or thrown, according to the `ignoreErrors` option, during the actual OperatorList parsing.	2020-07-15 12:47:07 +02:00
Jonas Jenwald	981ff41b5f	Add local caching of non-font Graphics State (ExtGState) data in `PartialEvaluator.getTextContent` It turns out that `getTextContent` suffers from similar problems with repeated GStates as `getOperatorList`; please see the previous patch. While only `/ExtGState` resources containing Fonts will actually be parsed by `PartialEvaluator.getTextContent`, we're still forced to fetch/validate repeated `/ExtGState` resources even though most of them won't affect the textContent (since they mostly contain purely graphical state). With these changes we also no longer need to immediately reset the current text-state when encountering a `setGState` operator, which may thus improve text-selection in some cases.	2020-07-14 10:34:43 +02:00
Jonas Jenwald	90eb579713	Add local caching of "simple" Graphics State (ExtGState) data in `PartialEvaluator.getOperatorList` (issue 2813) This patch will help pathological cases the most, with issue 2813 being a particularily problematic example. While there's only four `/ExtGState` resources, there's a total `29062` of `setGState` operators. Even though parsing of a single `/ExtGState` resource is quite fast, having to re-parse them thousands of times does add up quite significantly. For simplicity we'll only cache "simple" `/ExtGState` resource, since e.g. the general `SMask` case cannot be easily cached (without re-factoring other code, which may have undesirable effects on general parsing). By caching "simple" `/ExtGState` resource, we thus improve performance by: - Not having to fetch/validate/parse the same `/ExtGState` data over and over. - Handling of repeated `setGState` operators becomes synchronous during the `OperatorList` building, instead of having to defer to the event-loop/microtask-queue since the `/ExtGState` parsing is done asynchronously. --- Obviously I had intended to include (standard) benchmark results with this patch, but for reasons I don't understand the test run-time (even with `master`) of the document in issue 2813 is a lot slower than in the development viewer (making normal benchmarking infeasible). However, testing this manually in the development viewer (using `pdfBug=Stats`) shows a reduction of `~10 %` in the rendering time of the PDF document in issue 2813.	2020-07-14 10:34:43 +02:00
Tim van der Meij	6c39aff374	Merge pull request #12090 from Snuffleupagus/rm-not-xobj Stop special-casing the (very unlikely) "no `/XObject` found"-scenario, when parsing `OPS.paintXObject` operators, in `PartialEvaluator.{getOperatorList, getTextContent}`	2020-07-13 23:50:08 +02:00
Tim van der Meij	cf0ed3c30a	Merge pull request #12092 from Snuffleupagus/update-packages Update packages and translations	2020-07-13 23:45:35 +02:00

... 2 3 4 5 6 ...

12839 Commits