pdf.js

Author	SHA1	Message	Date
Tim van der Meij	29adbb7cd7	Implement unit tests for the `RefSetCache` primitive This primitive did not have unit test coverage yet, which is important for upcoming refactoring of the primitive.	2020-07-17 13:35:29 +02:00
Tim van der Meij	a604973cc7	Merge pull request #12085 from tamuratak/fix_isnodejs Make the detection of Node.js environments on Electron strict.	2020-07-17 13:29:59 +02:00
Tim van der Meij	7d26f4147f	Merge pull request #12099 from Snuffleupagus/hasBlendModes-RefSet Use a `RefSet`, rather than a plain Object, for tracking already processed nodes in `PartialEvaluator.hasBlendModes`	2020-07-17 12:37:37 +02:00
Jonas Jenwald	b3480842b3	Use a `RefSet`, rather than a plain Object, for tracking already processed nodes in `PartialEvaluator.hasBlendModes`	2020-07-17 09:52:36 +02:00
Jonas Jenwald	f20aeb9343	Slightly simplify the code in `PartialEvaluator.hasBlendModes`, e.g. by using `for...of` loops - Replace the existing loops with `for...of` variants instead. - Make use of `continue`, to reduce indentation and to make the code (slightly) easier to follow, when checking `/Resources` entries.	2020-07-15 12:47:11 +02:00
Jonas Jenwald	15fa3f8518	Remove a redundant `/XObject` stream dictionary `objId` check in `PartialEvaluator.hasBlendModes` (PR 6971 follow-up) This case should no longer happen, given the `instanceof Ref` branch just above (added in PR 6971). Also, I've run the entire test-suite locally with `continue` replaced by `throw new Error(...)` and didn't find any problems.	2020-07-15 12:47:11 +02:00
Jonas Jenwald	84476da26e	Handle lookup errors "silently" in `PartialEvaluator.hasBlendModes` (PR 11680 follow-up) Given that this method is used during what's essentially a pre-parsing stage, before the actual OperatorList parsing occurs, on second thought it doesn't seem at all necessary to warn and trigger fallback in cases where there's lookup errors. Please note: Any any errors will still be either suppressed or thrown, according to the `ignoreErrors` option, during the actual OperatorList parsing.	2020-07-15 12:47:07 +02:00
Tim van der Meij	6c39aff374	Merge pull request #12090 from Snuffleupagus/rm-not-xobj Stop special-casing the (very unlikely) "no `/XObject` found"-scenario, when parsing `OPS.paintXObject` operators, in `PartialEvaluator.{getOperatorList, getTextContent}`	2020-07-13 23:50:08 +02:00
Tim van der Meij	cf0ed3c30a	Merge pull request #12092 from Snuffleupagus/update-packages Update packages and translations	2020-07-13 23:45:35 +02:00
Jonas Jenwald	8256981aca	Update l10n files	2020-07-13 11:08:28 +02:00
Jonas Jenwald	20ad4337ac	Fix most vulnerabilities reported by `npm audit` This was automatically, by using the `npm audit fix` command.	2020-07-13 11:06:47 +02:00
Jonas Jenwald	fac5b5ff0c	Update `npm` packages	2020-07-13 11:05:50 +02:00
Jonas Jenwald	d4d7ac1b88	Stop special-casing the (very unlikely) "no `/XObject` found"-scenario, when parsing `OPS.paintXObject` operators, in `PartialEvaluator.{getOperatorList, getTextContent}` Originally there weren't any (generally) good ways to handle errors gracefully, on the worker-side, however that's no longer the case and we can simply fallback to the existing `ignoreErrors` functionality instead. Also, please note that the "no `/XObject` found"-scenario should be extremely unlikely in practice and would only occur in corrupt/broken documents. Note that the `PartialEvaluator.getOperatorList` case is especially bad currently, since we'll simply (attempt to) send the data as-is to the main-thread. This is quite bad, since in a corrupt/broken document the data could contain anything and e.g. be unclonable (which would cause breaking errors). Also, we're (obviously) not attempting to do anything with this "raw" `OPS.paintXObject` data on the main-thread and simply ensuring that we never send it definately seems like the correct approach.	2020-07-12 21:59:59 +02:00
Takashi Tamura	473ea1f1a4	Make the detection of Node.js environments on Electron strict. The main process and its child processes should be detected as Node.js environments.	2020-07-12 10:56:17 +09:00
Tim van der Meij	7dabc5ecc8	Merge pull request #12063 from Snuffleupagus/issue-10989 Tweak the heuristic, in `src/core/jpg.js`, that handles JPEG images with a wildly incorrect SOF (Start of Frame) `scanLines` parameter (issue 10989)	2020-07-11 00:05:11 +02:00
Tim van der Meij	c11fc3acfc	Merge pull request #12083 from Snuffleupagus/ColorSpace-rm-IR Remove the IR (internal representation) part of the ColorSpace parsing	2020-07-11 00:00:45 +02:00
Jonas Jenwald	d18cf47419	Remove the special handling, used when creating Indexed ColorSpaces, for the case where the `lookup`-data is a `Stream` This special-case was added in PR 1992, however it became unnecessary with the changes in PR 4824 since all of the ColorSpace parsing is now done on the worker-thread (with only RGB-data being sent to the main-thread).	2020-07-10 17:22:55 +02:00
Jonas Jenwald	ea6a0e4435	Remove the IR (internal representation) part of the ColorSpace parsing Originally ColorSpaces were only partially parsed on the worker-thread, to obtain an IR-format which was sent to the main-thread. This had the somewhat unfortunate side-effect of causing the majority of the (potentially heavy) ColorSpace parsing to happen on the main-thread. Hence PR 4824 which, among other things, changed ColorSpaces to be fully parsed on the worker-thread with only RGB-data being sent to the main-thread. While it thus originally was necessary to have `ColorSpace.{parseToIR, fromIR}` methods, to handle the worker/main-thread split, that's no longer the case and we can thus reduce all of the ColorSpace parsing to one method instead. Currently, when parsing a ColorSpace, we call `ColorSpace.parseToIR` which parses the ColorSpace-data from the document and then creates the IR-format. We then, immediately, call `ColorSpace.fromIR` which parses the IR-format and then finally creates the actual ColorSpace.[1] All-in-all, this leads to a fair amount of unnecessary indirection which also (in my opinion) makes the code less clear. Obviously these changes are not really expected to have a significant effect on performance, especially with the recently added caching of ColorSpaces, however there'll now be strictly fewer function calls, less memory allocated, and overall less parsing required during ColorSpace-handling. --- [1] For ICCBased ColorSpaces, given the validation necessary, this currently even leads to parsing an /Alternate ColorSpace twice.	2020-07-10 17:22:44 +02:00
Tim van der Meij	38cf49bb81	Merge pull request #12070 from Snuffleupagus/createFontId Re-factor the `idFactory` functionality, used in the `core/`-code, and move the `fontID` generation into it	2020-07-09 23:44:33 +02:00
Tim van der Meij	72d71ba6a5	Merge pull request #12056 from Snuffleupagus/_delayedFallback Refactor/simplify the "delayedFallback" handling in the default viewer	2020-07-09 00:10:40 +02:00
Jonas Jenwald	f9157ec243	Unconditionally report telemetry, in the viewer, regardless of build target Given the dummy-methods on `DefaultExternalServices`, there's no longer any compelling reason not to (attempt to) report telemetry unconditionally. The only larger change consists of moving the `KNOWN_VERSIONS` and `KNOWN_GENERATORS` arrays ouf of the `PDFViewerApplication._initializeMetadata` method. Please note: Most of this patch consists of whitespace-only changes.	2020-07-08 16:05:25 +02:00
Jonas Jenwald	625f8a6f51	Refactor/simplify the "delayedFallback" handling in the default viewer There's a few things that could be improved in the current implementation, such as: - It's currently necessary to both manually track the `featureId`s which should trigger delayed fallback, as well as manually report telemetry in affected cases. Obviously there's only two call-sites as of now (forms and javaScript), but it still feels somewhat error-prone especially if more cases were to be added in the future. To address this, this patch adds a new (private) method which abstracts away these details from the call-sites. - Generally, it also seems nice to reduce and simplify the amount of state we need to store on `PDFViewerApplication` in order to support the "delayedFallback" functionality. Also, having to manually work with the "delayedFallback"-array in multiple places feels cumbersome and makes e.g. the `PDFViewerApplication.fallback` method less clear as to its behaviour. - Having code outside of `PDFViewerApplication`, i.e. in the event handlers, directly access properties which are marked as "private" via a leading underscore doesn't seem that great in general. Furthermore, having the event handlers directly deal with that should be internal state also seem unfortunate. To address this, the patch will instead make use of a new `PDFViewerApplication.triggerDelayedFallback` callback. - There's at least one code-path in the viewer, see `PDFViewerApplication.error`, where `fallback` can be called without an argument. It's currently possible (although maybe somewhat unlikely) that such a call could be overridden by the `featureId` of a pending "delayedFallback" call, thus not reporting the correct fallback reason. - The "delayedFallback"-state weren't being reset on document close (which shouldn't affect Firefox, but nonetheless it ought to be fixed).	2020-07-08 15:30:07 +02:00
Tim van der Meij	780faf6b26	Merge pull request #12072 from mozilla/dependabot/npm_and_yarn/npm-6.14.6 Bump npm from 6.14.5 to 6.14.6	2020-07-08 00:14:02 +02:00
dependabot[bot]	82b1930452	Bump npm from 6.14.5 to 6.14.6 Bumps [npm](https://github.com/npm/cli) from 6.14.5 to 6.14.6. - [Release notes](https://github.com/npm/cli/releases) - [Changelog](https://github.com/npm/cli/blob/latest/CHANGELOG.md) - [Commits](https://github.com/npm/cli/compare/v6.14.5...v6.14.6) Signed-off-by: dependabot[bot] <support@github.com>	2020-07-07 19:25:19 +00:00
Jonas Jenwald	4cc6797f17	Re-factor the `idFactory` functionality, used in the `core/`-code, and move the `fontID` generation into it Note how the `getFontID`-method in `src/core/fonts.js` is completely global, rather than properly tied to the current document. This means that if you repeatedly open and parse/render, and then close, even the same PDF document the `fontID`s will still be incremented continuously. For comparison the `createObjId` method, on `idFactory`, will always create a consistent id, assuming of course that the document and its pages are parsed/rendered in the same order. In order to address this inconsistency, it thus seems reasonable to add a new `createFontId` method on the `idFactory` and use that when obtaining `fontID`s. (When the current `getFontID` method was added the `idFactory` didn't actually exist yet, which explains why the code looks the way it does.) Please note: Since the document id is (still) part of the `loadedName`, it's thus not possible for different documents to have identical font names.	2020-07-07 16:33:31 +02:00
Tim van der Meij	cf8daaf78b	Merge pull request #12060 from Snuffleupagus/evaluator-class Convert the code in `src/core/evaluator.js` to use standard classes	2020-07-06 23:50:36 +02:00
Jonas Jenwald	1d66fce781	Tweak the heuristic, in `src/core/jpg.js`, that handles JPEG images with a wildly incorrect SOF (Start of Frame) `scanLines` parameter (issue 10989)	2020-07-06 13:06:49 +02:00
Jonas Jenwald	c95fbb6e21	Convert the code in `src/core/evaluator.js` to use standard classes This removes additional `// eslint-disable-next-line no-shadow` usage, which our old pseudo-classes necessitated. Most of the re-formatting changes, after the `class` definitions and methods were fixed, were done automatically by Prettier. Please note: I'm purposely not doing any `var` to `let`/`const` conversion here, since it's generally better to (if possible) do that automatically on e.g. a directory basis instead.	2020-07-05 16:01:04 +02:00
Jonas Jenwald	32a0b6fa73	Move some constants and helper functions out of the `PartialEvaluator` closure This will simplify the `class` conversion in the next patch, and with modern JavaScript the moved code is still limited to the current module scope. Please note: For improved consistency with our usual formatting, the `TILING_PATTERN`/`SHADING_PATTERN` constants where re-factored slightly.	2020-07-05 15:56:23 +02:00
Tim van der Meij	c4255fdbfd	Merge pull request #12059 from Snuffleupagus/image-class Convert the code in `src/core/image.js` to use ES6 classes	2020-07-05 14:08:55 +02:00
Tim van der Meij	4ef43ca44b	Merge pull request #12057 from Snuffleupagus/BaseLocalCache-onlyRefs Allow `BaseLocalCache` to, optionally, only allocate storage for caching of references (PR 12034 follow-up)	2020-07-05 14:05:45 +02:00
Jonas Jenwald	59da1d5829	Convert the code in `src/core/image.js` to use ES6 classes This removes additional `// eslint-disable-next-line no-shadow` usage, which our old pseudo-classes necessitated. Please note: I'm purposely not doing any `var` to `let`/`const` conversion here, since it's generally better to (if possible) do that automatically on e.g. a directory basis instead.	2020-07-05 09:34:14 +02:00
Jonas Jenwald	85ced3fbfd	Allow `BaseLocalCache` to, optionally, only allocate storage for caching of references (PR 12034 follow-up) Yet another instalment in the never-ending series of things that you think of __after__ a patch has landed. Since `Function`s are only cached by reference, we thus don't need to allocate storage for names in `LocalFunctionCache` instances. Obviously the effect of these changes are really tiny, but it seems reasonable in principle to avoid allocating data structures that are guaranteed to be unused.	2020-07-04 15:01:32 +02:00
Tim van der Meij	29548ad498	Merge pull request #12034 from Snuffleupagus/Function-local-cache-3 Add local caching of `Function`s, by reference, in the `PDFFunctionFactory` (issue 2541)	2020-07-04 12:05:15 +02:00
Jonas Jenwald	ca719ecaa4	Add local caching of `Function`s, by reference, in the `PDFFunctionFactory` (issue 2541) Note that compared other structures, such as e.g. Images and ColorSpaces, `Function`s are not referred to by name, which however does bring the advantage of being able to share the cache for an entire page. Furthermore, similar to ColorSpaces, the parsing of individual `Function`s are generally fast enough to not really warrant trying to cache them in any "smarter" way than by reference. (Hence trying to do caching similar to e.g. Fonts would most likely be a losing proposition, given the amount of data lookup/parsing that'd be required.) Originally I tried implementing this similar to e.g. the recently added ColorSpace caching (and in a couple of different ways), however it unfortunately turned out to be quite ugly/unwieldy given the sheer number of functions/methods where you'd thus need to pass in a `LocalFunctionCache` instance. (Also, the affected functions/methods didn't exactly have short signatures as-is.) After going back and forth on this for a while it seemed to me that the simplest, or least "invasive" if you will, solution would be if each `PartialEvaluator` instance had its own `PDFFunctionFactory` instance (since the latter is already passed to all of the required code). This way each `PDFFunctionFactory` instances could have a local `Function` cache, without it being necessary to provide a `LocalFunctionCache` instance manually at every `PDFFunctionFactory.{create, createFromArray}` call-site. Obviously, with this patch, there's now (potentially) more `PDFFunctionFactory` instances than before when the entire document shared just one. However, each such instance is really quite small and it's also tied to a `PartialEvaluator` instance and those are not kept alive and/or cached. To reduce the impact of these changes, I've tried to make as many of these structures as possible lazily initialized, specifically: - The `PDFFunctionFactory`, on `PartialEvaluator` instances, since not all kinds of general parsing actually requires it. For example: `getTextContent` calls won't cause any `Function` to be parsed, and even some `getOperatorList` calls won't trigger `Function` parsing (if a page contains e.g. no Patterns or "complex" ColorSpaces). - The `LocalFunctionCache`, on `PDFFunctionFactory` instances, since only certain parsing requires it. Generally speaking, only e.g. Patterns, "complex" ColorSpaces, and/or (some) SoftMasks will trigger any `Function` parsing. To put these changes into perspective, when loading/rendering all (14) pages of the default `tracemonkey.pdf` file there's now a total of 6 `PDFFunctionFactory` and 1 `LocalFunctionCache` instances created thanks to the lazy initialization. (If you instead would keep the document-"global" `PDFFunctionFactory` instance and pass around `LocalFunctionCache` instances everywhere, the numbers for the `tracemonkey.pdf` file would be instead be something like 1 `PDFFunctionFactory` and 6 `LocalFunctionCache` instances.) All-in-all, I thus don't think that the `PDFFunctionFactory` changes should be generally problematic. With these changes, we can also modify (some) call-sites to pass in a `Reference` rather than the actual `Function` data. This is nice since `Function`s can also be `Streams`, which are not cached on the `XRef` instance (given their potential size), and this way we can avoid unnecessary lookups and thus save some additional time/resources. Obviously I had intended to include (standard) benchmark results with these changes, but for reasons I don't really understand the test run-time (even with `master`) of the document in issue 2541 is quite a bit slower than in the development viewer. However, logging the time it takes for the relevant `PDFFunctionFactory`/`PDFFunction ` parsing shows that it takes approximately `0.5 ms` for the `Function` in question. Looking up a cached `Function`, on the other hand, is one order of magnitude faster which does add up when the same `Function` is invoked close to 2000 times.	2020-07-04 00:55:18 +02:00
Tim van der Meij	1f7175542e	Merge pull request #12039 from Snuffleupagus/Node-utils [api-minor] Use the `NodeCanvasFactory`/`NodeCMapReaderFactory` classes as defaults in Node.js environments (issue 11900)	2020-07-04 00:42:52 +02:00
Jonas Jenwald	4a7e29865d	[api-minor] Use the `NodeCanvasFactory`/`NodeCMapReaderFactory` classes as defaults in Node.js environments (issue 11900) This moves, and slightly simplifies, code that's currently residing in the unit-test utils into the actual library, such that it's bundled with `GENERIC`-builds and used in e.g. the API-code. As an added bonus, this also brings out-of-the-box support for CMaps in e.g. the Node.js examples.	2020-07-02 04:44:23 +02:00
Brendan Dahl	fe3df495cc	Merge pull request #12040 from wojtekmaj/replace-non-inclusive Replace non-inclusive "whitelist" term with "allowlist"	2020-07-01 15:41:41 -07:00
Tim van der Meij	ec635b822a	Merge pull request #12044 from Snuffleupagus/issue-12010 Adjust the heuristics used when dealing with rectangles, i.e. `re` operators, with zero width/height (issue 12010)	2020-07-02 00:16:28 +02:00
Jonas Jenwald	fef24658e7	Adjust the heuristics used when dealing with rectangles, i.e. `re` operators, with zero width/height (issue 12010)	2020-07-02 00:02:49 +02:00
Tim van der Meij	75fed02630	Merge pull request #12043 from Snuffleupagus/issue-4260-test Add a reduced test-case for issue 4260 (PR 4521 follow-up)	2020-07-01 23:51:21 +02:00
Tim van der Meij	3359739ffe	Merge pull request #12036 from Snuffleupagus/test-renderTaskOnContinue Add at least some test-coverage for the `RenderTask.onContinue` functionality	2020-07-01 23:48:37 +02:00
Tim van der Meij	4097c83602	Merge pull request #12049 from Snuffleupagus/eslint_no-promise-executor-return Enable the `no-promise-executor-return` ESLint rule	2020-07-01 23:44:28 +02:00
Jonas Jenwald	4bc5bc454e	Enable the `no-promise-executor-return` ESLint rule Please see https://eslint.org/docs/rules/no-promise-executor-return for additional information.	2020-07-01 13:01:39 +02:00
Jonas Jenwald	e451cabe37	Add a reduced test-case for issue 4260 (PR 4521 follow-up)	2020-06-30 09:26:41 +02:00
Brendan Dahl	bce306f168	Merge pull request #12041 from sylvestre/patch-1 Replace Mozilla Labs by just Mozilla	2020-06-29 13:36:06 -07:00
Sylvestre Ledru	529062a61b	Replace Mozilla Labs by just Mozilla Mozilla labs isn't a thing	2020-06-29 17:48:35 +02:00
Wojciech Maj	78970bbbe1	Replace non-inclusive "whitelist" term with "allowlist"	2020-06-29 17:15:14 +02:00
Jonas Jenwald	4a5b68e077	Add at least some test-coverage for the `RenderTask.onContinue` functionality The default viewer, and thus Firefox, depends on the `RenderTask.onContinue` functionality to pause/continue rendering (such that the most visible page always renders first). Despite this functionality thus being very important, it has however never actually been tested at all as far as I can tell. Hence this patch which adds a new boolean `renderTaskOnContinue` parameter (`false` by default), that can be used to force a reference-test to use the `RenderTask.onContinue` code-path in the `InternalRenderTask` class. Note that I purposely made this new reference-test behaviour optional, since I didn't want to negatively affect the general runtime of the tests (given that there's a slight delay added to the rendering). Also, for e.g. benchmarking you'd most likely want to stay away from the `RenderTask.onContinue` functionality for similar reasons.	2020-06-29 00:38:34 +02:00
Tim van der Meij	9993397e30	Merge pull request #12032 from Snuffleupagus/update-packages Update packages and translations	2020-06-28 15:24:53 +02:00

1 2 3 4 5 ...

12648 Commits