pdf.js

Author	SHA1	Message	Date
calixteman	faf6b10939	Merge pull request #13394 from calixteman/xml_parser Handle PI with no value in xml parser	2021-05-18 11:14:48 +02:00
Calixte Denizet	4544ebf38a	Handle PI with no value in xml parser - an XML PI contains a target and optionally some content (see https://en.wikipedia.org/wiki/Processing_Instruction) - the parser expected to always have some content and so it could lead to wrong parsing.	2021-05-18 10:22:18 +02:00
Brendan Dahl	239d0097fa	Merge pull request #13390 from calixteman/opentype_and_xfa XFA - Don't move glyphes in private area with non-truetype fonts	2021-05-17 12:39:10 -07:00
Brendan Dahl	46c2eeb19a	Merge pull request #13389 from calixteman/width_in_cff Get any width (if one is present) in CFF parser	2021-05-17 09:13:45 -07:00
Brendan Dahl	17e9cfcd2a	Merge pull request #13328 from calixteman/js_display1 JS - Add support for display property	2021-05-17 08:47:13 -07:00
Calixte Denizet	a74d19262a	XFA - Don't move glyphes in private area with non-truetype fonts - it has been done in PR #13146 but only for truetype fonts.	2021-05-17 16:52:39 +02:00
Calixte Denizet	d394188835	Get any width (if one is present) in CFF parser - in charstring specs at page 21 (section 4.2): "Also, it may appear in the charstring as the difference from nominalWidthX" so the number we've on the stack doesn't have to be positive. - currently this bug has probably no visible effect - but when the font is loaded to be used with XFA, then the rendering is incorrect.	2021-05-17 14:17:08 +02:00
Jonas Jenwald	718f7bf7e1	Fix a few safe ESLint `no-var` failures in `src/core/evaluator.js` (13371 follow-up) As can be seen in PR 13371, some of the `no-var` changes in the `PartialEvaluator.{getOperatorList, getTextContent}` methods caused errors in `gulp server`-mode. However, there's a handful of instances of `var` in other methods which should be completely safe to convert since there's no strange scope-issues present in that code.	2021-05-16 15:22:43 +02:00
Tim van der Meij	a5c74f53c1	Merge pull request #13386 from timvandermeij/src-core-bidi-no-var Enable the `no-var` linting rule in `src/core/bidi.js`	2021-05-16 15:02:18 +02:00
Tim van der Meij	b8a5e797c5	Enable the `no-var` linting rule in `src/core/bidi.js` This is done automatically with `gulp lint --fix` and the following manual changes: ```diff diff --git a/src/core/bidi.js b/src/core/bidi.js index e9e0a7217..32691c0c6 100644 --- a/src/core/bidi.js +++ b/src/core/bidi.js @@ -82,7 +82,8 @@ function isEven(i) { } function findUnequal(arr, start, value) { - for (var j = start, jj = arr.length; j < jj; ++j) { + let j, jj; + for (j = start, jj = arr.length; j < jj; ++j) { if (arr[j] !== value) { return j; } @@ -251,15 +252,14 @@ function bidi(str, startLevel, vertical) { for (i = 0; i < strLength; ++i) { if (types[i] === "EN") { // do before - var j; - for (j = i - 1; j >= 0; --j) { + for (let j = i - 1; j >= 0; --j) { if (types[j] !== "ET") { break; } types[j] = "EN"; } // do after - for (j = i + 1; j < strLength; ++j) { + for (let j = i + 1; j < strLength; ++j) { if (types[j] !== "ET") { break; } ```	2021-05-16 14:14:26 +02:00
Jonas Jenwald	3cfa316d40	Convert `src/core/operator_list.js` to use standard classes With modern JavaScript modules, where only explicitly exported properties are visible to the outside, the `QueueOptimizerClosure` should no longer be necessary. Furthermore, to reduce the possibility of `NullOptimizer` and `QueueOptimizer` getting out of sync (note e.g. the inconsistency fixed in PR 10784), we now let the latter extend the former one.	2021-05-16 13:39:54 +02:00
Jonas Jenwald	8943bcd3c3	Account for formatting changes in Prettier version `2.3.0` With the exception of one tweaked `eslint-disable` comment, in `web/generic_scripting.js`, this patch was generated automatically using `gulp lint --fix`. Please find additional information at: - https://github.com/prettier/prettier/releases/tag/2.3.0 - https://prettier.io/blog/2021/05/09/2.3.0.html	2021-05-16 11:44:05 +02:00
Tim van der Meij	d2e7161f2c	Merge pull request #13377 from Snuffleupagus/pattern-class Re-factor and convert the code in `src/core/pattern.js` to use standard classes	2021-05-14 22:23:44 +02:00
Jonas Jenwald	ebe3ee4f25	Modernize the `Shadings` structure, in `src/core/pattern.js`, to use standard classes This patch replaces the old structure with a abstract base-class, which the new RadialAxial/Mesh-shading classes then inherit from.[1] The old `MeshClosure` can now be removed, since it's not necessary any more, and most of the functions inside of it are now instead methods on the new `MeshShading` class. This is particularly nice, in my opinion, since we previously were manually passing around a reference to the current `Mesh`-instance. --- [1] If we want/need to, in the future, split e.g. the Mesh-handling into multiple classes that should now be easy to do.	2021-05-14 21:44:41 +02:00
Jonas Jenwald	6acb2db4be	Convert `src/core/pattern.js` to use standard classes Note that this patch only covers `Pattern` and `MeshStreamReader`, since the `Shadings`-implementation required additional re-factoring.	2021-05-14 21:42:21 +02:00
Calixte Denizet	f92e1fa160	Replace terminal null char by a endchar command in CFF charstrings to make OTS happy	2021-05-14 18:34:51 +02:00
Jonas Jenwald	612b43852b	Remove unused properties from the `Shadings`-implementations in `src/core/pattern.js` Neither the `type` or the `cs` properties are used outside of the "constructors", and we can thus remove them.[1] Note that a lot of this code is very old, and that it actually predates the main/worker-thread split before which the same file was used on both the main- and worker-threads. --- [1] On the main-thread, a similar `type` property was removed in PR 12591.	2021-05-14 16:11:48 +02:00
Calixte Denizet	1a2cea21a5	Replace command with not enough args by an endchar in CFF font - Right now, a glyph with an erroneous outline is replaced by an empty glyph if the error is far enough from the start there's likely something to render so the idea is to replace a command with args by an endchar when no args are on the stack: this way OTS is likely happy (no remaining args on stack) and we can draw something which is likely better than nothing.	2021-05-14 13:45:45 +02:00
Jonas Jenwald	4248f0745c	Improve the `Page.content` and `Page.getContentStream` methods First of all, by using `Dict.getArray` in the `Page.content` getter we remove the need to manually iterate through and fetch the sub-streams (when they exist) in the `Page.getContentStream` method. Secondly, we can simplify the code in `Page.{getOperatorList, extractTextContent}` by letting `Page.getContentStream` ensure that `content` is available and returning a Promise instead.	2021-05-14 11:47:34 +02:00
Jonas Jenwald	70113131de	Inline the data lookup in the `Dict.getArray` method Similar to the `get`/`getAsync` methods, this should be a tiny bit more efficient which cannot hurt considering that `getArray` is now used a lot more than when initially added.	2021-05-14 11:24:27 +02:00
Jonas Jenwald	75208d36c2	Revert "Fix the remaining `no-var` failures, which couldn't be handled automatically, in the `src/core/evaluator.js` file" (PR 13344 follow-up) This reverts commit 0ef9b5aafc88094f19fec793c174c622e7e15542, since it cases a lot of warnings (see below) locally with e.g. the document from issue 9627. Strangely enough, this only occurs with `gulp server`-mode and the actual builds are apparently fine. It seems that this may be some unfortunate interaction with the old Babel-plugin that's used together with SystemJS. ``` Warning: getTextContent - ignoring ExtGState: "FormatError: ExtGState should be a dictionary.". ``` Rather than taking the risk that this could actually cover a more serious bug, and since I cannot immediately figure out what's wrong, it thus seem safest to revert this for now and we can (carefully) revisit this once SystemJS has been removed (see PR 12563).	2021-05-13 11:19:46 +02:00
Tim van der Meij	ba99e54c66	Merge pull request #13361 from brendandahl/patterns-fixes Fix several issues with radial/axial shadings and tiling patterns.	2021-05-12 20:27:37 +02:00
Jonas Jenwald	757636d519	Convert the remaining functions in `src/core/primitives.js` to use standard classes This patch was tested using the PDF file from issue 2618, i.e. https://bug570667.bugzilla-attachments.gnome.org/attachment.cgi?id=226471, with the following manifest file: ``` [ { "id": "issue2618", "file": "../web/pdfs/issue2618.pdf", "md5": "", "rounds": 50, "type": "eq" } ] ``` which gave the following results when comparing this patch against the `master` branch: ``` -- Grouped By browser, stat -- browser \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ---- \| ------------- firefox \| Overall \| 50 \| 3417 \| 3426 \| 9 \| 0.27 \| firefox \| Page Request \| 50 \| 1 \| 1 \| 0 \| 5.41 \| firefox \| Rendering \| 50 \| 3416 \| 3426 \| 9 \| 0.27 \| ``` Based on these results, there's no significant performance regression from using standard classes and this patch should thus be OK.	2021-05-12 09:36:28 +02:00
Brendan Dahl	ac44afa70e	Fix several issues with radial/axial shadings and tiling patterns. Previously, we set the base transformation and pattern matrix directly to the main rendering ctx of the page, however doing this caused the current transform to be lost. This would cause issues with things like shear missing so the pattern was misaligned or when stroke was used the scale of the line width or dash would be wrong. Instead we should leave the current transform and use setTransfrom on the pattern so it is applied correctly. For axial and radial shadings I had to create a temporary canvas to draw the shading so I could in turn use setTransform. Fixes: #13325, #6769, #7847, #11018, #11597, #11473 The following already in the corpus are improved: issue8078-page1 issue1877-page1	2021-05-11 16:32:24 -07:00
Jonas Jenwald	6eef69de22	Export the "raw" `toUnicode`-data from `PartialEvaluator.preEvaluateFont` Compared to other data-structures, such as e.g. `Dict`s, we're purposely not caching Streams on the `XRef`-instance.[1] The, somewhat unfortunate, effect of Streams not being cached is that repeatedly getting the same Stream-data requires re-parsing/re-initializing of a bunch of data; see `XRef.fetch` and related methods. For the font-parsing in particular we're currently fetching the `toUnicode`-data, which is very often a Stream, in `PartialEvaluator.preEvaluateFont` and then again in `PartialEvaluator.extractDataStructures` soon afterwards. By instead letting `PartialEvaluator.preEvaluateFont` export the "raw" `toUnicode`-data, we can avoid some unnecessary re-parsing/re-initializing when handling fonts. Please note: In this particular case, given that `PartialEvaluator.preEvaluateFont` only accesses the "raw" `toUnicode` data, exporting a Stream should be safe. --- [1] The reasons for this include: - Streams, especially `DecodeStream`-instances, can become very large once read. Hence caching them really isn't a good idea simply because of the (potential) memory impact of doing so. - Attempting to read from the same Stream-instance more than once won't work, unless it's `reset` in between, since using any method such as e.g. `getBytes` always starts at the current data position. - Given that parsing, even in the worker-thread, is now fairly asynchronous it's generally impossible to assert that any one Stream-instance isn't being accessed "concurrently" by e.g. different `getOperatorList` calls. Hence `reset`-ing a cached Stream-instance isn't going to work in the general case.	2021-05-08 12:04:13 +02:00
Jonas Jenwald	13fb1654dc	Export the `firstChar`/`lastChar`-data from `PartialEvaluator.preEvaluateFont` Rather than re-fetching/re-parsing these properties immediately in `PartialEvaluator.translateFont`, we can simply export them instead. (Obviously the effect will be really tiny, but there is less parsing overall this way.)	2021-05-08 12:02:49 +02:00
Jonas Jenwald	8a1cb82aee	Ensure that the `Widths` array is parsed correctly in `PartialEvaluator.preEvaluateFont` Please note: While I don't have a document that this patches fixes, the current code is however not entirely correct as far as I can tell. Looking at how the `Widths` array is parsed in `PartialEvaluator.extractWidths`, it's clear that the implementation in `PartialEvaluator.preEvaluateFont` is a bit too simplistic. In particular, by only wrapping the data into a TypedArray, there's no attempt to handle indirect objects which could potentially lead to colliding `hash`es being computed.	2021-05-07 21:23:44 +02:00
Jonas Jenwald	30b2739adf	Ensure that composite/non-composite fonts won't get the same `hash` in `PartialEvaluator.preEvaluateFont` To hopefully help prevent any future bugs, make sure that composite/non-composite fonts cannot accidentally get matching `hash`es. Given the differences between those font types, that's very unlikely to be useful or even correct in general.	2021-05-07 21:22:37 +02:00
Jonas Jenwald	fc59a5f709	Take the `W` array into account when computing the hash, in `PartialEvaluator.preEvaluateFont`, for composite fonts (issue 13343) Without this some composite fonts may incorrectly end up with matching `hash`es, thus breaking rendering since we'll not actually try to load/parse some of the fonts. Please note: Given that the document, in the referenced issue, doesn't embed any of its fonts there's no guarantee that it renders correctly in all configurations even with this patch.	2021-05-07 21:22:36 +02:00
Tim van der Meij	a3632c0f38	Merge pull request #13344 from Snuffleupagus/evaluator-no-var Enable the `no-var` rule in the `src/core/evaluator.js` file	2021-05-07 21:02:46 +02:00
Tim van der Meij	5248d0a77d	Merge pull request #13338 from Snuffleupagus/images-class Convert the `src/core/{jbig2, jpg, jpx}.js` files to use standard classes	2021-05-07 20:59:58 +02:00
Calixte Denizet	af125cd299	JS - Add support for display property - in annotation_layer, move common properties treatment in a common method instead having duplicated code in each widget.	2021-05-06 11:15:38 +02:00
Jonas Jenwald	0ef9b5aafc	Fix the remaining `no-var` failures, which couldn't be handled automatically, in the `src/core/evaluator.js` file The only slight complication here were some of the `switch`-cases, in `getOperatorList`/`getTextContent`, where the parsing is done asynchronously. However, those cases are easy to deal with by wrapping the code within its own block; please see https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Statements/switch#block-scope_variables_within_switch_statements	2021-05-06 10:21:05 +02:00
Jonas Jenwald	f93c3b9aa7	Enable the `no-var` rule in the `src/core/evaluator.js` file These changes were made automatically, using `gulp lint --fix`.	2021-05-06 09:39:21 +02:00
Jonas Jenwald	0a32ad3e42	Remove unnecessary closure in the `src/core/font_renderer.js` file With modern JavaScript modules, where you explicitly list the properties that should be exported, it's no longer necessary to wrap all of the code within one file into a top-level closure.[1] This patch reduces the size, of even the built `pdf.worker.js` file, since there's now a lot less unnecessary whitespace. --- [1] For files which contain different functionality, some closures may however still make sense in order to separate the code. It might be possible to remove some of those cases later, e.g. once private class fields becomes generally available/usable in browsers.	2021-05-05 22:35:52 +02:00
Tim van der Meij	afb8c4fd25	Merge pull request #13327 from Snuffleupagus/split-fonts Split the functionality in `src/core/fonts.js` into multiple files, and use standard classes	2021-05-05 20:16:24 +02:00
Jonas Jenwald	ce14171cf0	Convert `src/core/jpx.js` to use standard classes Please note: Ignoring whitespace-only changes is probably necessary in order to review this.	2021-05-05 14:02:21 +02:00
Jonas Jenwald	cb65b762eb	Fix the remaining `no-var` failures, which couldn't be handled automatically, in the `src/core/jpx.js` file	2021-05-05 14:02:21 +02:00
Jonas Jenwald	a273599a12	Enable the `no-var` rule in the `src/core/jpx.js` file These changes were made automatically, using `gulp lint --fix`.	2021-05-05 14:02:21 +02:00
Jonas Jenwald	69dea39a42	Convert `src/core/jpg.js` to use standard classes Please note: Ignoring whitespace-only changes is probably necessary in order to review this.	2021-05-05 14:02:21 +02:00
Jonas Jenwald	d0a299713c	Fix the remaining `no-var` failures, which couldn't be handled automatically, in the `src/core/jpg.js` file	2021-05-05 14:02:21 +02:00
Jonas Jenwald	1e5a179600	Enable the `no-var` rule in the `src/core/jpg.js` file These changes were made automatically, using `gulp lint --fix`.	2021-05-05 14:02:21 +02:00
Jonas Jenwald	0addf3a0d4	Convert `src/core/jbig2.js` to use standard classes Please note: Ignoring whitespace-only changes is probably necessary in order to review this.	2021-05-05 14:02:21 +02:00
Jonas Jenwald	d59c9ab3ab	Fix the remaining `no-var` failures, which couldn't be handled automatically, in the `src/core/jbig2.js` file	2021-05-05 14:02:21 +02:00
Jonas Jenwald	7ca3a34e1f	Enable the `no-var` rule in the `src/core/jbig2.js` file These changes were made automatically, using `gulp lint --fix`.	2021-05-05 14:02:21 +02:00
Jonas Jenwald	99fae47c8e	[Regression] Move the `super`-call in the `PredictorStream`-constructor to prevent errors (PR 13303) My apologies for breaking this; thankfully PR 13303 hasn't reach mozilla-central yet. It's (obviously) necessary to initialize a `PredictorStream`-instance fully, since otherwise breakage may occur if there's errors during the actual stream parsing. To reproduce this issue, try opening the PDF document from issue 13051 locally and observe the following message in the console: ``` Warning: Invalid stream: "ReferenceError: this hasn't been initialised - super() hasn't been called" ```	2021-05-05 13:24:12 +02:00
Calixte Denizet	549aae6c3d	JS -- add support for page property in field	2021-05-03 15:46:29 +02:00
Jonas Jenwald	5e5daca407	Remove unnecessary `MissingDataException` check from `getHeaderBlock` It shouldn't be possible for the `getBytes`-call to throw a `MissingDataException`, since all resources are loaded before e.g. font-parsing ever starts; see `f0817015bd/src/core/object_loader.js (L111-L126)` Furthermore, even if we'd somehow re-throw a `MissingDataException` here that still won't help considering where the `Type1Font`-instance is created. Note how in the `Font`-constructor we simply catch any errors and fallback to a standard font, which means that a `MissingDataException` would just lead to rendering errors anyway; see `f0817015bd/src/core/fonts.js (L648-L691)` All-in-all, it's not possible for a `MissingDataException` to be thrown in `getHeaderBlock` and this code-path can thus be removed.	2021-05-03 13:57:30 +02:00
Jonas Jenwald	b487edd05d	Convert `src/core/fonts.js` to use standard classes Obviously the `Font`-class is still very large, given particularly how TrueType fonts are handled, however this patch-series at least improves things by moving a number of functions/classes into their own files. As a follow-up it might make sense to try and re-factor/extract the TrueType parsing into its own file, since all of this code is quite old, however that's probably best left for another time. For e.g. `gulp mozcentral`, the built `pdf.worker.js` files decreases from `1 620 332` to `1 617 466` bytes with this patch-series.	2021-05-03 13:57:25 +02:00
Jonas Jenwald	cadc20d8b9	Fix the remaining `no-var` failures, which couldn't be handled automatically, in the `src/core/fonts.js` file	2021-05-02 21:00:29 +02:00

1 2 3 4 5 ...

2148 Commits