pdf.js

Author	SHA1	Message	Date
Brendan Dahl	56e7bb626c	Merge pull request #13660 from calixteman/no_xfaf XFA - Disable xfa rendering for XFAF pdfs	2021-08-23 12:30:29 -07:00
Calixte Denizet	04573d2dc8	XFA - Disable xfa rendering for XFAF pdfs - we'll implement XFAF support later.	2021-08-23 12:18:20 -07:00
Tim van der Meij	83e1064360	Merge pull request #13920 from Snuffleupagus/issue-13916 Extend the glyph maps for standard respectively Calibri fonts (issue 13916)	2021-08-21 15:05:08 +02:00
Tim van der Meij	db11ba024d	Merge pull request #13899 from Snuffleupagus/includeAnnotationStorage-fix-caching [Regression] Re-factor the internal `includeAnnotationStorage` handling, since it's currently subtly wrong	2021-08-21 15:04:28 +02:00
Jonas Jenwald	ac27f96987	Extend the glyph maps for standard respectively Calibri fonts (issue 13916)	2021-08-21 00:48:38 +02:00
Jonas Jenwald	5f25fea0fe	Re-factor the `LocalTilingPatternCache` to cache by Ref rather than Name (PR 12458 follow-up, issue 13780) This way there cannot be any incorrect cache hits, since Refs are guaranteed to be unique. Please note that the reason for caching by Ref rather than doing something along the lines of the `localShadingPatternCache` (which uses a `Map` directly), is that TilingPatterns are streams and those cannot be cached on the `XRef`-instance (this way we avoid unnecessary parsing).	2021-08-18 12:49:01 +02:00
Jonas Jenwald	8ee5acd85d	Tweak handling of the `onlyRefs`-option in the `BaseLocalCache` class	2021-08-18 12:24:51 +02:00
Jonas Jenwald	a7f0301f21	[Regression] Re-factor the internal `includeAnnotationStorage` handling, since it's currently subtly wrong This patch is very similar to the recently fixed `renderInteractiveForms`-options, see PR 13867. As far as I can tell, this subtle bug has existed ever since `AnnotationStorage`-support was first added in PR 12106 (a little over a year ago). The value of the `includeAnnotationStorage`-option, as passed to the `PDFPageProxy.render` method, will (potentially) affect the size/content of the operatorList that's returned from the worker (for documents with forms). Given that operatorLists will generally, unless they contain huge images, be cached in the API, repeated `PDFPageProxy.render` calls where the form-data has been changed by the user in between, can thus wrongly return a cached operatorList. In the viewer we're only using the `includeAnnotationStorage`-option when printing, which is probably why this has gone unnoticed for so long. Note that we, for performance reasons, don't cache printing-operatorLists in the API. However, there's nothing stopping an API-user from using the `includeAnnotationStorage`-option during "normal" rendering, which could thus result in subtle (and difficult to understand) rendering bugs. In order to handle this, we need to know if the `AnnotationStorage`-instance has been updated since the last `PDFPageProxy.render` call. The most "correct" solution would obviously be to create a hash of the `AnnotationStorage` contents, however that would require adding a bunch of code, complexity, and runtime overhead. Given that operatorList caching in the API doesn't have to be perfect[1], but only have to avoid false cache-hits, we can simplify things significantly be only keeping track of the last time that the `AnnotationStorage`-data was modified. Please note: While working on this patch, I also noticed that the `renderInteractiveForms`- and `includeAnnotationStorage`-options in the `PDFPageProxy.render` method are mutually exclusive.[2] Given that the various Annotation-related options in `PDFPageProxy.render` have been added at different times, this has unfortunately led to the current "messy" situation.[3] --- [1] Note how we're already not caching operatorLists for pages with huge images, in order to save memory, hence there's no guarantee that operatorLists will always be cached. [2] Setting both to `true` will result in undefined behaviour, since trying to insert `AnnotationStorage`-values into fields that are being excluded from the operatorList-building will obviously not work, which isn't at all clear from the documentation. [3] My intention is to try and fix this in a follow-up PR, and I've got a WIP patch locally, however it will result in a number of API-observable changes.	2021-08-18 10:09:03 +02:00
Jonas Jenwald	3369f9a783	Move some validation, in `Dict.merge`, used during merging of sub-dictionaries (PR 13775 follow-up) By not adding any additional non-`Dict` entries to the list of candidates for merging of sub-dictionaries, we can very slightly reduce the amount of parsing required by not having to again iterate through unmergeable data.	2021-08-12 11:32:11 +02:00
Jonas Jenwald	6167566f1b	Re-factor the `BaseException.name` handling, and clean-up some code Once we're finally able to get rid of SystemJS, which is unfortunately still blocked on [bug 1247687](https://bugzilla.mozilla.org/show_bug.cgi?id=1247687), we might also want to clean-up (or even completely remove) the `BaseException` abstraction and simply extend `Error` directly instead. At that point we'd need to (explicitly) set the `name` on each class anyway, so this patch is essentially preparing for future clean-up. Furthermore, after the `BaseException` abstraction was added there's been multiple issues filed about third-party minification breaking our code since `this.constructor.name` is not guaranteed to always do what you intended. While hard-coding the strings indeed feels quite unfortunate, it's likely the "best" solution to avoid the problem described above.	2021-08-10 11:27:47 +02:00
Tim van der Meij	952f6366bf	Merge pull request #13867 from Snuffleupagus/RenderingIntentFlag [api-minor] Re-factor the internal renderingIntent, and change the default `intent` value in the `PDFPageProxy.getAnnotations` method	2021-08-07 19:25:51 +02:00
Brendan Dahl	3d18c76a53	Merge pull request #13881 from calixteman/bug_1723734 XFA - Elements under an area must be bound (bug 1723734)	2021-08-06 11:56:58 -07:00
Calixte Denizet	328383ea7a	XFA - Elements under an area must be bound (bug 1723734) - aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1723734.	2021-08-06 20:20:19 +02:00
Jonas Jenwald	107efdb178	[Regression] Re-factor the internal `renderInteractiveForms` handling, since it's currently subtly wrong The value of the `renderInteractiveForms` parameter, as passed to the `PDFPageProxy.render` method, will (potentially) affect the size/content of the operatorList that's returned from the worker (for documents with forms). Given that operatorLists will generally, unless they contain huge images, be cached in the API, repeated `PDFPageProxy.render` calls that only change the `renderInteractiveForms` parameter can thus return an incorrect operatorList. As far as I can tell, this subtle bug has existed ever since `renderInteractiveForms`-support was first added in PR 7633 (which is almost five years ago). With the previous patch, fixing this is now really simple by "encoding" the `renderInteractiveForms` parameter in the internal renderingIntent handling.	2021-08-06 00:40:43 +02:00
Jonas Jenwald	47f94235ab	[api-minor] Re-factor the internal renderingIntent, and change the default `intent` value in the `PDFPageProxy.getAnnotations` method With the changes made in PR 13746 the internal renderingIntent handling became somewhat "messy", since we're now having to do string-matching in various spots in order to handle the "oplist"-intent correctly. Hence this patch, which implements the idea from PR 13746 to convert the `intent`-strings, used in various API-methods, into an internal renderingIntent that's implemented using a bit-field instead. Please note: This part of the patch, in itself, does not change the public API (but see below). This patch is tagged `api-minor` for the following reasons: 1. It changes the default value for the `intent` parameter, in the `PDFPageProxy.getAnnotations` method, to "display" in order to be consistent across the API. 2. In order to get all annotations, with the `PDFPageProxy.getAnnotations` method, you now need to explicitly set "any" as the `intent` parameter. 3. The `PDFPageProxy.getOperatorList` method will now also support the new "any" intent, to allow accessing the operatorList of all annotations (limited to those types that have one). 4. Finally, for consistency across the API, the `PDFPageProxy.render` method also support the new "any" intent (although I'm not sure how useful that'll be). Points 1 and 2 above are the significant, and thus breaking, changes in default behaviour here. However, unfortunately I cannot see a good way to improve the overall API while also keeping `PDFPageProxy.getAnnotations` unchanged.	2021-08-06 00:39:42 +02:00
Brendan Dahl	a38d1122d8	XFA - Support aria heading and table structure. (bug 1723421) (bug 1723425) https://bugzilla.mozilla.org/show_bug.cgi?id=1723421 https://bugzilla.mozilla.org/show_bug.cgi?id=1723425	2021-08-05 15:25:04 -07:00
Calixte Denizet	fef939d347	Annotation & XFA: Add focus outlines on different fields (bug 1723615, bug 1718528) - set a default tabindex to be sure they'll be taken into account in the TAB cycle (https://bugzilla.mozilla.org/show_bug.cgi?id=1723615). - show default outline when fields are focused (it was an a11y bug: https://bugzilla.mozilla.org/show_bug.cgi?id=1718528).	2021-08-05 13:33:46 +02:00
Calixte Denizet	71a100a4d0	Annotation & XFA: Scale the font size in choicelist using zoom factor (bug 1715996) - this is an accessibility issue which could be painful for some people with visual disabilities.	2021-08-04 20:36:04 +02:00
calixteman	52ef63f1fe	Merge pull request #13856 from calixteman/xfa_layout_rounding XFA - Avoid to put something in very small areas	2021-08-04 10:09:13 +02:00
Brendan Dahl	3e003245b1	[XFA] Add alt text for images. (bug 1723418) Not many XFA PDFs have alt text. Some examples: bug1723422.pdf xfa_bug1718670_1.pdf xfa_issue13611.pdf xfa_issue13633.pdf xfa_issue13634.pdf	2021-08-03 17:18:58 -07:00
Brendan Dahl	6cf1ee3251	Merge pull request #13858 from brendandahl/xfa-aria-label Add aria-labels to XFA form elements. (bug 1723422)	2021-08-03 17:18:08 -07:00
Brendan Dahl	6ea56f35ab	Add aria-labels to XFA form elements. (bug 1723422)	2021-08-03 15:58:33 -07:00
Tim van der Meij	ad90fe90ed	Merge pull request #13848 from Snuffleupagus/rm-lgtm Remove the LGTM configuration and inline disable comments (issue 13829)	2021-08-03 23:13:05 +02:00
Jonas Jenwald	766299016f	Remove the `isEOF` helper function and slightly re-factor `EOF` Given how trivial the `isEOF` function is, we can simply inline the check at the various call-sites and remove the function (which ought to be ever so slightly more efficient as well). Furthermore, this patch also changes the `EOF` primitive itself to a `Symbol` instead of an Object since that has the nice benefit of making it unclonable (thus preventing accidentally trying to send `EOF` from the worker-thread).	2021-08-03 20:19:32 +02:00
Calixte Denizet	be1ee155d1	XFA - Avoid to put something in very small areas - it aims to fix #13855.	2021-08-03 17:05:29 +02:00
Jonas Jenwald	8fef8630fe	Remove the LGTM configuration and inline disable comments (issue 13829) Given that the GitHub Advanced Security workflow now covers everything that LGTM does, but generally faster and with better GitHub-integration, there's no longer much point in also running LGTM separately. As a follow-up to this patch, we should also disable/remove the LGTM-integration from the PDF.js repository.	2021-08-03 11:14:49 +02:00
Jonas Jenwald	705d1cfad3	Remove useless assignment of `availableSpace` in the `src/core/xfa/template.js` file (issue 13829, 13835)	2021-08-03 10:58:57 +02:00
Tim van der Meij	10a1db6980	Merge pull request #13824 from Snuffleupagus/issue-13823 When no "V" entry exists, let the fieldValue fallback to the "DV" entry (issue 13823)	2021-07-30 22:48:38 +02:00
Jonas Jenwald	ff71be793d	When no "V" entry exists, let the fieldValue fallback to the "DV" entry (issue 13823)	2021-07-30 16:17:42 +02:00
Calixte Denizet	7bb5331087	XFA - Avoid an error when an exdata is a string (bug 1723114)	2021-07-30 14:43:53 +02:00
Brendan Dahl	4ad5c5d52a	Merge pull request #13808 from brendandahl/pattern-cache-v2 Improve caching of shading patterns. (bug 1721949)	2021-07-28 11:17:16 -07:00
Brendan Dahl	c836e1f0fb	Improve caching of shading patterns. (bug 1721949) The PDF in bug 1721949 uses many unique pattern objects that references the same shading many times. This caused a new canvas pattern to be created and cached many times driving up memory use. To fix, I've changed the cache in the worker to key off the shading object and instead send the shading and matrix separately. While that worked well to fix the above bug, there could be PDFs that use many shading that could cause memory issues, so I've also added a LRU cache on the main thread for canvas patterns. This should prevent memory use from getting too high.	2021-07-28 10:29:20 -07:00
Calixte Denizet	4a4591bd2c	XFA - Fix font scale factors (bug 1720888) - All the scale factors in for the substitution font were wrong because of different glyph positions between Liberation and the other ones: - regenerate all the factors - Text may have polish chars for example and in this case the glyph widths were wrong: - treat substitution font as a composite one - add a map glyphIndex to unicode for Liberation in order to generate width array for cid font	2021-07-28 19:10:42 +02:00
Calixte Denizet	92f4cc52a6	XFA - Add a transparent blue background on all text fields for consistency	2021-07-28 14:47:29 +02:00
Calixte Denizet	76d882b560	XFA - Fix auto-sized fields (bug 1722030) - In order to better compute text fields size, use line height with no gaps (and consequently guessed height for text are slightly better in general). - Fix default background color in fields.	2021-07-28 09:43:15 +02:00
Tim van der Meij	336a74a0e5	Merge pull request #13796 from Snuffleupagus/issue-13794 Allow `StreamsSequenceStream.readBlock` to skip sub-streams with errors (issue 13794)	2021-07-27 22:25:58 +02:00
Calixte Denizet	bd6f55186d	XFA - Get the full value when binding and not only the 1st line (bug 1718725)	2021-07-27 20:25:33 +02:00
Calixte Denizet	959120e6c9	XFA - Elements created outside of XML must have all their properties (bug 1722029) - an Image element was created, attached to its parent but the $globalData property was not set and that led to an error. - the pdf in bug 1722029 has 27 rendered rows (checked in Acrobat) when only one was displayed: this patch some binding issues around the occur element.	2021-07-26 19:38:52 +02:00
Jonas Jenwald	885e7a8aa4	Allow `StreamsSequenceStream.readBlock` to skip sub-streams with errors (issue 13794) This patch makes use of the existing `ignoreErrors` option, thus allowing a page to continue parsing/rendering even if (some of) its sub-streams are corrupt. Obviously this may cause part of a page to be broken/missing, however it should be better than (potentially) rendering nothing. Also, to the best of my knowledge, this is the first bug of its kind that we've encountered. To avoid having to pass in a bunch of, for a `BaseStream`-instance, mostly unrelated parameters when initializing a `StreamsSequenceStream`-instance, I settled on utilizing a callback function instead to allow conditional Error-suppression. Note that the `StreamsSequenceStream`-class is a special stream-implementation that we only use when the `/Contents`-entry, in the `/Page`-dictionary, consists of an Array with streams.	2021-07-26 16:42:50 +02:00
Jonas Jenwald	833f27c677	Disable a LGTM warning, again (PR 13787 follow-up) Apparently I didn't put one of the disable comments on the correct line, since I didn't read the instructions carefully enough, so let's try again. Note that, most unfortunately, disabling of warnings isn't applied until after a patch has been merged.	2021-07-25 10:32:40 +02:00
Tim van der Meij	41a2b5c809	Merge pull request #13787 from Snuffleupagus/lgtm-fix-warnings Fix (most) LGTM warnings	2021-07-24 15:20:07 +02:00
Tim van der Meij	7b6767d415	Merge pull request #13784 from Snuffleupagus/issue-13783 When parsing corrupt documents, avoid inserting obviously broken data in the XRef-table (issue 13783)	2021-07-24 14:37:39 +02:00
Tim van der Meij	687cfcecd4	Merge pull request #13786 from Snuffleupagus/rm-more-src-core-closures Remove a couple of small closures in `src/core/` code	2021-07-24 14:26:57 +02:00
Jonas Jenwald	70bac87fed	Fix (most) LGTM warnings Most of the warnings we don't really care about, and those are simply white-listed using inline comments; however two cases prompted actual code changes: - In `src/display/pattern_helper.js` the branch in question is indeed unreachable, and should thus be safe to remove. (This code originated in PR 4192, which is now over seven years ago.) - In `test/test.js`, the function in question indeed doesn't accept any arguments. (The patch also re-formats a string just above, which didn't seem worthy of a separated patch.) This now leaves only one warning in the LGTM report, however that one is a false positive that we'll need to report upstream.	2021-07-24 14:23:59 +02:00
Tim van der Meij	9854b85dc1	Merge pull request #13775 from Snuffleupagus/Dict-merge-refactor Remove some duplication in the `Dict.merge` method	2021-07-24 14:21:41 +02:00
Jonas Jenwald	ebbbc973a5	Remove the closure used with the `PostScriptToken` class This patch uses the same approach as used in lots of other parts of the code-base, which thus slightly reduces the size of this code.	2021-07-24 13:05:46 +02:00
Jonas Jenwald	81009d42cf	Remove the closure used with the `PostScriptStack` class This patch uses the same approach as used in lots of other parts of the code-base, which thus slightly reduces the size of this code.	2021-07-24 12:59:53 +02:00
Jonas Jenwald	b82c802dff	When parsing corrupt documents, avoid inserting obviously broken data in the XRef-table (issue 13783) In cases where even the very first attempt at reading from an object will throw, simply ignoring such objects will help improve rendering of some corrupt documents. Note that this will lead to more parsing in some cases, but considering that this only applies to corrupt documents that shouldn't be a big deal.	2021-07-23 18:10:53 +02:00
Jonas Jenwald	51f0a81085	Merge pull request #13770 from brendandahl/cache-pattern Improve performance of reused patterns.	2021-07-23 10:43:23 +02:00
Brendan Dahl	da1af02ac8	Improve performance of reused patterns. Bug 1721218 has a shading pattern that was used thousands of times. To improve performance of this PDF: - add a cache for patterns in the evaluator and only send the IR form once to the main thread (this also makes caching in canvas easier) - cache the created canvas radial/axial patterns - for shading fill radial/axial use the pattern directly instead of creating temporary canvas	2021-07-22 16:47:40 -07:00

1 2 3 4 5 ...

2335 Commits