pdf.js

Author	SHA1	Message	Date
Calixte Denizet	be1ee155d1	XFA - Avoid to put something in very small areas - it aims to fix #13855.	2021-08-03 17:05:29 +02:00
Brendan Dahl	4ad5c5d52a	Merge pull request #13808 from brendandahl/pattern-cache-v2 Improve caching of shading patterns. (bug 1721949)	2021-07-28 11:17:16 -07:00
Brendan Dahl	c836e1f0fb	Improve caching of shading patterns. (bug 1721949) The PDF in bug 1721949 uses many unique pattern objects that references the same shading many times. This caused a new canvas pattern to be created and cached many times driving up memory use. To fix, I've changed the cache in the worker to key off the shading object and instead send the shading and matrix separately. While that worked well to fix the above bug, there could be PDFs that use many shading that could cause memory issues, so I've also added a LRU cache on the main thread for canvas patterns. This should prevent memory use from getting too high.	2021-07-28 10:29:20 -07:00
Calixte Denizet	4a4591bd2c	XFA - Fix font scale factors (bug 1720888) - All the scale factors in for the substitution font were wrong because of different glyph positions between Liberation and the other ones: - regenerate all the factors - Text may have polish chars for example and in this case the glyph widths were wrong: - treat substitution font as a composite one - add a map glyphIndex to unicode for Liberation in order to generate width array for cid font	2021-07-28 19:10:42 +02:00
Calixte Denizet	92f4cc52a6	XFA - Add a transparent blue background on all text fields for consistency	2021-07-28 14:47:29 +02:00
Calixte Denizet	76d882b560	XFA - Fix auto-sized fields (bug 1722030) - In order to better compute text fields size, use line height with no gaps (and consequently guessed height for text are slightly better in general). - Fix default background color in fields.	2021-07-28 09:43:15 +02:00
Tim van der Meij	336a74a0e5	Merge pull request #13796 from Snuffleupagus/issue-13794 Allow `StreamsSequenceStream.readBlock` to skip sub-streams with errors (issue 13794)	2021-07-27 22:25:58 +02:00
Calixte Denizet	bd6f55186d	XFA - Get the full value when binding and not only the 1st line (bug 1718725)	2021-07-27 20:25:33 +02:00
Calixte Denizet	959120e6c9	XFA - Elements created outside of XML must have all their properties (bug 1722029) - an Image element was created, attached to its parent but the $globalData property was not set and that led to an error. - the pdf in bug 1722029 has 27 rendered rows (checked in Acrobat) when only one was displayed: this patch some binding issues around the occur element.	2021-07-26 19:38:52 +02:00
Jonas Jenwald	885e7a8aa4	Allow `StreamsSequenceStream.readBlock` to skip sub-streams with errors (issue 13794) This patch makes use of the existing `ignoreErrors` option, thus allowing a page to continue parsing/rendering even if (some of) its sub-streams are corrupt. Obviously this may cause part of a page to be broken/missing, however it should be better than (potentially) rendering nothing. Also, to the best of my knowledge, this is the first bug of its kind that we've encountered. To avoid having to pass in a bunch of, for a `BaseStream`-instance, mostly unrelated parameters when initializing a `StreamsSequenceStream`-instance, I settled on utilizing a callback function instead to allow conditional Error-suppression. Note that the `StreamsSequenceStream`-class is a special stream-implementation that we only use when the `/Contents`-entry, in the `/Page`-dictionary, consists of an Array with streams.	2021-07-26 16:42:50 +02:00
Jonas Jenwald	833f27c677	Disable a LGTM warning, again (PR 13787 follow-up) Apparently I didn't put one of the disable comments on the correct line, since I didn't read the instructions carefully enough, so let's try again. Note that, most unfortunately, disabling of warnings isn't applied until after a patch has been merged.	2021-07-25 10:32:40 +02:00
Tim van der Meij	41a2b5c809	Merge pull request #13787 from Snuffleupagus/lgtm-fix-warnings Fix (most) LGTM warnings	2021-07-24 15:20:07 +02:00
Tim van der Meij	7b6767d415	Merge pull request #13784 from Snuffleupagus/issue-13783 When parsing corrupt documents, avoid inserting obviously broken data in the XRef-table (issue 13783)	2021-07-24 14:37:39 +02:00
Tim van der Meij	687cfcecd4	Merge pull request #13786 from Snuffleupagus/rm-more-src-core-closures Remove a couple of small closures in `src/core/` code	2021-07-24 14:26:57 +02:00
Jonas Jenwald	70bac87fed	Fix (most) LGTM warnings Most of the warnings we don't really care about, and those are simply white-listed using inline comments; however two cases prompted actual code changes: - In `src/display/pattern_helper.js` the branch in question is indeed unreachable, and should thus be safe to remove. (This code originated in PR 4192, which is now over seven years ago.) - In `test/test.js`, the function in question indeed doesn't accept any arguments. (The patch also re-formats a string just above, which didn't seem worthy of a separated patch.) This now leaves only one warning in the LGTM report, however that one is a false positive that we'll need to report upstream.	2021-07-24 14:23:59 +02:00
Tim van der Meij	9854b85dc1	Merge pull request #13775 from Snuffleupagus/Dict-merge-refactor Remove some duplication in the `Dict.merge` method	2021-07-24 14:21:41 +02:00
Jonas Jenwald	ebbbc973a5	Remove the closure used with the `PostScriptToken` class This patch uses the same approach as used in lots of other parts of the code-base, which thus slightly reduces the size of this code.	2021-07-24 13:05:46 +02:00
Jonas Jenwald	81009d42cf	Remove the closure used with the `PostScriptStack` class This patch uses the same approach as used in lots of other parts of the code-base, which thus slightly reduces the size of this code.	2021-07-24 12:59:53 +02:00
Jonas Jenwald	b82c802dff	When parsing corrupt documents, avoid inserting obviously broken data in the XRef-table (issue 13783) In cases where even the very first attempt at reading from an object will throw, simply ignoring such objects will help improve rendering of some corrupt documents. Note that this will lead to more parsing in some cases, but considering that this only applies to corrupt documents that shouldn't be a big deal.	2021-07-23 18:10:53 +02:00
Jonas Jenwald	51f0a81085	Merge pull request #13770 from brendandahl/cache-pattern Improve performance of reused patterns.	2021-07-23 10:43:23 +02:00
Brendan Dahl	da1af02ac8	Improve performance of reused patterns. Bug 1721218 has a shading pattern that was used thousands of times. To improve performance of this PDF: - add a cache for patterns in the evaluator and only send the IR form once to the main thread (this also makes caching in canvas easier) - cache the created canvas radial/axial patterns - for shading fill radial/axial use the pattern directly instead of creating temporary canvas	2021-07-22 16:47:40 -07:00
Calixte Denizet	a51c4a3a0f	XFA - A field without an ui must provide a default one (bug 1718245)	2021-07-22 20:31:25 +02:00
Jonas Jenwald	e1ee3835cd	Remove some duplication in the `Dict.merge` method Currently the `!mergeSubDicts` code-path is essentially just duplicated code, which we can easily avoid by simply moving that check. (This may lead to ever so slightly more parsing for this case, but the difference ought to be negligible in practice.)	2021-07-22 14:01:43 +02:00
Jonas Jenwald	2cf90cd9ad	Merge pull request #13766 from Snuffleupagus/issue-13751 XFA - Handle `startIndex` correctly in the `Template.$toHTML` method (issue 13751)	2021-07-21 18:58:29 +02:00
Calixte Denizet	5555114bb3	XFA - Remove namespace from nodes under xfa:data node - in real life some xfa contains xml like <xfa:data><xfa:Foo><xfa:Bar>...</xfa:data> since there are no Foo or Bar in the xfa namespace the JS representation are empty and that leads to errors. - so the idea is to make all nodes under xfa:data namespace agnostic which means that ns are removed from nodes in the parser but only xfa:data descendants.	2021-07-21 17:11:31 +02:00
Jonas Jenwald	7d1c19f8bd	XFA - Handle `startIndex` correctly in the `Template.$toHTML` method (issue 13751) Please note: The PDF document in issue 13751 is dynamically created (in e.g. Adobe Reader), with pages added when certain buttons are clicked, hence this patch simply fixes the breaking error and nothing more. It looks like the current code contains a little bit too much copy-and-paste from the similar `index` branch above, since we cannot set the `startIndex` to a negative value. Note how it's being used to initialize the loop-variable, which is then used to lookup values in an Array and accessing the `-1`th element of an Array obviously makes no sense.	2021-07-21 16:17:13 +02:00
Jonas Jenwald	6c9b6bc599	Merge pull request #13764 from Snuffleupagus/issue-13748 XFA - Add a missing method to `XFAAttribute`, to prevent breaking errors (issue 13748)	2021-07-20 18:55:23 +02:00
Jonas Jenwald	c2fe493abe	XFA - Add a missing method to `XFAAttribute`, to prevent breaking errors (issue 13748) This is yet another case where I've got no idea if the patch is correct, but it does at least fix a breaking error :-) Note how in the [`Binder._bindValue` method](`683ce66a48/src/core/xfa/bind.js (L92-L93)`), we're assuming that if a `data`-value exists then it'll also be possible to actually access it. For the `XFAAttribute`-implementation however, the second method is missing and that's what causes the breaking errors in issue 13748. Please note that another possible way of "fixing" the error wouldn't been to simply change the exists-check to return `false`, and I could see that being a preferred solution. However, the reason for submitting the current patch is that we get fewer warnings about Nodes with mis-matched types this way.	2021-07-20 17:41:05 +02:00
Calixte Denizet	1d07ef597e	XFA - Must use bindItems element even if there is no direct binding (bug 1720907)	2021-07-20 17:07:32 +02:00
Jonas Jenwald	cf7978d507	XFA - Prevent breaking errors in `Binder`, when `searchNode` doesn't return data (issue 13756) As can be seen in the code (see below), the `searchNode` helper function will return `null` in some cases and all of its call-sites should protect against that before attempting to access the returned data. While only one of these changes were necessary to fix the breaking errors in issue 13756, in order to prevent future bugs I've added similar defensive code throughout this file. - `07955fa1d3/src/core/xfa/som.js (L169)` - `07955fa1d3/src/core/xfa/som.js (L239)` - `07955fa1d3/src/core/xfa/som.js (L254)`	2021-07-19 18:07:07 +02:00
Tim van der Meij	07955fa1d3	Merge pull request #13735 from Snuffleupagus/bug-1720411 Ensure that the field value, for checkboxes, refers to an existing appearance state (bug 1720411)	2021-07-18 13:48:34 +02:00
Tim van der Meij	668c58d68d	Merge pull request #13746 from Snuffleupagus/getOperatorList-intent [api-minor] Add `intent` support to the `PDFPageProxy.getOperatorList` method (issue 13704)	2021-07-18 13:28:08 +02:00
Jonas Jenwald	481af097b4	Convert `PDFFunction` to a standard class with `static` methods For e.g. `gulp mozcentral`, the built `pdf.worker.js` file decreases from `1 837 608` to `1 834 907` bytes with this patch-series. The improvement comes first of all from less overall indentation in `PDFFunction`, and secondly from the removal of (now) unnecessary indirection in the code.	2021-07-17 16:46:57 +02:00
Jonas Jenwald	d35fe3e796	Remove the IR (internal representation) part of the `PDFFunction` parsing This follows the exact same princial as PR 12083, but for the `PDFFunction` parsing instead. Given that the IR format is completely unused now, all that the current code does is add a bunch of unnecessary indirection/overhead to the handling of PDF-functions.	2021-07-17 16:44:58 +02:00
Jonas Jenwald	03cf28bf17	[api-minor] Add `intent` support to the `PDFPageProxy.getOperatorList` method (issue 13704) With this patch, the `PDFPageProxy.getOperatorList` method will now return `PDFOperatorList`-instances that also include Annotation-operatorLists (when those exist). Hence this closes a small, but potentially confusing, gap between the `render` and `getOperatorList` methods. Previously we've been somewhat reluctant to do this, as explained below, but given that there's actual use-cases where it's required probably means that we'll have to implement it now. Since we still need the ability to separate "normal" rendering operations from direct `getOperatorList` calls in the worker-thread, this API-change unfortunately causes the internal renderingIntent to become a bit "messy" which is indeed unfortunate (note the `"oplist-"` strings in various spots). As-is I suppose that it's not all that bad, but we may want to consider changing the internal renderingIntent to e.g. a bitfield in the future. Besides fixing issue 13704, this patch would also be necessary if someone ever tries to implement e.g. issue 10165 (since currently `PDFPageProxy.getOperatorList` doesn't include Annotation-operatorLists). Please note: This patch is also tagged "api-minor" for a second reason, which is that we're now including the Annotation-id in the `beginAnnotation` argument. The reason for this is to allow correlating the Annotation-data returned by `PDFPageProxy.getAnnotations`, with its corresponding operatorList-data (for those Annotations that have it).	2021-07-16 17:16:30 +02:00
Jonas Jenwald	da808aeab3	Ensure that the field value, for checkboxes, refers to an existing appearance state (bug 1720411) Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1720411	2021-07-16 13:11:48 +02:00
calixteman	4b2e0d0d01	Merge pull request #13732 from calixteman/rect XFA - A rectangle must have the width of its parent but without inner margins	2021-07-15 22:30:25 +02:00
Jonas Jenwald	3838c4e27c	Re-factor the handling of empty `Name`-instances (PR 13612 follow-up) When working on PR 13612, I mostly prioritized a simple solution that didn't require touching a lot of code. However, while working on PR 13735 I started to realize that the static `Name.empty` construction really wasn't a good idea. In particular, having a special `Name`-instance where the `name`-property isn't actually a String is confusing (to put it mildly) and can easily lead to issues elsewhere. The only reason for not simply allowing the `name`-property to be an empty string, in PR 13612, was to avoid having to touch a lot of existing code. However, it turns out that this is only limited to a few methods in the `PartialEvaluator` and a few of the `BaseLocalCache`-implementations, all of which can be easily re-factored to handle empty `Name`-instances. All-in-all, I think that this patch is even an overall improvement since we're now validating (what should always be) `Name`-data better in the `PartialEvaluator`. This is what I ought to have done from the start, sorry about the code churn here!	2021-07-15 12:00:42 +02:00
Calixte Denizet	019699acfb	XFA - Cannot print fields with no names - it was not possible to print pdf file in issue #13500.	2021-07-14 17:38:35 +02:00
Calixte Denizet	5081167e7f	XFA - A rectangle must have the width of its parent but without inner margins - it aims to fix #13584; - to avoid bad rendering because of clipping just set overflow to visible on SVG element.	2021-07-14 16:46:13 +02:00
Calixte Denizet	dd55e76f5d	XFA - Avoid to have containers not pushed in the html - it aims to fix issue #13668.	2021-07-12 21:34:58 +02:00
calixteman	140c2bc563	Revert "XFA - Avoid to have containers not pushed in the html"	2021-07-12 09:46:38 +02:00
calixteman	b6445ddc08	Merge pull request #13716 from calixteman/layout7 XFA - Avoid to have containers not pushed in the html	2021-07-12 09:31:27 +02:00
Calixte Denizet	9bbc194846	XFA - Support assist element	2021-07-11 21:01:18 +02:00
Calixte Denizet	fccc6c2242	XFA - Avoid to have containers not pushed in the html - it aims to fix issue #13668.	2021-07-11 19:14:44 +02:00
Calixte Denizet	690b5d1941	XFA - Use fake MyriadPro as a fallback for missing fonts - aims to fix #13597.	2021-07-11 13:52:13 +02:00
calixteman	d416b23898	Merge pull request #13705 from calixteman/lineheight3 XFA - Fix text positions (bug 1718741)	2021-07-10 14:19:03 +02:00
Jonas Jenwald	700b79a305	XFA - Always compute the transformed BBox values in `checkDimensions` (PR 13691 follow-up) This way we ensure that these BBox values are always defined as expected for every `case`-block, and we also don't need to duplicate the lookup in multiple places. (Also, the patch removes a couple of unnecessary line-breaks in existing comments.) Fixes https://github.com/mozilla/pdf.js/pull/13691#pullrequestreview-702356627, which was flagged by LGTM.	2021-07-10 11:24:05 +02:00
calixteman	a4f60fc417	Merge pull request #13708 from calixteman/xfa_tab XFA - Add support for traversal and traverse element	2021-07-09 21:59:50 +02:00
Calixte Denizet	ccac125623	XFA - Add support for traversal and traverse element - For now, just implement the "next" target in using tabindex attribute of html elements.	2021-07-09 20:50:25 +02:00

1 2 3 4 5 ...

2306 Commits