pdf.js

Author	SHA1	Message	Date
Brendan Dahl	3e003245b1	[XFA] Add alt text for images. (bug 1723418) Not many XFA PDFs have alt text. Some examples: bug1723422.pdf xfa_bug1718670_1.pdf xfa_issue13611.pdf xfa_issue13633.pdf xfa_issue13634.pdf	2021-08-03 17:18:58 -07:00
Brendan Dahl	6cf1ee3251	Merge pull request #13858 from brendandahl/xfa-aria-label Add aria-labels to XFA form elements. (bug 1723422)	2021-08-03 17:18:08 -07:00
Brendan Dahl	6ea56f35ab	Add aria-labels to XFA form elements. (bug 1723422)	2021-08-03 15:58:33 -07:00
Tim van der Meij	85be62c684	Merge pull request #13854 from Snuffleupagus/issue-13851 Prevent breaking errors when an optional content group is undefined (issue 13851)	2021-08-03 23:34:34 +02:00
Tim van der Meij	ad90fe90ed	Merge pull request #13848 from Snuffleupagus/rm-lgtm Remove the LGTM configuration and inline disable comments (issue 13829)	2021-08-03 23:13:05 +02:00
Jonas Jenwald	766299016f	Remove the `isEOF` helper function and slightly re-factor `EOF` Given how trivial the `isEOF` function is, we can simply inline the check at the various call-sites and remove the function (which ought to be ever so slightly more efficient as well). Furthermore, this patch also changes the `EOF` primitive itself to a `Symbol` instead of an Object since that has the nice benefit of making it unclonable (thus preventing accidentally trying to send `EOF` from the worker-thread).	2021-08-03 20:19:32 +02:00
Calixte Denizet	be1ee155d1	XFA - Avoid to put something in very small areas - it aims to fix #13855.	2021-08-03 17:05:29 +02:00
Jonas Jenwald	d5e14d3dc3	Prevent breaking errors when an optional content group is undefined (issue 13851) In the referenced PDF document most of the form `/Form` XObjects don't have an `/OC` entry, which thus causes the runtime failure during rendering.	2021-08-03 15:59:29 +02:00
Jonas Jenwald	8fef8630fe	Remove the LGTM configuration and inline disable comments (issue 13829) Given that the GitHub Advanced Security workflow now covers everything that LGTM does, but generally faster and with better GitHub-integration, there's no longer much point in also running LGTM separately. As a follow-up to this patch, we should also disable/remove the LGTM-integration from the PDF.js repository.	2021-08-03 11:14:49 +02:00
Jonas Jenwald	705d1cfad3	Remove useless assignment of `availableSpace` in the `src/core/xfa/template.js` file (issue 13829, 13835)	2021-08-03 10:58:57 +02:00
Tim van der Meij	10a1db6980	Merge pull request #13824 from Snuffleupagus/issue-13823 When no "V" entry exists, let the fieldValue fallback to the "DV" entry (issue 13823)	2021-07-30 22:48:38 +02:00
Tim van der Meij	67f4c34f63	Merge pull request #13822 from Snuffleupagus/ReadableStreams-cancel-no-Uncaught_promise Prevent "Uncaught promise" messages in the console when cancelling (some) `ReadableStream`s	2021-07-30 22:09:29 +02:00
Tim van der Meij	99b14a9da0	Merge pull request #13813 from Snuffleupagus/rm-closure-API Remove a couple of closures in the `src/display/api.js` file	2021-07-30 21:55:45 +02:00
Jonas Jenwald	ff71be793d	When no "V" entry exists, let the fieldValue fallback to the "DV" entry (issue 13823)	2021-07-30 16:17:42 +02:00
Calixte Denizet	7bb5331087	XFA - Avoid an error when an exdata is a string (bug 1723114)	2021-07-30 14:43:53 +02:00
Jonas Jenwald	1df9da949e	Prevent "Uncaught promise" messages in the console when cancelling (some) `ReadableStream`s While fixing issue 13794, I noticed that cancelling the `ReadableStream` returned by the `PDFPageProxy.streamTextContent`-method could lead to "Uncaught promise" messages in the console.[1] Generally speaking, we don't really care about errors when cancelling a `ReadableStream` and it thus seems reasonable to simply suppress any output in those cases. --- [1] Although, after that issue was fixed you'd now need to set the API-option `stopAtErrors = true` to actually trigger this.	2021-07-30 14:27:38 +02:00
Jonas Jenwald	5fac0a4350	Simplify some code related to `fallbackWorkerSrc` and `getMainThreadWorkerMessageHandler`	2021-07-30 11:34:47 +02:00
Jonas Jenwald	4c679d80ac	Remove the closure used with the `InternalRenderTask` class This patch utilizes the same approach as used in lots of other parts of the code-base, which thus slightly reduces the size of this code.	2021-07-30 11:34:47 +02:00
Jonas Jenwald	b18620ac0f	Remove the closure used with the `PDFDocumentLoadingTask` class This patch utilizes the same approach as used in lots of other parts of the code-base, which thus slightly reduces the size of this code. By removing some of the (current) indirection, we can also simplify the JSDocs a little bit. Looking at the `gulp jsdoc` output, this actually seem to improve the documentation for this class.	2021-07-30 11:34:47 +02:00
Brendan Dahl	4ad5c5d52a	Merge pull request #13808 from brendandahl/pattern-cache-v2 Improve caching of shading patterns. (bug 1721949)	2021-07-28 11:17:16 -07:00
Brendan Dahl	c836e1f0fb	Improve caching of shading patterns. (bug 1721949) The PDF in bug 1721949 uses many unique pattern objects that references the same shading many times. This caused a new canvas pattern to be created and cached many times driving up memory use. To fix, I've changed the cache in the worker to key off the shading object and instead send the shading and matrix separately. While that worked well to fix the above bug, there could be PDFs that use many shading that could cause memory issues, so I've also added a LRU cache on the main thread for canvas patterns. This should prevent memory use from getting too high.	2021-07-28 10:29:20 -07:00
Calixte Denizet	4a4591bd2c	XFA - Fix font scale factors (bug 1720888) - All the scale factors in for the substitution font were wrong because of different glyph positions between Liberation and the other ones: - regenerate all the factors - Text may have polish chars for example and in this case the glyph widths were wrong: - treat substitution font as a composite one - add a map glyphIndex to unicode for Liberation in order to generate width array for cid font	2021-07-28 19:10:42 +02:00
Calixte Denizet	92f4cc52a6	XFA - Add a transparent blue background on all text fields for consistency	2021-07-28 14:47:29 +02:00
Calixte Denizet	76d882b560	XFA - Fix auto-sized fields (bug 1722030) - In order to better compute text fields size, use line height with no gaps (and consequently guessed height for text are slightly better in general). - Fix default background color in fields.	2021-07-28 09:43:15 +02:00
Tim van der Meij	336a74a0e5	Merge pull request #13796 from Snuffleupagus/issue-13794 Allow `StreamsSequenceStream.readBlock` to skip sub-streams with errors (issue 13794)	2021-07-27 22:25:58 +02:00
calixteman	45f3804737	Merge pull request #13807 from calixteman/fulltext XFA - Get the full value when binding and not only the 1st line (bug 1718725)	2021-07-27 22:22:37 +02:00
Tim van der Meij	e51cbe63bf	Merge pull request #13801 from Snuffleupagus/AnnotationLayer-check-navigator Access `navigator` safely in the `src/display/annotation_layer.js` file	2021-07-27 22:10:27 +02:00
Calixte Denizet	bd6f55186d	XFA - Get the full value when binding and not only the 1st line (bug 1718725)	2021-07-27 20:25:33 +02:00
Jonas Jenwald	4b3ab1472c	Access `navigator` safely in the `src/display/annotation_layer.js` file For code that's part of the core library, rather than e.g. the `web/`-folder, we should always be careful about directly accessing any DOM methods. The `navigator` is one such structure, which shouldn't be assumed to always be available and we should thus check that it's actually present.[1] Hence this patch re-factors the `navigator.platform` access, in the `AnnotationLayer`-code, to ensure that it's generally safe. Furthermore, to reduce unnecessary repeated string-matching to determine the current platform, we're now using a shadowed getter which is evaluated only once instead (at first access). --- [1] Note e.g. the `isSyncFontLoadingSupported` getter, in the `src/display/font_loader.js` file.	2021-07-27 09:40:42 +02:00
Calixte Denizet	959120e6c9	XFA - Elements created outside of XML must have all their properties (bug 1722029) - an Image element was created, attached to its parent but the $globalData property was not set and that led to an error. - the pdf in bug 1722029 has 27 rendered rows (checked in Acrobat) when only one was displayed: this patch some binding issues around the occur element.	2021-07-26 19:38:52 +02:00
Jonas Jenwald	885e7a8aa4	Allow `StreamsSequenceStream.readBlock` to skip sub-streams with errors (issue 13794) This patch makes use of the existing `ignoreErrors` option, thus allowing a page to continue parsing/rendering even if (some of) its sub-streams are corrupt. Obviously this may cause part of a page to be broken/missing, however it should be better than (potentially) rendering nothing. Also, to the best of my knowledge, this is the first bug of its kind that we've encountered. To avoid having to pass in a bunch of, for a `BaseStream`-instance, mostly unrelated parameters when initializing a `StreamsSequenceStream`-instance, I settled on utilizing a callback function instead to allow conditional Error-suppression. Note that the `StreamsSequenceStream`-class is a special stream-implementation that we only use when the `/Contents`-entry, in the `/Page`-dictionary, consists of an Array with streams.	2021-07-26 16:42:50 +02:00
Jonas Jenwald	e1fa845293	Only define existing methods, when converting the `OPS` format to method-names on the `CanvasGraphics.prototype` There's no good reason, as far as I can tell, to explicitly define a bunch of methods to be `undefined`, which the current unconditional "copying" of methods will do. Note that of the `OPS` ~23 percent don't, for various reasons, have an associated method on the `CanvasGraphics.prototype`.	2021-07-25 13:28:28 +02:00
Jonas Jenwald	fbaafdc4e8	Remove the remaining closure in the `src/display/canvas.js` file For e.g. the `gulp mozcentral` command, the built `pdf.js` file decreases from `304 607` to `301 295` bytes with this patch. The improvement comes mostly from having less overall indentation in the code.	2021-07-25 13:14:58 +02:00
Jonas Jenwald	833f27c677	Disable a LGTM warning, again (PR 13787 follow-up) Apparently I didn't put one of the disable comments on the correct line, since I didn't read the instructions carefully enough, so let's try again. Note that, most unfortunately, disabling of warnings isn't applied until after a patch has been merged.	2021-07-25 10:32:40 +02:00
Tim van der Meij	41a2b5c809	Merge pull request #13787 from Snuffleupagus/lgtm-fix-warnings Fix (most) LGTM warnings	2021-07-24 15:20:07 +02:00
Tim van der Meij	7b6767d415	Merge pull request #13784 from Snuffleupagus/issue-13783 When parsing corrupt documents, avoid inserting obviously broken data in the XRef-table (issue 13783)	2021-07-24 14:37:39 +02:00
Tim van der Meij	687cfcecd4	Merge pull request #13786 from Snuffleupagus/rm-more-src-core-closures Remove a couple of small closures in `src/core/` code	2021-07-24 14:26:57 +02:00
Jonas Jenwald	70bac87fed	Fix (most) LGTM warnings Most of the warnings we don't really care about, and those are simply white-listed using inline comments; however two cases prompted actual code changes: - In `src/display/pattern_helper.js` the branch in question is indeed unreachable, and should thus be safe to remove. (This code originated in PR 4192, which is now over seven years ago.) - In `test/test.js`, the function in question indeed doesn't accept any arguments. (The patch also re-formats a string just above, which didn't seem worthy of a separated patch.) This now leaves only one warning in the LGTM report, however that one is a false positive that we'll need to report upstream.	2021-07-24 14:23:59 +02:00
Tim van der Meij	9854b85dc1	Merge pull request #13775 from Snuffleupagus/Dict-merge-refactor Remove some duplication in the `Dict.merge` method	2021-07-24 14:21:41 +02:00
Jonas Jenwald	ebbbc973a5	Remove the closure used with the `PostScriptToken` class This patch uses the same approach as used in lots of other parts of the code-base, which thus slightly reduces the size of this code.	2021-07-24 13:05:46 +02:00
Jonas Jenwald	81009d42cf	Remove the closure used with the `PostScriptStack` class This patch uses the same approach as used in lots of other parts of the code-base, which thus slightly reduces the size of this code.	2021-07-24 12:59:53 +02:00
Jonas Jenwald	b82c802dff	When parsing corrupt documents, avoid inserting obviously broken data in the XRef-table (issue 13783) In cases where even the very first attempt at reading from an object will throw, simply ignoring such objects will help improve rendering of some corrupt documents. Note that this will lead to more parsing in some cases, but considering that this only applies to corrupt documents that shouldn't be a big deal.	2021-07-23 18:10:53 +02:00
Jonas Jenwald	51f0a81085	Merge pull request #13770 from brendandahl/cache-pattern Improve performance of reused patterns.	2021-07-23 10:43:23 +02:00
Brendan Dahl	da1af02ac8	Improve performance of reused patterns. Bug 1721218 has a shading pattern that was used thousands of times. To improve performance of this PDF: - add a cache for patterns in the evaluator and only send the IR form once to the main thread (this also makes caching in canvas easier) - cache the created canvas radial/axial patterns - for shading fill radial/axial use the pattern directly instead of creating temporary canvas	2021-07-22 16:47:40 -07:00
Calixte Denizet	a51c4a3a0f	XFA - A field without an ui must provide a default one (bug 1718245)	2021-07-22 20:31:25 +02:00
Jonas Jenwald	e1ee3835cd	Remove some duplication in the `Dict.merge` method Currently the `!mergeSubDicts` code-path is essentially just duplicated code, which we can easily avoid by simply moving that check. (This may lead to ever so slightly more parsing for this case, but the difference ought to be negligible in practice.)	2021-07-22 14:01:43 +02:00
Jonas Jenwald	2cf90cd9ad	Merge pull request #13766 from Snuffleupagus/issue-13751 XFA - Handle `startIndex` correctly in the `Template.$toHTML` method (issue 13751)	2021-07-21 18:58:29 +02:00
Calixte Denizet	5555114bb3	XFA - Remove namespace from nodes under xfa:data node - in real life some xfa contains xml like <xfa:data><xfa:Foo><xfa:Bar>...</xfa:data> since there are no Foo or Bar in the xfa namespace the JS representation are empty and that leads to errors. - so the idea is to make all nodes under xfa:data namespace agnostic which means that ns are removed from nodes in the parser but only xfa:data descendants.	2021-07-21 17:11:31 +02:00
Jonas Jenwald	7d1c19f8bd	XFA - Handle `startIndex` correctly in the `Template.$toHTML` method (issue 13751) Please note: The PDF document in issue 13751 is dynamically created (in e.g. Adobe Reader), with pages added when certain buttons are clicked, hence this patch simply fixes the breaking error and nothing more. It looks like the current code contains a little bit too much copy-and-paste from the similar `index` branch above, since we cannot set the `startIndex` to a negative value. Note how it's being used to initialize the loop-variable, which is then used to lookup values in an Array and accessing the `-1`th element of an Array obviously makes no sense.	2021-07-21 16:17:13 +02:00
Jonas Jenwald	6c9b6bc599	Merge pull request #13764 from Snuffleupagus/issue-13748 XFA - Add a missing method to `XFAAttribute`, to prevent breaking errors (issue 13748)	2021-07-20 18:55:23 +02:00

... 2 3 4 5 6 ...

5014 Commits