pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	fd1f0f647f	Print a special warning message, in the viewer, for XFA Foreground documents Currently XFAF documents use the same warning message as in the XFA disabled case, which is neither helpful nor correct.	2021-09-23 15:02:24 +02:00
Jonas Jenwald	6cba5509f2	Re-factor `document.getElementsByName` lookups in the AnnotationLayer (issue 14003) This replaces direct `document.getElementsByName` lookups with a helper method which: - Lets the AnnotationLayer use the data returned by the `PDFDocumentProxy.getFieldObjects` API-method, such that we can directly lookup only the necessary DOM elements. - Fallback to using `document.getElementsByName` as before, such that e.g. the standalone viewer components still work. Finally, to fix the problems reported in issue 14003, regardless of the code-path we now also enforce that the DOM elements found were actually created by the AnnotationLayer code. With these changes we'll thus be able to update form elements on all visible pages just as before, but we'll additionally update the AnnotationStorage for not-yet-rendered elements thus fixing a pre-existing bug.	2021-09-23 13:05:18 +02:00
Jonas Jenwald	3e550f392a	Add `PDF_TO_CSS_UNITS` to the `PixelsPerInch`-structure Rather than re-computing this value in a number of different places throughout the code-base[1], we can expose this in the API via the existing `PixelsPerInch`-structure instead. There's also been feature requests asking for the old `CSS_UNITS` viewer constant to be made accessible, such that it could be used in third-party implementations. I suppose that it could be argued that it's somewhat confusing to place a unitless property in `PixelsPerInch`, however given that the `PDF_TO_CSS_UNITS`-property is defined strictly in terms of the existing properties this is hopefully deemed reasonable. --- [1] These include: - The viewer, with the `CSS_UNITS` name. - The reference-tests. - The display-layer, when rendering images; see PR 13991.	2021-09-20 13:20:09 +02:00
Jonas Jenwald	20eb6ca2ec	Merge pull request #14044 from calixteman/bug1719148 Annotations - Avoid empty value in text field when storage contains something for it (bug 1719148)	2021-09-18 16:31:45 +02:00
Tim van der Meij	c870fb489e	Merge pull request #14013 from Snuffleupagus/api-unittest-instanceof Improve the API unit-tests, and try to expose more API-functionality in the TypeScript definitions	2021-09-18 16:08:19 +02:00
Calixte Denizet	eb762ad624	Annotations - Avoid empty value in text field when storage contains something for it (bug 1719148) - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1719148; - JS can set a property for a non-rendered annotation using the annotationStorage but the other unset default properties must be used when the annotation is finally rendered; - so this patch just adds the properties already set in the annotationStorage to the default value.	2021-09-18 15:08:22 +02:00
Calixte Denizet	e87c12bf34	JS - Avoid the Stay/Leave popup when clicking on a button with a JS action - it aims to fix #14039.	2021-09-17 21:04:07 +02:00
Calixte Denizet	a3aa6dd6ab	Annotation - Checkboxes with the same name and export values must be in unison - it aims to fix #14024. - this patch adds an attribute `acroformExportValue` to the HTML input in order to set the checked attribute in taking into account the exportValue for the checkboxes with the same name.	2021-09-15 15:30:24 +02:00
Jonas Jenwald	d854352cd5	Improve the API unit-tests by checking that `PDFPageProxy.render` returns a `RenderTask`-instance This is similar to existing unit-tests, which checks for `PDFDocumentProxy`- and `PDFPageProxy`-instances.	2021-09-13 13:34:37 +02:00
Jonas Jenwald	fa7a607d33	Improve the API unit-tests by checking that `getDocument` returns a `PDFDocumentLoadingTask`-instance This is similar to existing unit-tests, which checks for `PDFDocumentProxy`- and `PDFPageProxy`-instances.	2021-09-13 13:34:28 +02:00
Jonas Jenwald	0e54f568fb	Re-factor the `CSS_PIXELS_PER_INCH`/`PDF_PIXELS_PER_INCH` exports (PR 13991 follow-up) For improved maintainability, since these constants are being exposed in the official API, this patch moves them into an Object instead.	2021-09-11 11:15:25 +02:00
Jonas Jenwald	bd51bbfd16	Remove `mozImageSmoothingEnabled` fallback in `CanvasGraphics.endGroup` This was added all the way back in PR 2936, however it's been unnecessary ever since Firefox 51 (released on 2017-01-24); please see the MDN compatibility data: https://developer.mozilla.org/en-US/docs/Web/API/CanvasRenderingContext2D/imageSmoothingEnabled#browser_compatibility	2021-09-11 10:30:39 +02:00
Jonas Jenwald	9ce63a6dc6	Merge pull request #13991 from brendandahl/interpolate Enable/disable image smoothing based on image interpolate value. (bug 1722191)	2021-09-11 10:02:53 +02:00
Brendan Dahl	f38fb42b42	Enable/disable image smoothing based on image interpolate value. (bug 1722191) While some of the output looks worse to my eye, this behavior more closely matches what I see when I open the PDFs in Adobe acrobat. Fixes: #4706, #9713, #8245, #1344	2021-09-10 14:23:35 -07:00
Calixte Denizet	623860bf8f	XFA - Remove the checked attribute from the checkbox when unchecked (bug 1729877) - it aims to fix: https://bugzilla.mozilla.org/show_bug.cgi?id=1729877.	2021-09-09 19:14:16 +02:00
Jonas Jenwald	4c1b586dd2	Reduce the size of `TextLayerRenderTask._textDivProperties` in "regular" text-selection mode While these changes will obviously not have a significant effect on overall memory usage, it cannot hurt as far as I'm concerned. This patch makes the following changes: - Clear out `_textDivProperties` once rendering is done, since those properties are only necessary to keep alive when enhanced text-selection is being used. - Reduce the size of the `_textDivProperties`-entries by default, since a majority of the properties are only relevant when enhanced text-selection is being used.	2021-09-05 12:12:34 +02:00
Jonas Jenwald	6318ccf6d2	Treat all content as visible when no optional content groups are defined (issue 13971) In the referenced PDF document the /Contents stream contains MarkedContent-operators, however no optional content dictionary exists; according to [the specification](https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#G7.3883825): > Null values or references to deleted objects shall be ignored. If this entry is not present, is an empty array, or contains references only to null or deleted objects, the membership dictionary shall have no effect on the visibility of any content.	2021-09-04 08:13:37 +02:00
Jonas Jenwald	1f56451d56	Implement `PDFNetworkStreamRangeRequestReader._onError`, to handle range request errors with XMLHttpRequest (issue 9883) Given that the Fetch API is normally being used now, these changes are probably less important now than they used to be. However, given that it's simple enough to implement this I figured why not just fix issue 9883 (better late than never I suppose).	2021-08-31 10:23:57 +02:00
Jonas Jenwald	bd9a92a161	Use optional chaining more in the `src/display/network.js` file Also changes the different `_onDone`/`_onProgress` methods to use consistent parameter names, and some other small improvements.	2021-08-31 10:23:54 +02:00
Jonas Jenwald	cf0ccc4bab	Merge pull request #13937 from overleaf/jpa-fix-error-handling Fix handling of fetch errors	2021-08-30 15:50:03 +02:00
Jakob Ackermann	291ffd3059	Fix handling of fetch errors Testing: - delete the pdf file while the initial request is inflight - delete the pdf file after the initial request has finished Repeat for a small file and large file, exercising both one-off and chunked transports.	2021-08-30 12:43:28 +01:00
Jonas Jenwald	ce3f5ea2bf	Use `async` a bit more in the API This patch changes the `PDFDocumentLoadingTask.destroy`-method and the `_fetchDocument`-function to be `async`, which slightly simplifies the relevant code. Furthermore, remove the catch-handler from the `WorkerTransport.getPageIndex`-method since it's no longer needed. Given that the `MessageHandler` is nowadays wrapping every possible Exception, it's no longer necessary to try and re-wrap the reason here.	2021-08-29 12:31:28 +02:00
Tim van der Meij	153d058b3a	Merge pull request #13933 from brendandahl/xfa-checkbox2 Fix saving of XFA checkboxes. (bug 1726381)	2021-08-27 22:45:44 +02:00
Brendan Dahl	6d2193a812	Fix saving of XFA checkboxes. (bug 1726381) Previously were were always setting the storage value to the on value.	2021-08-24 15:53:55 -07:00
Jonas Jenwald	2a0ad8e696	Add deprecation warnings for the `renderInteractiveForms` and `includeAnnotationStorage` options, in `PDFPageProxy.render` This is done separately from the previous patch, to make it easier to revert these changes once they've been included in a couple of releases. Please note that because these two options are mutually exclusive, which is a large part of the reason for the previous patch, it's not guaranteed that the fallback-values will always be correct in every situation (but it's the best that we can do).	2021-08-24 01:40:12 +02:00
Jonas Jenwald	41efa3c071	[api-minor] Introduce a new `annotationMode`-option, in `PDFPageProxy.{render, getOperatorList}` This is a follow-up to PRs 13867 and 13899. This patch is tagged `api-minor` for the following reasons: - It replaces the `renderInteractiveForms`/`includeAnnotationStorage`-options, in the `PDFPageProxy.render`-method, with the single `annotationMode`-option that controls which annotations are being rendered and how. Note that the old options were mutually exclusive, and setting both to `true` would result in undefined behaviour. - For improved consistency in the API, the `annotationMode`-option will also work together with the `PDFPageProxy.getOperatorList`-method. - It's now also possible to disable all annotation rendering in both the API and the Viewer, since the other changes meant that this could now be supported with a single added line on the worker-thread[1]; fixes 7282. --- [1] Please note that in order to simplify the overall implementation, we'll purposely only support disabling of all annotations and that the option is being shared between the API and the Viewer. For any more "specialized" use-cases, where e.g. only some annotation-types are being rendered and/or the API and Viewer render different sets of annotations, that'll have to be handled in third-party implementations/forks of the PDF.js code-base.	2021-08-24 01:13:02 +02:00
Brendan Dahl	bf5a45ce6d	Merge pull request #13908 from brendandahl/xfa-find [api-minor] XFA - Support text search in XFA documents.	2021-08-23 08:53:02 -07:00
Brendan Dahl	bb47128864	XFA - Support text search in XFA documents. Moves the logic out of TextLayerBuilder to handle highlighting matches into a new separate class `TextHighlighter` that can be used with regular PDFs and XFA PDFs. To mimic the current find functionality in XFA, two arrays from the XFA rendering are created to get the text content and map those to DOM nodes. Fixes #13878	2021-08-23 08:44:20 -07:00
Jonas Jenwald	a7f0301f21	[Regression] Re-factor the internal `includeAnnotationStorage` handling, since it's currently subtly wrong This patch is very similar to the recently fixed `renderInteractiveForms`-options, see PR 13867. As far as I can tell, this subtle bug has existed ever since `AnnotationStorage`-support was first added in PR 12106 (a little over a year ago). The value of the `includeAnnotationStorage`-option, as passed to the `PDFPageProxy.render` method, will (potentially) affect the size/content of the operatorList that's returned from the worker (for documents with forms). Given that operatorLists will generally, unless they contain huge images, be cached in the API, repeated `PDFPageProxy.render` calls where the form-data has been changed by the user in between, can thus wrongly return a cached operatorList. In the viewer we're only using the `includeAnnotationStorage`-option when printing, which is probably why this has gone unnoticed for so long. Note that we, for performance reasons, don't cache printing-operatorLists in the API. However, there's nothing stopping an API-user from using the `includeAnnotationStorage`-option during "normal" rendering, which could thus result in subtle (and difficult to understand) rendering bugs. In order to handle this, we need to know if the `AnnotationStorage`-instance has been updated since the last `PDFPageProxy.render` call. The most "correct" solution would obviously be to create a hash of the `AnnotationStorage` contents, however that would require adding a bunch of code, complexity, and runtime overhead. Given that operatorList caching in the API doesn't have to be perfect[1], but only have to avoid false cache-hits, we can simplify things significantly be only keeping track of the last time that the `AnnotationStorage`-data was modified. Please note: While working on this patch, I also noticed that the `renderInteractiveForms`- and `includeAnnotationStorage`-options in the `PDFPageProxy.render` method are mutually exclusive.[2] Given that the various Annotation-related options in `PDFPageProxy.render` have been added at different times, this has unfortunately led to the current "messy" situation.[3] --- [1] Note how we're already not caching operatorLists for pages with huge images, in order to save memory, hence there's no guarantee that operatorLists will always be cached. [2] Setting both to `true` will result in undefined behaviour, since trying to insert `AnnotationStorage`-values into fields that are being excluded from the operatorList-building will obviously not work, which isn't at all clear from the documentation. [3] My intention is to try and fix this in a follow-up PR, and I've got a WIP patch locally, however it will result in a number of API-observable changes.	2021-08-18 10:09:03 +02:00
Jonas Jenwald	1465b1670f	[src/display/api.js] Move the `getRenderingIntent` helper function into `WorkerTransport` By doing this re-factoring separately, since it's mostly a mechanical change, the size/scope of the next patch will be reduced somewhat.	2021-08-18 09:58:26 +02:00
Jonas Jenwald	6167566f1b	Re-factor the `BaseException.name` handling, and clean-up some code Once we're finally able to get rid of SystemJS, which is unfortunately still blocked on [bug 1247687](https://bugzilla.mozilla.org/show_bug.cgi?id=1247687), we might also want to clean-up (or even completely remove) the `BaseException` abstraction and simply extend `Error` directly instead. At that point we'd need to (explicitly) set the `name` on each class anyway, so this patch is essentially preparing for future clean-up. Furthermore, after the `BaseException` abstraction was added there's been multiple issues filed about third-party minification breaking our code since `this.constructor.name` is not guaranteed to always do what you intended. While hard-coding the strings indeed feels quite unfortunate, it's likely the "best" solution to avoid the problem described above.	2021-08-10 11:27:47 +02:00
Jonas Jenwald	7f2d524df5	Improve caching of Annotations-data, by using a `Map`, in the API Rather than caching only the last `PDFPageProxy.getAnnotations` call, and having to handle the intent separately, we can instead implement the caching in exactly the same way as done in the `PDFPageProxy.{render, getOperatorList}` methods.	2021-08-08 08:14:51 +02:00
Tim van der Meij	036b81496e	Merge pull request #13882 from Snuffleupagus/PDFWorker-rm-closure [api-minor] Remove the closure from the `PDFWorker` class, in the `src/display/api.js` file	2021-08-07 19:52:39 +02:00
Jonas Jenwald	1cf9405281	[api-minor] Remove the closure from the `PDFWorker` class, in the `src/display/api.js` file This patch removes the only remaining closure in the `src/display/api.js` file, utilizing a similar approach as used in lots of other parts of the code-base, which results in a small decrease in the size of the build `pdf.js` file. Given that `PDFWorker` is exposed through the public API, this complicates things somewhat since there's a couple of worker-related properties that really should stay private. Initially, while working on PR 13813, I believed that we'd need support for private (static) class fields in order to get rid of this closure, however I've managed to come up with what's hopefully deemed an acceptable work-around here. Furthermore, some helper functions were simply moved into the `PDFWorker` class as static methods, thus simplifying the overall implementation (e.g. we don't need to manually cache the Promise in the `PDFWorker._setupFakeWorkerGlobal`-method). Finally, as part of this re-factoring a number of missing JSDoc-comments were added which together with the removal of the closure significantly improves the `gulp jsdoc` output for the `PDFWorker` class. Please note: This patch is tagged with `api-minor` since it deprecates `PDFWorker.getWorkerSrc()` in favor of the shorter `PDFWorker.workerSrc`, with the fallback limited to `GENERIC` builds.	2021-08-07 10:43:39 +02:00
Jonas Jenwald	107efdb178	[Regression] Re-factor the internal `renderInteractiveForms` handling, since it's currently subtly wrong The value of the `renderInteractiveForms` parameter, as passed to the `PDFPageProxy.render` method, will (potentially) affect the size/content of the operatorList that's returned from the worker (for documents with forms). Given that operatorLists will generally, unless they contain huge images, be cached in the API, repeated `PDFPageProxy.render` calls that only change the `renderInteractiveForms` parameter can thus return an incorrect operatorList. As far as I can tell, this subtle bug has existed ever since `renderInteractiveForms`-support was first added in PR 7633 (which is almost five years ago). With the previous patch, fixing this is now really simple by "encoding" the `renderInteractiveForms` parameter in the internal renderingIntent handling.	2021-08-06 00:40:43 +02:00
Jonas Jenwald	47f94235ab	[api-minor] Re-factor the internal renderingIntent, and change the default `intent` value in the `PDFPageProxy.getAnnotations` method With the changes made in PR 13746 the internal renderingIntent handling became somewhat "messy", since we're now having to do string-matching in various spots in order to handle the "oplist"-intent correctly. Hence this patch, which implements the idea from PR 13746 to convert the `intent`-strings, used in various API-methods, into an internal renderingIntent that's implemented using a bit-field instead. Please note: This part of the patch, in itself, does not change the public API (but see below). This patch is tagged `api-minor` for the following reasons: 1. It changes the default value for the `intent` parameter, in the `PDFPageProxy.getAnnotations` method, to "display" in order to be consistent across the API. 2. In order to get all annotations, with the `PDFPageProxy.getAnnotations` method, you now need to explicitly set "any" as the `intent` parameter. 3. The `PDFPageProxy.getOperatorList` method will now also support the new "any" intent, to allow accessing the operatorList of all annotations (limited to those types that have one). 4. Finally, for consistency across the API, the `PDFPageProxy.render` method also support the new "any" intent (although I'm not sure how useful that'll be). Points 1 and 2 above are the significant, and thus breaking, changes in default behaviour here. However, unfortunately I cannot see a good way to improve the overall API while also keeping `PDFPageProxy.getAnnotations` unchanged.	2021-08-06 00:39:42 +02:00
Calixte Denizet	fef939d347	Annotation & XFA: Add focus outlines on different fields (bug 1723615, bug 1718528) - set a default tabindex to be sure they'll be taken into account in the TAB cycle (https://bugzilla.mozilla.org/show_bug.cgi?id=1723615). - show default outline when fields are focused (it was an a11y bug: https://bugzilla.mozilla.org/show_bug.cgi?id=1718528).	2021-08-05 13:33:46 +02:00
Calixte Denizet	71a100a4d0	Annotation & XFA: Scale the font size in choicelist using zoom factor (bug 1715996) - this is an accessibility issue which could be painful for some people with visual disabilities.	2021-08-04 20:36:04 +02:00
Jonas Jenwald	d5e14d3dc3	Prevent breaking errors when an optional content group is undefined (issue 13851) In the referenced PDF document most of the form `/Form` XObjects don't have an `/OC` entry, which thus causes the runtime failure during rendering.	2021-08-03 15:59:29 +02:00
Tim van der Meij	67f4c34f63	Merge pull request #13822 from Snuffleupagus/ReadableStreams-cancel-no-Uncaught_promise Prevent "Uncaught promise" messages in the console when cancelling (some) `ReadableStream`s	2021-07-30 22:09:29 +02:00
Jonas Jenwald	1df9da949e	Prevent "Uncaught promise" messages in the console when cancelling (some) `ReadableStream`s While fixing issue 13794, I noticed that cancelling the `ReadableStream` returned by the `PDFPageProxy.streamTextContent`-method could lead to "Uncaught promise" messages in the console.[1] Generally speaking, we don't really care about errors when cancelling a `ReadableStream` and it thus seems reasonable to simply suppress any output in those cases. --- [1] Although, after that issue was fixed you'd now need to set the API-option `stopAtErrors = true` to actually trigger this.	2021-07-30 14:27:38 +02:00
Jonas Jenwald	5fac0a4350	Simplify some code related to `fallbackWorkerSrc` and `getMainThreadWorkerMessageHandler`	2021-07-30 11:34:47 +02:00
Jonas Jenwald	4c679d80ac	Remove the closure used with the `InternalRenderTask` class This patch utilizes the same approach as used in lots of other parts of the code-base, which thus slightly reduces the size of this code.	2021-07-30 11:34:47 +02:00
Jonas Jenwald	b18620ac0f	Remove the closure used with the `PDFDocumentLoadingTask` class This patch utilizes the same approach as used in lots of other parts of the code-base, which thus slightly reduces the size of this code. By removing some of the (current) indirection, we can also simplify the JSDocs a little bit. Looking at the `gulp jsdoc` output, this actually seem to improve the documentation for this class.	2021-07-30 11:34:47 +02:00
Brendan Dahl	4ad5c5d52a	Merge pull request #13808 from brendandahl/pattern-cache-v2 Improve caching of shading patterns. (bug 1721949)	2021-07-28 11:17:16 -07:00
Brendan Dahl	c836e1f0fb	Improve caching of shading patterns. (bug 1721949) The PDF in bug 1721949 uses many unique pattern objects that references the same shading many times. This caused a new canvas pattern to be created and cached many times driving up memory use. To fix, I've changed the cache in the worker to key off the shading object and instead send the shading and matrix separately. While that worked well to fix the above bug, there could be PDFs that use many shading that could cause memory issues, so I've also added a LRU cache on the main thread for canvas patterns. This should prevent memory use from getting too high.	2021-07-28 10:29:20 -07:00
Jonas Jenwald	4b3ab1472c	Access `navigator` safely in the `src/display/annotation_layer.js` file For code that's part of the core library, rather than e.g. the `web/`-folder, we should always be careful about directly accessing any DOM methods. The `navigator` is one such structure, which shouldn't be assumed to always be available and we should thus check that it's actually present.[1] Hence this patch re-factors the `navigator.platform` access, in the `AnnotationLayer`-code, to ensure that it's generally safe. Furthermore, to reduce unnecessary repeated string-matching to determine the current platform, we're now using a shadowed getter which is evaluated only once instead (at first access). --- [1] Note e.g. the `isSyncFontLoadingSupported` getter, in the `src/display/font_loader.js` file.	2021-07-27 09:40:42 +02:00
Jonas Jenwald	e1fa845293	Only define existing methods, when converting the `OPS` format to method-names on the `CanvasGraphics.prototype` There's no good reason, as far as I can tell, to explicitly define a bunch of methods to be `undefined`, which the current unconditional "copying" of methods will do. Note that of the `OPS` ~23 percent don't, for various reasons, have an associated method on the `CanvasGraphics.prototype`.	2021-07-25 13:28:28 +02:00
Jonas Jenwald	fbaafdc4e8	Remove the remaining closure in the `src/display/canvas.js` file For e.g. the `gulp mozcentral` command, the built `pdf.js` file decreases from `304 607` to `301 295` bytes with this patch. The improvement comes mostly from having less overall indentation in the code.	2021-07-25 13:14:58 +02:00
Jonas Jenwald	70bac87fed	Fix (most) LGTM warnings Most of the warnings we don't really care about, and those are simply white-listed using inline comments; however two cases prompted actual code changes: - In `src/display/pattern_helper.js` the branch in question is indeed unreachable, and should thus be safe to remove. (This code originated in PR 4192, which is now over seven years ago.) - In `test/test.js`, the function in question indeed doesn't accept any arguments. (The patch also re-formats a string just above, which didn't seem worthy of a separated patch.) This now leaves only one warning in the LGTM report, however that one is a false positive that we'll need to report upstream.	2021-07-24 14:23:59 +02:00

1 2 3 4 5 ...

1303 Commits