pdf.js

Author	SHA1	Message	Date
Calixte Denizet	e76a96892a	JS - Add the basic architecture to be able to execute embedded js	2020-10-21 19:00:56 +02:00
Calixte Denizet	d2ef878702	Invalidate an annotation with no quadPoints (when it's required) Some pdf softwares don't remove highlight annotations but make the QuadPoints array empty. And the Rect for the annotation can be [-32768, -32768, 32768, 32768] so it leads to have a giant div which catches all the mouse events and make the pdf unusable when there are some forms elements.	2020-10-21 13:53:19 +02:00
Jonas Jenwald	8431cfe482	Re-name and re-factor the `PDFLinkService.navigateTo` method This modernizes and improves the code, by using `async`/`await` and by extracting the helper function to its own method. To hopefully avoid confusion, given the next patch, the method is also re-named to `goToDestination` to make is slightly clearer what it actually does.	2020-10-18 14:29:59 +02:00
Calixte Denizet	c30a3a94f0	JS - Add a function in api to get the fields ids in AcroForm::CO	2020-10-17 12:56:40 +02:00
Tim van der Meij	ff2631493e	Merge pull request #12481 from calixteman/issue_12475 Get urls if any in AA::D dictionary for pushbuttons	2020-10-16 22:55:43 +02:00
Tim van der Meij	32bceae732	Merge pull request #12483 from Snuffleupagus/formInfo-hasFields Don't store complex data in `PDFDocument.formInfo`, and replace the `fields` object with a `hasFields` boolean instead	2020-10-16 22:40:40 +02:00
Jonas Jenwald	f956d0a96a	Stop caching the parsed Font data on its `Dict` object (PR 7347 follow-up) Given that all fonts are, ever since PR 7347, now cached in the "normal" `fontCache` there's actually no reason for the special `font.translated` construction. (Given how Objects in JavaScript are references, rather than raw values, the old code shouldn't have caused any significant memory overhead.) Instead we can simply store the `cacheKey`, which is a simple string, on only the Font `Dict`s where it's needed and thus look-up all fonts using the `fontCache` instead.	2020-10-16 17:45:01 +02:00
Jonas Jenwald	29af15f37e	Add more validation in the `PDFDocument._hasOnlyDocumentSignatures` method If this method is ever passed invalid/unexpected data, or if during the course of parsing (since it's used recursively) such data is found, it will fail in a non-graceful way. Hence this patch, which ensures that we don't attempt to access non-existent properties and also that errors such as the one fixed in PR 12479 wouldn't have occured.	2020-10-16 13:03:47 +02:00
Jonas Jenwald	3351d3476d	Don't store complex data in `PDFDocument.formInfo`, and replace the `fields` object with a `hasFields` boolean instead This patch is based on a couple of smaller things that I noticed when working on PR 12479. - Don't store the /Fields on the `formInfo` getter, since that feels like overloading it with unintended (and too complex) data, and utilize a `hasFields` boolean instead. This functionality was originally added in PR 12271, to help determine what kind of form data a PDF document contains, and I think that we should ensure that the return value of `formInfo` only consists of "simple" data. With these changes the `fieldObjects` getter instead has to look-up the /Fields manually, however that shouldn't be a problem since the access is guarded by a `formInfo.hasFields` check which ensures that the data both exists and is valid. Furthermore, most documents doesn't even have any /AcroForm data anyway. - Determine the `hasFields` property first, to ensure that it's always correct even if there's errors when checking e.g. the /XFA or /SigFlags entires, since the `fieldObjects` getter depends on it. - Simplify a loop in `fieldObjects`, since the object being accessed is a `Map` and those have built-in iteration support. - Use a higher logging level for errors in the `formInfo` getter, and include the actual error message, since that'd have helped with fixing PR 12479 a lot quicker. - Update the JSDoc comment in `src/display/api.js` to list the return values correctly, and also slightly extend/improve the description.	2020-10-16 12:47:27 +02:00
Calixte Denizet	ce3d3a6ff8	Get urls if any in AA::D dictionary for pushbuttons	2020-10-15 19:42:36 +02:00
Jonas Jenwald	bc6b47a50e	Convert `PartialEvaluator.translateFont` to an `async` method This allows us to make a slight simplification in `PartialEvaluator.loadFont`, which thus removes an old TODO-comment from the method. Furthermore, in `PartialEvaluator.translateFont`, the CMap-handling is now limited to only composite fonts to avoid having to wait for a "dummy"-Promise for most fonts.	2020-10-15 09:42:58 +02:00
Tim van der Meij	a373137304	Merge pull request #12429 from calixteman/collect_js [api-minor] Add the possibility to collect Javascript actions	2020-10-14 23:27:47 +02:00
Calixte Denizet	71ecc3129b	Add the possibility to collect Javascript actions	2020-10-14 10:44:16 +02:00
Tim van der Meij	1034769ca1	Merge pull request #12477 from Snuffleupagus/SaveDocument-WorkerTask Handle `WorkerTask`s, and various PDF document properties, correctly in the "SaveDocument" handler in `src/core/worker.js`	2020-10-13 21:11:54 +02:00
Jonas Jenwald	65132ba5d8	Handle `WorkerTask`s, and various PDF document properties, correctly in the "SaveDocument" handler in `src/core/worker.js` - Actually register/unregister the `WorkerTask`s, used when saving each page, correctly. To prevent issues when terminating the Worker, we purposely wait for all running `WorkerTask`s to complete first. Hence we need to actually handle `WorkerTask`s the same way in "SaveDocument" as in the rest of this file, see e.g. "GetOperatorList" and "GetTextContent". - Access `PDFDocument` properties in a generally safe/consistent way. While the current code works fine, given how the PDF document is being loaded, it still seems like a very good idea to be consistent in how we access these kind of properties (since in general you need to avoid `MissingDataException` everywhere in this file). - Change a variable name, since there's essentially no precedent in the code-base for local variable names to start with an underscore.	2020-10-13 19:30:43 +02:00
Jonas Jenwald	38629c345d	Remove the `scope` parameter from the "GetOperatorList" handler in `src/core/worker.js` (PR 11110 follow-up) Support for the `scope` parameter, in `MessageHandler.on`, was removed in PR 11110 however this particular case was unused/unnecessary for years prior to that change. (From a quick look through the history, I'm not even sure if it was actually needed in the first place.)	2020-10-13 15:58:38 +02:00
Jonas Jenwald	30e8d5dea1	Add local caching of TilingPatterns in `PartialEvaluator.getOperatorList` (issue 2765 and 8473) In practice it's not uncommon for PDF documents to re-use the same TilingPatterns more than once, and parsing them is essentially equal to parsing of a (small) page since a `getOperatorList` call is required. By caching the internal TilingPattern representation we can thus avoid having to re-parse the same data over and over, and there's also less asynchronous parsing required for repeated TilingPatterns. Initially I had intended to include (standard) benchmark results with this patch, however it's not entirely clear that this is actually necessary here given the preliminary results. When testing this manually in the development viewer, using `pdfBug=Stats`, the following (approximate) reduction in rendering times were observed when comparing `master` against this patch: - http://pubs.usgs.gov/sim/3067/pdf/sim3067sheet-2.pdf (from issue 2765): `6800 ms` -> `4100 ms`. - https://github.com/mozilla/pdf.js/files/1046131/stepped.pdf (from issue 8473): `54000 ms` -> `13000 ms` - https://github.com/mozilla/pdf.js/files/1046130/proof.pdf (from issue 8473): `5900 ms` -> `2500 ms` As always, whenever you're dealing with documents which are "slow", there's usually a certain level of subjectivity involved with regards to what's deemed acceptable performance. Hence it's not clear to me that we want to regard any of the referenced issues as fixed, however the improvements are significant enough to warrant caching of TilingPatterns in my opinion.	2020-10-08 18:43:21 +02:00
Jani Pehkonen	935568c2f1	Fix invalid `XUID` entries in CFF fonts In CFF fonts, entry `XUID` should be an array that has no more than 16 elements. In the issue, the length is 20, which causes the fonts to fail. See Appendix B, "Implementation Limits" in PostScript Language Reference Manual https://web.archive.org/web/20170218093716/https://www.adobe.com/products/postscript/pdfs/PLRM.pdf Actually entries `XUID` and `UniqueID` are obsolete altogether. https://blogs.adobe.com/CCJKType/2016/06/no-more-xuid-arrays.html	2020-10-05 17:38:01 +03:00
Jonas Jenwald	9416b14e8b	Re-factor how the ESLint `no-var` rule is enabled in the `src/` folder This simplifies/consolidates the ESLint configuration slightly in the `src/` folder, and prevents the addition of any new files where `var` is being used.[1] Hence we no longer need to manually add `/* eslint no-var: error */` in files, which is easy to forget, and can instead disable the rule in the `src/core/` files where `var` is still in use. --- [1] Obviously the `no-var` rule can, in the same way as every other rule, be disabled on a case-by-case basis where actually necessary.	2020-10-03 20:15:29 +02:00
Tim van der Meij	48e27a1a22	Merge pull request #12437 from Snuffleupagus/src-display-no-var Enable the ESLint `no-var` rule in the `src/display/` folder	2020-10-03 19:59:56 +02:00
Tim van der Meij	6ff1fe4ea9	Merge pull request #12333 from calixteman/tooltip Add tooltip if any in annotations layer	2020-10-03 19:50:39 +02:00
Jonas Jenwald	2a7d1557f9	Enable the ESLint `no-var` rule in the `src/shared/` folder Previously this rule has been enabled in the `web/` folder, and in select files in the `src/` sub-folders. In this case, enabling of this rule didn't actually require any further code changes. Please find additional details about the ESLint rule at https://eslint.org/docs/rules/no-var	2020-10-03 08:27:45 +02:00
Jonas Jenwald	52f6016e6c	Fix the remaining ESLint `no-var` errors in the `src/display/` folder While most of necessary changes were fixed automatically, see the previous patch, there's a number of cases that needed to be fixed manually.	2020-10-02 16:29:13 +02:00
Jonas Jenwald	e557be5a17	Re-format the `src/display/` files to enforce the ESLint `no-var` rule This was done automatically, using `gulp lint --fix`.	2020-10-02 16:17:28 +02:00
Jonas Jenwald	2a8983d76b	Enable the ESLint `no-var` rule in the `src/display/` folder Previously this rule has been enabled in the `web/` folder, and in select files in the `src/` sub-folders. Note that a number of the files in the `src/display/` folder were already enforcing the `no-var` rule, and thanks to Prettier the necessary re-writing will be (mostly) handled automatically. Please find additional details about the ESLint rule at https://eslint.org/docs/rules/no-var	2020-10-02 16:16:23 +02:00
calixteman	20b12d2bda	Add tooltip if any in annotations layer	2020-10-02 10:11:18 +02:00
Jonas Jenwald	bd3b15b897	Use the `cidToGidMap`, if it exists, when building the glyph mapping for non-embedded composite fonts (issue 12418)	2020-09-28 14:40:43 +02:00
Tim van der Meij	120c5c2261	Merge pull request #12409 from Snuffleupagus/bug-1627030 Compute the `transformOrigin` correctly, for negative values, when rendering `AnnotationElement`s (bug 1627030)	2020-09-24 23:48:21 +02:00
Calixte Denizet	5af352e65a	Need to reset the streams when printing	2020-09-24 19:13:09 +02:00
Jonas Jenwald	fca53a8eb0	Compute the `transformOrigin` correctly, for negative values, when rendering `AnnotationElement`s (bug 1627030) This changes the `transformOrigin` calculations in `AnnotationElement._createContainer` and `PopupAnnotationElement.render`, to ensure that e.g. the clickable area of annotations and/or popups are both positioned correctly. The problem occurs for negative values, since they're not negated correctly because of how the `transformOrigin` strings were build; see issue 12406 for a more in-depth explanation. Previously, for negative values, the `transformOrigin` strings would thus be ignored since they're not valid.	2020-09-24 10:28:29 +02:00
Jonas Jenwald	2497e8eab9	Prevent errors if the `InkList` property, in InkAnnotations, is missing and/or not an Array (issue 12392) To prevent a future bug, the `Vertices` property in PolylineAnnotations are handled the same way.	2020-09-19 15:34:32 +02:00
Calixte Denizet	d51e7e86ff	Use the same kind of strings for radio values	2020-09-16 18:47:25 +02:00
Tim van der Meij	558d3870d3	Merge pull request #12369 from emilio/better-cancelation-follow-up canvas: fix restore() with existing SMask groups and re-land #12363.	2020-09-15 23:19:17 +02:00
Tim van der Meij	374aad77c4	Merge pull request #12375 from Snuffleupagus/emptyDict-set Ensure that the empty dictionary won't be accidentally modified, and slightly improve the "SaveDocument" handler in `src/core/worker.js`	2020-09-15 23:04:57 +02:00
Calixte Denizet	16dd5403c7	Set parent of radio annotation even if there is no 'V' field	2020-09-15 14:41:57 +02:00
Jonas Jenwald	ed4e7cd8a4	A couple of small improvements in the "SaveDocument" handler in `src/core/worker.js` - Check that the "Info"-entry, in the XRef-trailer, is actually a dictionary before accessing it. This is similar to the `PDFDocument.documentInfo` method and follows the general principal of validating data carefully before accessing it, given how often PDF-software may create corrupt PDF files. - Slightly simplify the "XFA"-lookup, since there's no point in trying to fetch something from the empty dictionary.	2020-09-15 09:57:40 +02:00
Jonas Jenwald	a531c98cd2	Ensure that the empty dictionary won't be accidentally modified Currently there's nothing that prevents modification of the `Dict.empty` primitive, which obviously needs to be truly empty to prevent any future (hard to find) bugs.	2020-09-15 09:29:00 +02:00
Tim van der Meij	b0c7a74a0c	Merge pull request #12361 from Snuffleupagus/_getSaveFieldResources Ensure that all necessary /Font resources are included when saving a `WidgetAnnotation`-instance (issue 12294)	2020-09-15 00:09:31 +02:00
Tim van der Meij	9d7b1d89ca	Merge pull request #12370 from timvandermeij/annotation-reset Implement resetting of created streams for annotations	2020-09-14 23:16:17 +02:00
Tim van der Meij	3ecd984758	Implement resetting of created streams for annotations	2020-09-14 23:08:50 +02:00
Calixte Denizet	0c8de5aaf9	Replace \n and \r by \n and \r when saving a string	2020-09-14 17:34:39 +02:00
Jonas Jenwald	c992b8e460	Ensure that all necessary /Font resources are included when saving a `WidgetAnnotation`-instance (issue 12294) This patch contains a possible approach for fixing issue 12294, which compared to other PRs is purposely limited to the affected `WidgetAnnotation` code. As mentioned elsewhere, considering that we're (at least for now) trying to fix one specific case, I think that we should avoid modifying the `Dict` primitive[1] and/or avoid a solution that (indirectly) modifies an existing `Dict`-instance[2]. This patch simply fixes the issue at hand, since that seems easiest for now, and I'd suggest that we worry about a more general approach if/when that actually becomes necessary. Hence the solution implemented here, for `WidgetAnnotation`, is to simply use a combination of the local and AcroForm /DR resources during OperatorList-parsing to ensure that things work correctly regardless of where a particular /Font resource is found. For saving of form-data, on the other hand, we want to avoid increasing the file-size unnecessarily and need to be smarter than just merging all of the available resources. To achive this, a new `WidgetAnnotation._getSaveFieldResources` method will when necessary produce a combined resources `Dict` with only the minimum amount of data from the AcroForm /DR resources included. --- [1] You want to avoid anything that could cause the general `Dict` implementation to become slower, or more complex, just for handling an edge-case in my opinion. [2] If an existing `Dict`-instance is modified unexpectedly, that could very easily lead to problems elsewhere since e.g. `Dict`-instances created during parsing are not expected to be changed.	2020-09-14 15:22:40 +02:00
Emilio Cobos Álvarez	bf8b1adf73	canvas: Properly restore all the remaining items in stateStack in endDrawing. We were correctly finishing the SMask group but not restoring all the extra transformations applied in stateStack, so if somebody ends up drawing to the same context after canceling mid-draw we'd get artifacts. This re-lands #12363 and fixes Mozilla bug 1664178[1]. [1]: https://bugzilla.mozilla.org/show_bug.cgi?id=1664178	2020-09-12 16:37:54 +02:00
Emilio Cobos Álvarez	3a277f3ba5	canvas: restore() should reflect that smask groups are finished when stateStack is empty. This fixes the issue that caused #12363 to get reverted, see #12367. When we end the SMask group and stateStack.length is zero, nothing updates this.current to reflect it.	2020-09-12 16:37:54 +02:00
Jonas Jenwald	f43d1b316b	Revert "canvas: Properly restore all the remaining items in stateStack in endDrawing"	2020-09-12 16:15:33 +02:00
Tim van der Meij	cdac6f4e68	Merge pull request #12363 from emilio/better-cancelation canvas: Properly restore all the remaining items in stateStack in endDrawing	2020-09-12 15:03:34 +02:00
Emilio Cobos Álvarez	ef1e9a1a3e	canvas: Properly restore all the remaining items in stateStack in endDrawing. We were correctly finishing the SMask group but not restoring all the extra transformations applied in stateStack, so if somebody ends up drawing to the same context after canceling mid-draw we'd get artifacts. This fixes Mozilla bug 1664178[1]. [1]: https://bugzilla.mozilla.org/show_bug.cgi?id=1664178	2020-09-12 13:50:56 +02:00
Tim van der Meij	dfebe7b907	Merge pull request #12365 from Snuffleupagus/forbid-DecodeStream.length Ensure that the `length` property won't be accidentally accessed on a `DecodeStream`-instance	2020-09-11 22:18:30 +02:00
Jonas Jenwald	a11b7341a1	Ensure that the `length` property won't be accidentally accessed on a `DecodeStream`-instance For these streams, compared to `Stream` and `ChunkedStream`, there's no well defined concept of length and consequently no `length` getter.[1] However, attempting to access the non-existent `length` won't currently error, but just return `undefined`, which could thus easily lead to bugs elsewhere in the code-base. --- [1] However, note that all stream implementations have an `isEmpty` getter which can be used instead.	2020-09-11 13:25:40 +02:00
Calixte Denizet	fc154590e8	Dict keys need to be escaped too when saving	2020-09-11 12:25:05 +02:00

1 2 3 4 5 ...

4182 Commits