pdf.js

Author	SHA1	Message	Date
Tim van der Meij	c493dc96fa	Merge pull request #12516 from Snuffleupagus/fieldObjects-annotation-undefined Prevent issues, in `PDFDocument.fieldObjects`, for invalid Annotations	2020-10-24 15:42:33 +02:00
Jonas Jenwald	b478d3e7b9	Improve argument/name handling when parsing TilingPatterns (PR 12458 follow-up) - Handle the arguments correctly in `PartialEvaluator.handleColorN`. For TilingPatterns with a base-ColorSpace, we're currently using the `args` when computing the color. However, as can be seen we're passing the Array as-is to the `ColorSpace.getRgb` method, which means that the `Name` is included as well.[1] Thankfully this hasn't, as far as I know, caused any actual bugs, but that may be more luck than anything else given how the `ColorSpace` code is implemented. This can be easily fixed though, simply by popping the `Name`-object off of the `args` Array. - Cache TilingPatterns using the `Name`-string, rather than the object directly. This is not only consistent with other caches in `PartialEvaluator`, but importantly it also ensures that the cache lookup always works correctly. Note that since `Name`-objects, similar to other primitives, uses a cache themselves a manually triggered `cleanup`-call could thus (theoretically) cause the `LocalTilingPatternCache` to not find an existing entry. While the likelihood of this happening is extremely small, it's still something that we should fix. --- [1] The `args` Array can e.g. look like this: `[0.043, 0.09, 0.188, 0.004, /P1]`, which means that we're passing in the `Name`-object to the `ColorSpace` method.	2020-10-24 13:49:46 +02:00
Calixte Denizet	37c86b2daa	Fallback font for buttons must be ZapfDingbats. Fix bug https://bugzilla.mozilla.org/show_bug.cgi?id=1669099.	2020-10-24 12:00:03 +02:00
Calixte Denizet	85e6c67cf3	Split highlight annotation div into multiple divs Fix for issue #12504. Highlight annotation may have several rectangles so we must have several divs to add mouse events handlers.	2020-10-23 15:26:16 +02:00
Jonas Jenwald	9f8d9802f9	A couple of small (viewer) tweaks of tooltip-only Annotations (PR 12333 follow-up) Ensure that these tooltip-only Annotations are handled as "internalLink"s, to ensure that they behave as expected in PresentationMode (e.g. they should still use a `pointer`-cursor). Ensure that `PDFLinkService.getDestinationHash` won't create links with empty hashes, since those don't really make a lot of sense in general (this improves things for tooltip-only Annotations). This PDF file can be used for testing: http://mirrors.ctan.org/macros/latex/contrib/pdfcomment/doc/pdfcomment.pdf#page=14	2020-10-23 14:31:45 +02:00
Brendan Dahl	1eaf9c961b	Merge pull request #12432 from calixteman/scripting_api JS - Add the basic architecture to be able to execute embedded js	2020-10-22 19:57:58 -07:00
Tim van der Meij	8cf27494b3	Merge pull request #12503 from calixteman/no_quad Invalidate an annotation with no quadPoints (when it's required)	2020-10-23 00:25:52 +02:00
Jonas Jenwald	b44a975d7c	Prevent issues, in `PDFDocument.fieldObjects`, for invalid Annotations For an invalid Annotation, there's one code-path where `undefined` is returned from `AnnotationFactory._create`. That'd currently, incorrectly, trigger an error during the `PDFDocument._collectFieldObjects` parsing which thus seem good to avoid. Along these lines, the filtering in `PDFDocument.fieldObjects` is also updated to handle both `null` and `undefined` the same way.	2020-10-22 13:24:43 +02:00
Calixte Denizet	e76a96892a	JS - Add the basic architecture to be able to execute embedded js	2020-10-21 19:00:56 +02:00
Calixte Denizet	d2ef878702	Invalidate an annotation with no quadPoints (when it's required) Some pdf softwares don't remove highlight annotations but make the QuadPoints array empty. And the Rect for the annotation can be [-32768, -32768, 32768, 32768] so it leads to have a giant div which catches all the mouse events and make the pdf unusable when there are some forms elements.	2020-10-21 13:53:19 +02:00
Jonas Jenwald	8431cfe482	Re-name and re-factor the `PDFLinkService.navigateTo` method This modernizes and improves the code, by using `async`/`await` and by extracting the helper function to its own method. To hopefully avoid confusion, given the next patch, the method is also re-named to `goToDestination` to make is slightly clearer what it actually does.	2020-10-18 14:29:59 +02:00
Calixte Denizet	c30a3a94f0	JS - Add a function in api to get the fields ids in AcroForm::CO	2020-10-17 12:56:40 +02:00
Tim van der Meij	ff2631493e	Merge pull request #12481 from calixteman/issue_12475 Get urls if any in AA::D dictionary for pushbuttons	2020-10-16 22:55:43 +02:00
Tim van der Meij	32bceae732	Merge pull request #12483 from Snuffleupagus/formInfo-hasFields Don't store complex data in `PDFDocument.formInfo`, and replace the `fields` object with a `hasFields` boolean instead	2020-10-16 22:40:40 +02:00
Jonas Jenwald	f956d0a96a	Stop caching the parsed Font data on its `Dict` object (PR 7347 follow-up) Given that all fonts are, ever since PR 7347, now cached in the "normal" `fontCache` there's actually no reason for the special `font.translated` construction. (Given how Objects in JavaScript are references, rather than raw values, the old code shouldn't have caused any significant memory overhead.) Instead we can simply store the `cacheKey`, which is a simple string, on only the Font `Dict`s where it's needed and thus look-up all fonts using the `fontCache` instead.	2020-10-16 17:45:01 +02:00
Jonas Jenwald	29af15f37e	Add more validation in the `PDFDocument._hasOnlyDocumentSignatures` method If this method is ever passed invalid/unexpected data, or if during the course of parsing (since it's used recursively) such data is found, it will fail in a non-graceful way. Hence this patch, which ensures that we don't attempt to access non-existent properties and also that errors such as the one fixed in PR 12479 wouldn't have occured.	2020-10-16 13:03:47 +02:00
Jonas Jenwald	3351d3476d	Don't store complex data in `PDFDocument.formInfo`, and replace the `fields` object with a `hasFields` boolean instead This patch is based on a couple of smaller things that I noticed when working on PR 12479. - Don't store the /Fields on the `formInfo` getter, since that feels like overloading it with unintended (and too complex) data, and utilize a `hasFields` boolean instead. This functionality was originally added in PR 12271, to help determine what kind of form data a PDF document contains, and I think that we should ensure that the return value of `formInfo` only consists of "simple" data. With these changes the `fieldObjects` getter instead has to look-up the /Fields manually, however that shouldn't be a problem since the access is guarded by a `formInfo.hasFields` check which ensures that the data both exists and is valid. Furthermore, most documents doesn't even have any /AcroForm data anyway. - Determine the `hasFields` property first, to ensure that it's always correct even if there's errors when checking e.g. the /XFA or /SigFlags entires, since the `fieldObjects` getter depends on it. - Simplify a loop in `fieldObjects`, since the object being accessed is a `Map` and those have built-in iteration support. - Use a higher logging level for errors in the `formInfo` getter, and include the actual error message, since that'd have helped with fixing PR 12479 a lot quicker. - Update the JSDoc comment in `src/display/api.js` to list the return values correctly, and also slightly extend/improve the description.	2020-10-16 12:47:27 +02:00
Calixte Denizet	ce3d3a6ff8	Get urls if any in AA::D dictionary for pushbuttons	2020-10-15 19:42:36 +02:00
Jonas Jenwald	bc6b47a50e	Convert `PartialEvaluator.translateFont` to an `async` method This allows us to make a slight simplification in `PartialEvaluator.loadFont`, which thus removes an old TODO-comment from the method. Furthermore, in `PartialEvaluator.translateFont`, the CMap-handling is now limited to only composite fonts to avoid having to wait for a "dummy"-Promise for most fonts.	2020-10-15 09:42:58 +02:00
Tim van der Meij	a373137304	Merge pull request #12429 from calixteman/collect_js [api-minor] Add the possibility to collect Javascript actions	2020-10-14 23:27:47 +02:00
Calixte Denizet	71ecc3129b	Add the possibility to collect Javascript actions	2020-10-14 10:44:16 +02:00
Tim van der Meij	1034769ca1	Merge pull request #12477 from Snuffleupagus/SaveDocument-WorkerTask Handle `WorkerTask`s, and various PDF document properties, correctly in the "SaveDocument" handler in `src/core/worker.js`	2020-10-13 21:11:54 +02:00
Jonas Jenwald	65132ba5d8	Handle `WorkerTask`s, and various PDF document properties, correctly in the "SaveDocument" handler in `src/core/worker.js` - Actually register/unregister the `WorkerTask`s, used when saving each page, correctly. To prevent issues when terminating the Worker, we purposely wait for all running `WorkerTask`s to complete first. Hence we need to actually handle `WorkerTask`s the same way in "SaveDocument" as in the rest of this file, see e.g. "GetOperatorList" and "GetTextContent". - Access `PDFDocument` properties in a generally safe/consistent way. While the current code works fine, given how the PDF document is being loaded, it still seems like a very good idea to be consistent in how we access these kind of properties (since in general you need to avoid `MissingDataException` everywhere in this file). - Change a variable name, since there's essentially no precedent in the code-base for local variable names to start with an underscore.	2020-10-13 19:30:43 +02:00
Jonas Jenwald	38629c345d	Remove the `scope` parameter from the "GetOperatorList" handler in `src/core/worker.js` (PR 11110 follow-up) Support for the `scope` parameter, in `MessageHandler.on`, was removed in PR 11110 however this particular case was unused/unnecessary for years prior to that change. (From a quick look through the history, I'm not even sure if it was actually needed in the first place.)	2020-10-13 15:58:38 +02:00
Jonas Jenwald	30e8d5dea1	Add local caching of TilingPatterns in `PartialEvaluator.getOperatorList` (issue 2765 and 8473) In practice it's not uncommon for PDF documents to re-use the same TilingPatterns more than once, and parsing them is essentially equal to parsing of a (small) page since a `getOperatorList` call is required. By caching the internal TilingPattern representation we can thus avoid having to re-parse the same data over and over, and there's also less asynchronous parsing required for repeated TilingPatterns. Initially I had intended to include (standard) benchmark results with this patch, however it's not entirely clear that this is actually necessary here given the preliminary results. When testing this manually in the development viewer, using `pdfBug=Stats`, the following (approximate) reduction in rendering times were observed when comparing `master` against this patch: - http://pubs.usgs.gov/sim/3067/pdf/sim3067sheet-2.pdf (from issue 2765): `6800 ms` -> `4100 ms`. - https://github.com/mozilla/pdf.js/files/1046131/stepped.pdf (from issue 8473): `54000 ms` -> `13000 ms` - https://github.com/mozilla/pdf.js/files/1046130/proof.pdf (from issue 8473): `5900 ms` -> `2500 ms` As always, whenever you're dealing with documents which are "slow", there's usually a certain level of subjectivity involved with regards to what's deemed acceptable performance. Hence it's not clear to me that we want to regard any of the referenced issues as fixed, however the improvements are significant enough to warrant caching of TilingPatterns in my opinion.	2020-10-08 18:43:21 +02:00
Jani Pehkonen	935568c2f1	Fix invalid `XUID` entries in CFF fonts In CFF fonts, entry `XUID` should be an array that has no more than 16 elements. In the issue, the length is 20, which causes the fonts to fail. See Appendix B, "Implementation Limits" in PostScript Language Reference Manual https://web.archive.org/web/20170218093716/https://www.adobe.com/products/postscript/pdfs/PLRM.pdf Actually entries `XUID` and `UniqueID` are obsolete altogether. https://blogs.adobe.com/CCJKType/2016/06/no-more-xuid-arrays.html	2020-10-05 17:38:01 +03:00
Jonas Jenwald	9416b14e8b	Re-factor how the ESLint `no-var` rule is enabled in the `src/` folder This simplifies/consolidates the ESLint configuration slightly in the `src/` folder, and prevents the addition of any new files where `var` is being used.[1] Hence we no longer need to manually add `/* eslint no-var: error */` in files, which is easy to forget, and can instead disable the rule in the `src/core/` files where `var` is still in use. --- [1] Obviously the `no-var` rule can, in the same way as every other rule, be disabled on a case-by-case basis where actually necessary.	2020-10-03 20:15:29 +02:00
Tim van der Meij	48e27a1a22	Merge pull request #12437 from Snuffleupagus/src-display-no-var Enable the ESLint `no-var` rule in the `src/display/` folder	2020-10-03 19:59:56 +02:00
Tim van der Meij	6ff1fe4ea9	Merge pull request #12333 from calixteman/tooltip Add tooltip if any in annotations layer	2020-10-03 19:50:39 +02:00
Jonas Jenwald	2a7d1557f9	Enable the ESLint `no-var` rule in the `src/shared/` folder Previously this rule has been enabled in the `web/` folder, and in select files in the `src/` sub-folders. In this case, enabling of this rule didn't actually require any further code changes. Please find additional details about the ESLint rule at https://eslint.org/docs/rules/no-var	2020-10-03 08:27:45 +02:00
Jonas Jenwald	52f6016e6c	Fix the remaining ESLint `no-var` errors in the `src/display/` folder While most of necessary changes were fixed automatically, see the previous patch, there's a number of cases that needed to be fixed manually.	2020-10-02 16:29:13 +02:00
Jonas Jenwald	e557be5a17	Re-format the `src/display/` files to enforce the ESLint `no-var` rule This was done automatically, using `gulp lint --fix`.	2020-10-02 16:17:28 +02:00
Jonas Jenwald	2a8983d76b	Enable the ESLint `no-var` rule in the `src/display/` folder Previously this rule has been enabled in the `web/` folder, and in select files in the `src/` sub-folders. Note that a number of the files in the `src/display/` folder were already enforcing the `no-var` rule, and thanks to Prettier the necessary re-writing will be (mostly) handled automatically. Please find additional details about the ESLint rule at https://eslint.org/docs/rules/no-var	2020-10-02 16:16:23 +02:00
calixteman	20b12d2bda	Add tooltip if any in annotations layer	2020-10-02 10:11:18 +02:00
Jonas Jenwald	bd3b15b897	Use the `cidToGidMap`, if it exists, when building the glyph mapping for non-embedded composite fonts (issue 12418)	2020-09-28 14:40:43 +02:00
Tim van der Meij	120c5c2261	Merge pull request #12409 from Snuffleupagus/bug-1627030 Compute the `transformOrigin` correctly, for negative values, when rendering `AnnotationElement`s (bug 1627030)	2020-09-24 23:48:21 +02:00
Calixte Denizet	5af352e65a	Need to reset the streams when printing	2020-09-24 19:13:09 +02:00
Jonas Jenwald	fca53a8eb0	Compute the `transformOrigin` correctly, for negative values, when rendering `AnnotationElement`s (bug 1627030) This changes the `transformOrigin` calculations in `AnnotationElement._createContainer` and `PopupAnnotationElement.render`, to ensure that e.g. the clickable area of annotations and/or popups are both positioned correctly. The problem occurs for negative values, since they're not negated correctly because of how the `transformOrigin` strings were build; see issue 12406 for a more in-depth explanation. Previously, for negative values, the `transformOrigin` strings would thus be ignored since they're not valid.	2020-09-24 10:28:29 +02:00
Jonas Jenwald	2497e8eab9	Prevent errors if the `InkList` property, in InkAnnotations, is missing and/or not an Array (issue 12392) To prevent a future bug, the `Vertices` property in PolylineAnnotations are handled the same way.	2020-09-19 15:34:32 +02:00
Calixte Denizet	d51e7e86ff	Use the same kind of strings for radio values	2020-09-16 18:47:25 +02:00
Tim van der Meij	558d3870d3	Merge pull request #12369 from emilio/better-cancelation-follow-up canvas: fix restore() with existing SMask groups and re-land #12363.	2020-09-15 23:19:17 +02:00
Tim van der Meij	374aad77c4	Merge pull request #12375 from Snuffleupagus/emptyDict-set Ensure that the empty dictionary won't be accidentally modified, and slightly improve the "SaveDocument" handler in `src/core/worker.js`	2020-09-15 23:04:57 +02:00
Calixte Denizet	16dd5403c7	Set parent of radio annotation even if there is no 'V' field	2020-09-15 14:41:57 +02:00
Jonas Jenwald	ed4e7cd8a4	A couple of small improvements in the "SaveDocument" handler in `src/core/worker.js` - Check that the "Info"-entry, in the XRef-trailer, is actually a dictionary before accessing it. This is similar to the `PDFDocument.documentInfo` method and follows the general principal of validating data carefully before accessing it, given how often PDF-software may create corrupt PDF files. - Slightly simplify the "XFA"-lookup, since there's no point in trying to fetch something from the empty dictionary.	2020-09-15 09:57:40 +02:00
Jonas Jenwald	a531c98cd2	Ensure that the empty dictionary won't be accidentally modified Currently there's nothing that prevents modification of the `Dict.empty` primitive, which obviously needs to be truly empty to prevent any future (hard to find) bugs.	2020-09-15 09:29:00 +02:00
Tim van der Meij	b0c7a74a0c	Merge pull request #12361 from Snuffleupagus/_getSaveFieldResources Ensure that all necessary /Font resources are included when saving a `WidgetAnnotation`-instance (issue 12294)	2020-09-15 00:09:31 +02:00
Tim van der Meij	9d7b1d89ca	Merge pull request #12370 from timvandermeij/annotation-reset Implement resetting of created streams for annotations	2020-09-14 23:16:17 +02:00
Tim van der Meij	3ecd984758	Implement resetting of created streams for annotations	2020-09-14 23:08:50 +02:00
Calixte Denizet	0c8de5aaf9	Replace \n and \r by \n and \r when saving a string	2020-09-14 17:34:39 +02:00
Jonas Jenwald	c992b8e460	Ensure that all necessary /Font resources are included when saving a `WidgetAnnotation`-instance (issue 12294) This patch contains a possible approach for fixing issue 12294, which compared to other PRs is purposely limited to the affected `WidgetAnnotation` code. As mentioned elsewhere, considering that we're (at least for now) trying to fix one specific case, I think that we should avoid modifying the `Dict` primitive[1] and/or avoid a solution that (indirectly) modifies an existing `Dict`-instance[2]. This patch simply fixes the issue at hand, since that seems easiest for now, and I'd suggest that we worry about a more general approach if/when that actually becomes necessary. Hence the solution implemented here, for `WidgetAnnotation`, is to simply use a combination of the local and AcroForm /DR resources during OperatorList-parsing to ensure that things work correctly regardless of where a particular /Font resource is found. For saving of form-data, on the other hand, we want to avoid increasing the file-size unnecessarily and need to be smarter than just merging all of the available resources. To achive this, a new `WidgetAnnotation._getSaveFieldResources` method will when necessary produce a combined resources `Dict` with only the minimum amount of data from the AcroForm /DR resources included. --- [1] You want to avoid anything that could cause the general `Dict` implementation to become slower, or more complex, just for handling an edge-case in my opinion. [2] If an existing `Dict`-instance is modified unexpectedly, that could very easily lead to problems elsewhere since e.g. `Dict`-instances created during parsing are not expected to be changed.	2020-09-14 15:22:40 +02:00

1 2 3 4 5 ...

4240 Commits