pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	46e94cad17	Fix some errors reported by the ESLint `no-useless-escape` rule This patch removes unnecessary escape-sequence in (mostly) strings, as a first step, since the ones in regular expressions probably requires more careful testing (just in case). The only exception is a regular expression in `src/core/annotation.js`, since we should have both unit- and reference-tests for this code and given [this information on MDN](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Guide/Regular_Expressions/Character_Classes#Types): > Inside a character set, the dot loses its special meaning and matches a literal dot. Please find additional details about the ESLint rule at https://eslint.org/docs/rules/no-useless-escape	2020-10-29 15:40:40 +01:00
Jonas Jenwald	820fb7f969	Update all `Object.fromEntries` call-sites to ensure that a `null` prototype is used Given that `Object.fromEntries` doesn't seem to guarantee that a `null` prototype is used, we thus hack around that by using `Object.assign` with `Object.create(null)`.	2020-10-28 14:43:44 +01:00
Jonas Jenwald	9fc7cdcc9d	Use a `Map`, rather than an `Object`, internally in the `Catalog.openAction` getter (PR 11644 follow-up) This provides a work-around to avoid having to conditionally try to initialize the `openAction`-object in multiple places. Given that `Object.fromEntries` doesn't seem to guarantee that a `null` prototype is used, we thus hack around that by using `Object.assign` with `Object.create(null)`.	2020-10-28 14:43:28 +01:00
Tim van der Meij	ea4d88a330	Merge pull request #12395 from calixteman/checks Render not displayed annotations in using normal appearance when printing	2020-10-28 00:11:10 +01:00
Calixte Denizet	6be2f84b4e	Render not displayed annotations in using normal appearance when printing	2020-10-27 19:00:31 +01:00
Tim van der Meij	71a14be8e7	Merge pull request #12534 from Snuffleupagus/murmurhash-slice Ensure that `MurmurHash3_64.update` handles `ArrayBuffer` input correctly, to avoid hash-collisions (issue 12533)	2020-10-26 23:34:03 +01:00
Jonas Jenwald	f2fa053c51	Ensure that `MurmurHash3_64.update` handles `ArrayBuffer` input correctly, to avoid hash-collisions (issue 12533) Different fonts incorrectly end up with identical hashes, despite having different /ToUnicode data. The issue, and it's very interesting that we've apparently not seen it before, appears to be caused by the fact that different /ToUnicode entries share the same underlying `ArrayBuffer`, which thus becomes problematic at the `const dataUint32 = new Uint32Array(data.buffer, 0, blockCounts);` line. The simplest solution thus seem to be to just copy the input, when it's an `ArrayBuffer`, rather than using it as-is. (Note that if we'd stringified the input, when calling `MurmurHash3_64.update`, the issue would also have been fixed. In this case, we're already creating an unique TypedArray.)	2020-10-26 16:27:33 +01:00
Jonas Jenwald	666535be47	Prevent use of optional chaining and nullish coalescing in the `src/shared/` folder Given that this code is used on the worker-thread, where SystemJS is still used during development, we need to (for now) handle this folder the same way as the `src/core/` one.	2020-10-26 13:16:01 +01:00
Jonas Jenwald	c293fc2b8f	Add (some) optional chaining usage in `src/display/api.js` Since we no longer use SystemJS to load the unit-tests, there's now nothing that prevents us from using optional chaining and nullish coalescing in the `src/display/` directory.	2020-10-26 11:11:48 +01:00
Jonas Jenwald	d9084c0be2	Load the fake worker, in non-`PRODUCTION` mode, with native async `import` This removes the last SystemJS usage from both the API and the default viewer.	2020-10-26 11:11:48 +01:00
Jonas Jenwald	56fa6d414c	Add a `getArrayLookupTableFactory` helper function and use it to re-format `src/core/{glyphlist, unicode}.js` Please note: Once https://bugzilla.mozilla.org/show_bug.cgi?id=1247687 is implemented, and we've removed SystemJS completely, this entire patch can (and even should) be reverted. This is similar to the existing `getLookupTableFactory` helper function, but is implemented as outlined in issue 6774. The re-formatting of the tables were done automatically, by using find-and-replace with regular expressions. For reasons that I don't even pretend to understand, using this particular structure for these very long lookup tables allow SystemJS to process the files correctly/quickly and the development viewer thus works as intended.	2020-10-26 11:08:00 +01:00
Jonas Jenwald	441d9c8cc0	Change `src/core/{glyphlist, unicode}.js` to use standard `import`/`export` statements While the built `pdf.worker.js` file still works correctly with these changes, despite these two files being excluded by Babel[1], the development viewer does not because of issues with SystemJS[2] and/or its Babel-plugin (both of which are old). Furthermore, note also that excluding these two files from Babel-processing isn't generally necessary since e.g. the `gulp mozcentral` command works anyway. The explanation is rather that it's actually the source-map generation which fails for these huge sequences when building the `pdf.worker.js` file. However, not using standard `import`/`export` statements in all files means we also need to use SystemJS when e.e. running the unit-tests. This is very unfortunate, since SystemJS (or its old Babel-version) doesn't support modern ECMAScript features such as e.g. optional chaining and nullish coalescing. Unfortunately it also seems that https://bugzilla.mozilla.org/show_bug.cgi?id=1247687, which tracks the implementation of worker-modules in Firefox, has stalled since there hasn't been any updates for six months now. To hopefully address all of the above, this patch is the first in a series that attempts to further reduce our reliance on SystemJS. --- [1] The only difference being how the dependencies are handled, in the Webpack-bundled file. [2] Parsing takes way too long and consumes too much memory, thus rendering the development viewer essentially unusable.	2020-10-26 11:08:00 +01:00
Jonas Jenwald	61ffa9caa9	Tweak the `pdf.scripting.js` bundling, to improve overall consistency This brings the new `pdf.scripting.js` bundling more in-line with the pre-existing handling for the `pdf.js`/`pdf.worker.js` files: - Add a new `src/pdf.scripting.js` file as the entry-point for the build scripts. - Add the version/build numbers at the top of the built `pdf.scripting.js` files, since all other built files include that information given that it's often helpful to be able to easily determine the exact version. - Tweak the `createScriptingBundle` in the gulp-file, since it looks like a little bit too much copy-and-paste in the variable names.	2020-10-25 16:36:56 +01:00
Tim van der Meij	b4ca3d55b8	Merge pull request #12508 from calixteman/button_fallback_font Fallback font for buttons must be ZapfDingbats.	2020-10-24 18:56:12 +02:00
Tim van der Meij	da73537fdb	Merge pull request #12524 from Snuffleupagus/pr-12333-followup A couple of small (viewer) tweaks of tooltip-only Annotations (PR 12333 follow-up)	2020-10-24 16:06:01 +02:00
Tim van der Meij	180f35ee91	Merge pull request #12526 from Snuffleupagus/TilingPattern-args Improve argument/name handling when parsing TilingPatterns (PR 12458 follow-up)	2020-10-24 15:47:57 +02:00
Tim van der Meij	c493dc96fa	Merge pull request #12516 from Snuffleupagus/fieldObjects-annotation-undefined Prevent issues, in `PDFDocument.fieldObjects`, for invalid Annotations	2020-10-24 15:42:33 +02:00
Jonas Jenwald	b478d3e7b9	Improve argument/name handling when parsing TilingPatterns (PR 12458 follow-up) - Handle the arguments correctly in `PartialEvaluator.handleColorN`. For TilingPatterns with a base-ColorSpace, we're currently using the `args` when computing the color. However, as can be seen we're passing the Array as-is to the `ColorSpace.getRgb` method, which means that the `Name` is included as well.[1] Thankfully this hasn't, as far as I know, caused any actual bugs, but that may be more luck than anything else given how the `ColorSpace` code is implemented. This can be easily fixed though, simply by popping the `Name`-object off of the `args` Array. - Cache TilingPatterns using the `Name`-string, rather than the object directly. This is not only consistent with other caches in `PartialEvaluator`, but importantly it also ensures that the cache lookup always works correctly. Note that since `Name`-objects, similar to other primitives, uses a cache themselves a manually triggered `cleanup`-call could thus (theoretically) cause the `LocalTilingPatternCache` to not find an existing entry. While the likelihood of this happening is extremely small, it's still something that we should fix. --- [1] The `args` Array can e.g. look like this: `[0.043, 0.09, 0.188, 0.004, /P1]`, which means that we're passing in the `Name`-object to the `ColorSpace` method.	2020-10-24 13:49:46 +02:00
Calixte Denizet	37c86b2daa	Fallback font for buttons must be ZapfDingbats. Fix bug https://bugzilla.mozilla.org/show_bug.cgi?id=1669099.	2020-10-24 12:00:03 +02:00
Calixte Denizet	85e6c67cf3	Split highlight annotation div into multiple divs Fix for issue #12504. Highlight annotation may have several rectangles so we must have several divs to add mouse events handlers.	2020-10-23 15:26:16 +02:00
Jonas Jenwald	9f8d9802f9	A couple of small (viewer) tweaks of tooltip-only Annotations (PR 12333 follow-up) Ensure that these tooltip-only Annotations are handled as "internalLink"s, to ensure that they behave as expected in PresentationMode (e.g. they should still use a `pointer`-cursor). Ensure that `PDFLinkService.getDestinationHash` won't create links with empty hashes, since those don't really make a lot of sense in general (this improves things for tooltip-only Annotations). This PDF file can be used for testing: http://mirrors.ctan.org/macros/latex/contrib/pdfcomment/doc/pdfcomment.pdf#page=14	2020-10-23 14:31:45 +02:00
Brendan Dahl	1eaf9c961b	Merge pull request #12432 from calixteman/scripting_api JS - Add the basic architecture to be able to execute embedded js	2020-10-22 19:57:58 -07:00
Tim van der Meij	8cf27494b3	Merge pull request #12503 from calixteman/no_quad Invalidate an annotation with no quadPoints (when it's required)	2020-10-23 00:25:52 +02:00
Jonas Jenwald	b44a975d7c	Prevent issues, in `PDFDocument.fieldObjects`, for invalid Annotations For an invalid Annotation, there's one code-path where `undefined` is returned from `AnnotationFactory._create`. That'd currently, incorrectly, trigger an error during the `PDFDocument._collectFieldObjects` parsing which thus seem good to avoid. Along these lines, the filtering in `PDFDocument.fieldObjects` is also updated to handle both `null` and `undefined` the same way.	2020-10-22 13:24:43 +02:00
Calixte Denizet	e76a96892a	JS - Add the basic architecture to be able to execute embedded js	2020-10-21 19:00:56 +02:00
Calixte Denizet	d2ef878702	Invalidate an annotation with no quadPoints (when it's required) Some pdf softwares don't remove highlight annotations but make the QuadPoints array empty. And the Rect for the annotation can be [-32768, -32768, 32768, 32768] so it leads to have a giant div which catches all the mouse events and make the pdf unusable when there are some forms elements.	2020-10-21 13:53:19 +02:00
Jonas Jenwald	8431cfe482	Re-name and re-factor the `PDFLinkService.navigateTo` method This modernizes and improves the code, by using `async`/`await` and by extracting the helper function to its own method. To hopefully avoid confusion, given the next patch, the method is also re-named to `goToDestination` to make is slightly clearer what it actually does.	2020-10-18 14:29:59 +02:00
Calixte Denizet	c30a3a94f0	JS - Add a function in api to get the fields ids in AcroForm::CO	2020-10-17 12:56:40 +02:00
Tim van der Meij	ff2631493e	Merge pull request #12481 from calixteman/issue_12475 Get urls if any in AA::D dictionary for pushbuttons	2020-10-16 22:55:43 +02:00
Tim van der Meij	32bceae732	Merge pull request #12483 from Snuffleupagus/formInfo-hasFields Don't store complex data in `PDFDocument.formInfo`, and replace the `fields` object with a `hasFields` boolean instead	2020-10-16 22:40:40 +02:00
Jonas Jenwald	f956d0a96a	Stop caching the parsed Font data on its `Dict` object (PR 7347 follow-up) Given that all fonts are, ever since PR 7347, now cached in the "normal" `fontCache` there's actually no reason for the special `font.translated` construction. (Given how Objects in JavaScript are references, rather than raw values, the old code shouldn't have caused any significant memory overhead.) Instead we can simply store the `cacheKey`, which is a simple string, on only the Font `Dict`s where it's needed and thus look-up all fonts using the `fontCache` instead.	2020-10-16 17:45:01 +02:00
Jonas Jenwald	29af15f37e	Add more validation in the `PDFDocument._hasOnlyDocumentSignatures` method If this method is ever passed invalid/unexpected data, or if during the course of parsing (since it's used recursively) such data is found, it will fail in a non-graceful way. Hence this patch, which ensures that we don't attempt to access non-existent properties and also that errors such as the one fixed in PR 12479 wouldn't have occured.	2020-10-16 13:03:47 +02:00
Jonas Jenwald	3351d3476d	Don't store complex data in `PDFDocument.formInfo`, and replace the `fields` object with a `hasFields` boolean instead This patch is based on a couple of smaller things that I noticed when working on PR 12479. - Don't store the /Fields on the `formInfo` getter, since that feels like overloading it with unintended (and too complex) data, and utilize a `hasFields` boolean instead. This functionality was originally added in PR 12271, to help determine what kind of form data a PDF document contains, and I think that we should ensure that the return value of `formInfo` only consists of "simple" data. With these changes the `fieldObjects` getter instead has to look-up the /Fields manually, however that shouldn't be a problem since the access is guarded by a `formInfo.hasFields` check which ensures that the data both exists and is valid. Furthermore, most documents doesn't even have any /AcroForm data anyway. - Determine the `hasFields` property first, to ensure that it's always correct even if there's errors when checking e.g. the /XFA or /SigFlags entires, since the `fieldObjects` getter depends on it. - Simplify a loop in `fieldObjects`, since the object being accessed is a `Map` and those have built-in iteration support. - Use a higher logging level for errors in the `formInfo` getter, and include the actual error message, since that'd have helped with fixing PR 12479 a lot quicker. - Update the JSDoc comment in `src/display/api.js` to list the return values correctly, and also slightly extend/improve the description.	2020-10-16 12:47:27 +02:00
Calixte Denizet	ce3d3a6ff8	Get urls if any in AA::D dictionary for pushbuttons	2020-10-15 19:42:36 +02:00
Jonas Jenwald	bc6b47a50e	Convert `PartialEvaluator.translateFont` to an `async` method This allows us to make a slight simplification in `PartialEvaluator.loadFont`, which thus removes an old TODO-comment from the method. Furthermore, in `PartialEvaluator.translateFont`, the CMap-handling is now limited to only composite fonts to avoid having to wait for a "dummy"-Promise for most fonts.	2020-10-15 09:42:58 +02:00
Tim van der Meij	a373137304	Merge pull request #12429 from calixteman/collect_js [api-minor] Add the possibility to collect Javascript actions	2020-10-14 23:27:47 +02:00
Calixte Denizet	71ecc3129b	Add the possibility to collect Javascript actions	2020-10-14 10:44:16 +02:00
Tim van der Meij	1034769ca1	Merge pull request #12477 from Snuffleupagus/SaveDocument-WorkerTask Handle `WorkerTask`s, and various PDF document properties, correctly in the "SaveDocument" handler in `src/core/worker.js`	2020-10-13 21:11:54 +02:00
Jonas Jenwald	65132ba5d8	Handle `WorkerTask`s, and various PDF document properties, correctly in the "SaveDocument" handler in `src/core/worker.js` - Actually register/unregister the `WorkerTask`s, used when saving each page, correctly. To prevent issues when terminating the Worker, we purposely wait for all running `WorkerTask`s to complete first. Hence we need to actually handle `WorkerTask`s the same way in "SaveDocument" as in the rest of this file, see e.g. "GetOperatorList" and "GetTextContent". - Access `PDFDocument` properties in a generally safe/consistent way. While the current code works fine, given how the PDF document is being loaded, it still seems like a very good idea to be consistent in how we access these kind of properties (since in general you need to avoid `MissingDataException` everywhere in this file). - Change a variable name, since there's essentially no precedent in the code-base for local variable names to start with an underscore.	2020-10-13 19:30:43 +02:00
Jonas Jenwald	38629c345d	Remove the `scope` parameter from the "GetOperatorList" handler in `src/core/worker.js` (PR 11110 follow-up) Support for the `scope` parameter, in `MessageHandler.on`, was removed in PR 11110 however this particular case was unused/unnecessary for years prior to that change. (From a quick look through the history, I'm not even sure if it was actually needed in the first place.)	2020-10-13 15:58:38 +02:00
Jonas Jenwald	30e8d5dea1	Add local caching of TilingPatterns in `PartialEvaluator.getOperatorList` (issue 2765 and 8473) In practice it's not uncommon for PDF documents to re-use the same TilingPatterns more than once, and parsing them is essentially equal to parsing of a (small) page since a `getOperatorList` call is required. By caching the internal TilingPattern representation we can thus avoid having to re-parse the same data over and over, and there's also less asynchronous parsing required for repeated TilingPatterns. Initially I had intended to include (standard) benchmark results with this patch, however it's not entirely clear that this is actually necessary here given the preliminary results. When testing this manually in the development viewer, using `pdfBug=Stats`, the following (approximate) reduction in rendering times were observed when comparing `master` against this patch: - http://pubs.usgs.gov/sim/3067/pdf/sim3067sheet-2.pdf (from issue 2765): `6800 ms` -> `4100 ms`. - https://github.com/mozilla/pdf.js/files/1046131/stepped.pdf (from issue 8473): `54000 ms` -> `13000 ms` - https://github.com/mozilla/pdf.js/files/1046130/proof.pdf (from issue 8473): `5900 ms` -> `2500 ms` As always, whenever you're dealing with documents which are "slow", there's usually a certain level of subjectivity involved with regards to what's deemed acceptable performance. Hence it's not clear to me that we want to regard any of the referenced issues as fixed, however the improvements are significant enough to warrant caching of TilingPatterns in my opinion.	2020-10-08 18:43:21 +02:00
Jani Pehkonen	935568c2f1	Fix invalid `XUID` entries in CFF fonts In CFF fonts, entry `XUID` should be an array that has no more than 16 elements. In the issue, the length is 20, which causes the fonts to fail. See Appendix B, "Implementation Limits" in PostScript Language Reference Manual https://web.archive.org/web/20170218093716/https://www.adobe.com/products/postscript/pdfs/PLRM.pdf Actually entries `XUID` and `UniqueID` are obsolete altogether. https://blogs.adobe.com/CCJKType/2016/06/no-more-xuid-arrays.html	2020-10-05 17:38:01 +03:00
Jonas Jenwald	9416b14e8b	Re-factor how the ESLint `no-var` rule is enabled in the `src/` folder This simplifies/consolidates the ESLint configuration slightly in the `src/` folder, and prevents the addition of any new files where `var` is being used.[1] Hence we no longer need to manually add `/* eslint no-var: error */` in files, which is easy to forget, and can instead disable the rule in the `src/core/` files where `var` is still in use. --- [1] Obviously the `no-var` rule can, in the same way as every other rule, be disabled on a case-by-case basis where actually necessary.	2020-10-03 20:15:29 +02:00
Tim van der Meij	48e27a1a22	Merge pull request #12437 from Snuffleupagus/src-display-no-var Enable the ESLint `no-var` rule in the `src/display/` folder	2020-10-03 19:59:56 +02:00
Tim van der Meij	6ff1fe4ea9	Merge pull request #12333 from calixteman/tooltip Add tooltip if any in annotations layer	2020-10-03 19:50:39 +02:00
Jonas Jenwald	2a7d1557f9	Enable the ESLint `no-var` rule in the `src/shared/` folder Previously this rule has been enabled in the `web/` folder, and in select files in the `src/` sub-folders. In this case, enabling of this rule didn't actually require any further code changes. Please find additional details about the ESLint rule at https://eslint.org/docs/rules/no-var	2020-10-03 08:27:45 +02:00
Jonas Jenwald	52f6016e6c	Fix the remaining ESLint `no-var` errors in the `src/display/` folder While most of necessary changes were fixed automatically, see the previous patch, there's a number of cases that needed to be fixed manually.	2020-10-02 16:29:13 +02:00
Jonas Jenwald	e557be5a17	Re-format the `src/display/` files to enforce the ESLint `no-var` rule This was done automatically, using `gulp lint --fix`.	2020-10-02 16:17:28 +02:00
Jonas Jenwald	2a8983d76b	Enable the ESLint `no-var` rule in the `src/display/` folder Previously this rule has been enabled in the `web/` folder, and in select files in the `src/` sub-folders. Note that a number of the files in the `src/display/` folder were already enforcing the `no-var` rule, and thanks to Prettier the necessary re-writing will be (mostly) handled automatically. Please find additional details about the ESLint rule at https://eslint.org/docs/rules/no-var	2020-10-02 16:16:23 +02:00
calixteman	20b12d2bda	Add tooltip if any in annotations layer	2020-10-02 10:11:18 +02:00
Jonas Jenwald	bd3b15b897	Use the `cidToGidMap`, if it exists, when building the glyph mapping for non-embedded composite fonts (issue 12418)	2020-09-28 14:40:43 +02:00
Tim van der Meij	120c5c2261	Merge pull request #12409 from Snuffleupagus/bug-1627030 Compute the `transformOrigin` correctly, for negative values, when rendering `AnnotationElement`s (bug 1627030)	2020-09-24 23:48:21 +02:00
Calixte Denizet	5af352e65a	Need to reset the streams when printing	2020-09-24 19:13:09 +02:00
Jonas Jenwald	fca53a8eb0	Compute the `transformOrigin` correctly, for negative values, when rendering `AnnotationElement`s (bug 1627030) This changes the `transformOrigin` calculations in `AnnotationElement._createContainer` and `PopupAnnotationElement.render`, to ensure that e.g. the clickable area of annotations and/or popups are both positioned correctly. The problem occurs for negative values, since they're not negated correctly because of how the `transformOrigin` strings were build; see issue 12406 for a more in-depth explanation. Previously, for negative values, the `transformOrigin` strings would thus be ignored since they're not valid.	2020-09-24 10:28:29 +02:00
Jonas Jenwald	2497e8eab9	Prevent errors if the `InkList` property, in InkAnnotations, is missing and/or not an Array (issue 12392) To prevent a future bug, the `Vertices` property in PolylineAnnotations are handled the same way.	2020-09-19 15:34:32 +02:00
Calixte Denizet	d51e7e86ff	Use the same kind of strings for radio values	2020-09-16 18:47:25 +02:00
Tim van der Meij	558d3870d3	Merge pull request #12369 from emilio/better-cancelation-follow-up canvas: fix restore() with existing SMask groups and re-land #12363.	2020-09-15 23:19:17 +02:00
Tim van der Meij	374aad77c4	Merge pull request #12375 from Snuffleupagus/emptyDict-set Ensure that the empty dictionary won't be accidentally modified, and slightly improve the "SaveDocument" handler in `src/core/worker.js`	2020-09-15 23:04:57 +02:00
Calixte Denizet	16dd5403c7	Set parent of radio annotation even if there is no 'V' field	2020-09-15 14:41:57 +02:00
Jonas Jenwald	ed4e7cd8a4	A couple of small improvements in the "SaveDocument" handler in `src/core/worker.js` - Check that the "Info"-entry, in the XRef-trailer, is actually a dictionary before accessing it. This is similar to the `PDFDocument.documentInfo` method and follows the general principal of validating data carefully before accessing it, given how often PDF-software may create corrupt PDF files. - Slightly simplify the "XFA"-lookup, since there's no point in trying to fetch something from the empty dictionary.	2020-09-15 09:57:40 +02:00
Jonas Jenwald	a531c98cd2	Ensure that the empty dictionary won't be accidentally modified Currently there's nothing that prevents modification of the `Dict.empty` primitive, which obviously needs to be truly empty to prevent any future (hard to find) bugs.	2020-09-15 09:29:00 +02:00
Tim van der Meij	b0c7a74a0c	Merge pull request #12361 from Snuffleupagus/_getSaveFieldResources Ensure that all necessary /Font resources are included when saving a `WidgetAnnotation`-instance (issue 12294)	2020-09-15 00:09:31 +02:00
Tim van der Meij	9d7b1d89ca	Merge pull request #12370 from timvandermeij/annotation-reset Implement resetting of created streams for annotations	2020-09-14 23:16:17 +02:00
Tim van der Meij	3ecd984758	Implement resetting of created streams for annotations	2020-09-14 23:08:50 +02:00
Calixte Denizet	0c8de5aaf9	Replace \n and \r by \n and \r when saving a string	2020-09-14 17:34:39 +02:00
Jonas Jenwald	c992b8e460	Ensure that all necessary /Font resources are included when saving a `WidgetAnnotation`-instance (issue 12294) This patch contains a possible approach for fixing issue 12294, which compared to other PRs is purposely limited to the affected `WidgetAnnotation` code. As mentioned elsewhere, considering that we're (at least for now) trying to fix one specific case, I think that we should avoid modifying the `Dict` primitive[1] and/or avoid a solution that (indirectly) modifies an existing `Dict`-instance[2]. This patch simply fixes the issue at hand, since that seems easiest for now, and I'd suggest that we worry about a more general approach if/when that actually becomes necessary. Hence the solution implemented here, for `WidgetAnnotation`, is to simply use a combination of the local and AcroForm /DR resources during OperatorList-parsing to ensure that things work correctly regardless of where a particular /Font resource is found. For saving of form-data, on the other hand, we want to avoid increasing the file-size unnecessarily and need to be smarter than just merging all of the available resources. To achive this, a new `WidgetAnnotation._getSaveFieldResources` method will when necessary produce a combined resources `Dict` with only the minimum amount of data from the AcroForm /DR resources included. --- [1] You want to avoid anything that could cause the general `Dict` implementation to become slower, or more complex, just for handling an edge-case in my opinion. [2] If an existing `Dict`-instance is modified unexpectedly, that could very easily lead to problems elsewhere since e.g. `Dict`-instances created during parsing are not expected to be changed.	2020-09-14 15:22:40 +02:00
Emilio Cobos Álvarez	bf8b1adf73	canvas: Properly restore all the remaining items in stateStack in endDrawing. We were correctly finishing the SMask group but not restoring all the extra transformations applied in stateStack, so if somebody ends up drawing to the same context after canceling mid-draw we'd get artifacts. This re-lands #12363 and fixes Mozilla bug 1664178[1]. [1]: https://bugzilla.mozilla.org/show_bug.cgi?id=1664178	2020-09-12 16:37:54 +02:00
Emilio Cobos Álvarez	3a277f3ba5	canvas: restore() should reflect that smask groups are finished when stateStack is empty. This fixes the issue that caused #12363 to get reverted, see #12367. When we end the SMask group and stateStack.length is zero, nothing updates this.current to reflect it.	2020-09-12 16:37:54 +02:00
Jonas Jenwald	f43d1b316b	Revert "canvas: Properly restore all the remaining items in stateStack in endDrawing"	2020-09-12 16:15:33 +02:00
Tim van der Meij	cdac6f4e68	Merge pull request #12363 from emilio/better-cancelation canvas: Properly restore all the remaining items in stateStack in endDrawing	2020-09-12 15:03:34 +02:00
Emilio Cobos Álvarez	ef1e9a1a3e	canvas: Properly restore all the remaining items in stateStack in endDrawing. We were correctly finishing the SMask group but not restoring all the extra transformations applied in stateStack, so if somebody ends up drawing to the same context after canceling mid-draw we'd get artifacts. This fixes Mozilla bug 1664178[1]. [1]: https://bugzilla.mozilla.org/show_bug.cgi?id=1664178	2020-09-12 13:50:56 +02:00
Tim van der Meij	dfebe7b907	Merge pull request #12365 from Snuffleupagus/forbid-DecodeStream.length Ensure that the `length` property won't be accidentally accessed on a `DecodeStream`-instance	2020-09-11 22:18:30 +02:00
Jonas Jenwald	a11b7341a1	Ensure that the `length` property won't be accidentally accessed on a `DecodeStream`-instance For these streams, compared to `Stream` and `ChunkedStream`, there's no well defined concept of length and consequently no `length` getter.[1] However, attempting to access the non-existent `length` won't currently error, but just return `undefined`, which could thus easily lead to bugs elsewhere in the code-base. --- [1] However, note that all stream implementations have an `isEmpty` getter which can be used instead.	2020-09-11 13:25:40 +02:00
Calixte Denizet	fc154590e8	Dict keys need to be escaped too when saving	2020-09-11 12:25:05 +02:00
Tim van der Meij	8cfcd7a488	Merge pull request #12360 from calixteman/12359 Reset cursor position when focus is out of text field	2020-09-10 23:11:35 +02:00
Calixte Denizet	dc4eb71ff1	PDF names need to be escaped when saving	2020-09-10 16:08:13 +02:00
Calixte Denizet	44b24fcc29	Reset cursor position when focus is out of text field	2020-09-10 10:37:13 +02:00
Tim van der Meij	f9d56320f5	Merge pull request #12349 from calixteman/followup_12344 Follow-up of pr #12344	2020-09-09 23:40:53 +02:00
Calixte Denizet	908e7ae5e4	Set the modification date to the current day when saving	2020-09-09 19:06:39 +02:00
Calixte Denizet	64a6efd95e	Follow-up of pr #12344	2020-09-09 11:46:02 +02:00
Brendan Dahl	e51e9d1f33	Merge pull request #12345 from calixteman/save_btn Don't try to save something for a button which is neither a checkbox nor a radio	2020-09-08 15:44:04 -07:00
calixteman	68b99c59ee	Save form data in XFA datasets when pdf is a mix of acroforms and xfa (#12344 ) * Move display/xml_parser.js in shared to use it in worker * Save form data in XFA datasets when pdf is a mix of acroforms and xfa Co-authored-by: Brendan Dahl <brendan.dahl@gmail.com>	2020-09-08 15:13:52 -07:00
Calixte Denizet	7e5026dfc5	Don't try to save something for a button which is neither a checkbox nor a radio	2020-09-08 20:47:46 +02:00
Tim van der Meij	20c891542b	Merge pull request #12269 from calixteman/highlight Add support for missing appearances for hightlights, strikeout, squiggly and underline annotations.	2020-09-06 22:25:36 +02:00
Jonas Jenwald	babeae9448	Remove, manually implemented, DOM polyfills only necessary for IE 11 support Please refer to the following compatibility information: - https://developer.mozilla.org/en-US/docs/Web/API/ChildNode/remove#Browser_compatibility - https://developer.mozilla.org/en-US/docs/Web/API/DOMTokenList/add#Browser_compatibility - https://developer.mozilla.org/en-US/docs/Web/API/DOMTokenList/remove#Browser_compatibility - https://developer.mozilla.org/en-US/docs/Web/API/DOMTokenList/toggle#Browser_compatibility Finally, for the `pushState`/`replaceState` polyfills, please refer to PRs 10461 and 11318 for additional details.	2020-09-06 18:24:17 +02:00
Calixte Denizet	65ecd981fe	Add support for missing appearances for hightlights, strikeout, squiggly and underline annotations.	2020-09-06 15:40:15 +02:00
Tim van der Meij	50958c46f7	Merge pull request #12331 from Snuffleupagus/Promise-polyfill [api-minor] Only support browsers/environments that have basic support for `Promise` natively	2020-09-06 14:19:55 +02:00
Jonas Jenwald	449c7763d5	[api-minor] Only support browsers/environments that have basic support for `Promise` natively Based on https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Promise#Browser_compatibility and https://caniuse.com/#feat=promises, all even remotely modern browsers already support basic `Promise` functionality natively. The only reason for keeping the `Promise` polyfill (at all) is to be able to support recent additions to the specification, such as e.g. `finally` and `allSettled`. Note that this patch will, on its own, remove support for IE 11/Edge (non-Chromium based) in both the general PDF.js library and the default viewer.	2020-09-06 13:45:56 +02:00
Jonas Jenwald	6c8f1f7d6f	Run `gulp lint --fix`, to account for changes in Prettier version `2.1.x`	2020-09-06 12:23:59 +02:00
Jonas Jenwald	784a420027	Add support, in `Dict.merge`, for merging of "sub"-dictionaries This allows for merging of dictionaries one level deeper than previously. This could be useful e.g. for /Resources dictionaries, where you want to e.g. merge their respective /Font dictionaries (and other) together rather than picking just the first one.	2020-08-30 23:18:32 +02:00
Jonas Jenwald	66aabe3ec7	[api-minor] Add support for toggling of Optional Content in the viewer (issue 12096) Besides, obviously, adding viewer support: This patch attempts to improve the general API for Optional Content Groups slightly, by adding a couple of new methods for interacting with the (more complex) data structures of `OptionalContentConfig`-instances. (Thus allowing us to mark some of the data as "private", given that it probably shouldn't be manipulated directly.) By utilizing not just the "raw" Optional Content Groups, but the data from the `/Order` array when available, we can thus display the Layers in a proper tree-structure with collapsible headings for PDF documents that utilizes that feature. Note that it's possible to reset all Optional Content Groups to their default visibility state, simply by double-clicking on the Layers-button in the sidebar. (Currently that's indicated in the Layers-button tooltip, which is obviously easy to overlook, however it's probably the best we can do for now without adding more buttons, or even a dropdown-toolbar, to the sidebar.) Also, the current Layers-button icons are a little rough around the edges, quite literally, but given that the viewer will soon have its UI modernized anyway they hopefully suffice in the meantime. To give users full control of the visibility of the various Optional Content Groups, even those which according to the `/Order` array should not (by default) be toggleable in the UI, this patch will place those under a custom heading which: - Is collapsed by default, and placed at the bottom of the Layers-tree, to be a bit less obtrusive. - Uses a slightly different formatting, compared to the "regular" headings. - Is localizable. Finally, note that the thumbnails are purposely always rendered with all Optional Content Groups at their default visibility state, since that seems the most useful and it's also consistent with other viewers. To ensure that this works as intended, we'll thus disable the `PDFThumbnailView.setImage` functionality when the Optional Content Groups have been changed in the viewer. (This obviously means that we'll re-render thumbnails instead of using the rendered pages. However, this situation ought to be rare enough for this to not really be a problem.)	2020-08-30 16:28:40 +02:00
Jonas Jenwald	2393443e73	Include the `/Order` array, if available, when parsing the Optional Content configuration The `/Order` array is used to improve the display of Optional Content groups in PDF viewers, and it allows a PDF document to e.g. specify that Optional Content groups should be displayed as a (collapsable) tree-structure rather than as just a list. Note that not all available Optional Content groups must be present in the `/Order` array, and PDF viewers will often (by default) hide those toggles in the UI. To allow us to improve the UX around toggling of Optional Content groups, in the default viewer, these hidden-by-default groups are thus appended to the parsed `/Order` array under a custom nesting level (with `name == null`). Finally, the patch also slightly tweaks an `OptionalContentConfig` related JSDoc-comment in the API.	2020-08-30 16:28:40 +02:00
Tim van der Meij	06b53d770a	Merge pull request #12259 from brendandahl/cmap-fix Fix handling of symbolic fonts and unicode cmaps.	2020-08-30 16:01:24 +02:00
Brendan Dahl	45e8a31cc0	Fix handling of symbolic fonts and unicode cmaps. In issue 12120, the font has a 1,0 cmap and is marked symbolic which according to the spec means we should directly use the cmap instead of the extra steps that are defined in 9.6.6.4. However, just fixing that caused bug 1057544 to break. The font in bug 1057544 has a 0,1 cmap (Unicode 1.1) which we were not using, but is easy to support. We're also easily able to support some of the other unicode cmaps, so I added those as well. There was also a second issue with bug 1057544, the cmap doesn't have a mapping for the "quoteright" glyph, but it is defined in the post table. To handle this, I've moved post table as a fallback for any font that has an encoding.	2020-08-27 14:33:11 -07:00
Calixte Denizet	ba94f04ba3	Bug 1661226 - Push button are not rendered with renderInteractiveForms enabled	2020-08-27 10:45:14 +02:00
Tim van der Meij	0f229d537f	Inline the `setup` method in the `parse` method in `src/core/document.js` Now that the `parse` method is simplified we can inline the `setup` method in the `parse` method since it's only two lines of code. This avoids some indirection.	2020-08-25 23:28:55 +02:00
Tim van der Meij	280207c740	Redo the form type detection logic and include unit tests Good form type detection is important to get reliable telemetry and to only show the fallback bar if a form cannot be filled out by the user. PDF.js only supports AcroForm data, so XFA data is explicitly unsupported (tracked in issue #2373). However, the previous form type detection couldn't separate AcroForm and XFA well enough, causing form type telemetry to be incorrect sometimes and the fallback bar to be shown for forms that could in fact be filled out by the user. The solution in this commit is found by studying the specification and the form documents that are available to us. In a nutshell the rules are: - There is XFA data if the `XFA` entry is a non-empty array or stream. - There is AcroForm data if the `Fields` entry is a non-empty array and it doesn't consist of only document signatures. The document signatures part was not handled in the old code, causing a document with only XFA data to also be marked as having AcroForm data. Moreover, the old code didn't check all the data types. Now that AcroForm and XFA can be distinguished, the viewer is configured to only show the fallback bar for documents that only have XFA data. If a document also has AcroForm data, the viewer can use that to render the form. We have not found documents where the XFA data was necessary in that case. Finally, we include unit tests to ensure that all cases are covered and move the form type detection out of the `parse` function so that it's only executed if the document information is actually requested (potentially making initial parsing a tiny bit faster).	2020-08-25 23:28:55 +02:00
Tim van der Meij	f0bf62ff54	Mark the `catDict` member as private in the `Catalog` class Not only is `catDict` never accessed anymore outside of this file, it should also never happen since it's internal to the catalog. If data from it is needed elsewhere, the catalog should provide a getter for it that can do basic data integrity checks and abstract away any unnecessary details.	2020-08-25 23:28:55 +02:00
Tim van der Meij	f20f0bcc78	Move the AcroForm logic from the document to the catalog The `AcroForm` entry is part of the catalog, not of the document, so its logic should be placed there instead. The document should look in the catalog to fetch it, and not have knowledge of `catDict`, which is a member internal to the catalog. Moreover, make the AcroForm member private on the document instance. It's only used internally and was also never intended to be public. For users it's exposed by the `getMetadata` API endpoint as `IsAcroFormPresent`. Only a boolean is exposed, so we now also only store the boolean on the document instance. Finally, the annotation code needs access to the full AcroForm dictionary, so it's updated to fetch the data from the catalog instead of the document that now only holds the boolean.	2020-08-25 23:28:55 +02:00
Tim van der Meij	b41a2f4d5a	Move the collection logic from the document to the catalog The `Collection` entry is part of the catalog, not of the document, so its logic should be placed there instead. The document should look in the catalog to fetch it, and not have knowledge of `catDict`, which is a member internal to the catalog. Moreover, remove the collection member from the document instance. It's only used internally and was also never intended to be public. For users it's exposed by the `getMetadata` API endpoint as `IsCollectionPresent`. Moving this out of the `parse` function makes sure that the getter is only executed if the document information is actually requested (potentially making initial parsing a tiny bit faster).	2020-08-25 23:28:55 +02:00

1 2 3 4 5 ...

4206 Commits