pdf.js

Author	SHA1	Message	Date
Tim van der Meij	374aad77c4	Merge pull request #12375 from Snuffleupagus/emptyDict-set Ensure that the empty dictionary won't be accidentally modified, and slightly improve the "SaveDocument" handler in `src/core/worker.js`	2020-09-15 23:04:57 +02:00
Calixte Denizet	16dd5403c7	Set parent of radio annotation even if there is no 'V' field	2020-09-15 14:41:57 +02:00
Jonas Jenwald	ed4e7cd8a4	A couple of small improvements in the "SaveDocument" handler in `src/core/worker.js` - Check that the "Info"-entry, in the XRef-trailer, is actually a dictionary before accessing it. This is similar to the `PDFDocument.documentInfo` method and follows the general principal of validating data carefully before accessing it, given how often PDF-software may create corrupt PDF files. - Slightly simplify the "XFA"-lookup, since there's no point in trying to fetch something from the empty dictionary.	2020-09-15 09:57:40 +02:00
Jonas Jenwald	a531c98cd2	Ensure that the empty dictionary won't be accidentally modified Currently there's nothing that prevents modification of the `Dict.empty` primitive, which obviously needs to be truly empty to prevent any future (hard to find) bugs.	2020-09-15 09:29:00 +02:00
Tim van der Meij	b0c7a74a0c	Merge pull request #12361 from Snuffleupagus/_getSaveFieldResources Ensure that all necessary /Font resources are included when saving a `WidgetAnnotation`-instance (issue 12294)	2020-09-15 00:09:31 +02:00
Tim van der Meij	9d7b1d89ca	Merge pull request #12370 from timvandermeij/annotation-reset Implement resetting of created streams for annotations	2020-09-14 23:16:17 +02:00
Tim van der Meij	3ecd984758	Implement resetting of created streams for annotations	2020-09-14 23:08:50 +02:00
Calixte Denizet	0c8de5aaf9	Replace \n and \r by \n and \r when saving a string	2020-09-14 17:34:39 +02:00
Jonas Jenwald	c992b8e460	Ensure that all necessary /Font resources are included when saving a `WidgetAnnotation`-instance (issue 12294) This patch contains a possible approach for fixing issue 12294, which compared to other PRs is purposely limited to the affected `WidgetAnnotation` code. As mentioned elsewhere, considering that we're (at least for now) trying to fix one specific case, I think that we should avoid modifying the `Dict` primitive[1] and/or avoid a solution that (indirectly) modifies an existing `Dict`-instance[2]. This patch simply fixes the issue at hand, since that seems easiest for now, and I'd suggest that we worry about a more general approach if/when that actually becomes necessary. Hence the solution implemented here, for `WidgetAnnotation`, is to simply use a combination of the local and AcroForm /DR resources during OperatorList-parsing to ensure that things work correctly regardless of where a particular /Font resource is found. For saving of form-data, on the other hand, we want to avoid increasing the file-size unnecessarily and need to be smarter than just merging all of the available resources. To achive this, a new `WidgetAnnotation._getSaveFieldResources` method will when necessary produce a combined resources `Dict` with only the minimum amount of data from the AcroForm /DR resources included. --- [1] You want to avoid anything that could cause the general `Dict` implementation to become slower, or more complex, just for handling an edge-case in my opinion. [2] If an existing `Dict`-instance is modified unexpectedly, that could very easily lead to problems elsewhere since e.g. `Dict`-instances created during parsing are not expected to be changed.	2020-09-14 15:22:40 +02:00
Emilio Cobos Álvarez	bf8b1adf73	canvas: Properly restore all the remaining items in stateStack in endDrawing. We were correctly finishing the SMask group but not restoring all the extra transformations applied in stateStack, so if somebody ends up drawing to the same context after canceling mid-draw we'd get artifacts. This re-lands #12363 and fixes Mozilla bug 1664178[1]. [1]: https://bugzilla.mozilla.org/show_bug.cgi?id=1664178	2020-09-12 16:37:54 +02:00
Emilio Cobos Álvarez	3a277f3ba5	canvas: restore() should reflect that smask groups are finished when stateStack is empty. This fixes the issue that caused #12363 to get reverted, see #12367. When we end the SMask group and stateStack.length is zero, nothing updates this.current to reflect it.	2020-09-12 16:37:54 +02:00
Jonas Jenwald	f43d1b316b	Revert "canvas: Properly restore all the remaining items in stateStack in endDrawing"	2020-09-12 16:15:33 +02:00
Tim van der Meij	cdac6f4e68	Merge pull request #12363 from emilio/better-cancelation canvas: Properly restore all the remaining items in stateStack in endDrawing	2020-09-12 15:03:34 +02:00
Emilio Cobos Álvarez	ef1e9a1a3e	canvas: Properly restore all the remaining items in stateStack in endDrawing. We were correctly finishing the SMask group but not restoring all the extra transformations applied in stateStack, so if somebody ends up drawing to the same context after canceling mid-draw we'd get artifacts. This fixes Mozilla bug 1664178[1]. [1]: https://bugzilla.mozilla.org/show_bug.cgi?id=1664178	2020-09-12 13:50:56 +02:00
Tim van der Meij	dfebe7b907	Merge pull request #12365 from Snuffleupagus/forbid-DecodeStream.length Ensure that the `length` property won't be accidentally accessed on a `DecodeStream`-instance	2020-09-11 22:18:30 +02:00
Jonas Jenwald	a11b7341a1	Ensure that the `length` property won't be accidentally accessed on a `DecodeStream`-instance For these streams, compared to `Stream` and `ChunkedStream`, there's no well defined concept of length and consequently no `length` getter.[1] However, attempting to access the non-existent `length` won't currently error, but just return `undefined`, which could thus easily lead to bugs elsewhere in the code-base. --- [1] However, note that all stream implementations have an `isEmpty` getter which can be used instead.	2020-09-11 13:25:40 +02:00
Calixte Denizet	fc154590e8	Dict keys need to be escaped too when saving	2020-09-11 12:25:05 +02:00
Tim van der Meij	8cfcd7a488	Merge pull request #12360 from calixteman/12359 Reset cursor position when focus is out of text field	2020-09-10 23:11:35 +02:00
Calixte Denizet	dc4eb71ff1	PDF names need to be escaped when saving	2020-09-10 16:08:13 +02:00
Calixte Denizet	44b24fcc29	Reset cursor position when focus is out of text field	2020-09-10 10:37:13 +02:00
Tim van der Meij	f9d56320f5	Merge pull request #12349 from calixteman/followup_12344 Follow-up of pr #12344	2020-09-09 23:40:53 +02:00
Calixte Denizet	908e7ae5e4	Set the modification date to the current day when saving	2020-09-09 19:06:39 +02:00
Calixte Denizet	64a6efd95e	Follow-up of pr #12344	2020-09-09 11:46:02 +02:00
Brendan Dahl	e51e9d1f33	Merge pull request #12345 from calixteman/save_btn Don't try to save something for a button which is neither a checkbox nor a radio	2020-09-08 15:44:04 -07:00
calixteman	68b99c59ee	Save form data in XFA datasets when pdf is a mix of acroforms and xfa (#12344 ) * Move display/xml_parser.js in shared to use it in worker * Save form data in XFA datasets when pdf is a mix of acroforms and xfa Co-authored-by: Brendan Dahl <brendan.dahl@gmail.com>	2020-09-08 15:13:52 -07:00
Calixte Denizet	7e5026dfc5	Don't try to save something for a button which is neither a checkbox nor a radio	2020-09-08 20:47:46 +02:00
Tim van der Meij	20c891542b	Merge pull request #12269 from calixteman/highlight Add support for missing appearances for hightlights, strikeout, squiggly and underline annotations.	2020-09-06 22:25:36 +02:00
Jonas Jenwald	babeae9448	Remove, manually implemented, DOM polyfills only necessary for IE 11 support Please refer to the following compatibility information: - https://developer.mozilla.org/en-US/docs/Web/API/ChildNode/remove#Browser_compatibility - https://developer.mozilla.org/en-US/docs/Web/API/DOMTokenList/add#Browser_compatibility - https://developer.mozilla.org/en-US/docs/Web/API/DOMTokenList/remove#Browser_compatibility - https://developer.mozilla.org/en-US/docs/Web/API/DOMTokenList/toggle#Browser_compatibility Finally, for the `pushState`/`replaceState` polyfills, please refer to PRs 10461 and 11318 for additional details.	2020-09-06 18:24:17 +02:00
Calixte Denizet	65ecd981fe	Add support for missing appearances for hightlights, strikeout, squiggly and underline annotations.	2020-09-06 15:40:15 +02:00
Tim van der Meij	50958c46f7	Merge pull request #12331 from Snuffleupagus/Promise-polyfill [api-minor] Only support browsers/environments that have basic support for `Promise` natively	2020-09-06 14:19:55 +02:00
Jonas Jenwald	449c7763d5	[api-minor] Only support browsers/environments that have basic support for `Promise` natively Based on https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Promise#Browser_compatibility and https://caniuse.com/#feat=promises, all even remotely modern browsers already support basic `Promise` functionality natively. The only reason for keeping the `Promise` polyfill (at all) is to be able to support recent additions to the specification, such as e.g. `finally` and `allSettled`. Note that this patch will, on its own, remove support for IE 11/Edge (non-Chromium based) in both the general PDF.js library and the default viewer.	2020-09-06 13:45:56 +02:00
Jonas Jenwald	6c8f1f7d6f	Run `gulp lint --fix`, to account for changes in Prettier version `2.1.x`	2020-09-06 12:23:59 +02:00
Jonas Jenwald	784a420027	Add support, in `Dict.merge`, for merging of "sub"-dictionaries This allows for merging of dictionaries one level deeper than previously. This could be useful e.g. for /Resources dictionaries, where you want to e.g. merge their respective /Font dictionaries (and other) together rather than picking just the first one.	2020-08-30 23:18:32 +02:00
Jonas Jenwald	66aabe3ec7	[api-minor] Add support for toggling of Optional Content in the viewer (issue 12096) Besides, obviously, adding viewer support: This patch attempts to improve the general API for Optional Content Groups slightly, by adding a couple of new methods for interacting with the (more complex) data structures of `OptionalContentConfig`-instances. (Thus allowing us to mark some of the data as "private", given that it probably shouldn't be manipulated directly.) By utilizing not just the "raw" Optional Content Groups, but the data from the `/Order` array when available, we can thus display the Layers in a proper tree-structure with collapsible headings for PDF documents that utilizes that feature. Note that it's possible to reset all Optional Content Groups to their default visibility state, simply by double-clicking on the Layers-button in the sidebar. (Currently that's indicated in the Layers-button tooltip, which is obviously easy to overlook, however it's probably the best we can do for now without adding more buttons, or even a dropdown-toolbar, to the sidebar.) Also, the current Layers-button icons are a little rough around the edges, quite literally, but given that the viewer will soon have its UI modernized anyway they hopefully suffice in the meantime. To give users full control of the visibility of the various Optional Content Groups, even those which according to the `/Order` array should not (by default) be toggleable in the UI, this patch will place those under a custom heading which: - Is collapsed by default, and placed at the bottom of the Layers-tree, to be a bit less obtrusive. - Uses a slightly different formatting, compared to the "regular" headings. - Is localizable. Finally, note that the thumbnails are purposely always rendered with all Optional Content Groups at their default visibility state, since that seems the most useful and it's also consistent with other viewers. To ensure that this works as intended, we'll thus disable the `PDFThumbnailView.setImage` functionality when the Optional Content Groups have been changed in the viewer. (This obviously means that we'll re-render thumbnails instead of using the rendered pages. However, this situation ought to be rare enough for this to not really be a problem.)	2020-08-30 16:28:40 +02:00
Jonas Jenwald	2393443e73	Include the `/Order` array, if available, when parsing the Optional Content configuration The `/Order` array is used to improve the display of Optional Content groups in PDF viewers, and it allows a PDF document to e.g. specify that Optional Content groups should be displayed as a (collapsable) tree-structure rather than as just a list. Note that not all available Optional Content groups must be present in the `/Order` array, and PDF viewers will often (by default) hide those toggles in the UI. To allow us to improve the UX around toggling of Optional Content groups, in the default viewer, these hidden-by-default groups are thus appended to the parsed `/Order` array under a custom nesting level (with `name == null`). Finally, the patch also slightly tweaks an `OptionalContentConfig` related JSDoc-comment in the API.	2020-08-30 16:28:40 +02:00
Tim van der Meij	06b53d770a	Merge pull request #12259 from brendandahl/cmap-fix Fix handling of symbolic fonts and unicode cmaps.	2020-08-30 16:01:24 +02:00
Brendan Dahl	45e8a31cc0	Fix handling of symbolic fonts and unicode cmaps. In issue 12120, the font has a 1,0 cmap and is marked symbolic which according to the spec means we should directly use the cmap instead of the extra steps that are defined in 9.6.6.4. However, just fixing that caused bug 1057544 to break. The font in bug 1057544 has a 0,1 cmap (Unicode 1.1) which we were not using, but is easy to support. We're also easily able to support some of the other unicode cmaps, so I added those as well. There was also a second issue with bug 1057544, the cmap doesn't have a mapping for the "quoteright" glyph, but it is defined in the post table. To handle this, I've moved post table as a fallback for any font that has an encoding.	2020-08-27 14:33:11 -07:00
Calixte Denizet	ba94f04ba3	Bug 1661226 - Push button are not rendered with renderInteractiveForms enabled	2020-08-27 10:45:14 +02:00
Tim van der Meij	0f229d537f	Inline the `setup` method in the `parse` method in `src/core/document.js` Now that the `parse` method is simplified we can inline the `setup` method in the `parse` method since it's only two lines of code. This avoids some indirection.	2020-08-25 23:28:55 +02:00
Tim van der Meij	280207c740	Redo the form type detection logic and include unit tests Good form type detection is important to get reliable telemetry and to only show the fallback bar if a form cannot be filled out by the user. PDF.js only supports AcroForm data, so XFA data is explicitly unsupported (tracked in issue #2373). However, the previous form type detection couldn't separate AcroForm and XFA well enough, causing form type telemetry to be incorrect sometimes and the fallback bar to be shown for forms that could in fact be filled out by the user. The solution in this commit is found by studying the specification and the form documents that are available to us. In a nutshell the rules are: - There is XFA data if the `XFA` entry is a non-empty array or stream. - There is AcroForm data if the `Fields` entry is a non-empty array and it doesn't consist of only document signatures. The document signatures part was not handled in the old code, causing a document with only XFA data to also be marked as having AcroForm data. Moreover, the old code didn't check all the data types. Now that AcroForm and XFA can be distinguished, the viewer is configured to only show the fallback bar for documents that only have XFA data. If a document also has AcroForm data, the viewer can use that to render the form. We have not found documents where the XFA data was necessary in that case. Finally, we include unit tests to ensure that all cases are covered and move the form type detection out of the `parse` function so that it's only executed if the document information is actually requested (potentially making initial parsing a tiny bit faster).	2020-08-25 23:28:55 +02:00
Tim van der Meij	f0bf62ff54	Mark the `catDict` member as private in the `Catalog` class Not only is `catDict` never accessed anymore outside of this file, it should also never happen since it's internal to the catalog. If data from it is needed elsewhere, the catalog should provide a getter for it that can do basic data integrity checks and abstract away any unnecessary details.	2020-08-25 23:28:55 +02:00
Tim van der Meij	f20f0bcc78	Move the AcroForm logic from the document to the catalog The `AcroForm` entry is part of the catalog, not of the document, so its logic should be placed there instead. The document should look in the catalog to fetch it, and not have knowledge of `catDict`, which is a member internal to the catalog. Moreover, make the AcroForm member private on the document instance. It's only used internally and was also never intended to be public. For users it's exposed by the `getMetadata` API endpoint as `IsAcroFormPresent`. Only a boolean is exposed, so we now also only store the boolean on the document instance. Finally, the annotation code needs access to the full AcroForm dictionary, so it's updated to fetch the data from the catalog instead of the document that now only holds the boolean.	2020-08-25 23:28:55 +02:00
Tim van der Meij	b41a2f4d5a	Move the collection logic from the document to the catalog The `Collection` entry is part of the catalog, not of the document, so its logic should be placed there instead. The document should look in the catalog to fetch it, and not have knowledge of `catDict`, which is a member internal to the catalog. Moreover, remove the collection member from the document instance. It's only used internally and was also never intended to be public. For users it's exposed by the `getMetadata` API endpoint as `IsCollectionPresent`. Moving this out of the `parse` function makes sure that the getter is only executed if the document information is actually requested (potentially making initial parsing a tiny bit faster).	2020-08-25 23:28:55 +02:00
Tim van der Meij	935d95b462	Move the version logic from the document to the catalog The `Version` entry is part of the catalog, not of the document, so its logic should be placed there instead. The document should look in the catalog to fetch it, and not have knowledge of `catDict`, which is a member internal to the catalog. Moreover, make the version member private on the document instance. It's only used internally and was also never intended to be public. For users it's exposed by the `getMetadata` API endpoint as `PDFFormatVersion`. Finally, clarify how the version from the header and the version from the catalog are treated using a comment.	2020-08-25 23:28:55 +02:00
Jonas Jenwald	bd16c363ce	Access the `Catalog` data correctly in the "GetPageIndex" handler in `src/core/worker.js` Even though the code obviously works as-is, given that we have unit-tests for it, it still feels incorrect to just assume that the `Catalog`-instance has all of its properties immediately available. Especially when (almost) all of the other handlers, in `src/core/worker.js`, protect their data accesses with appropriate `pdfManager.ensure` calls.	2020-08-25 12:14:14 +02:00
Jonas Jenwald	2e6e2c3b41	Access the `XRef` data correctly in the "GetStats" handler in `src/core/worker.js` Even though the code obviously works as-is, given that we have unit-tests for it, it still feels incorrect to just assume that the `XRef`-instance has all of its properties immediately available. Especially when (almost) all of the other handlers, in `src/core/worker.js`, protect their data accesses with appropriate `pdfManager.ensure` calls.	2020-08-25 12:14:11 +02:00
Jani Pehkonen	e7febbf0f7	Accent positioning in Type1 `seac` glyphs In `display/canvas.js` the accent offsets must be multiplied by `fontSize` to make the offsets large enough. Another problem is in `core/type1_parser.js` when the Type1 command `seac` is handled. There is an error in the Adobe Type1 spec. See chapter 6 in Type1 Font Format Supplement, which provides an errata: The arguments of `seac` specify the offset of the left side bearing (LSB) points, not the offset of origins. This can be fixed in `core/type1_parser.js` by adding the difference of the LSB values.	2020-08-23 21:01:25 +03:00
Tim van der Meij	7df8aa34a5	Merge pull request #12263 from timvandermeij/acroform-fixes Fix AcroForm printing/saving edge cases	2020-08-23 13:37:30 +02:00
Tim van der Meij	a8efc0296b	Obtain the export values for choice widgets from the normal appearance The down appearance (`D`) is optional and not available in the document from #12233, so the checkboxes are never saved/printed as checked because the checked appearance is based on the export value that is missing because the `D` entry is not available. Instead, we should use the normal appearance (`N`) since that one is required and therefore always available. Finally, the /Off appearance is optional according to section 12.7.4.2.3 of the specification, so that needs to be taken into account to match the specification and to fix reference test failures for the `annotation-button-widget-print` test. That is a file that doesn't specify an /Off appearance in the normal appearance dictionary.	2020-08-23 13:00:02 +02:00
Tim van der Meij	1b82ad8fff	Decode widget form values consistently The helper method `_decodeFormValue` is used to ensure that it happens in one place. Note that form values are field values, display values and export values.	2020-08-23 13:00:01 +02:00

... 2 3 4 5 6 ...

4249 Commits