pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	7025b9f859	[src/core/writer.js] Support `null` values in the `writeValue` function This fixes something that I noticed, having recently looked at both the `Lexer.getObj` and `writeValue` code. Please note that I unfortunately don't have an example of a form where saving fails without this patch. However, given its overall simplicity and that unit-tests are added, it's hopefully deemed useful to fix this potential issue pro-actively rather than waiting for a bug report. At this point one might, and rightly so, wonder if there's actually any real-world PDF documents where a `null` value is being used? Unfortunately the answer is yes, and we have a couple of examples in the test-suite (although none of those are related to forms); please see: `issue1015`, `issue2642`, `issue10402`, `issue12823`, `issue13823`, and `pr12564`.	2021-09-12 18:24:37 +02:00
Jonas Jenwald	761519ef3f	Merge pull request #13998 from calixteman/bug1729971 Write boolean value when saving a form (bug 1729971)	2021-09-12 15:38:10 +02:00
Jonas Jenwald	a47844d1fc	Let `Lexer.getObj` return a dummy-`Cmd` for commands that start with a non-visible ASCII character (issue 13999) This way we avoid breaking badly generated PDF documents where a non-visible ASCII character is "glued" to a valid command.	2021-09-11 19:54:13 +02:00
Calixte Denizet	474ab7c86d	Write boolean value when saving a form (bug 1729971) - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1729971#c4.	2021-09-10 14:10:25 +02:00
Tim van der Meij	680f33c31c	Merge pull request #13961 from Snuffleupagus/simpler-regexp Simplify some regular expressions	2021-09-04 15:39:30 +02:00
Brendan Dahl	a8ce15a2d7	Merge pull request #13966 from calixteman/no_ns XFA - Created data node mustn't belong to datasets namespace	2021-09-03 09:59:40 -07:00
Calixte Denizet	77b9657e57	XFA - Overwrite AcroForm dictionary when saving if no datasets in XFA (bug 1720179) - aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1720179 - in some pdfs the XFA array in AcroForm dictionary doesn't contain an entry for 'datasets' (which contains saved data), so basically this patch allows to overwrite the AcroForm dictionary with an updated XFA array when doing an incremental update.	2021-09-03 17:04:03 +02:00
Calixte Denizet	57ae3a5a76	XFA - Created data node mustn't belong to datasets namespace - when some named nodes in the template don't have their counterpart in datasets we create some nodes: the main node mustn't belong to the datasets namespace because it doesn't make sense and Acrobat Reader isn't able to read pdf with such nodes. - so created nodes under a datasets node have a namespaceId set to -1 and consequently when serialized no namespace prefix will appear.	2021-09-03 15:43:25 +02:00
Jonas Jenwald	c42887221a	Simplify some regular expressions There's a fair number of regular expressions througout the code-base which are slightly more verbose than strictly necessary, in particular: - We have a lot of regular expressions that use `[0-9]` explicitly, and those can be simplified to use `\d` instead. - We have one instance of a regular expression containing a `A-Za-z0-9_` sequence, which can be simplified to use `\w` instead.	2021-09-02 11:50:42 +02:00
Calixte Denizet	9619bf92be	Correctly pad strings when saving an encrypted pdf (bug 1726789)	2021-09-02 10:37:21 +02:00
Jonas Jenwald	e69afc6f3d	Re-factor the `setPDFNetworkStreamFactory` usage for the unit-tests (PR 13549 follow-up) This should have been part of PR 13549, since we no longer support browsers without native Fetch API and ReadableStream implementations.	2021-08-29 18:27:53 +02:00
Jonas Jenwald	41efa3c071	[api-minor] Introduce a new `annotationMode`-option, in `PDFPageProxy.{render, getOperatorList}` This is a follow-up to PRs 13867 and 13899. This patch is tagged `api-minor` for the following reasons: - It replaces the `renderInteractiveForms`/`includeAnnotationStorage`-options, in the `PDFPageProxy.render`-method, with the single `annotationMode`-option that controls which annotations are being rendered and how. Note that the old options were mutually exclusive, and setting both to `true` would result in undefined behaviour. - For improved consistency in the API, the `annotationMode`-option will also work together with the `PDFPageProxy.getOperatorList`-method. - It's now also possible to disable all annotation rendering in both the API and the Viewer, since the other changes meant that this could now be supported with a single added line on the worker-thread[1]; fixes 7282. --- [1] Please note that in order to simplify the overall implementation, we'll purposely only support disabling of all annotations and that the option is being shared between the API and the Viewer. For any more "specialized" use-cases, where e.g. only some annotation-types are being rendered and/or the API and Viewer render different sets of annotations, that'll have to be handled in third-party implementations/forks of the PDF.js code-base.	2021-08-24 01:13:02 +02:00
Tim van der Meij	036b81496e	Merge pull request #13882 from Snuffleupagus/PDFWorker-rm-closure [api-minor] Remove the closure from the `PDFWorker` class, in the `src/display/api.js` file	2021-08-07 19:52:39 +02:00
Tim van der Meij	952f6366bf	Merge pull request #13867 from Snuffleupagus/RenderingIntentFlag [api-minor] Re-factor the internal renderingIntent, and change the default `intent` value in the `PDFPageProxy.getAnnotations` method	2021-08-07 19:25:51 +02:00
Jonas Jenwald	1cf9405281	[api-minor] Remove the closure from the `PDFWorker` class, in the `src/display/api.js` file This patch removes the only remaining closure in the `src/display/api.js` file, utilizing a similar approach as used in lots of other parts of the code-base, which results in a small decrease in the size of the build `pdf.js` file. Given that `PDFWorker` is exposed through the public API, this complicates things somewhat since there's a couple of worker-related properties that really should stay private. Initially, while working on PR 13813, I believed that we'd need support for private (static) class fields in order to get rid of this closure, however I've managed to come up with what's hopefully deemed an acceptable work-around here. Furthermore, some helper functions were simply moved into the `PDFWorker` class as static methods, thus simplifying the overall implementation (e.g. we don't need to manually cache the Promise in the `PDFWorker._setupFakeWorkerGlobal`-method). Finally, as part of this re-factoring a number of missing JSDoc-comments were added which together with the removal of the closure significantly improves the `gulp jsdoc` output for the `PDFWorker` class. Please note: This patch is tagged with `api-minor` since it deprecates `PDFWorker.getWorkerSrc()` in favor of the shorter `PDFWorker.workerSrc`, with the fallback limited to `GENERIC` builds.	2021-08-07 10:43:39 +02:00
Brendan Dahl	3d18c76a53	Merge pull request #13881 from calixteman/bug_1723734 XFA - Elements under an area must be bound (bug 1723734)	2021-08-06 11:56:58 -07:00
Calixte Denizet	328383ea7a	XFA - Elements under an area must be bound (bug 1723734) - aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1723734.	2021-08-06 20:20:19 +02:00
calixteman	98e893b84f	Merge pull request #13880 from eltociear/patch-5 Fix typo in cff_parser_spec.js	2021-08-06 19:31:52 +02:00
Ikko Ashimine	23236f1b0b	Fix typo in cff_parser_spec.js shoudn't -> shouldn't	2021-08-06 19:30:36 +09:00
Jonas Jenwald	47f94235ab	[api-minor] Re-factor the internal renderingIntent, and change the default `intent` value in the `PDFPageProxy.getAnnotations` method With the changes made in PR 13746 the internal renderingIntent handling became somewhat "messy", since we're now having to do string-matching in various spots in order to handle the "oplist"-intent correctly. Hence this patch, which implements the idea from PR 13746 to convert the `intent`-strings, used in various API-methods, into an internal renderingIntent that's implemented using a bit-field instead. Please note: This part of the patch, in itself, does not change the public API (but see below). This patch is tagged `api-minor` for the following reasons: 1. It changes the default value for the `intent` parameter, in the `PDFPageProxy.getAnnotations` method, to "display" in order to be consistent across the API. 2. In order to get all annotations, with the `PDFPageProxy.getAnnotations` method, you now need to explicitly set "any" as the `intent` parameter. 3. The `PDFPageProxy.getOperatorList` method will now also support the new "any" intent, to allow accessing the operatorList of all annotations (limited to those types that have one). 4. Finally, for consistency across the API, the `PDFPageProxy.render` method also support the new "any" intent (although I'm not sure how useful that'll be). Points 1 and 2 above are the significant, and thus breaking, changes in default behaviour here. However, unfortunately I cannot see a good way to improve the overall API while also keeping `PDFPageProxy.getAnnotations` unchanged.	2021-08-06 00:39:42 +02:00
Brendan Dahl	a38d1122d8	XFA - Support aria heading and table structure. (bug 1723421) (bug 1723425) https://bugzilla.mozilla.org/show_bug.cgi?id=1723421 https://bugzilla.mozilla.org/show_bug.cgi?id=1723425	2021-08-05 15:25:04 -07:00
Brendan Dahl	3e003245b1	[XFA] Add alt text for images. (bug 1723418) Not many XFA PDFs have alt text. Some examples: bug1723422.pdf xfa_bug1718670_1.pdf xfa_issue13611.pdf xfa_issue13633.pdf xfa_issue13634.pdf	2021-08-03 17:18:58 -07:00
Brendan Dahl	6cf1ee3251	Merge pull request #13858 from brendandahl/xfa-aria-label Add aria-labels to XFA form elements. (bug 1723422)	2021-08-03 17:18:08 -07:00
Brendan Dahl	6ea56f35ab	Add aria-labels to XFA form elements. (bug 1723422)	2021-08-03 15:58:33 -07:00
Jonas Jenwald	766299016f	Remove the `isEOF` helper function and slightly re-factor `EOF` Given how trivial the `isEOF` function is, we can simply inline the check at the various call-sites and remove the function (which ought to be ever so slightly more efficient as well). Furthermore, this patch also changes the `EOF` primitive itself to a `Symbol` instead of an Object since that has the nice benefit of making it unclonable (thus preventing accidentally trying to send `EOF` from the worker-thread).	2021-08-03 20:19:32 +02:00
Jonas Jenwald	16a09eaed8	Fix a broken regular expression in the `docId` unit-test (issue 13838, PR 13813 follow-up) The current regular expression contains a typo, leading to intermittent test-failures for certain `docId`s; sorry about that!	2021-08-01 15:18:25 +02:00
Tim van der Meij	d1c0f8f91c	Implement unit tests for the `parseQueryString` utility function Now that these unit tests are in place, we also take the opportunity to slightly modernize the code itself by using a `for ... of` loop.	2021-08-01 14:14:33 +02:00
Jonas Jenwald	b18620ac0f	Remove the closure used with the `PDFDocumentLoadingTask` class This patch utilizes the same approach as used in lots of other parts of the code-base, which thus slightly reduces the size of this code. By removing some of the (current) indirection, we can also simplify the JSDocs a little bit. Looking at the `gulp jsdoc` output, this actually seem to improve the documentation for this class.	2021-07-30 11:34:47 +02:00
Calixte Denizet	1d07ef597e	XFA - Must use bindItems element even if there is no direct binding (bug 1720907)	2021-07-20 17:07:32 +02:00
Tim van der Meij	07955fa1d3	Merge pull request #13735 from Snuffleupagus/bug-1720411 Ensure that the field value, for checkboxes, refers to an existing appearance state (bug 1720411)	2021-07-18 13:48:34 +02:00
Jonas Jenwald	03cf28bf17	[api-minor] Add `intent` support to the `PDFPageProxy.getOperatorList` method (issue 13704) With this patch, the `PDFPageProxy.getOperatorList` method will now return `PDFOperatorList`-instances that also include Annotation-operatorLists (when those exist). Hence this closes a small, but potentially confusing, gap between the `render` and `getOperatorList` methods. Previously we've been somewhat reluctant to do this, as explained below, but given that there's actual use-cases where it's required probably means that we'll have to implement it now. Since we still need the ability to separate "normal" rendering operations from direct `getOperatorList` calls in the worker-thread, this API-change unfortunately causes the internal renderingIntent to become a bit "messy" which is indeed unfortunate (note the `"oplist-"` strings in various spots). As-is I suppose that it's not all that bad, but we may want to consider changing the internal renderingIntent to e.g. a bitfield in the future. Besides fixing issue 13704, this patch would also be necessary if someone ever tries to implement e.g. issue 10165 (since currently `PDFPageProxy.getOperatorList` doesn't include Annotation-operatorLists). Please note: This patch is also tagged "api-minor" for a second reason, which is that we're now including the Annotation-id in the `beginAnnotation` argument. The reason for this is to allow correlating the Annotation-data returned by `PDFPageProxy.getAnnotations`, with its corresponding operatorList-data (for those Annotations that have it).	2021-07-16 17:16:30 +02:00
Jonas Jenwald	da808aeab3	Ensure that the field value, for checkboxes, refers to an existing appearance state (bug 1720411) Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1720411	2021-07-16 13:11:48 +02:00
Jonas Jenwald	3838c4e27c	Re-factor the handling of empty `Name`-instances (PR 13612 follow-up) When working on PR 13612, I mostly prioritized a simple solution that didn't require touching a lot of code. However, while working on PR 13735 I started to realize that the static `Name.empty` construction really wasn't a good idea. In particular, having a special `Name`-instance where the `name`-property isn't actually a String is confusing (to put it mildly) and can easily lead to issues elsewhere. The only reason for not simply allowing the `name`-property to be an empty string, in PR 13612, was to avoid having to touch a lot of existing code. However, it turns out that this is only limited to a few methods in the `PartialEvaluator` and a few of the `BaseLocalCache`-implementations, all of which can be easily re-factored to handle empty `Name`-instances. All-in-all, I think that this patch is even an overall improvement since we're now validating (what should always be) `Name`-data better in the `PartialEvaluator`. This is what I ought to have done from the start, sorry about the code churn here!	2021-07-15 12:00:42 +02:00
Calixte Denizet	9bbc194846	XFA - Support assist element	2021-07-11 21:01:18 +02:00
Calixte Denizet	58e1f51688	XFA - Fix text positions (bug 1718741) - font line height is taken into account by acrobat when it isn't with masterpdfeditor: I extracted a font from a pdf, modified some ascent/descent properties thanks to ttx and the reinjected the font in the pdf: only Acrobat is taken it into account. So in this patch, line heights for some substituted fonts are added. - it seems that Acrobat is using a line height of 1.2 when the line height in the font is not enough (it's the only way I found to fix correctly bug 1718741). - don't use flex in wrapper container (which was causing an horizontal overflow in the above bug). - consequently, the above fixes introduced a lot of small regressions, so in order to see real improvements on reftests, I fixed the regressions in this patch: - replace margin by padding in some case where padding is a part of a container dimensions; - remove some flex display: some containers are wrongly sized when rendered; - set letter-spacing to 0.01px: it helps to be sure that text is not broken because of not enough width in Firefox.	2021-07-09 18:11:12 +02:00
Jonas Jenwald	661c60ecc9	[api-minor] Support accessing both the original and modified PDF fingerprint The PDF.js API has only ever supported accessing the original file ID, however the second one that (should) exist in modified documents have thus far been completely inaccessible through the API. That seems like a simple oversight, caused e.g. by the viewer not needing it, since it really shouldn't hurt to provide API-users with the ability to check if a PDF document has been modified since its creation.[1] Please refer to https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#G13.2261661 for additional information. For an example of how to update existing code to use the new API, please see the changes in the `web/app.js` file included in this patch. Please note: While I'm not sure if we'll ever be able to remove the old `PDFDocumentProxy.fingerprint` getter, given that it's existed since "forever", that probably isn't a big deal given that it's now limited to only `GENERIC`-builds. --- [1] Although this obviously depends on the PDF software following the specification, by updating the second file ID as intended.	2021-07-03 13:56:33 +02:00
Calixte Denizet	ff440d13e7	XFA - Remove empty pages - it aims to fix #13583; - fix the switch to breakBefore target; - force the layout of an unsplittable element on an empty page; - don't fail when there is horizontal overflow (except in lr-tb); - handle correctly overflow in the same content area (bug 1717805, bug 1717668); - fix a typo in radial gradient first argument.	2021-06-30 16:32:27 +02:00
Calixte Denizet	429ffdcd2f	XFA - Save filled data in the pdf when downloading the file (Bug 1716288) - when binding (after parsing) we get a map between some template nodes and some data nodes; - so set user data in input handlers in using data node uids in the annotation storage; - to save the form, just put the value we have in the storage in the correct data nodes, serialize the xml as a string and then write the string at the end of the pdf using src/core/writer.js; - fix few bugs around data bindings: - the "Off" issue in Bug 1716980.	2021-06-25 18:57:01 +02:00
Brendan Dahl	f4f00a9bc6	Merge pull request #13618 from calixteman/bind_root XFA - Always bind root subform on root data	2021-06-23 13:14:12 -07:00
Calixte Denizet	b836616667	XFA - Always bind root subform on root data - it partially fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1717805 (on the data side at least but there is still a layout issue).	2021-06-23 20:46:41 +02:00
Jonas Jenwald	6467907318	Support corrupt documents with empty `Name`-entries (issue 13610) Apparently some really bad PDF software can create documents with empty `Name`-entries, which we thus need to somehow deal with. While I don't know if this patch is necessarily the best solution, it should at least ensure that the empty `Name`-instance cannot accidentally match a proper `Name`-instance (and it doesn't require changes to a lot of existing code).[1] --- [1] I briefly considered using a `Symbol` rather than an Object, but quickly decided against that since the former one [is not clonable](https://developer.mozilla.org/en-US/docs/Web/API/Web_Workers_API/Structured_clone_algorithm#supported_types) and `Name`-instances may be sent to the API.	2021-06-22 16:55:44 +02:00
calixteman	56a75f8b26	Revert "Revert "XFA - Fix the way to select page on breaking"" - and fix the error which caused the backout: add an $extra property when creating html. - switch to next content area when breaking on page area.	2021-06-21 17:07:31 +02:00
calixteman	a9385bbb52	Revert "XFA - Fix the way to select page on breaking"	2021-06-21 15:45:04 +02:00
Calixte Denizet	7aea8faa34	XFA - Fix the way to select page on breaking - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1716838. - some fonts in the pdf in the bug where bold when they shouldn't so write the font properties in the html to avoid to use some wrong inherited ones.	2021-06-21 12:45:23 +02:00
Calixte Denizet	7cdbc98716	XFA - Match font family correctly - partial fix for https://bugzilla.mozilla.org/show_bug.cgi?id=1716980; - some pdf can contain an invalid font family (e.g. 'Windings 3') so in this case remove the space; - the font family in typeface attribute doesn't always match the one defined in the FontDescriptor dictionary.	2021-06-20 15:16:28 +02:00
Calixte Denizet	df08b1548b	XFA - Fix layout issues - PR #13554 is buggy, so this patch aims to fix bugs. - check if a component fits into its parent in taking into account the parent layout. - introduce method isSplittable for template nodes to know if a component can be splitted in case of overflow.	2021-06-17 16:09:22 +02:00
Calixte Denizet	8eeb7ab4a3	XFA - Add the possibily to layout and measure text - some containers doesn't always have their 2 dimensions and those dimensions re based on contents; - so in order to measure text, we must get the glyph widths (for the xfa fonts) before starting the layout; - implement a word-wrap algorithm; - handle font change during text layout.	2021-06-17 14:17:02 +02:00
Calixte Denizet	793a0156ce	XFA - By default a text ui has only one line when in a field element - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1716809.	2021-06-16 20:18:29 +02:00
Calixte Denizet	d89c429d78	XFA - Handle maxChars property for text fields - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1716294.	2021-06-14 13:07:06 +02:00
Brendan Dahl	d333af7848	Merge pull request #13527 from calixteman/bind_inf_loop XFA - Avoid infinite loop when creating some nodes in data	2021-06-09 12:37:29 -07:00

... 4 5 6 7 8 ...

1100 Commits