pdf.js

Author	SHA1	Message	Date
Tim van der Meij	06b53d770a	Merge pull request #12259 from brendandahl/cmap-fix Fix handling of symbolic fonts and unicode cmaps.	2020-08-30 16:01:24 +02:00
Brendan Dahl	45e8a31cc0	Fix handling of symbolic fonts and unicode cmaps. In issue 12120, the font has a 1,0 cmap and is marked symbolic which according to the spec means we should directly use the cmap instead of the extra steps that are defined in 9.6.6.4. However, just fixing that caused bug 1057544 to break. The font in bug 1057544 has a 0,1 cmap (Unicode 1.1) which we were not using, but is easy to support. We're also easily able to support some of the other unicode cmaps, so I added those as well. There was also a second issue with bug 1057544, the cmap doesn't have a mapping for the "quoteright" glyph, but it is defined in the post table. To handle this, I've moved post table as a fallback for any font that has an encoding.	2020-08-27 14:33:11 -07:00
Calixte Denizet	ba94f04ba3	Bug 1661226 - Push button are not rendered with renderInteractiveForms enabled	2020-08-27 10:45:14 +02:00
Tim van der Meij	0f229d537f	Inline the `setup` method in the `parse` method in `src/core/document.js` Now that the `parse` method is simplified we can inline the `setup` method in the `parse` method since it's only two lines of code. This avoids some indirection.	2020-08-25 23:28:55 +02:00
Tim van der Meij	280207c740	Redo the form type detection logic and include unit tests Good form type detection is important to get reliable telemetry and to only show the fallback bar if a form cannot be filled out by the user. PDF.js only supports AcroForm data, so XFA data is explicitly unsupported (tracked in issue #2373). However, the previous form type detection couldn't separate AcroForm and XFA well enough, causing form type telemetry to be incorrect sometimes and the fallback bar to be shown for forms that could in fact be filled out by the user. The solution in this commit is found by studying the specification and the form documents that are available to us. In a nutshell the rules are: - There is XFA data if the `XFA` entry is a non-empty array or stream. - There is AcroForm data if the `Fields` entry is a non-empty array and it doesn't consist of only document signatures. The document signatures part was not handled in the old code, causing a document with only XFA data to also be marked as having AcroForm data. Moreover, the old code didn't check all the data types. Now that AcroForm and XFA can be distinguished, the viewer is configured to only show the fallback bar for documents that only have XFA data. If a document also has AcroForm data, the viewer can use that to render the form. We have not found documents where the XFA data was necessary in that case. Finally, we include unit tests to ensure that all cases are covered and move the form type detection out of the `parse` function so that it's only executed if the document information is actually requested (potentially making initial parsing a tiny bit faster).	2020-08-25 23:28:55 +02:00
Tim van der Meij	f0bf62ff54	Mark the `catDict` member as private in the `Catalog` class Not only is `catDict` never accessed anymore outside of this file, it should also never happen since it's internal to the catalog. If data from it is needed elsewhere, the catalog should provide a getter for it that can do basic data integrity checks and abstract away any unnecessary details.	2020-08-25 23:28:55 +02:00
Tim van der Meij	f20f0bcc78	Move the AcroForm logic from the document to the catalog The `AcroForm` entry is part of the catalog, not of the document, so its logic should be placed there instead. The document should look in the catalog to fetch it, and not have knowledge of `catDict`, which is a member internal to the catalog. Moreover, make the AcroForm member private on the document instance. It's only used internally and was also never intended to be public. For users it's exposed by the `getMetadata` API endpoint as `IsAcroFormPresent`. Only a boolean is exposed, so we now also only store the boolean on the document instance. Finally, the annotation code needs access to the full AcroForm dictionary, so it's updated to fetch the data from the catalog instead of the document that now only holds the boolean.	2020-08-25 23:28:55 +02:00
Tim van der Meij	b41a2f4d5a	Move the collection logic from the document to the catalog The `Collection` entry is part of the catalog, not of the document, so its logic should be placed there instead. The document should look in the catalog to fetch it, and not have knowledge of `catDict`, which is a member internal to the catalog. Moreover, remove the collection member from the document instance. It's only used internally and was also never intended to be public. For users it's exposed by the `getMetadata` API endpoint as `IsCollectionPresent`. Moving this out of the `parse` function makes sure that the getter is only executed if the document information is actually requested (potentially making initial parsing a tiny bit faster).	2020-08-25 23:28:55 +02:00
Tim van der Meij	935d95b462	Move the version logic from the document to the catalog The `Version` entry is part of the catalog, not of the document, so its logic should be placed there instead. The document should look in the catalog to fetch it, and not have knowledge of `catDict`, which is a member internal to the catalog. Moreover, make the version member private on the document instance. It's only used internally and was also never intended to be public. For users it's exposed by the `getMetadata` API endpoint as `PDFFormatVersion`. Finally, clarify how the version from the header and the version from the catalog are treated using a comment.	2020-08-25 23:28:55 +02:00
Jonas Jenwald	bd16c363ce	Access the `Catalog` data correctly in the "GetPageIndex" handler in `src/core/worker.js` Even though the code obviously works as-is, given that we have unit-tests for it, it still feels incorrect to just assume that the `Catalog`-instance has all of its properties immediately available. Especially when (almost) all of the other handlers, in `src/core/worker.js`, protect their data accesses with appropriate `pdfManager.ensure` calls.	2020-08-25 12:14:14 +02:00
Jonas Jenwald	2e6e2c3b41	Access the `XRef` data correctly in the "GetStats" handler in `src/core/worker.js` Even though the code obviously works as-is, given that we have unit-tests for it, it still feels incorrect to just assume that the `XRef`-instance has all of its properties immediately available. Especially when (almost) all of the other handlers, in `src/core/worker.js`, protect their data accesses with appropriate `pdfManager.ensure` calls.	2020-08-25 12:14:11 +02:00
Jani Pehkonen	e7febbf0f7	Accent positioning in Type1 `seac` glyphs In `display/canvas.js` the accent offsets must be multiplied by `fontSize` to make the offsets large enough. Another problem is in `core/type1_parser.js` when the Type1 command `seac` is handled. There is an error in the Adobe Type1 spec. See chapter 6 in Type1 Font Format Supplement, which provides an errata: The arguments of `seac` specify the offset of the left side bearing (LSB) points, not the offset of origins. This can be fixed in `core/type1_parser.js` by adding the difference of the LSB values.	2020-08-23 21:01:25 +03:00
Tim van der Meij	7df8aa34a5	Merge pull request #12263 from timvandermeij/acroform-fixes Fix AcroForm printing/saving edge cases	2020-08-23 13:37:30 +02:00
Tim van der Meij	a8efc0296b	Obtain the export values for choice widgets from the normal appearance The down appearance (`D`) is optional and not available in the document from #12233, so the checkboxes are never saved/printed as checked because the checked appearance is based on the export value that is missing because the `D` entry is not available. Instead, we should use the normal appearance (`N`) since that one is required and therefore always available. Finally, the /Off appearance is optional according to section 12.7.4.2.3 of the specification, so that needs to be taken into account to match the specification and to fix reference test failures for the `annotation-button-widget-print` test. That is a file that doesn't specify an /Off appearance in the normal appearance dictionary.	2020-08-23 13:00:02 +02:00
Tim van der Meij	1b82ad8fff	Decode widget form values consistently The helper method `_decodeFormValue` is used to ensure that it happens in one place. Note that form values are field values, display values and export values.	2020-08-23 13:00:01 +02:00
Jonas Jenwald	fa02808f76	Mark the `setModified` method, on `AnnotationStorage`, as "private" (PR 12241 follow-up) Since it shouldn't be called manually, we can just mark it as "private".	2020-08-22 20:04:25 +02:00
Jonas Jenwald	1f5021d76a	Prevent errors if `PDFDocumentProxy.saveDocument` is called without the `annotationStorage` parameter (PR 12241 follow-up) Obviously it doesn't make sense to call that method without providing an `AnnotationStorage`-instance, however we should ensure that doing so won't cause errors. Hence we need to check that `annotationStorage` is actually defined, before attempting to call its `resetModified` method.	2020-08-22 18:09:17 +02:00
Tim van der Meij	36e149800e	Unconditionally set the field value for choice widgets in the annotation storage This commit makes the following improvements: - The code is similar to the other interactive form widgets now, with a clear note for the only difference. - Calling `getOrCreateValue` unconditionally ensures that choice widgets always have a value in the annotation storage. Previously we only inserted a value in the annotation storage when an option matched or when a selection was changed. However, this causes breakage when saving/printing because comboboxes, which we don't fully support yet but are rendered, might not have a value in storage at all. Their field value might not match any option since it allows the user to enter a custom value. - Calling `getOrCreateValue` unconditionally ensures that forms with choice widgets no longer always trigger a warning when the user navigates away from the page. This fixes https://github.com/mozilla/pdf.js/pull/12241#discussion_r474279654	2020-08-22 16:01:33 +02:00
Jonas Jenwald	a8de614a9f	Also enable `renderInteractiveForms` by default in the viewer components (PR 12201 follow-up) Given that `renderInteractiveForms` is now enabled by default in "full" viewer, it seems reasonable to enable it by default in the viewer components as well. Especially considering that it's simple to disable, when creating the affected components, for anyone implementing their own viewer.	2020-08-22 14:24:04 +02:00
Tim van der Meij	5fed7112a2	Use the export value instead of the display value for choice widget option selection The export value is used when the document is saved, so it should also be used when the document is opened to determine which choice widget option is selected. The display value is, as the name implies, only to be used for viewer display purposes and not for other logic. This makes sure that in the document from #12233 the "Favourite colour" choice widget is correctly initialized with "Red" instead of "Black" because the field value is equal to the export value (always the case), but not the display value (not always the case). Moreover, saving now also correctly uses the export value and not the display value.	2020-08-22 14:11:41 +02:00
Tim van der Meij	3c790936c1	Merge pull request #12247 from timvandermeij/acroform-choice-null Improve the field value parsing for choice widgets to handle `null` values	2020-08-21 23:17:20 +02:00
Brendan Dahl	8023175103	Support file save triggered from the Firefox integrated version. Related to https://bugzilla.mozilla.org/show_bug.cgi?id=1659753 This allows Firefox trigger a "save" event from ctrl/cmd+s or the "Save Page As" context menu, which in turn lets pdf.js generate a new PDF if there is form data to save. I also now use `sourceEventType` on downloads so Firefox can determine if it should launch the "open with" dialog or "save as" dialog.	2020-08-20 18:05:08 -07:00
Aki Sasaki	83365a3756	confirm if leaving a modified form without saving	2020-08-20 17:23:06 -07:00
Tim van der Meij	12c20772ac	Improve the field value parsing for choice widgets to handle `null` values The specification states that the field value is `null` if no item is selected and we didn't handle this case properly. Even though this did not break the rendering because we always convert the value to an array and the `includes` check in the display layer would simply not match, the field value would be `[null]` which is not expected and strange from an API perspective. This commit fixes that by ensuring that we return an empty array in case the field value is `null`. The API therefore still always gives an array for the field value, but now the code is more specific so that the value is either an empty array or an array of strings.	2020-08-19 23:27:50 +02:00
Jonas Jenwald	1058f16605	Add (basic) support for transfer functions to Images (issue 6931, bug 1149713) This is similar to the existing transfer function support for SMasks, but extended to simple image data. Please note that the extra amount of data now being sent to the worker-thread, for affected /ExtGState entries, is limited to at most 4 `Uint8Array`s each with a length of 256 elements. Refer to https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G9.1658137 for additional details.	2020-08-17 10:34:12 +02:00
Jonas Jenwald	9d3e046a4f	Don't cache /ExtGState entries that contain fonts (PR 12087 follow-up) I completely overlooked the fact that `PartialEvaluator.handleSetFont` also updates the current `state`, which means that currently we're not actually handling font data correctly for cached /ExtGState data. (Thankfully, using /ExtGState to set a font is somewhat rare in practice.)	2020-08-17 08:17:25 +02:00
Jonas Jenwald	b26d736809	Ensure that the "DocException" message handler, in the API, will always either error or warn (depending on the build) if a valid `Error` isn't found Having this present would have made debugging issues 11941 and 12209 so much quicker and easier.	2020-08-13 13:17:30 +02:00
Calixte Denizet	1a6816ba98	Add support for saving forms	2020-08-12 10:32:59 +02:00
Tim van der Meij	57c988853b	Merge pull request #12192 from Snuffleupagus/misc-AnnotationStorage-improvements A couple of (small) tweaks of the `AnnotationStorage` (PR 12173 follow-up)	2020-08-11 23:46:13 +02:00
Brendan Dahl	7fb01f9f2a	Merge pull request #12186 from brendandahl/loca-2 Fix bad truetype loca tables.	2020-08-10 20:34:19 -07:00
Brendan Dahl	f6dff81223	Fix bad truetype loca tables. Some fonts have loca tables that aren't sorted or use 0 as an offset to signal a missing glyph. This fixes the bad loca tables by sorting them and then rewriting the loca table and potentially re-ordering the glyf table to match. Fixes #11131 and bug 1650302.	2020-08-10 14:15:49 -07:00
Jonas Jenwald	4d351eab93	A couple of (small) tweaks of the `AnnotationStorage` (PR 12173 follow-up) - Initialize the `AnnotationStorage`-instance, on `PDFDocumentProxy`, lazily. - Change the `AnnotationStorage` to use a `Map` internally, rather than a regular Object (simplifies the following points). - Let `AnnotationStorage.getAll` return `null` when there's no data stored, to avoid unnecessary parsing on the worker-thread. This ought to "just work", since the worker-thread code should already handle the `!annotationStorage` case everywhere. - Add a new `AnnotationStorage.size` getter, to be able to easily tell if there's any data stored.	2020-08-10 17:07:24 +02:00
Calixte Denizet	88b112ab0c	Support comb textfields for printing	2020-08-09 14:41:26 +02:00
Tim van der Meij	b061c300b4	Merge pull request #12176 from calixteman/multiline Support multiline textfields for printing	2020-08-09 13:37:36 +02:00
Calixte Denizet	cd8bb7293b	Support multiline textfields for printing	2020-08-09 12:14:34 +02:00
Jonathan Grimes	ac723a1760	Allow loading pdf fonts into another document.	2020-08-08 02:52:32 +00:00
Takashi Tamura	4ac62d8787	Fix the type of PDFDocumentLoadingTask.destroy.	2020-08-07 16:10:19 +09:00
Tim van der Meij	8c162f57f7	Merge pull request #12175 from calixteman/textfield Support textfield and choice widgets for printing	2020-08-07 00:20:29 +02:00
Calixte Denizet	1747d259f9	Support textfield and choice widgets for printing	2020-08-06 14:45:23 +02:00
Jonas Jenwald	16fa9dc4ea	Add support for `Object.fromEntries` This provides a simpler way of creating an `Object` from e.g. a `Map`, without having to manually iterate over it. Please see https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Object/fromEntries	2020-08-06 14:39:51 +02:00
Jonas Jenwald	5e44b241b2	[api-minor] Fix the `annotationStorage` parameter in `PDFPageProxy.render` While the parameter name (clearly) suggests that an `AnnotationStorage`-instance is expected, looking at the only call-sites that include the parameter (i.e. the `PDFPrintServiceFactory` instances) it actually contains just a normal Object. Hence it seems much more reasonable to actually pass a valid `AnnotationStorage`-instance, as the name suggests, and simply have `PDFPageProxy.render` do the `annotationStorage.getAll()` call. (Since we cannot send an `AnnotationStorage`-instance as-is to the worker-thread, given the "structured clone algorithm".)	2020-08-05 23:02:30 +02:00
Takashi Tamura	a0f0ab78f3	Fix the type definition of TypedArray.	2020-08-05 17:01:08 +09:00
Tim van der Meij	56ca027c08	Improve consistency for the API documentation comments Over time we used multiple different formats for JSDoc comments. This commit standardizes those formats to the one we used most often. Moreover, this removes the example in the outline endpoint documentation since it now has a proper type definition and it didn't render correctly in JSDoc.	2020-08-04 23:27:22 +02:00
Tim van der Meij	ba4a07ce07	Fix incorrect types in the API documentation	2020-08-04 23:19:59 +02:00
Tim van der Meij	3116216e1d	Improve the API documentation for `PDFDocumentLoadingTask` This commit: - formats the documentation block according to the standards; - replaces the callback definitions with the `function` type (we have that for other definitions already and the callback type was not rendered correctly by JSDoc); - synchronizes the type documentation and the class documentation; - fixes the documentation by making it easier to read and making sure that all optional properties are indicated as such; - uses the `@link` tag to indicate links to other code. The `typestest` still passes and JSDoc now renders this class correctly.	2020-08-04 23:17:24 +02:00
Brendan Dahl	ac494a2278	Add support for optional marked content. Add a new method to the API to get the optional content configuration. Add a new render task param that accepts the above configuration. For now, the optional content is not controllable by the user in the viewer, but renders with the default configuration in the PDF. All of the test files added exhibit different uses of optional content. Fixes #269. Fix test to work with optional content. - Change the stopAtErrors test to ensure the operator list has something, instead of asserting the exact number of operators.	2020-08-04 09:26:55 -07:00
Tim van der Meij	e68ac05f18	Merge pull request #12160 from tamuratak/worker_options Use typedef to define the type of GlobalWorkerOptions (PR 12102 follow-up)	2020-08-03 22:55:49 +02:00
Tim van der Meij	0b75701012	Merge pull request #12157 from tamuratak/fix_svggraphics Fix the type of SVGGraphics (PR 12102 follow-up)	2020-08-03 22:52:10 +02:00
Tim van der Meij	adc7645a44	Merge pull request #12161 from tamuratak/exported_func Add types to functions exported as API in src/pdf.js (PR 12102 follow-up)	2020-08-03 22:43:50 +02:00
Takashi Tamura	923ba27f1f	Tweak for the type of PageViewportParameters.viewBox	2020-08-03 20:42:42 +09:00

1 2 3 4 5 ...

4114 Commits