pdf.js

Author	SHA1	Message	Date
Calixte Denizet	9619bf92be	Correctly pad strings when saving an encrypted pdf (bug 1726789)	2021-09-02 10:37:21 +02:00
Brendan Dahl	a7f807b059	Only use base encoding if it's populated. (bug 1727053) The font dict in this file has an encoding entry, but only specifies a differences map. The base encoding is empty in this case and shouldn't be used.	2021-08-30 12:51:59 -07:00
Brendan Dahl	306119b12a	Merge pull request #13932 from Snuffleupagus/oc-images Support Optional Content in Image-/XObjects (issue 13931)	2021-08-30 10:10:14 -07:00
Jonas Jenwald	e69afc6f3d	Re-factor the `setPDFNetworkStreamFactory` usage for the unit-tests (PR 13549 follow-up) This should have been part of PR 13549, since we no longer support browsers without native Fetch API and ReadableStream implementations.	2021-08-29 18:27:53 +02:00
Jonas Jenwald	1a1de9bb3e	Add support for specifying non-default Optional Content in the ref-tests	2021-08-26 16:54:16 +02:00
Jonas Jenwald	853b1172a1	Support Optional Content in Image-/XObjects (issue 13931) Currently, in the `PartialEvaluator`, we only support Optional Content in Form-/XObjects. Hence this patch adds support for Image-/XObjects as well, which looks like a simple oversight in PR 12095 since the canvas-implementation already contains the necessary code to support this.	2021-08-26 16:54:15 +02:00
Michael Wu	c08b4ea30d	Fix Viewer API definitions and include in CI The Viewer API definitions do not compile because of missing imports and anonymous objects are typed as `Object`. These issues were not caught during CI because the test project was not compiling anything from the Viewer API. As an example of the first problem: ``` /** * @implements MyInterface / export class MyClass { ... } ``` will generate a broken definition that doesn’t import MyInterface: ``` /* * @implements MyInterface / export class MyClass implements MyInterface { ... } ``` This can be fixed by adding a typedef jsdoc to specify the import: ``` /* @typedef {import("./otherFile").MyInterface} MyInterface / ``` See https://github.com/jsdoc/jsdoc/issues/1537 and https://github.com/microsoft/TypeScript/issues/22160 for more details. As an example of the second problem: ``` /* * Gets the size of the specified page, converted from PDF units to inches. * @param {Object} An Object containing the properties: {Array} `view`, * {number} `userUnit`, and {number} `rotate`. / function getPageSizeInches({ view, userUnit, rotate }) { ... } ``` generates the broken definition: ``` function getPageSizeInches({ view, userUnit, rotate }: Object) { ... } ``` The jsdoc should specify the type of each nested property: ``` /* * Gets the size of the specified page, converted from PDF units to inches. * @param {Object} options An object containing the properties: {Array} `view`, * {number} `userUnit`, and {number} `rotate`. * @param {number[]} options.view * @param {number} options.userUnit * @param {number} options.rotate */ ```	2021-08-25 18:45:46 -04:00
Jonas Jenwald	41efa3c071	[api-minor] Introduce a new `annotationMode`-option, in `PDFPageProxy.{render, getOperatorList}` This is a follow-up to PRs 13867 and 13899. This patch is tagged `api-minor` for the following reasons: - It replaces the `renderInteractiveForms`/`includeAnnotationStorage`-options, in the `PDFPageProxy.render`-method, with the single `annotationMode`-option that controls which annotations are being rendered and how. Note that the old options were mutually exclusive, and setting both to `true` would result in undefined behaviour. - For improved consistency in the API, the `annotationMode`-option will also work together with the `PDFPageProxy.getOperatorList`-method. - It's now also possible to disable all annotation rendering in both the API and the Viewer, since the other changes meant that this could now be supported with a single added line on the worker-thread[1]; fixes 7282. --- [1] Please note that in order to simplify the overall implementation, we'll purposely only support disabling of all annotations and that the option is being shared between the API and the Viewer. For any more "specialized" use-cases, where e.g. only some annotation-types are being rendered and/or the API and Viewer render different sets of annotations, that'll have to be handled in third-party implementations/forks of the PDF.js code-base.	2021-08-24 01:13:02 +02:00
Brendan Dahl	bf5a45ce6d	Merge pull request #13908 from brendandahl/xfa-find [api-minor] XFA - Support text search in XFA documents.	2021-08-23 08:53:02 -07:00
Brendan Dahl	bb47128864	XFA - Support text search in XFA documents. Moves the logic out of TextLayerBuilder to handle highlighting matches into a new separate class `TextHighlighter` that can be used with regular PDFs and XFA PDFs. To mimic the current find functionality in XFA, two arrays from the XFA rendering are created to get the text content and map those to DOM nodes. Fixes #13878	2021-08-23 08:44:20 -07:00
Jonas Jenwald	ac27f96987	Extend the glyph maps for standard respectively Calibri fonts (issue 13916)	2021-08-21 00:48:38 +02:00
Tim van der Meij	036b81496e	Merge pull request #13882 from Snuffleupagus/PDFWorker-rm-closure [api-minor] Remove the closure from the `PDFWorker` class, in the `src/display/api.js` file	2021-08-07 19:52:39 +02:00
Tim van der Meij	952f6366bf	Merge pull request #13867 from Snuffleupagus/RenderingIntentFlag [api-minor] Re-factor the internal renderingIntent, and change the default `intent` value in the `PDFPageProxy.getAnnotations` method	2021-08-07 19:25:51 +02:00
Tim van der Meij	f3960a65d3	Merge pull request #13879 from Snuffleupagus/test-resources-fix-globals Fix the global variable definitions in `test/resources/reftest-analyzer.js` (issue 13862)	2021-08-07 19:00:42 +02:00
Jonas Jenwald	1cf9405281	[api-minor] Remove the closure from the `PDFWorker` class, in the `src/display/api.js` file This patch removes the only remaining closure in the `src/display/api.js` file, utilizing a similar approach as used in lots of other parts of the code-base, which results in a small decrease in the size of the build `pdf.js` file. Given that `PDFWorker` is exposed through the public API, this complicates things somewhat since there's a couple of worker-related properties that really should stay private. Initially, while working on PR 13813, I believed that we'd need support for private (static) class fields in order to get rid of this closure, however I've managed to come up with what's hopefully deemed an acceptable work-around here. Furthermore, some helper functions were simply moved into the `PDFWorker` class as static methods, thus simplifying the overall implementation (e.g. we don't need to manually cache the Promise in the `PDFWorker._setupFakeWorkerGlobal`-method). Finally, as part of this re-factoring a number of missing JSDoc-comments were added which together with the removal of the closure significantly improves the `gulp jsdoc` output for the `PDFWorker` class. Please note: This patch is tagged with `api-minor` since it deprecates `PDFWorker.getWorkerSrc()` in favor of the shorter `PDFWorker.workerSrc`, with the fallback limited to `GENERIC` builds.	2021-08-07 10:43:39 +02:00
Brendan Dahl	3d18c76a53	Merge pull request #13881 from calixteman/bug_1723734 XFA - Elements under an area must be bound (bug 1723734)	2021-08-06 11:56:58 -07:00
Calixte Denizet	328383ea7a	XFA - Elements under an area must be bound (bug 1723734) - aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1723734.	2021-08-06 20:20:19 +02:00
calixteman	98e893b84f	Merge pull request #13880 from eltociear/patch-5 Fix typo in cff_parser_spec.js	2021-08-06 19:31:52 +02:00
Ikko Ashimine	23236f1b0b	Fix typo in cff_parser_spec.js shoudn't -> shouldn't	2021-08-06 19:30:36 +09:00
Jonas Jenwald	df79b831f4	Fix the global variable definitions in `test/resources/reftest-analyzer.js` (issue 13862) It shouldn't be necessary to assign these variables to the global scope (as far as I can tell), either explicitly with `window` or implicitly with `var`, and this way we don't need to disable the ESLint `no-undef` rule; fixes another small part of issue 13862. Please note: I wasn't going to put additional work into this code after PR 13869, however these changes looked so simple that I figured trying to get rid of the few remaining "Code scanning alerts" wouldn't hurt. However, this file would still very much benefit from additional clean-up and re-factoring work, since it's quite old and currently contains some dead code (commented out).	2021-08-06 11:45:55 +02:00
Jonas Jenwald	47f94235ab	[api-minor] Re-factor the internal renderingIntent, and change the default `intent` value in the `PDFPageProxy.getAnnotations` method With the changes made in PR 13746 the internal renderingIntent handling became somewhat "messy", since we're now having to do string-matching in various spots in order to handle the "oplist"-intent correctly. Hence this patch, which implements the idea from PR 13746 to convert the `intent`-strings, used in various API-methods, into an internal renderingIntent that's implemented using a bit-field instead. Please note: This part of the patch, in itself, does not change the public API (but see below). This patch is tagged `api-minor` for the following reasons: 1. It changes the default value for the `intent` parameter, in the `PDFPageProxy.getAnnotations` method, to "display" in order to be consistent across the API. 2. In order to get all annotations, with the `PDFPageProxy.getAnnotations` method, you now need to explicitly set "any" as the `intent` parameter. 3. The `PDFPageProxy.getOperatorList` method will now also support the new "any" intent, to allow accessing the operatorList of all annotations (limited to those types that have one). 4. Finally, for consistency across the API, the `PDFPageProxy.render` method also support the new "any" intent (although I'm not sure how useful that'll be). Points 1 and 2 above are the significant, and thus breaking, changes in default behaviour here. However, unfortunately I cannot see a good way to improve the overall API while also keeping `PDFPageProxy.getAnnotations` unchanged.	2021-08-06 00:39:42 +02:00
Brendan Dahl	a38d1122d8	XFA - Support aria heading and table structure. (bug 1723421) (bug 1723425) https://bugzilla.mozilla.org/show_bug.cgi?id=1723421 https://bugzilla.mozilla.org/show_bug.cgi?id=1723425	2021-08-05 15:25:04 -07:00
Jonas Jenwald	39663e730e	Change the `hashParameters` function to return a `Map` rather than an Object (issue 13862) This patch (basically) mirrors the implementation in PR 13831, to get rid of the "Remote property injection" warning.	2021-08-04 15:17:13 +02:00
Jonas Jenwald	5dfdfbc70b	Fix some of the remaining linting issues in `test/resources/reftest-analyzer.js` Given that issue 13862 tracks updating/modernizing the code, this patch purposely limits the scope of the changes. In particular, the following things are still left to address: - The ESLint `no-undef` errors; for now the rule is simply disabled globally in this file. - A couple of unused variables are commented out for now, but could perhaps just be removed.	2021-08-04 14:14:04 +02:00
Jonas Jenwald	92300965a4	Fix most linting/formatting issues in the `test/resources/` folder These changes were done automatically, by using the `gulp lint --fix` command.	2021-08-04 13:59:21 +02:00
calixteman	52ef63f1fe	Merge pull request #13856 from calixteman/xfa_layout_rounding XFA - Avoid to put something in very small areas	2021-08-04 10:09:13 +02:00
Brendan Dahl	3e003245b1	[XFA] Add alt text for images. (bug 1723418) Not many XFA PDFs have alt text. Some examples: bug1723422.pdf xfa_bug1718670_1.pdf xfa_issue13611.pdf xfa_issue13633.pdf xfa_issue13634.pdf	2021-08-03 17:18:58 -07:00
Brendan Dahl	6cf1ee3251	Merge pull request #13858 from brendandahl/xfa-aria-label Add aria-labels to XFA form elements. (bug 1723422)	2021-08-03 17:18:08 -07:00
Brendan Dahl	6ea56f35ab	Add aria-labels to XFA form elements. (bug 1723422)	2021-08-03 15:58:33 -07:00
Tim van der Meij	b317e9311d	Merge pull request #13846 from Snuffleupagus/test-xfa Add a special `gulp xfatest` command, to limit the ref-tests to only XFA-documents (issue 13744)	2021-08-03 23:47:30 +02:00
Jonas Jenwald	844319cdb0	Add a special `gulp xfatest` command, to limit the ref-tests to only XFA-documents (issue 13744) The new command is a variation of the standard `gulp test` command and will run all unit/font/integration-tests just as normal, while only running ref-tests for XFA-documents to speed up development. Given that we currently have (some) unit-tests for XFA-documents, and that we may also (in the future) want to add integration-tests, it thus makes sense to run all test-suites in my opinion. Please note: Once this patch has landed, I'll submit a follow-up patch to https://github.com/mozilla/botio-files-pdfjs such that we can also run the new command on the bots.	2021-08-03 23:41:10 +02:00
Tim van der Meij	85be62c684	Merge pull request #13854 from Snuffleupagus/issue-13851 Prevent breaking errors when an optional content group is undefined (issue 13851)	2021-08-03 23:34:34 +02:00
Tim van der Meij	ad90fe90ed	Merge pull request #13848 from Snuffleupagus/rm-lgtm Remove the LGTM configuration and inline disable comments (issue 13829)	2021-08-03 23:13:05 +02:00
Jonas Jenwald	766299016f	Remove the `isEOF` helper function and slightly re-factor `EOF` Given how trivial the `isEOF` function is, we can simply inline the check at the various call-sites and remove the function (which ought to be ever so slightly more efficient as well). Furthermore, this patch also changes the `EOF` primitive itself to a `Symbol` instead of an Object since that has the nice benefit of making it unclonable (thus preventing accidentally trying to send `EOF` from the worker-thread).	2021-08-03 20:19:32 +02:00
Calixte Denizet	be1ee155d1	XFA - Avoid to put something in very small areas - it aims to fix #13855.	2021-08-03 17:05:29 +02:00
Jonas Jenwald	d5e14d3dc3	Prevent breaking errors when an optional content group is undefined (issue 13851) In the referenced PDF document most of the form `/Form` XObjects don't have an `/OC` entry, which thus causes the runtime failure during rendering.	2021-08-03 15:59:29 +02:00
Jonas Jenwald	8fef8630fe	Remove the LGTM configuration and inline disable comments (issue 13829) Given that the GitHub Advanced Security workflow now covers everything that LGTM does, but generally faster and with better GitHub-integration, there's no longer much point in also running LGTM separately. As a follow-up to this patch, we should also disable/remove the LGTM-integration from the PDF.js repository.	2021-08-03 11:14:49 +02:00
Jonas Jenwald	16a09eaed8	Fix a broken regular expression in the `docId` unit-test (issue 13838, PR 13813 follow-up) The current regular expression contains a typo, leading to intermittent test-failures for certain `docId`s; sorry about that!	2021-08-01 15:18:25 +02:00
Tim van der Meij	d1c0f8f91c	Implement unit tests for the `parseQueryString` utility function Now that these unit tests are in place, we also take the opportunity to slightly modernize the code itself by using a `for ... of` loop.	2021-08-01 14:14:33 +02:00
Tim van der Meij	10a1db6980	Merge pull request #13824 from Snuffleupagus/issue-13823 When no "V" entry exists, let the fieldValue fallback to the "DV" entry (issue 13823)	2021-07-30 22:48:38 +02:00
Tim van der Meij	99b14a9da0	Merge pull request #13813 from Snuffleupagus/rm-closure-API Remove a couple of closures in the `src/display/api.js` file	2021-07-30 21:55:45 +02:00
Jonas Jenwald	ff71be793d	When no "V" entry exists, let the fieldValue fallback to the "DV" entry (issue 13823)	2021-07-30 16:17:42 +02:00
Calixte Denizet	7bb5331087	XFA - Avoid an error when an exdata is a string (bug 1723114)	2021-07-30 14:43:53 +02:00
Jonas Jenwald	b18620ac0f	Remove the closure used with the `PDFDocumentLoadingTask` class This patch utilizes the same approach as used in lots of other parts of the code-base, which thus slightly reduces the size of this code. By removing some of the (current) indirection, we can also simplify the JSDocs a little bit. Looking at the `gulp jsdoc` output, this actually seem to improve the documentation for this class.	2021-07-30 11:34:47 +02:00
Calixte Denizet	4a4591bd2c	XFA - Fix font scale factors (bug 1720888) - All the scale factors in for the substitution font were wrong because of different glyph positions between Liberation and the other ones: - regenerate all the factors - Text may have polish chars for example and in this case the glyph widths were wrong: - treat substitution font as a composite one - add a map glyphIndex to unicode for Liberation in order to generate width array for cid font	2021-07-28 19:10:42 +02:00
Calixte Denizet	76d882b560	XFA - Fix auto-sized fields (bug 1722030) - In order to better compute text fields size, use line height with no gaps (and consequently guessed height for text are slightly better in general). - Fix default background color in fields.	2021-07-28 09:43:15 +02:00
Tim van der Meij	336a74a0e5	Merge pull request #13796 from Snuffleupagus/issue-13794 Allow `StreamsSequenceStream.readBlock` to skip sub-streams with errors (issue 13794)	2021-07-27 22:25:58 +02:00
Calixte Denizet	959120e6c9	XFA - Elements created outside of XML must have all their properties (bug 1722029) - an Image element was created, attached to its parent but the $globalData property was not set and that led to an error. - the pdf in bug 1722029 has 27 rendered rows (checked in Acrobat) when only one was displayed: this patch some binding issues around the occur element.	2021-07-26 19:38:52 +02:00
Jonas Jenwald	885e7a8aa4	Allow `StreamsSequenceStream.readBlock` to skip sub-streams with errors (issue 13794) This patch makes use of the existing `ignoreErrors` option, thus allowing a page to continue parsing/rendering even if (some of) its sub-streams are corrupt. Obviously this may cause part of a page to be broken/missing, however it should be better than (potentially) rendering nothing. Also, to the best of my knowledge, this is the first bug of its kind that we've encountered. To avoid having to pass in a bunch of, for a `BaseStream`-instance, mostly unrelated parameters when initializing a `StreamsSequenceStream`-instance, I settled on utilizing a callback function instead to allow conditional Error-suppression. Note that the `StreamsSequenceStream`-class is a special stream-implementation that we only use when the `/Contents`-entry, in the `/Page`-dictionary, consists of an Array with streams.	2021-07-26 16:42:50 +02:00
Tim van der Meij	41a2b5c809	Merge pull request #13787 from Snuffleupagus/lgtm-fix-warnings Fix (most) LGTM warnings	2021-07-24 15:20:07 +02:00

1 2 3 4 5 ...

2506 Commits