pdf.js

Author	SHA1	Message	Date
Calixte Denizet	ad3fb71a02	[Annotations] Add support for printing/saving choice list with multiple selections - it aims to fix issue #12189.	2022-03-29 18:59:44 +02:00
Jonas Jenwald	0dd6bc9a85	Merge pull request #14703 from calixteman/14627 [text selection] Add the whitespaces present in the pdf in the text chunk	2022-03-27 15:20:19 +02:00
Calixte Denizet	18e79e3c0b	[text selection] Add the whitespaces present in the pdf in the text chunk - it aims to fix issue #14627; - the basic idea of the recent text refactoring was to only consider the rendered visible whitespaces. But sometimes, the heuristics aren't correct and although some whitespaces are in the text stream they weren't in the text chunks because they were too small. Hence we added some exceptions, for example, we always add a whitespace when it is between two non-whitespace chars but only when in the same Tj. So basically, this patch removes the constraint to have the chars in the same Tj (in using a circular buffer to save the two last chars) but don't add a space when the visible space is really too small (hence `NOT_A_SPACE_FACTOR`).	2022-03-27 14:34:56 +02:00
Jonas Jenwald	7f0589c74a	Change the type of the `container` property, in the `TextLayerRenderParameters` typedef (issue 14716) Given that the textLayer-code has been using a `DocumentFragment` ever since PR 3356 (back in 2013), simply updating the type of the `container` property should be fine. This patch also tries to, ever so slightly, improve the grammar of a couple of other properties in the typedef.	2022-03-24 22:42:37 +01:00
Jonas Jenwald	849de5a508	Slightly improve validation of (some) parameters in `getDocument` There's a couple of `getDocument` parameters that should be numbers, but which are currently not fully validated to prevent issues elsewhere in the code-base. Also, improves validation of the `ownerDocument` parameter since we currently accept more-or-less anything here.	2022-03-21 13:32:17 +01:00
Jonas Jenwald	73d2ddac0d	Update npm packages Note that the Prettier update made it possible to move a couple of comments after `default:`-cases back to their original/intended positions, please see https://prettier.io/blog/2022/03/16/2.6.0.html	2022-03-20 10:59:13 +01:00
Calixte Denizet	f0b549c2a2	[JS] - Parse a date in using the given format first and then try the default date parser - it aims to fix #14672.	2022-03-19 16:07:43 +01:00
Tim van der Meij	5de6af4e64	Merge pull request #14683 from Snuffleupagus/sendTest-cleanup [src/display/api.js] Simplify the `sendTest` function, used with Worker initialization (PR 14291 follow-up)	2022-03-19 13:38:05 +01:00
Jonas Jenwald	c0736647f9	Add general iteration support in the `RefSet` and `RefSetCache` classes This patch removes the existing `forEach` methods, in favor of making the classes properly iterable instead. Given that the classes are using a `Set` respectively a `Map` internally, implementing this is very easy/efficient and allows us to simplify some existing code.	2022-03-18 14:27:34 +01:00
Jonas Jenwald	be2b1d5d2a	[src/display/api.js] Simplify the `sendTest` function, used with Worker initialization (PR 14291 follow-up) Given that we now only use Workers when `postMessage` transfers are supported, there's really no point in trying to send a "test" message without transfers present. Hence, if `postMessage` transfers are not supported by the browser, we'll now fallback to "fake" Workers immediately instead. The comment about Opera is also removed, since it was originally added back in PR 983 and mentions Opera `11.60` [which was released in 2011](https://en.wikipedia.org/wiki/History_of_the_Opera_web_browser#Version_11).	2022-03-16 13:25:41 +01:00
Jonas Jenwald	d5c9be341d	[src/display/api.js] Use private static class fields, rather than `shadow`ed getter work-arounds (PR 13813, 13882 follow-up) At the time private static class fields were to new, however that's no longer an issue and we can thus (ever so slightly) simplify the code.	2022-03-16 13:02:34 +01:00
Jonas Jenwald	0c349c701f	Remove the `addLinkAttributes` warnings in the Annotation/XFA-layers (PR 14092 follow-up) These warnings have now been present in three releases, see PR 14092, hence it should (hopefully) be fine to remove them now.	2022-03-13 11:38:56 +01:00
Tim van der Meij	790735eaf1	Merge pull request #14658 from Snuffleupagus/api-validate-cMapUrl-standardFontDataUrl Validate the `cMapUrl`/`standardFontDataUrl` parameters in `getDocument`	2022-03-11 21:09:58 +01:00
Jonas Jenwald	a60b98412f	Validate the `cMapUrl`/`standardFontDataUrl` parameters in `getDocument` These changes make sense for two reasons: - Given that the parameters are potentially passed to the worker-thread, depending on the `useWorkerFetch` parameter, we need to prevent errors if the user provides values that aren't clonable. - By ensuring that the default values are indeed `null`, we'll trigger main-thread fetching (of CMaps and Standard fonts) as intended in the `PartialEvaluator` and thus potentially provide better Error messages.	2022-03-10 16:33:10 +01:00
Jonas Jenwald	537ed37835	Move the `isSameOrigin` helper function This function is currently placed in the `src/shared/util.js` file, which means that the code is duplicated in both of the built `pdf.js` and `pdf.worker.js` files. Furthermore, it only has a single call-site which is also specific to the `GENERIC`-build of the PDF.js library. Hence this helper function is instead moved into the `src/display/api.js` file, in such a way that it's conditionally defined but still can be unit-tested.	2022-03-10 13:51:09 +01:00
Tim van der Meij	e85bb0b599	Merge pull request #14645 from Snuffleupagus/Node-DOMMatrix-polyfill [api-minor] Remove the, in `legacy` builds, bundled `DOMMatrix` polyfill	2022-03-09 20:38:26 +01:00
Tim van der Meij	55a931e454	Merge pull request #14648 from Snuffleupagus/PDFDocument-stream Simplify the `PDFDocument` constructor	2022-03-09 20:36:49 +01:00
Jonas Jenwald	6a78f20b17	Simplify the `PDFDocument` constructor Originally the code in the `src/`-folder was shared between the main/worker-threads, and back then it probably made sense that the `PDFDocument` constructor accepted different arguments. However, for many years we've not been passing anything except Streams to `PDFDocument` and we should thus be able to slightly simplify that code. Note that for e.g. unit-tests of this code, using either a `NullStream` or a `StringStream` works just fine.	2022-03-08 17:13:47 +01:00
Jonas Jenwald	157a71d404	[api-minor] Remove the, in `legacy` builds, bundled `DOMMatrix` polyfill According to the MDN compatibility data, see https://developer.mozilla.org/en-US/docs/Web/API/DOMMatrix/DOMMatrix#browser_compatibility, all browsers that we support have native `DOMMatrix` implementations (since quite some time too). Hence Node.js is the only environment that lack `DOMMatrix` support, which probably isn't that surprising given that it's browser functionality. While the `DOMMatrix` polyfill isn't that large, it nonetheless seems completely unnecessary to bundle it in the `legacy` builds when it's not needed in browsers. However, we can avoid that by simply listing `dommatrix` as a dependency for the `pdfjs-dist` library.	2022-03-08 10:29:11 +01:00
Jonas Jenwald	6f600befdd	Update TypeScript to version `4.6.2` and work-around stricter type checks I'm guessing that we're now running into the class-related improvements mentioned in https://devblogs.microsoft.com/typescript/announcing-typescript-4-6/#target-es2022 To unblock this update, and any future ones, this patch simply tweaks the JSDocs to get `gulp typestest` to run without errors.	2022-03-07 11:55:17 +01:00
Tim van der Meij	5242c38af5	Merge pull request #14628 from Snuffleupagus/issue-14626 When `stopAtErrors` is set, throw rather than warn when exceeding `maxImageSize` (issue 14626)	2022-03-05 13:09:36 +01:00
Tim van der Meij	5d12ac576b	Merge pull request #14631 from Snuffleupagus/typedef-fixes Fix a couple of small typos in JSDoc `typedef` comments	2022-03-05 13:06:53 +01:00
Jonas Jenwald	939e6f0c4c	Fix a couple of small typos in JSDoc `typedef` comments While this doesn't affect the official API documentation, these cases should nonetheless be fixed.	2022-03-04 12:11:52 +01:00
Jonas Jenwald	1a7921dbf0	Compute the loca table `endOffset`, of the "first" glyph, correctly (issue 14618) When there are multiple empty glyphs at the start of the data, ensure that the "first" glyph gets a correct `endOffset` to avoid skipping it during parsing in the `sanitizeGlyph` function.	2022-03-03 14:22:45 +01:00
Jonas Jenwald	d0d5c596fb	When `stopAtErrors` is set, throw rather than warn when exceeding `maxImageSize` (issue 14626) The situation described in issue 14626 seems like a fairly special case, and it thus seem reasonable that we simply follow the same pattern as elsewhere in the `PartialEvaluator` when the `stopAtErrors` API-option is being used.	2022-03-03 13:11:29 +01:00
Brendan Dahl	85ff7b117e	Merge pull request #14536 from calixteman/thin_line Fix some issues with lineWidth < 1 after transform (bug 1753075, bug 1743245, bug 1710019)	2022-03-02 09:46:15 -08:00
Jonas Jenwald	ab55071568	Remove the JSDocs "External: Promise"-page, since `Promise`s are now a standard feature The "External: Promise"-page in the JSDocs pre-dates the introduction of `Promise`s, as a generally available standard JS feature, by a number of years. Hence it now longer seems necessary, as far as I can tell, to include this "special" page in the documentation. Also, while unrelated to the rest of the patch, updates the `test/`-folder description in the documentation.	2022-02-26 23:53:11 +01:00
calixteman	046ff07ee3	Merge pull request #14610 from Snuffleupagus/jpx-resetContextProbabilities [JPEG 2000] Add support for resetContextProbabilities (bug 1731483)	2022-02-26 18:26:39 +01:00
Jonas Jenwald	99cd24ce3e	Remove the `isString` helper function The call-sites are replaced by direct `typeof`-checks instead, which removes unnecessary function calls. Note that in the `src/`-folder we already had more `typeof`-cases than `isString`-calls.	2022-02-26 16:33:41 +01:00
Jonas Jenwald	6bd4e0f5af	Re-factor the `PDFDocument.documentInfo` method This removes the `DocumentInfoValidators` structure, and thus (slightly) simplifies the code overall. With these changes we only have to iterate through, and validate, the actually available Dictionary entries.	2022-02-26 16:33:21 +01:00
Tim van der Meij	f782f5e5bb	Merge pull request #14607 from Snuffleupagus/wrapReason-unreachable Simplify the `wrapReason` helper function	2022-02-26 15:37:29 +01:00
Tim van der Meij	cf7ce0aa7e	Merge pull request #14600 from Snuffleupagus/getPageIndex-more-validation [api-minor] Add validation for the `PDFDocumentProxy.getPageIndex` method	2022-02-26 15:30:00 +01:00
Jeff Muizelaar	9b9609a6d8	[JPEG 2000] Add support for resetContextProbabilities (bug 1731483)	2022-02-26 13:05:23 +01:00
Calixte Denizet	46369e4aa5	Fix some issues with lineWidth < 1 after transform (bug 1753075, bug 1743245, bug 1710019) - it aims to fix: - https://bugzilla.mozilla.org/show_bug.cgi?id=1753075; - https://bugzilla.mozilla.org/show_bug.cgi?id=1743245; - https://bugzilla.mozilla.org/show_bug.cgi?id=1710019; - issue #13211; - issue #14521. - previously we were trying to adjust lineWidth to have something correct after the current transform is applied but this approach was not correct because finally the pixel is rescaled with the same factors in both directions. And sometimes those factors must be different (see bug 1753075). - So the idea of this patch is to apply a scale matrix to the current transform just before setting lineWidth and stroking. This scale matrix is computed in order to ensure that after transform, a pixel will have its two thickness greater than 1.	2022-02-25 18:37:34 +01:00
Jonas Jenwald	28fc8248f0	Simplify the `wrapReason` helper function All call-sites that use `wrapReason` should be passing a (possibly cloned) `Error` to the helper function, hence we shouldn't need to have a fallback code-path for any other data. Note that for the `cancel`/`error` methods on Streams, since PR 11115 we've been asserting that the argument is in fact an `Error` as intended. When calling `wrapReason` from rejected Promises, we should also be guaranteed that an `Error` is provided thanks to the ESLint rules `no-throw-literal` and `prefer-promise-reject-errors`.	2022-02-25 18:31:12 +01:00
Jonas Jenwald	172d007598	[api-minor] Add validation for the `PDFDocumentProxy.getPageIndex` method Currently we'll happily attempt to send any argument passed to this method over to the worker-thread, without doing any sort of validation. That could obviously be quite bad, since there's first of all no protection against sending unclonable data. Secondly, it's also possible to pass data that will cause the `Ref.get` call in the worker-thread to fail immediately. In order to address all of these issues, we'll now properly validate the argument passed to `PDFDocumentProxy.getPageIndex` and when necessary reject already on the main-thread instead.	2022-02-24 12:01:51 +01:00
Jonas Jenwald	2be8036eb7	[api-minor] Reduce duplication in the "gets non-existent page" unit-test	2022-02-24 11:25:21 +01:00
Jonas Jenwald	ec87995050	Ensure that `Cmd`/`Name` is only initialized with string arguments Trying to use a non-string argument in either a `Cmd` or a `Name` is not intended, and would basically be an implementation error. Hence we can add a non-PRODUCTION check to enforce this, similar to the existing one used e.g. in the `Dict.set` method.	2022-02-23 22:39:12 +01:00
Tim van der Meij	2bb96a708c	Merge pull request #14598 from Snuffleupagus/rm-isBool Re-factor the `Catalog.viewerPreferences` method and remove the `isBool` helper function	2022-02-23 20:36:56 +01:00
Tim van der Meij	409cbfc817	Merge pull request #14597 from Snuffleupagus/Dict-set-validate-key Ensure that `Dict.set` only accepts string `key`s	2022-02-23 20:31:36 +01:00
Tim van der Meij	1b51e10c9c	Merge pull request #14595 from Snuffleupagus/structuredClone-comment-support Update the support information for `structuredClone` (PR 14392 follow-up)	2022-02-23 20:27:35 +01:00
Jonas Jenwald	3704283f5b	Remove the `isBool` helper function The call-sites are replaced by direct `typeof`-checks instead, which removes unnecessary function calls.	2022-02-23 13:31:03 +01:00
Jonas Jenwald	82f1ee1755	Re-factor the `Catalog.viewerPreferences` method This removes the `ViewerPreferencesValidators` structure, and thus (slightly) simplifies the code overall. With these changes we only have to iterate through, and validate, the actually available Dictionary entries.	2022-02-23 13:25:56 +01:00
Jonas Jenwald	a2f9031e9a	Ensure that `Dict.set` only accepts string `key`s Trying to use a non-string `key` in a `Dict` is not intended, and would basically be an implementation error. Hence we can add a non-PRODUCTION check to enforce this, complementing the existing `value` check added in PR 11672.	2022-02-22 16:35:20 +01:00
Jonas Jenwald	48985bd221	Update the support information for `structuredClone` (PR 14392 follow-up) When the `structuredClone` polyfill was added, the support information in Safari was unclear. Given that an actual version number is now available, see below, it seems like a good idea to update the comment accordingly. https://developer.mozilla.org/en-US/docs/Web/API/structuredClone#browser_compatibility	2022-02-22 12:30:54 +01:00
Jonas Jenwald	05edd91bdb	Remove the `isNum` helper function The call-sites are replaced by direct `typeof`-checks instead, which removes unnecessary function calls. Note that in the `src/`-folder we already had more `typeof`-cases than `isNum`-calls. These changes were mostly done using regular expression search-and-replace, with two exceptions: - In `Font._charToGlyph` we no longer unconditionally update the `width`, since that seems completely unnecessary. - In `PDFDocument.documentInfo`, when parsing custom entries, we now do the `typeof`-check once.	2022-02-22 11:55:34 +01:00
Jonas Jenwald	b282814e38	Prefer `instanceof Name` rather than calling `isName()` with one argument Unless you actually need to check that something is both a `Name` and also of the correct type, using `instanceof Name` directly should be a tiny bit more efficient since it avoids one function call and an unnecessary `undefined` check. This patch uses ESLint to enforce this, since we obviously still want to keep the `isName` helper function for where it makes sense.	2022-02-21 12:45:00 +01:00
Jonas Jenwald	4df82ad31e	Prefer `instanceof Dict` rather than calling `isDict()` with one argument Unless you actually need to check that something is both a `Dict` and also of the correct type, using `instanceof Dict` directly should be a tiny bit more efficient since it avoids one function call and an unnecessary `undefined` check. This patch uses ESLint to enforce this, since we obviously still want to keep the `isDict` helper function for where it makes sense.	2022-02-21 12:44:56 +01:00
Jonas Jenwald	67b658e8d5	Prefer `instanceof Cmd` rather than calling `isCmd()` with one argument Unless you actually need to check that something is both a `Cmd` and also of the correct type, using `instanceof Cmd` directly should be a tiny bit more efficient since it avoids one function call and an unnecessary `undefined` check. This patch uses ESLint to enforce this, since we obviously still want to keep the `isCmd` helper function for where it makes sense.	2022-02-21 12:44:51 +01:00
Jonas Jenwald	bad15894fc	Improve the JSDocs for the `PDFObjects` class Given that we expose `PDFObjects`-instances, via the `commonObjs` and `objs` properties, on the `PDFPageProxy`-instances this ought to help provide slightly better TypeScript definitions.	2022-02-20 13:02:14 +01:00

1 2 3 4 5 ...

5202 Commits