pdf.js

Author	SHA1	Message	Date
Calixte Denizet	687c9a8710	Improve performance of applyMaskImageData - write some uint32 instead of uint8 to avoid the check before clamping; - unroll the loop to write data in the buffer - but keep a loop for the last element of a line: it likely doesn't hurt that much since it's executed only for one time for each line; - I tested on a macbook with an Apple chip, and on Firefox nightly the new code is almost 3.5x faster than before (~1.8x with Chrome).	2022-04-09 22:19:02 +02:00
Calixte Denizet	040fcae5ab	Improve performance with image masks (bug 857031) - it aims to partially fix performance issue reported: https://bugzilla.mozilla.org/show_bug.cgi?id=857031; - the idea is too avoid to use byte arrays but use ImageBitmap which are a way faster to draw: * an ImageBitmap is Transferable which means that it can be built in the worker instead of in the main thread: - this is achieved in using an OffscreenCanvas when it's available, there is a bug to enable them for pdf.js: https://bugzilla.mozilla.org/show_bug.cgi?id=1763330; - or in using createImageBitmap: in Firefox a task is sent to the main thread to build the bitmap so it's slightly slower than using an OffscreenCanvas. * it's transfered from the worker to the main thread by "reference"; * the byte buffers used to create the image data have a very short lifetime and ergo the memory used is globally less than before. - Use the localImageCache for the mask; - Fix the pdf issue4436r.pdf: it was expected to have a binary stream for the image; - Move the singlePixel trick from operator_list to image: this way we can use this trick even if it isn't in a set as defined in operator_list.	2022-04-09 18:26:26 +02:00
apeltop	a97dd26389	Correct typos	2022-04-09 09:43:18 +09:00
Jonas Jenwald	a919959d83	Slightly simplify the `Catalog._readMarkInfo` method We don't need to first check if the Dictionary contains the key, since trying to get a non-existent key simply returns `undefined` and we're already ensuring that the value is a boolean. Furthermore, we shouldn't need to worry about the `Object.prototype` containing enumerable properties since the checks (in `src/core/worker.js`) done for `Array.prototype` indirectly also cover `Object`s. (Keep in mind that an `Array` is just a special kind of `Object` in JavaScript.)	2022-04-05 16:37:51 +02:00
Jonas Jenwald	1dc4713a0b	Re-factor the `isLittleEndian`/`isEvalSupported` caching This functionality is very old, hence we should be able to improve the caching a little bit with modern JavaScript features.	2022-04-05 16:01:01 +02:00
Calixte Denizet	f4fcb59a5e	Refactor some xfa*** getters in document.js - it's a follow-up of PR #14735.	2022-04-03 20:38:12 +02:00
Jonas Jenwald	f33ce5fc2d	Decode non-ASCII values found in the xfa:datasets (PR 14735 follow-up) Please note: This is possibly bad/wrong in general, but I figured that submitting it for review wouldn't hurt. It seems that even Adobe Reader doesn't handle the non-ASCII characters that appear in some of the fields correctly, however it should be pretty easy to improve things on the PDF.js side.	2022-04-01 11:54:34 +02:00
Jonas Jenwald	36a289d747	Merge pull request #14735 from calixteman/14685 [Annotations] Some annotations can have their values stored in the xfa:datasets	2022-04-01 11:30:16 +02:00
Calixte Denizet	0b597304c1	[Annotations] Some annotations can have their values stored in the xfa:datasets - it aims to fix #14685; - add a basic object to get values from the parsed datasets; - these annotations don't have an appearance so we must create one when printing or saving.	2022-04-01 10:28:04 +02:00
Jonas Jenwald	addb4cb12b	Use `String.prototype.repeat()` in a couple of spots Rather than using a temporary Array to manually create repeated strings, we can use `String.prototype.repeat()` instead. The reason that we didn't use this from the start is most likely because some browsers, notably IE, didn't support this; note https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/repeat#browser_compatibility	2022-03-30 15:42:40 +02:00
Calixte Denizet	ad3fb71a02	[Annotations] Add support for printing/saving choice list with multiple selections - it aims to fix issue #12189.	2022-03-29 18:59:44 +02:00
Jonas Jenwald	0dd6bc9a85	Merge pull request #14703 from calixteman/14627 [text selection] Add the whitespaces present in the pdf in the text chunk	2022-03-27 15:20:19 +02:00
Calixte Denizet	18e79e3c0b	[text selection] Add the whitespaces present in the pdf in the text chunk - it aims to fix issue #14627; - the basic idea of the recent text refactoring was to only consider the rendered visible whitespaces. But sometimes, the heuristics aren't correct and although some whitespaces are in the text stream they weren't in the text chunks because they were too small. Hence we added some exceptions, for example, we always add a whitespace when it is between two non-whitespace chars but only when in the same Tj. So basically, this patch removes the constraint to have the chars in the same Tj (in using a circular buffer to save the two last chars) but don't add a space when the visible space is really too small (hence `NOT_A_SPACE_FACTOR`).	2022-03-27 14:34:56 +02:00
Jonas Jenwald	7f0589c74a	Change the type of the `container` property, in the `TextLayerRenderParameters` typedef (issue 14716) Given that the textLayer-code has been using a `DocumentFragment` ever since PR 3356 (back in 2013), simply updating the type of the `container` property should be fine. This patch also tries to, ever so slightly, improve the grammar of a couple of other properties in the typedef.	2022-03-24 22:42:37 +01:00
Jonas Jenwald	849de5a508	Slightly improve validation of (some) parameters in `getDocument` There's a couple of `getDocument` parameters that should be numbers, but which are currently not fully validated to prevent issues elsewhere in the code-base. Also, improves validation of the `ownerDocument` parameter since we currently accept more-or-less anything here.	2022-03-21 13:32:17 +01:00
Jonas Jenwald	73d2ddac0d	Update npm packages Note that the Prettier update made it possible to move a couple of comments after `default:`-cases back to their original/intended positions, please see https://prettier.io/blog/2022/03/16/2.6.0.html	2022-03-20 10:59:13 +01:00
Calixte Denizet	f0b549c2a2	[JS] - Parse a date in using the given format first and then try the default date parser - it aims to fix #14672.	2022-03-19 16:07:43 +01:00
Tim van der Meij	5de6af4e64	Merge pull request #14683 from Snuffleupagus/sendTest-cleanup [src/display/api.js] Simplify the `sendTest` function, used with Worker initialization (PR 14291 follow-up)	2022-03-19 13:38:05 +01:00
Jonas Jenwald	c0736647f9	Add general iteration support in the `RefSet` and `RefSetCache` classes This patch removes the existing `forEach` methods, in favor of making the classes properly iterable instead. Given that the classes are using a `Set` respectively a `Map` internally, implementing this is very easy/efficient and allows us to simplify some existing code.	2022-03-18 14:27:34 +01:00
Jonas Jenwald	be2b1d5d2a	[src/display/api.js] Simplify the `sendTest` function, used with Worker initialization (PR 14291 follow-up) Given that we now only use Workers when `postMessage` transfers are supported, there's really no point in trying to send a "test" message without transfers present. Hence, if `postMessage` transfers are not supported by the browser, we'll now fallback to "fake" Workers immediately instead. The comment about Opera is also removed, since it was originally added back in PR 983 and mentions Opera `11.60` [which was released in 2011](https://en.wikipedia.org/wiki/History_of_the_Opera_web_browser#Version_11).	2022-03-16 13:25:41 +01:00
Jonas Jenwald	d5c9be341d	[src/display/api.js] Use private static class fields, rather than `shadow`ed getter work-arounds (PR 13813, 13882 follow-up) At the time private static class fields were to new, however that's no longer an issue and we can thus (ever so slightly) simplify the code.	2022-03-16 13:02:34 +01:00
Jonas Jenwald	0c349c701f	Remove the `addLinkAttributes` warnings in the Annotation/XFA-layers (PR 14092 follow-up) These warnings have now been present in three releases, see PR 14092, hence it should (hopefully) be fine to remove them now.	2022-03-13 11:38:56 +01:00
Tim van der Meij	790735eaf1	Merge pull request #14658 from Snuffleupagus/api-validate-cMapUrl-standardFontDataUrl Validate the `cMapUrl`/`standardFontDataUrl` parameters in `getDocument`	2022-03-11 21:09:58 +01:00
Jonas Jenwald	a60b98412f	Validate the `cMapUrl`/`standardFontDataUrl` parameters in `getDocument` These changes make sense for two reasons: - Given that the parameters are potentially passed to the worker-thread, depending on the `useWorkerFetch` parameter, we need to prevent errors if the user provides values that aren't clonable. - By ensuring that the default values are indeed `null`, we'll trigger main-thread fetching (of CMaps and Standard fonts) as intended in the `PartialEvaluator` and thus potentially provide better Error messages.	2022-03-10 16:33:10 +01:00
Jonas Jenwald	537ed37835	Move the `isSameOrigin` helper function This function is currently placed in the `src/shared/util.js` file, which means that the code is duplicated in both of the built `pdf.js` and `pdf.worker.js` files. Furthermore, it only has a single call-site which is also specific to the `GENERIC`-build of the PDF.js library. Hence this helper function is instead moved into the `src/display/api.js` file, in such a way that it's conditionally defined but still can be unit-tested.	2022-03-10 13:51:09 +01:00
Tim van der Meij	e85bb0b599	Merge pull request #14645 from Snuffleupagus/Node-DOMMatrix-polyfill [api-minor] Remove the, in `legacy` builds, bundled `DOMMatrix` polyfill	2022-03-09 20:38:26 +01:00
Tim van der Meij	55a931e454	Merge pull request #14648 from Snuffleupagus/PDFDocument-stream Simplify the `PDFDocument` constructor	2022-03-09 20:36:49 +01:00
Jonas Jenwald	6a78f20b17	Simplify the `PDFDocument` constructor Originally the code in the `src/`-folder was shared between the main/worker-threads, and back then it probably made sense that the `PDFDocument` constructor accepted different arguments. However, for many years we've not been passing anything except Streams to `PDFDocument` and we should thus be able to slightly simplify that code. Note that for e.g. unit-tests of this code, using either a `NullStream` or a `StringStream` works just fine.	2022-03-08 17:13:47 +01:00
Jonas Jenwald	157a71d404	[api-minor] Remove the, in `legacy` builds, bundled `DOMMatrix` polyfill According to the MDN compatibility data, see https://developer.mozilla.org/en-US/docs/Web/API/DOMMatrix/DOMMatrix#browser_compatibility, all browsers that we support have native `DOMMatrix` implementations (since quite some time too). Hence Node.js is the only environment that lack `DOMMatrix` support, which probably isn't that surprising given that it's browser functionality. While the `DOMMatrix` polyfill isn't that large, it nonetheless seems completely unnecessary to bundle it in the `legacy` builds when it's not needed in browsers. However, we can avoid that by simply listing `dommatrix` as a dependency for the `pdfjs-dist` library.	2022-03-08 10:29:11 +01:00
Jonas Jenwald	6f600befdd	Update TypeScript to version `4.6.2` and work-around stricter type checks I'm guessing that we're now running into the class-related improvements mentioned in https://devblogs.microsoft.com/typescript/announcing-typescript-4-6/#target-es2022 To unblock this update, and any future ones, this patch simply tweaks the JSDocs to get `gulp typestest` to run without errors.	2022-03-07 11:55:17 +01:00
Tim van der Meij	5242c38af5	Merge pull request #14628 from Snuffleupagus/issue-14626 When `stopAtErrors` is set, throw rather than warn when exceeding `maxImageSize` (issue 14626)	2022-03-05 13:09:36 +01:00
Tim van der Meij	5d12ac576b	Merge pull request #14631 from Snuffleupagus/typedef-fixes Fix a couple of small typos in JSDoc `typedef` comments	2022-03-05 13:06:53 +01:00
Jonas Jenwald	939e6f0c4c	Fix a couple of small typos in JSDoc `typedef` comments While this doesn't affect the official API documentation, these cases should nonetheless be fixed.	2022-03-04 12:11:52 +01:00
Jonas Jenwald	1a7921dbf0	Compute the loca table `endOffset`, of the "first" glyph, correctly (issue 14618) When there are multiple empty glyphs at the start of the data, ensure that the "first" glyph gets a correct `endOffset` to avoid skipping it during parsing in the `sanitizeGlyph` function.	2022-03-03 14:22:45 +01:00
Jonas Jenwald	d0d5c596fb	When `stopAtErrors` is set, throw rather than warn when exceeding `maxImageSize` (issue 14626) The situation described in issue 14626 seems like a fairly special case, and it thus seem reasonable that we simply follow the same pattern as elsewhere in the `PartialEvaluator` when the `stopAtErrors` API-option is being used.	2022-03-03 13:11:29 +01:00
Brendan Dahl	85ff7b117e	Merge pull request #14536 from calixteman/thin_line Fix some issues with lineWidth < 1 after transform (bug 1753075, bug 1743245, bug 1710019)	2022-03-02 09:46:15 -08:00
Jonas Jenwald	ab55071568	Remove the JSDocs "External: Promise"-page, since `Promise`s are now a standard feature The "External: Promise"-page in the JSDocs pre-dates the introduction of `Promise`s, as a generally available standard JS feature, by a number of years. Hence it now longer seems necessary, as far as I can tell, to include this "special" page in the documentation. Also, while unrelated to the rest of the patch, updates the `test/`-folder description in the documentation.	2022-02-26 23:53:11 +01:00
calixteman	046ff07ee3	Merge pull request #14610 from Snuffleupagus/jpx-resetContextProbabilities [JPEG 2000] Add support for resetContextProbabilities (bug 1731483)	2022-02-26 18:26:39 +01:00
Jonas Jenwald	99cd24ce3e	Remove the `isString` helper function The call-sites are replaced by direct `typeof`-checks instead, which removes unnecessary function calls. Note that in the `src/`-folder we already had more `typeof`-cases than `isString`-calls.	2022-02-26 16:33:41 +01:00
Jonas Jenwald	6bd4e0f5af	Re-factor the `PDFDocument.documentInfo` method This removes the `DocumentInfoValidators` structure, and thus (slightly) simplifies the code overall. With these changes we only have to iterate through, and validate, the actually available Dictionary entries.	2022-02-26 16:33:21 +01:00
Tim van der Meij	f782f5e5bb	Merge pull request #14607 from Snuffleupagus/wrapReason-unreachable Simplify the `wrapReason` helper function	2022-02-26 15:37:29 +01:00
Tim van der Meij	cf7ce0aa7e	Merge pull request #14600 from Snuffleupagus/getPageIndex-more-validation [api-minor] Add validation for the `PDFDocumentProxy.getPageIndex` method	2022-02-26 15:30:00 +01:00
Jeff Muizelaar	9b9609a6d8	[JPEG 2000] Add support for resetContextProbabilities (bug 1731483)	2022-02-26 13:05:23 +01:00
Calixte Denizet	46369e4aa5	Fix some issues with lineWidth < 1 after transform (bug 1753075, bug 1743245, bug 1710019) - it aims to fix: - https://bugzilla.mozilla.org/show_bug.cgi?id=1753075; - https://bugzilla.mozilla.org/show_bug.cgi?id=1743245; - https://bugzilla.mozilla.org/show_bug.cgi?id=1710019; - issue #13211; - issue #14521. - previously we were trying to adjust lineWidth to have something correct after the current transform is applied but this approach was not correct because finally the pixel is rescaled with the same factors in both directions. And sometimes those factors must be different (see bug 1753075). - So the idea of this patch is to apply a scale matrix to the current transform just before setting lineWidth and stroking. This scale matrix is computed in order to ensure that after transform, a pixel will have its two thickness greater than 1.	2022-02-25 18:37:34 +01:00
Jonas Jenwald	28fc8248f0	Simplify the `wrapReason` helper function All call-sites that use `wrapReason` should be passing a (possibly cloned) `Error` to the helper function, hence we shouldn't need to have a fallback code-path for any other data. Note that for the `cancel`/`error` methods on Streams, since PR 11115 we've been asserting that the argument is in fact an `Error` as intended. When calling `wrapReason` from rejected Promises, we should also be guaranteed that an `Error` is provided thanks to the ESLint rules `no-throw-literal` and `prefer-promise-reject-errors`.	2022-02-25 18:31:12 +01:00
Jonas Jenwald	172d007598	[api-minor] Add validation for the `PDFDocumentProxy.getPageIndex` method Currently we'll happily attempt to send any argument passed to this method over to the worker-thread, without doing any sort of validation. That could obviously be quite bad, since there's first of all no protection against sending unclonable data. Secondly, it's also possible to pass data that will cause the `Ref.get` call in the worker-thread to fail immediately. In order to address all of these issues, we'll now properly validate the argument passed to `PDFDocumentProxy.getPageIndex` and when necessary reject already on the main-thread instead.	2022-02-24 12:01:51 +01:00
Jonas Jenwald	2be8036eb7	[api-minor] Reduce duplication in the "gets non-existent page" unit-test	2022-02-24 11:25:21 +01:00
Jonas Jenwald	ec87995050	Ensure that `Cmd`/`Name` is only initialized with string arguments Trying to use a non-string argument in either a `Cmd` or a `Name` is not intended, and would basically be an implementation error. Hence we can add a non-PRODUCTION check to enforce this, similar to the existing one used e.g. in the `Dict.set` method.	2022-02-23 22:39:12 +01:00
Tim van der Meij	2bb96a708c	Merge pull request #14598 from Snuffleupagus/rm-isBool Re-factor the `Catalog.viewerPreferences` method and remove the `isBool` helper function	2022-02-23 20:36:56 +01:00
Tim van der Meij	409cbfc817	Merge pull request #14597 from Snuffleupagus/Dict-set-validate-key Ensure that `Dict.set` only accepts string `key`s	2022-02-23 20:31:36 +01:00

1 2 3 4 5 ...

5262 Commits