pdf.js

Author	SHA1	Message	Date
calixteman	b10b8dad7d	Merge pull request #14853 from calixteman/white_lines Use integer coordinates when drawing images (bug 1264608, issue #3351)	2022-04-29 18:15:03 +02:00
Calixte Denizet	624d8a8e3e	Use integer coordinates when drawing images (bug 1264608, issue #3351 ) - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1264608; - it's only a partial fix for #3351; - some tiled images have some spurious white lines between the tiles. When the current transform is applyed the corners of an image can have some non-integer coordinates leading to some extra transparency added to handle that. So with this patch the current transform is applied on the point and on the dimensions in order to have at the end only integer values.	2022-04-29 16:01:34 +02:00
Jonas Jenwald	fbf6dee8ee	[api-minor] Remove the `forceClamped`-functionality in the Streams (issue 14849) As it turns out, most of the code-paths in the `PDFImage`-class won't actually pass the TypedArray (containing the image-data) to the `ColorSpace`-code. Hence we generally don't need to force the image-data to be a `Uint8ClampedArray`, and can just as well directly use a `Uint8Array` instead. In the following cases we're returning the data without any `ColorSpace`-parsing, and the exact TypedArray used shouldn't matter: - `b72a448327/src/core/image.js (L714)` - `b72a448327/src/core/image.js (L751)` In the following cases the image-data is only used internally, and again the exact TypedArray used shouldn't matter: - `b72a448327/src/core/image.js (L762)` with the actual image-data being defined (as `Uint8ClampedArray`) further below - `b72a448327/src/core/image.js (L837)` Please note: This is tagged `api-minor` because it's API-observable, given that some image/mask-data will now be returned as `Uint8Array` rather than using `Uint8ClampedArray` unconditionally. However, that seems like a small price to pay to (slightly) reduce memory usage during image-conversion.	2022-04-29 14:46:30 +02:00
Jonas Jenwald	71370d012b	Support destinations in NameTrees with encoded keys (issue 14847) Initially I considered updating the `NameOrNumberTree`-implementation to handle encoded keys, however that quickly became somewhat messy (especially in the `NameOrNumberTree.get`-method) since only NameTrees using string-keys. Hence the easiest solution, as far as I'm concerned, was thus to just update the `Catalog.destinations`-getter instead. Please note that in the referenced PDF document the `Catalog.destination`-method will thus fallback to fetch all destinations, which should be fine since this is the very first case of encoded keys that we've seen. Also changes the `NameOrNumberTree.getAll`-method to prevent a possible run-time error, although we've so far not seen such a case, for any non-Array Kids-entries found in a NameTree/NumberTree. Finally, to improve overall consistency and to hopefully prevent future bugs, the patch also updates a couple of other `NameTree` call-sites to correctly handle encoded keys. (Note that the `Catalog.attachments`-getter was already doing this.)	2022-04-27 11:19:55 +02:00
Calixte Denizet	314fd83bba	Don't use pref 'browser.download.improvements_to_download_panel' in Firefox (#14822 )	2022-04-25 15:05:43 +02:00
Tim van der Meij	752dee5caa	Merge pull request #14825 from Snuffleupagus/issue-14824 Ensure that worker-thread image caching doesn't break optional content (issue 14824)	2022-04-23 13:19:56 +02:00
Tim van der Meij	f9e54d9226	Merge pull request #14823 from Snuffleupagus/issue-14821 Ignore invalid /Encoding-entries when parsing fonts (issue 14821)	2022-04-23 13:19:26 +02:00
Jonas Jenwald	6c229dffb1	Ensure that worker-thread image caching doesn't break optional content (issue 14824) Currently we only insert optionalContent-data into the operatorList the first time that an image is parsed, which will (in hindsight) obviously cause problems for cached images. Hence we also need to insert the optionalContent-data in the various worker-thread image caches, such that it can be accessed in the fast-paths that are used to skip re-parsing of images. In order to reduce the amount of repeated code, this patch also adds a new `OperatorList`-method that takes care of inserting the necessary data in the operatorList.	2022-04-22 14:49:16 +02:00
Jonas Jenwald	e723da7261	Ignore invalid /Encoding-entries when parsing fonts (issue 14821) In the referenced PDF document the fonts have /Encoding-entries that are Streams (containing completely bogus data), which are thus obviously not valid here. Hence, only when `ignoreErrors` is set, we'll now ignore these corrupt /Encoding-entries and fallback to the existing code to try and infer a usable encoding. Given that this is clearly a case of corrupt PDF documents, there's no guarantee that this will "fix" all such cases, however it's the best that we do here and shouldn't really be worse than ignoring an entire font.	2022-04-22 11:49:03 +02:00
Jonas Jenwald	39d1bdde09	Ignore non-Stream /SMask-entries when parsing images (issue 14814) This is similar to the pre-existing check used in the /Mask-case below, to handle corrupt PDF documents that include non-Stream /SMask-entries in images; please refer to the PDF specification: https://web.archive.org/web/20220309040754if_/https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#page=216 Please note: Adobe Reader also fails to render the image on the second page, and displays an error message.	2022-04-21 12:14:08 +02:00
Jonas Jenwald	5bc7339c1b	Add support for the /Catalog Base-URI when resolving URLs (issue 14802) As far as I can tell, this is actually the very first time that we've seen a PDF document with a Base-URI specified in the /Catalog; please refer to the specification: https://web.archive.org/web/20220309040754if_/https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#G11.2097122 To simplify the overall implementation, this new parameter is accessed via the existing `BasePdfManager.docBaseUrl`-getter and will thus override any user-specified `docBaseUrl` API-parameter.	2022-04-19 17:14:52 +02:00
Calixte Denizet	3d74d2c6cb	Don't clip when the clip path is empty (issue #12306 )	2022-04-18 10:33:44 +02:00
Calixte Denizet	f62d961dfe	Improve performances with image masks (bug 857031) - it's the second part of the fix for https://bugzilla.mozilla.org/show_bug.cgi?id=857031; - some image masks can be used several times but at different positions; - an image need to be pre-process before to be rendered: * rescale it; * use the fill color/pattern. - the two operations above are time consuming so we can cache the generated canvas; - the cache key is based on the current transform matrix (without the translation part) and the current fill color when it isn't a pattern. - the rendering of the pdf in the above bug is really faster than without this patch.	2022-04-16 20:48:39 +02:00
Calixte Denizet	040fcae5ab	Improve performance with image masks (bug 857031) - it aims to partially fix performance issue reported: https://bugzilla.mozilla.org/show_bug.cgi?id=857031; - the idea is too avoid to use byte arrays but use ImageBitmap which are a way faster to draw: * an ImageBitmap is Transferable which means that it can be built in the worker instead of in the main thread: - this is achieved in using an OffscreenCanvas when it's available, there is a bug to enable them for pdf.js: https://bugzilla.mozilla.org/show_bug.cgi?id=1763330; - or in using createImageBitmap: in Firefox a task is sent to the main thread to build the bitmap so it's slightly slower than using an OffscreenCanvas. * it's transfered from the worker to the main thread by "reference"; * the byte buffers used to create the image data have a very short lifetime and ergo the memory used is globally less than before. - Use the localImageCache for the mask; - Fix the pdf issue4436r.pdf: it was expected to have a binary stream for the image; - Move the singlePixel trick from operator_list to image: this way we can use this trick even if it isn't in a set as defined in operator_list.	2022-04-09 18:26:26 +02:00
Jonas Jenwald	36a289d747	Merge pull request #14735 from calixteman/14685 [Annotations] Some annotations can have their values stored in the xfa:datasets	2022-04-01 11:30:16 +02:00
Calixte Denizet	0b597304c1	[Annotations] Some annotations can have their values stored in the xfa:datasets - it aims to fix #14685; - add a basic object to get values from the parsed datasets; - these annotations don't have an appearance so we must create one when printing or saving.	2022-04-01 10:28:04 +02:00
Jonas Jenwald	addb4cb12b	Use `String.prototype.repeat()` in a couple of spots Rather than using a temporary Array to manually create repeated strings, we can use `String.prototype.repeat()` instead. The reason that we didn't use this from the start is most likely because some browsers, notably IE, didn't support this; note https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/repeat#browser_compatibility	2022-03-30 15:42:40 +02:00
Calixte Denizet	ad3fb71a02	[Annotations] Add support for printing/saving choice list with multiple selections - it aims to fix issue #12189.	2022-03-29 18:59:44 +02:00
Calixte Denizet	18e79e3c0b	[text selection] Add the whitespaces present in the pdf in the text chunk - it aims to fix issue #14627; - the basic idea of the recent text refactoring was to only consider the rendered visible whitespaces. But sometimes, the heuristics aren't correct and although some whitespaces are in the text stream they weren't in the text chunks because they were too small. Hence we added some exceptions, for example, we always add a whitespace when it is between two non-whitespace chars but only when in the same Tj. So basically, this patch removes the constraint to have the chars in the same Tj (in using a circular buffer to save the two last chars) but don't add a space when the visible space is really too small (hence `NOT_A_SPACE_FACTOR`).	2022-03-27 14:34:56 +02:00
Jonas Jenwald	849de5a508	Slightly improve validation of (some) parameters in `getDocument` There's a couple of `getDocument` parameters that should be numbers, but which are currently not fully validated to prevent issues elsewhere in the code-base. Also, improves validation of the `ownerDocument` parameter since we currently accept more-or-less anything here.	2022-03-21 13:32:17 +01:00
Calixte Denizet	f0b549c2a2	[JS] - Parse a date in using the given format first and then try the default date parser - it aims to fix #14672.	2022-03-19 16:07:43 +01:00
Jonas Jenwald	c0736647f9	Add general iteration support in the `RefSet` and `RefSetCache` classes This patch removes the existing `forEach` methods, in favor of making the classes properly iterable instead. Given that the classes are using a `Set` respectively a `Map` internally, implementing this is very easy/efficient and allows us to simplify some existing code.	2022-03-18 14:27:34 +01:00
Jonas Jenwald	fb345ee184	Enable the "gets fieldObjects" unit-test in Node.js (PR 14409 follow-up) Apparently this unit-test works in Node.js now, hence it's possible that the reason it didn't work previously is that there were bugs in our old `structuredClone` polyfill.	2022-03-13 10:40:57 +01:00
Tim van der Meij	bcf453cf14	Merge pull request #14656 from Snuffleupagus/mv-isSameOrigin Move the `isSameOrigin` helper function	2022-03-11 21:08:49 +01:00
Jonas Jenwald	537ed37835	Move the `isSameOrigin` helper function This function is currently placed in the `src/shared/util.js` file, which means that the code is duplicated in both of the built `pdf.js` and `pdf.worker.js` files. Furthermore, it only has a single call-site which is also specific to the `GENERIC`-build of the PDF.js library. Hence this helper function is instead moved into the `src/display/api.js` file, in such a way that it's conditionally defined but still can be unit-tested.	2022-03-10 13:51:09 +01:00
Jonas Jenwald	e08e3f4d37	Replace XMLHttpRequest usage with the Fetch API in `send` (in `test/unit/testreporter.js`) Besides converting the `send` function to use the Fetch API, this patch also changes the method to return a `Promise` to get rid of the callback function. (Although, currently there's no call-site passing in a callback function.)	2022-03-10 12:55:08 +01:00
Tim van der Meij	ee39499a5a	Merge pull request #14651 from Snuffleupagus/Driver-inlineImages-fetch Replace XMLHttpRequest usage with the Fetch API in `inlineImages` (in `test/driver.js`)	2022-03-09 20:47:38 +01:00
Jonas Jenwald	b3f4758183	Replace XMLHttpRequest usage with the Fetch API in `inlineImages` (in `test/driver.js`) This is the final part in a series of patches that try to re-implement PR 14287 in smaller steps. Besides converting `inlineImages` to use the Fetch API, this patch also combines the `inlineImages` and `resolveImages` functions since they are always used together.	2022-03-09 11:32:51 +01:00
Jonas Jenwald	19c2cc8689	Replace XMLHttpRequest usage with the Fetch API in `Driver._send` This is another part in a series of patches that try to re-implement PR 14287 in smaller steps. Besides converting `Driver._send` to use the Fetch API, this also changes the method to return a `Promise` to get rid of the callback function. Please note that I purposely try to maintain the existing behaviour of re-sending the data on failure/unexpected response, including how/where the old callback function was invoked.	2022-03-07 16:00:52 +01:00
Tim van der Meij	3e593cfc1d	Merge pull request #14636 from Snuffleupagus/Driver-quit-fetch Replace XMLHttpRequest usage with the Fetch API in `Driver._quit`	2022-03-06 18:26:55 +01:00
Jonas Jenwald	90445679e8	Replace XMLHttpRequest usage with the Fetch API in the reftest-analyzer	2022-03-06 16:02:34 +01:00
Jonas Jenwald	65d5974192	Replace XMLHttpRequest usage with the Fetch API in `Driver._quit` This is another step in what'll hopefully become a series of patches to implement PR 14287 in smaller steps.	2022-03-06 15:36:48 +01:00
Jonas Jenwald	62e0939ce2	Replace XMLHttpRequest usage with the Fetch API in `loadStyles` (in `test/driver.js`) This is another small step in what'll hopefully become a series of patches to implement PR 14287 in smaller steps.	2022-03-06 13:57:42 +01:00
Jonas Jenwald	151b140eac	Replace XMLHttpRequest usage with the Fetch API in `Driver.run` This is a first step in what'll hopefully become a series of patches to implement PR 14287 in smaller steps.	2022-03-06 12:47:12 +01:00
Jonas Jenwald	1a7921dbf0	Compute the loca table `endOffset`, of the "first" glyph, correctly (issue 14618) When there are multiple empty glyphs at the start of the data, ensure that the "first" glyph gets a correct `endOffset` to avoid skipping it during parsing in the `sanitizeGlyph` function.	2022-03-03 14:22:45 +01:00
Brendan Dahl	85ff7b117e	Merge pull request #14536 from calixteman/thin_line Fix some issues with lineWidth < 1 after transform (bug 1753075, bug 1743245, bug 1710019)	2022-03-02 09:46:15 -08:00
calixteman	046ff07ee3	Merge pull request #14610 from Snuffleupagus/jpx-resetContextProbabilities [JPEG 2000] Add support for resetContextProbabilities (bug 1731483)	2022-02-26 18:26:39 +01:00
Jonas Jenwald	99cd24ce3e	Remove the `isString` helper function The call-sites are replaced by direct `typeof`-checks instead, which removes unnecessary function calls. Note that in the `src/`-folder we already had more `typeof`-cases than `isString`-calls.	2022-02-26 16:33:41 +01:00
Tim van der Meij	cf7ce0aa7e	Merge pull request #14600 from Snuffleupagus/getPageIndex-more-validation [api-minor] Add validation for the `PDFDocumentProxy.getPageIndex` method	2022-02-26 15:30:00 +01:00
Tim van der Meij	0808376a72	Merge pull request #14599 from Snuffleupagus/Cmd-Name-validate-arg Ensure that `Cmd`/`Name` is only initialized with string arguments	2022-02-26 15:25:00 +01:00
Jeff Muizelaar	9b9609a6d8	[JPEG 2000] Add support for resetContextProbabilities (bug 1731483)	2022-02-26 13:05:23 +01:00
Jonas Jenwald	4157d771c0	Merge pull request #14609 from brendandahl/misc-reftest Improvements to the reftest analyzer.	2022-02-26 10:43:36 +01:00
Brendan Dahl	c5404bee0e	Improvements to the reftest analyzer. - Scroll the selected reference into view (makes it easier to tell which pdf you're looking at) - Show the keyboard shortcuts (easier for new people) - Keep the test/ref controls visible (if you scroll you can now tell if you're looking at a test or ref)	2022-02-25 13:23:19 -08:00
Brendan Dahl	a969440af8	Don't close window from test driver. Sometimes I get a "Unable to find target with id XXX closeTarget..." error when running tests which happens when test.js tries to close all the open pages. I haven't been able to fully verify since this is intermittent, but I think this is coming from us closing the window in driver.js and also trying to close it in test.js.	2022-02-25 09:55:52 -08:00
Calixte Denizet	46369e4aa5	Fix some issues with lineWidth < 1 after transform (bug 1753075, bug 1743245, bug 1710019) - it aims to fix: - https://bugzilla.mozilla.org/show_bug.cgi?id=1753075; - https://bugzilla.mozilla.org/show_bug.cgi?id=1743245; - https://bugzilla.mozilla.org/show_bug.cgi?id=1710019; - issue #13211; - issue #14521. - previously we were trying to adjust lineWidth to have something correct after the current transform is applied but this approach was not correct because finally the pixel is rescaled with the same factors in both directions. And sometimes those factors must be different (see bug 1753075). - So the idea of this patch is to apply a scale matrix to the current transform just before setting lineWidth and stroking. This scale matrix is computed in order to ensure that after transform, a pixel will have its two thickness greater than 1.	2022-02-25 18:37:34 +01:00
Jonas Jenwald	f4e78d9b38	Simplify the `decodeFontData`/`encodeFontData` font-test helper functions We can (and in my opinion should) use the standard `atob`/`btoa` functions, rather than manually re-implementing this functionality for the font-tests.	2022-02-25 11:40:03 +01:00
Jonas Jenwald	889b761f22	Merge pull request #14545 from brendandahl/output-scale Generate test images at different output scales.	2022-02-24 21:56:54 +01:00
Brendan Dahl	f5c3abb8f7	Generate test images at different output scales. This will default to generating test images at the device pixel ratio of the machine the tests are created on unless the test explicitly defines and output scale using the `outputScale` setting. This makes the test look visually like they would on the machine they are running on. It also allows us to test different output scales.	2022-02-24 11:27:41 -08:00
Jonas Jenwald	172d007598	[api-minor] Add validation for the `PDFDocumentProxy.getPageIndex` method Currently we'll happily attempt to send any argument passed to this method over to the worker-thread, without doing any sort of validation. That could obviously be quite bad, since there's first of all no protection against sending unclonable data. Secondly, it's also possible to pass data that will cause the `Ref.get` call in the worker-thread to fail immediately. In order to address all of these issues, we'll now properly validate the argument passed to `PDFDocumentProxy.getPageIndex` and when necessary reject already on the main-thread instead.	2022-02-24 12:01:51 +01:00
Jonas Jenwald	2be8036eb7	[api-minor] Reduce duplication in the "gets non-existent page" unit-test	2022-02-24 11:25:21 +01:00

1 2 3 4 5 ...

2778 Commits