pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	876c962235	Ignore Annotations with too large border `width`s, to prevent the `annotationLayer` from rendering it over the surrounding document (bug 1552113) The border `width` will instead fallback to the default value of `1`, rather than ignoring it altoghether, to also ensure that e.g. `LinkAnnotation`s become clickable as intended. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1552113	2019-06-01 15:51:22 +02:00
Tim van der Meij	209e42043a	Merge pull request #10873 from Snuffleupagus/worker-terminate-clearPrimitiveCaches Ensure that the `Cmd`/`Name`/`Ref` caches are cleared when terminating the worker (PR 10863 follow-up)	2019-05-31 12:56:53 +02:00
Jonas Jenwald	a3742a9f83	Ensure that the `Cmd`/`Name`/`Ref` caches are cleared when terminating the worker (PR 10863 follow-up) Usually when the worker is terminated it will also be completely destroyed/removed, which means that any global caches (such as the ones in `src/core/primitive.js`) should be automatically cleared in the process. However, for certain ways of loading the `pdf.worker.js` file, e.g. passing in a re-usable worker to `getDocument`, using the `workerPort` functionality, or even disabling workers completely (even though this is never a good idea), the worker file may be kept in memory and these caches will not be cleared as expected.	2019-05-30 20:57:28 +02:00
Jonas Jenwald	8857a81c8d	Re-use, rather than re-creating, some `Array`s when resetting them in `src/display/api.js` Calling `someArray = []` will create a new Array, which seems completely unnecessary when it's sufficient to just call `someArray.length = 0` to achieve the same effect. Even though I cannot imagine these particular cases having any noticeable performance impact, similar changes were made in `core/` code years ago since it's apparently more efficient memory wise.	2019-05-30 16:33:05 +02:00
Jani Pehkonen	343b1381a2	Don't clip if path is undefined in SVG back-end	2019-05-28 18:37:15 +03:00
Jonas Jenwald	5e045bcdba	Ensure that the `Cmd`/`Name`/`Ref` caches are cleared when running other `cleanup` code The purpose of these caches is to reduce peak memory usage, by only ever having a single instance of a particular object. However, as-is these caches are never cleared and they will thus remain until the worker is destroyed. This could very well have a negative effect on total memory usage, particularly for large/long documents, hence it seems to make sense to clear out these caches together with various other ones.	2019-05-26 14:29:59 +02:00
Jonas Jenwald	2fe9f3ff8f	Add caching to reduce the number of `Ref` objects This is similar to the existing caching used to reduced the number of `Cmd` and `Name` objects. With the `tracemonkey.pdf` file, this patch changes the number of `Ref` objects as follows (in the default viewer): \| \| Loading the first page \| Loading all the pages \| \|----------\|------------------------\|-------------------------\| \| `master` \| 332 \| 3265 \| \| `patch` \| 163 \| 996 \|	2019-05-26 12:23:37 +02:00
Tim van der Meij	bc1eb49a77	Implement creation date only for markup annotations The specification states that `CreationDate` is only available for markup annotations instead of for all annotation types. Moreover, popup annotations are not markup annotations according to the specification, so the creation date inheritance from the parent annotation is also removed there (note that only the modification date is used in e.g., the viewer).	2019-05-25 15:31:06 +02:00
Tim van der Meij	cf07918ccb	Implement contents for every annotation type The specification states that `Contents` can be available for every annotation types instead of only for markup annotations.	2019-05-18 15:52:17 +02:00
Tim van der Meij	1421b2f205	Merge pull request #10827 from Snuffleupagus/network-streams-class Convert the (remaining) network streams to ES6 classes	2019-05-16 22:04:29 +02:00
Jonas Jenwald	f9769af365	Convert `network.js` to use ES6 classes	2019-05-16 10:08:51 +02:00
Jonas Jenwald	cc661a4d38	Update `fetch_stream.js` to use `const` in more places	2019-05-16 09:15:43 +02:00
Jonas Jenwald	737705264b	Convert `transport_stream.js` to use ES6 classes	2019-05-16 09:15:39 +02:00
Jonas Jenwald	0784c98172	Remove unused `ref` property from the `parameters` object used when creating annotations in `AnnotationFactory._create` The only use-cases for this property was removed in PRs 7570 and 7775, and it's been completely unused ever since the latter one.	2019-05-16 08:33:38 +02:00
Tim van der Meij	c8c937c257	Merge pull request #10794 from janpe2/cidtogidmap-zero Fix glyph at index zero in CIDFontType2 that has a CIDToGIDMap stream	2019-05-15 00:04:39 +02:00
Jonas Jenwald	173fbef05b	Enable the `consistent-return` ESLint rule This rule is already enabled in mozilla-central, and helps ensure more consistent functions/methods, see https://searchfox.org/mozilla-central/rev/b9da45f63cb567244933c77b2c7e827a057d3f9b/tools/lint/eslint/eslint-plugin-mozilla/lib/configs/recommended.js#119-120 Please see https://eslint.org/docs/rules/consistent-return for additional information.	2019-05-11 14:27:21 +02:00
Jani Pehkonen	05c527f035	Fix glyph 0 in CIDFontType2 that has a CIDToGIDMap stream	2019-05-07 18:44:37 +03:00
Tim van der Meij	be1d6626a7	Implement creation/modification date for annotations This includes the information in the core and display layers. The date parsing logic from the document properties is rewritten according to the specification and now includes unit tests. Moreover, missing unit tests for the color of a popup annotation have been added. Finally the styling of the popup is changed slightly to make the text a bit smaller (it's currently quite large in comparison to other viewers) and to make the drop shadow a bit more subtle. The former is done to be able to easily include the modification date in the popup similar to how other viewers do this.	2019-05-05 14:51:03 +02:00
Jonas Jenwald	007fab6ab5	Change `PartialEvaluator.handleColorN` to throw when no valid pattern is found Currently `handleColorN` will fallback to add a completely unparsed/unvalidated operator when no valid pattern was found. This is unfortunate, since it could very easily lead to a couple of different errors: - `DataCloneError`s when attempting to send the data to the main-thread, e.g. when `args` is `Dict`/`Stream`. - Errors in `getShadingPatternFromIR` on the main-thread, unless `args` just happens to have the expected format. - Errors when actually attempting to render the pattern on the main-thread, since the `args` will most likely not have the expected format. Hence it probably makes sense to error in `PartialEvaluator.handleColorN`, and having invalid patterns fail gracefully via the existing `ignoreErrors` code-paths instead.	2019-05-04 12:53:18 +02:00
Tim van der Meij	155304a0c1	Merge pull request #10756 from Snuffleupagus/issue-10542 Attempt to handle corrupt PDF documents that contains path operators inside of text object (issue 10542)	2019-05-02 22:29:24 +02:00
Jonas Jenwald	96942d4f7f	Ensure that the `OperatorList` constructor actually initializes a `NullOptimizer` when intended (PR 9089 follow-up) It appears that this has been broken ever since PR 9089, which also introduced this code, since the `QueueOptimizer`/`NullOptimizer` choice was made based on the still undefined `this.intent` property. Furthermore, fixing this also uncovered the fact that the `NullOptimizer.reset` method was missing.	2019-05-02 17:37:05 +02:00
Jonas Jenwald	5335285cda	Attempt to handle corrupt PDF documents that contains path operators inside of text object (issue 10542) First of all, while this simple approach appears to work OK in practice I'm not sure if it's the best way of addressing the problem (assuming that you even want to). Second of all, while the solution implemented here only requires tracking/checking one new boolean in order for this to work, I'm nonetheless not entirely happy about this since it will add additional overhead (albeit very small) to the parsing of path operators in PDF documents just for a handful of corrupt ones.	2019-04-30 23:35:33 +02:00
Tim van der Meij	762c58e0fc	Merge pull request #10738 from Snuffleupagus/ViewerPreferences-api [api-minor] Add support for ViewerPreferences in the API (issue 10736)	2019-04-20 18:39:32 +02:00
Jonas Jenwald	34952b732e	Add a `getDocId` method to the `idFactory`, in `Page` instances, to avoid passing around `PDFManager` instances unnecessarily (PR 7941 follow-up) This way we can avoid manually building a "document id" in multiple places in `evaluator.js`, and it also let's us avoid passing in an otherwise unnecessary `PDFManager` instance when creating a `PartialEvaluator`.	2019-04-20 13:11:17 +02:00
Tim van der Meij	55d9b35d37	Merge pull request #10727 from Snuffleupagus/type3-image-resources Support (rare) Type3 fonts which contains image resources (issue 10717)	2019-04-18 23:07:26 +02:00
Jonas Jenwald	5e9b606e7b	[Firefox] Avoid displaying the indeterminate loadingBar when `disableStream=true` is set (PR 10714 follow-up) While PR 10714 did address the `disableRange=true` case, it also managed to "break" the `disableStream=true` case instead since the indeterminate loadingBar is now displayed when it shouldn't; sorry about that! The solution is simple enough though, don't attempt to fallback to `_fullRequestReader.onProgress` when handling "incomplete" loading information.	2019-04-16 15:35:42 +02:00
Jonas Jenwald	311bac3ebb	[api-minor] Add support for ViewerPreferences in the API (issue 10736) Please see the specification, https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#M11.9.12864.1Heading.71.Viewer.Preferences Furthermore, note that this patch only adds API support and unit-tests but does not attempt to integrate e.g. the `ViewerPreferences -> Direction` property into the viewer (which would be necessary to address issue 10736). The reason for this is that it's not entirely clear to me exactly if/how that could be implemented; e.g. would it be as simple as setting the `dir` attribute on the `viewerContainer` DOM element, or will it be more complicated? There's also the question of how the `ViewerPreferences -> Direction` value interacts with the `PageMode`, and this will generally require a fair bit of manual testing. Since the direction of the entire viewer depends on the browser locale, there's also a somewhat open question regarding what default value to use for different locales. Finally, if the viewer supports `ViewerPreferences -> Direction` then I'm assuming that it will be necessary to allow users to override the default value, which will require (most likely) new `SecondaryToolbar` buttons and icons for those etc. Hence this patch only lays the necessary foundation for eventually addressing issue 10736, but defers the actual implementation until later. (Time permitting, I'll try to look into the viewer part later.)	2019-04-14 14:20:52 +02:00
Tim van der Meij	ae2a4dc3dd	Implement free text annotations	2019-04-13 18:45:22 +02:00
Jonas Jenwald	be604bd195	Support (rare) Type3 fonts which contains image resources (issue 10717) The Type3 font type is not commonly used in PDF documents, as can be seen from telemetry data such as: https://telemetry.mozilla.org/new-pipeline/dist.html#!cumulative=0&end_date=2019-04-09&include_spill=0&keys=__none__!__none__!__none__&max_channel_version=nightly%252F68&measure=PDF_VIEWER_FONT_TYPES&min_channel_version=nightly%252F57&processType=&product=Firefox&sanitize=1&sort_by_value=0&sort_keys=submissions&start_date=2019-03-18&table=0&trim=1&use_submission_date=0 (see also https://github.com/mozilla/pdf.js/wiki/Enumeration-Assignments-for-the-Telemetry-Histograms#pdf_viewer_font_types). Type3 fonts containing image resources are very* rare in practice, usually they only contain path rendering operators, but as the issue shows they unfortunately do exist. Currently these Type3-related image resources are not handled in any special way, and given that fonts are document rather than page specific rendering breaks since the image resources are thus not available to the entire document. Fortunately fixing this isn't too difficult, but it does require adding a couple of Type3-specific code-paths to the `PartialEvaluator`. In order to keep the implementation simple, particularily on the main-thread, these Type3 image resources are completely decoded on the worker-thread to avoid adding too many special cases. This should not cause any issues, only marginally less efficient code, but given how rare this kind of Type3 font is adding premature optimizations didn't seem at all warranted at this point.	2019-04-13 18:27:50 +02:00
Tim van der Meij	17de90b88a	Merge pull request #10694 from Snuffleupagus/main-thread-progressiveDataLength Avoid dispatching range requests to fetch PDF data that's already loaded with streaming (PR 10675 follow-up)	2019-04-13 17:15:01 +02:00
Tim van der Meij	2d0c38d626	Merge pull request #10696 from Snuffleupagus/makeSubStream-ensureByte Update `ChunkedStream.makeSubStream` to actually check if (some) data exists when the `length` parameter is undefined	2019-04-13 17:12:20 +02:00
Jonas Jenwald	a7273c8efe	Avoid dispatching range requests to fetch PDF data that's already loaded with streaming (PR 10675 follow-up) Please note: This patch purposely ignores `src/display/network.js`, since its support for progressive reading depends on the non-standard `moz-chunked-arraybuffer` responseType which is currently in the process of being removed.	2019-04-13 00:26:13 +02:00
Vlastimil Máca	d96267c30c	Annotations - _preparePopup method replaced with MarkupAnnotation base class. This is just refactoring, so it shouldn't break anything. It should move annotation API closer to PDF spec and enable future expansion.	2019-04-12 11:24:21 +02:00
Tim van der Meij	4055d0a302	Implement caret annotations The file `test/pdfs/annotation-caret-ink.pdf` is already available in the repository as a reference test for this since I supplied it for another patch that implemented ink annotations.	2019-04-09 23:39:56 +02:00
Tim van der Meij	ce62373db3	Merge pull request #10674 from timvandermeij/svg-backend-es6 Convert `src/display/svg.js` to ES6 syntax and implement `setRenderingIntent` and `setFlatness` for the SVG backend	2019-04-06 17:15:14 +02:00
Tim van der Meij	5a03b1c0d7	Optimize `convertOpList` in `svg.js` by computing the operator ID mapping only once There is no need to recompute this for every operator list we encounter.	2019-04-06 16:57:31 +02:00
Tim van der Meij	2b18e5a355	Implement `setRenderingIntent` and `setFlatness` for the SVG backend This mirrors the canvas implementation where we ignore these operators. This avoids console spam regarding unimplemented operators we're not interested in. For the Tracemonkey paper, we're now down to one warning about tiling patterns which is in fact a valid one.	2019-04-06 16:57:30 +02:00
Tim van der Meij	47d3620d5a	Convert `src/display/svg.js` to ES6 syntax In particular, this should reduce intermediate string creation by using template strings and reduce variable lookup times by removing unneeded variables and caching `this.current` in more places.	2019-04-06 16:57:30 +02:00
Jonas Jenwald	f0a28b3c0d	[Firefox] Ensure that loading progress is reported, and the loadingBar updated, when `disableRange=true` is set With PR 10675 having fixed the completely broken `disableRange=true` setting in the Firefox version of PDF.js, I couldn't help but noticing that loading progress is never reported properly in that case. Currently loading progress is only reported for the `rangeProgress` chrome-event, which obviously isn't dispatched with `disableRange=true` set. However, the `progressiveRead` chrome-event includes loading progress as well, but this information isn't being used in any way. Furthermore, the `PDFDataRangeTransport.onDataProgress` method wasn't able to handle "complete" loading information, and neither was `PDFDataTransportStream._onProgress` since that method would only ever attempt to report it through a RangeReader (which won't exist when `disableRange=true` is set).	2019-04-06 12:53:33 +02:00
Tim van der Meij	b161050df4	Merge pull request #10709 from Snuffleupagus/pageLayout [api-minor] Add basic support for PageLayout in the API and the viewer	2019-04-05 23:07:32 +02:00
Tim van der Meij	8c8738ea47	Merge pull request #10678 from Snuffleupagus/rm-moz-chunked-arraybuffer Remove `moz-chunked-arraybuffer` support, and related code, from `src/display/network.js`	2019-04-05 22:52:28 +02:00
Jonas Jenwald	7a999d1d67	[api-minor] Add basic support for PageLayout in the API and the viewer Please see the specification, https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G6.2393749, and refer to the inline comments for additional details.	2019-04-05 11:32:01 +02:00
Tim van der Meij	57abddc9ca	Merge pull request #10713 from Snuffleupagus/rm-JSDoc-annotation Remove `src/core/annotation.js` from the `gulp jsdoc` build target	2019-04-04 23:15:02 +02:00
Tim van der Meij	072c5864fb	Merge pull request #10675 from Snuffleupagus/PDFDataTransportStream-disableRange [Firefox regression] Fix `disableRange=true` bug in `PDFDataTransportStream`	2019-04-04 23:07:45 +02:00
Jonas Jenwald	f666395c24	Remove `src/core/annotation.js` from the `gulp jsdoc` build target Note how at https://mozilla.github.io/pdf.js/api/ it's being described as API docs, however `src/core/annotation.js` is not part of the public API. Furthermore, given that the code residing in the `src/core/` folder is run in a worker-thread, it's not even accessible on the main-thread (since `postMessage` is being used to transfer the data). Hence the different API methods simply returns a "proxy" to the underlying data, but not actually the same objects and data structures as in the worker-thread itself; thus it doesn't make a whole lot of sense to expose this in API docs as far as I'm concerned. Finally, the patch fixes a small JSDoc related typo in `src/display/api.js` when referring to the `TextStyle` typedef.	2019-04-04 18:03:08 +02:00
Jonas Jenwald	b40e6723be	Remove `moz-chunked-arraybuffer` support, and related code, from `src/display/network.js` The `moz-chunked-arraybuffer` responseType is a non-standard property, which has been subsumed by the Fetch API, and it's in the process of being removed from Firefox; please see https://bugzilla.mozilla.org/show_bug.cgi?id=1120171 and https://bugzilla.mozilla.org/show_bug.cgi?id=1411865 Please note: Rather than waiting for both `Fetch` and `ReadableStream` to be available in e.g. a Firefox ESR version (which is probably going to be 68 at the earliest), let's just decide that PDF.js release `2.1.266` will be the last one with `moz-chunked-arraybuffer` support and land this patch (since nothing should outright break without it anyway).	2019-04-01 20:48:51 +02:00
Jonas Jenwald	c6ddbd55e2	Add a `progressiveDataLength` fast-path to `ChunkedStream.ensureByte` This is similar to the existing check using in `ChunkedStream.ensureRange`.	2019-03-29 20:00:28 +01:00
Jonas Jenwald	49e8a270c4	Update `ChunkedStream.makeSubStream` to actually check if (some) data exists when the `length` parameter is undefined Note how `XRef.fetchUncompressed`, which is used a lot for most PDF documents, is calling the `makeSubStream` method without providing a `length` argument. In practice this results in the `makeSubStream` method, on the `ChunkedStream` instance, calling the `ensureRange` method with `NaN` as the end position, thus resulting in no data being requested despite it possibly being necessary. This may be quite bad, since in this particular case it will lead to a new `ChunkedStream` being created and also a new `Parser`/`Lexer` instance. Given that it's quite possible that even the very first `Parser.getObj` call could throw `MissingDataException`, this could thus lead to wasted time/resources (since re-parsing is necessary once the data finally arrives). You obviously need to be very careful to not have `ChunkedStream.makeSubStream` accidentally requesting the entire file, hence its `this.end` property is of no use here, but it should be possible to at least check that the `start` of the data is present before any potentially expensive parsing occurs.	2019-03-29 17:20:31 +01:00
Tim van der Meij	b4c3b94592	Merge pull request #6606 from Rob--W/pattern-scaling Improve performance and correctness of Tiling Patterns	2019-03-29 00:01:38 +01:00
Tim van der Meij	f9c58115fc	Merge pull request #10683 from janpe2/type0-noncid-cmap Use CMap in Type0 fonts when CFF is not a CID font	2019-03-28 00:07:08 +01:00

1 2 3 4 5 ...

3669 Commits