pdf.js

Author	SHA1	Message	Date
Tim van der Meij	87a70f3359	Convert `let` to `const` if possible in `src/display/display_utils.js` Finally, `var` usage is removed.	2019-03-06 23:41:54 +01:00
Tim van der Meij	c43396c2b7	Merge pull request #10590 from janpe2/svg-missing-moveto Fix missing moveTos in SVG paths	2019-03-02 14:43:53 +01:00
Jonas Jenwald	d7d1f23826	Zero the width/height of the temporary canvas used during `TextLayer` rendering The default size of these canvases seem to be `300 x 150` (two orders of magnitude larger than the ones in PR 10597), which probably is sufficient enough to matter since there's one such canvas for each textLayer that's rendered in the viewer. Also fixes the incorrect rejection reason, i.e. one using a string rather than an `Error`, in the `TextLayerRenderTask.cancel` method.	2019-03-01 04:05:37 +01:00
Tim van der Meij	9559d57636	Merge pull request #10595 from Snuffleupagus/JpegDecode-zero-tmpCanvas Zero the width/height of the temporary canvas used during `JpegDecode` (issue 10594)	2019-02-28 23:41:22 +01:00
Tim van der Meij	39fa26ea33	Merge pull request #10597 from Snuffleupagus/isFontSubpixelAAEnabled-canvas-cleanup Ensure that the temporary canvas created in `CanvasGraphics.isFontSubpixelAAEnabled` will be cleared	2019-02-28 23:37:24 +01:00
Tim van der Meij	af5597b7e5	Merge pull request #10573 from Snuffleupagus/type3-avoid-truncation Avoid truncating/breaking some Type3 glyphs in `compileType3Glyph` (bug 1245391, issue 10568)	2019-02-28 23:25:45 +01:00
Jonas Jenwald	b61b4d3229	Ensure that the temporary canvas created in `CanvasGraphics.isFontSubpixelAAEnabled` will be cleared While this particular canvas may be small, there can still be an arbitrarily large number of them (one per page rendered), which can/will eventually add up memory wise. This can be easily avoided by using the `cachedCanvases` abstraction instead, which will ensure that the `isFontSubpixelAAEnabled` canvas is removed together with other temporary canvases in `CanvasGraphics.endDrawing`.	2019-02-28 14:18:38 +01:00
Jonas Jenwald	4687cc85ac	Zero the width/height of the temporary canvas used during `JpegDecode` (issue 10594)	2019-02-28 12:23:34 +01:00
Jonas Jenwald	f664e074c9	Avoid using the Fetch API, in `GENERIC` builds, for unsupported protocols (issue 10587)	2019-02-27 13:04:20 +01:00
Jonas Jenwald	cbc07f985b	Load built-in CMap files using the Fetch API when possible	2019-02-27 13:04:19 +01:00
Jani Pehkonen	52e8e9b059	Fix missing moveTos in SVG paths	2019-02-26 20:00:35 +02:00
Jonas Jenwald	a1f7517996	Rename the `src/display/dom_utils.js` file to `src/display/display_utils.js` This file (currently) contains not only DOM-specific helper functions/classes, but is used generally for various helper code relevant for main-thread functionality.	2019-02-23 16:30:16 +01:00
Jonas Jenwald	fb774a65b0	Avoid truncating/breaking some Type3 glyphs in `compileType3Glyph` (bug 1245391, issue 10568) Hopefully this patch makes sense, since I cannot claim to fully understand this function. With the changes made in PR 3354 some Type3 glyph outlines are no longer rendering correctly, since the final paths were being accidentally ignored. The fact that Type3 fonts are not very common in PDF documents, and that most Type3 glyphs are unaffected by this regression, probably explains why this has gone unnoticed since 2013.	2019-02-21 23:29:43 +01:00
Jonas Jenwald	b6d090cc14	Fallback to the built-in font renderer when font loading fails After PR 9340 all glyphs are now re-mapped to a Private Use Area (PUA) which means that if a font fails to load, for whatever reason[1], all glyphs in the font will now render as Unicode glyph outlines. This obviously doesn't look good, to say the least, and might be seen as a "regression" since previously many glyphs were left in their original positions which provided a slightly better fallback[2]. Hence this patch, which implements a general fallback to the PDF.js built-in font renderer for fonts that fail to load (i.e. are rejected by the sanitizer). One caveat here is that this only works for the Font Loading API, since it's easy to handle errors in that case[3]. The solution implemented in this patch does not in any way delay the loading of valid fonts, which was the problem with my previous attempt at a solution, and will only require a bit of extra work/waiting for those fonts that actually fail to load. Please note: This patch doesn't fix any of the underlying PDF.js font conversion bugs that's responsible for creating corrupt font files, however it does improve rendering in a number of cases; refer to this possibly incomplete list: [Bug 1524888](https://bugzilla.mozilla.org/show_bug.cgi?id=1524888) Issue 10175 Issue 10232 --- [1] Usually because the PDF.js font conversion code wasn't able to parse the font file correctly. [2] Glyphs fell back to some default font, which while not accurate was more useful than the current state. [3] Furthermore I'm not sure how to implement this generally, assuming that's even possible, and don't really have time/interest to look into it either.	2019-02-11 10:27:08 +01:00
Jonas Jenwald	13230a1123	Remove the ability to pass in more than one font to `BaseFontLoader.bind` - The only existing call-site, of this method, is never passing more than one font at a time anyway. - As far as I can remember, this functionality has never actually been used (caveat: I didn't check the git history). - This allows simplification of the method, especially by making use of the fact that it's now asynchronous. - It should be just as easy to call `BaseFontLoader.bind` from within a loop, rather than having the loop in the method itself.	2019-02-10 21:09:57 +01:00
Jonas Jenwald	af3fcca88d	Convert `BaseFontLoader.bind` to be async, and only utilize `BaseFontLoader._queueLoadingCallback` when actually necessary Currently all fonts are using the `_queueLoadingCallback` method to determine when they have been loaded[1]. However in most cases this is just adding unnecessary overhead, especially with `BaseFontLoader.bind` now being asynchronous, given how fonts are loaded: - For fonts loaded using the Font Loading API, it's already possible to easily tell when a font has been loaded simply by checking the `loaded` promise on the FontFace object itself. - For browsers, e.g. Firefox, which support synchronous font loading it's already assumed that fonts are immediately available. Hence the `_queueLoadingCallback` method is moved into the `GenericFontLoader`, such that it's only utilized for fonts which are loaded using CSS. --- [1] In the "fonts loaded using CSS" case, this is already a hack anyway as outlined in the comments.	2019-02-10 21:09:57 +01:00
Jonas Jenwald	614e502227	[api-minor] Remove the `document.currentScript` polyfill This polyfill is currently used in only one file, i.e. `src/display/api.js`, and only when trying to build a fallback `workerSrc` path. Given that the global `workerSrc` should always be set[1] when using the PDF.js library[2], and that the fallback `workerSrc` should only be regarded as a best-effort solution anyway, there isn't a particularily strong reason to keep the compatibility code in my opinion. --- [1] Other supported options include setting the global `workerPort`, or passing in a `PDFWorker` instance as part of the `getDocument` call. [2] Which is clearly mentioned in the JSDocs in `src/display/worker_options.js`.	2019-02-03 14:09:24 +01:00
Jonas Jenwald	5081063b9e	Attempt to clean-up/restore pending rendering operations when errors occurs while a `RenderTask` runs (PR 10202 follow-up) This piggybacks of the existing `cancel` functionality, to ensure that any pending operations are closed and that any temporary canvases are actually being removed. Also simplifies `finishPaintTask` in `PDFPageView.draw` slightly, by converting it to an async function.	2019-01-26 16:02:51 +01:00
Jonas Jenwald	01d624f6a0	Add an `Array.from` polyfill, using core-js, and remove some compatibility hacks from the `src/display/content_disposition.js` file	2019-01-20 08:49:20 +01:00
Jonas Jenwald	9f45f8dfda	When parsing Metadata, attempt to remove "junk" before the first tag (PR 10398 follow-up) This will allow the Metadata to be successfully extracted from the PDF file in issue 10395. Furthermore, this patch also fixes a bug in `Metadata.get` which causes the method to return `null` rather than an empty string or zero (since either ought to be allowed).	2019-01-16 12:44:27 +01:00
Jonas Jenwald	e8f4b47d59	Prevent errors, in `SimpleXMLParser.onEndElement`, when the stack has already been completely parsed (issue 10410) The error was triggered for a particular set of metadata, where an end tag was encountered without the corresponding begin tag being present in the data. (The patch also fixes a minor oversight, from a recent PR, in the `SimpleDOMNode.nextSibling` method.)	2019-01-05 11:15:34 +01:00
Jonas Jenwald	6cd9ff48f3	Prevent errors, because of incorrect scope, in the `XMLParserBase._resolveEntities` method (issue 10407)	2019-01-04 10:13:32 +01:00
Tim van der Meij	1b84b2ed60	Merge pull request #10398 from Snuffleupagus/issue-10395 Prevent errors in various methods in `SimpleDOMNode` when the `childNodes` property is not defined (issue 10395)	2019-01-01 16:22:11 +01:00
Jonas Jenwald	d371d23382	Prevent errors in various methods in `SimpleDOMNode` when the `childNodes` property is not defined (issue 10395) Given that the issue, as filed, is incomplete since no PDF file was provided for debugging, this patch is really the best that we can do here. Please note: This patch will not enable the Metadata to be successfully parsed, but it should at least prevent the errors.	2018-12-31 13:07:15 +01:00
Tim van der Meij	5b57e69da2	Optimize `CanvasGraphics.setFont` to avoid intermediate string creation This method creates quite a few intermediate strings on each call and it's called often, even for smaller documents like the Tracemonkey document. Scrolling from top to bottom in that document resulted in 14126 strings being created in this method. With this commit applied, this is reduced to 2018 strings.	2018-12-30 14:58:32 +01:00
Tim van der Meij	95f9075565	Optimize `TextLayerRenderTask._layoutText` to avoid intermediate string creation This method creates quite a few intermediate strings on each call and it's called often, even for smaller documents like the Tracemonkey document. Scrolling from top to bottom in that document resulted in 12936 strings being created in this method. With this commit applied, this is reduced to 3610 strings.	2018-12-30 14:39:08 +01:00
Tim van der Meij	103f4616ac	Merge pull request #10334 from Snuffleupagus/OpenAction-dest [api-minor] Add support for OpenAction destinations (issue 10332)	2018-12-23 20:49:50 +01:00
Jonas Jenwald	f0719ed565	[api-minor] Change the `getViewport` method, on `PDFPageProxy`, to take a parameter object rather than a bunch of (randomly) ordered parameters If, as PR 10368 suggests, more parameters should be added to `getViewport` I think that it would be a mistake to not change the signature first to avoid needlessly unwieldy call-sites. To not break any existing code and third-party use-cases, this is obviously implemented with a deprecation warning and with a working fallback[1] for the old method signature. --- [1] This is limited to `GENERIC` builds, which should be sufficient.	2018-12-21 11:55:20 +01:00
Jonas Jenwald	b05f053287	[api-minor] Add support for OpenAction destinations (issue 10332) Note that the OpenAction dictionary may contain other information besides just a destination array, e.g. instructions for auto-printing[1]. Given first of all that an arbitrary `Dict` cannot be sent from the Worker (since cloning would fail), and second of all that the data obviously needs to be validated, this patch purposely only adds support for fetching a destination from the OpenAction entry[2]. --- [1] This information is, currently in PDF.js, being included through the `getJavaScript` API method. [2] This significantly reduces the complexity of the implementation, which seems fine for now. If there's ever need for other kinds of OpenAction to be fetched, additional API methods could/should be implemented as necessary (could e.g. follow the `getOpenActionWhatever` naming scheme).	2018-12-19 11:45:16 +01:00
Jani Pehkonen	ddabeb0645	Handle line width of zero in SVG	2018-12-04 16:05:32 +02:00
Jonas Jenwald	ac6b94c9dd	Replace the remaining occurences, in `src/display/api.js`, of `var` with `let`/`const`	2018-11-18 19:08:27 +01:00
Jonas Jenwald	061f7bd2f3	Convert `PDFWorker`, in `src/display/api.js`, to an ES6 class Also changes all occurrences of `var` to `let`/`const` in this code.	2018-11-18 19:08:27 +01:00
Jonas Jenwald	02e77a39ec	Convert `InternalRenderTask`, in `src/display/api.js`, to an ES6 class This changes all occurrences of `var` to `let`/`const` in this code, and updates the signature of the constructor to use object destructuring for better readability (and self documentation). Also, `useRequestAnimationFrame` is changed to a parameter and the `typeof window` check is now done once rather than at every `_scheduleNext` call.	2018-11-18 19:08:27 +01:00
Jonas Jenwald	5a0d64a6de	Convert `PDFPageProxy`, in `src/display/api.js`, to an ES6 class This changes all occurrences of `var` to `let`/`const` in this code, and updates the signatures of a couple of methods to use object destructuring. Finally, when creating `InternalRenderTask` instances only the necessary parameter are now provided, since passing through the `RenderParameters` as-is seems completely unnecessary.	2018-11-18 19:08:25 +01:00
Jonas Jenwald	2c003a82d5	Convert `RenderTask`, in `src/display/api.js`, to an ES6 class Also deprecates the `then` method, in favour of the `promise` getter.	2018-11-18 19:08:00 +01:00
Jonas Jenwald	ef8e5fd77c	Convert `PDFDocumentLoadingTask`, in `src/display/api.js`, to an ES6 class Also deprecates the `then` method, in favour of the `promise` getter.	2018-11-18 19:07:57 +01:00
PalmerAL	5f15dc2023	Use `span` instead of `div` in the text layer This improves copy/pasting text content since it reduces the amount of unnecessary newlines.	2018-11-18 15:54:08 +01:00
Jonas Jenwald	60da2d882b	[api-minor] Refactor/simplify the `PDFObject` class First of all, note how there's currently two methods for checking if a certain object exists, which seems completely unwarranted. Furthermore, the rarely used `getData` method was removed and its only callsite changed to use a combination of `PDFObjects.{has, get}` instead. Finally, the methods were rearranged slightly, to bring the most important ones (for an API user) to the top of the class.	2018-11-08 10:13:39 +01:00
Jonas Jenwald	d32321d84f	Convert `PDFObjects`, in `src/display/api.js`, to an ES6 class Also changes all occurrences of `var` to `const`, and marks internal properties/methods as "private".	2018-11-08 10:11:40 +01:00
Tim van der Meij	ec76aa531e	Merge pull request #10202 from Snuffleupagus/issue-10200 Attempt to clean-up/restore pending rendering operations on `RenderTask.cancel` (issue 10200)	2018-11-02 23:11:47 +01:00
Jonas Jenwald	f23dba1c10	Change `canvasInRendering` to a `WeakSet` instead of a `WeakMap` Note how nowhere in the code `canvasInRendering.get()` is ever called, and that this structure is really only used to store references to `<canvas>` DOM elements. The reason for this being a `WeakMap` is probably because at the time we weren't using `core-js` polyfills yet, and since there already existed a manually implemented `WeakMap` polyfill it was probably simpler to use that.	2018-10-31 18:15:23 +01:00
Jonas Jenwald	f77b463339	Attempt to clean-up/restore pending rendering operations on `RenderTask.cancel` (issue 10200) Please note that, given the lack of a runnable example, I'm not totally sure if this first of all is enough to completely address the issue as filed and second of all if we actually want this new behaviour.	2018-10-31 16:22:17 +01:00
Tim van der Meij	ed4ac1bc67	Merge pull request #10162 from janpe2/svg-normalize-bbox Normalize BBox of form XObjects in SVG back-end	2018-10-28 13:18:48 +01:00
Jani Pehkonen	9cd5f94f03	Normalize the BBox of form XObjects on the /core side	2018-10-22 14:17:05 +03:00
Jonas Jenwald	5bb7f4b615	Convert `PDFDataRangeTransport` to an ES6 class	2018-10-20 17:15:27 +02:00
Jonas Jenwald	327f2eb588	Ensure that `onProgress` is always called when the entire PDF file has been loaded, regardless of how it was fetched (issue 10160) Please note: I'm totally fine with this patch being rejected, and the issue closed as WONTFIX; however these changes should address the issue if that's desired. From a conceptual point of view, reporting loading progress doesn't really make a lot of sense for PDF files opened by passing raw binary data directly to `getDocument` (since obviously all data was loaded). This is compared to PDF files loaded via e.g. `XMLHttpRequest` or the Fetch API, where the entire PDF file isn't available from the start and knowing the loading progress makes total sense. However I can certainly see why the current API could be considered inconsistent, which isn't great, since a registered `onProgress` callback will never be called for certain `getDocument` calls. The simplest solution to this inconsistency thus seem to be to ensure that `onProgress` is always called when handling the `DataLoaded` message, since that will always be dispatched[1] from the worker-thread. --- [1] Note that this isn't guaranteed to happen, since setting `disableAutoFetch = true` often prevents the entire file from ever loading. However, this isn't relevant for the issue at hand, and is a well-known consequence of using `disableAutoFetch = true`; note how the default viewer even has a specialized code-path for hiding the loadingBar.	2018-10-16 13:51:12 +02:00
Tim van der Meij	9e9426c354	Merge pull request #10143 from Snuffleupagus/getMainThreadWorkerMessageHandler-catch-errors Ensure that `getMainThreadWorkerMessageHandler` won't accidentally break `getDocument` (PR 10139 follow-up)	2018-10-11 00:05:01 +02:00
Jonas Jenwald	0e2c6047e4	Ensure that `getMainThreadWorkerMessageHandler` won't accidentally break `getDocument` (PR 10139 follow-up) This should have been part of PR 10139. In the event that a user has attempted to manually load the worker file on the main-thread, but somehow failed to do that correctly, there's a possibility that `getMainThreadWorkerMessageHandler` could throw. Considering how/where that helper function is being called, an error could still prevent `PDFDocumentLoadingTask` from completing (regardless if it's being resolved/rejected).	2018-10-09 15:44:31 +02:00
Jonas Jenwald	21c8dd4842	Combine the `pdfjsFilePath` and fallback `workerSrc` handling in `src/display/api.js` With the way that the `getWorkerSrc()` helper function is implemented now, there's no longer a particularly strong reason for keeping the global `pdfjsFilePath` variable around. With this patch the fallback `workerSrc` will thus, assuming is wasn't already set, be set to the "pdfjsFilePath" which simplifies the `getWorkerSrc()` function and reduces the amount of global state. Finally, the global `workerSrc` variable was renamed to prevent shadowing.	2018-10-09 13:47:48 +02:00
Tim van der Meij	f45e46d7ad	Merge pull request #10133 from kevinleedrum/fix-content-length Set returnValues.suggestedLength to Content-Length if integer	2018-10-09 00:05:57 +02:00

1 2 3 4 5 ...

786 Commits