pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	fb774a65b0	Avoid truncating/breaking some Type3 glyphs in `compileType3Glyph` (bug 1245391, issue 10568) Hopefully this patch makes sense, since I cannot claim to fully understand this function. With the changes made in PR 3354 some Type3 glyph outlines are no longer rendering correctly, since the final paths were being accidentally ignored. The fact that Type3 fonts are not very common in PDF documents, and that most Type3 glyphs are unaffected by this regression, probably explains why this has gone unnoticed since 2013.	2019-02-21 23:29:43 +01:00
Jonas Jenwald	b6d090cc14	Fallback to the built-in font renderer when font loading fails After PR 9340 all glyphs are now re-mapped to a Private Use Area (PUA) which means that if a font fails to load, for whatever reason[1], all glyphs in the font will now render as Unicode glyph outlines. This obviously doesn't look good, to say the least, and might be seen as a "regression" since previously many glyphs were left in their original positions which provided a slightly better fallback[2]. Hence this patch, which implements a general fallback to the PDF.js built-in font renderer for fonts that fail to load (i.e. are rejected by the sanitizer). One caveat here is that this only works for the Font Loading API, since it's easy to handle errors in that case[3]. The solution implemented in this patch does not in any way delay the loading of valid fonts, which was the problem with my previous attempt at a solution, and will only require a bit of extra work/waiting for those fonts that actually fail to load. Please note: This patch doesn't fix any of the underlying PDF.js font conversion bugs that's responsible for creating corrupt font files, however it does improve rendering in a number of cases; refer to this possibly incomplete list: [Bug 1524888](https://bugzilla.mozilla.org/show_bug.cgi?id=1524888) Issue 10175 Issue 10232 --- [1] Usually because the PDF.js font conversion code wasn't able to parse the font file correctly. [2] Glyphs fell back to some default font, which while not accurate was more useful than the current state. [3] Furthermore I'm not sure how to implement this generally, assuming that's even possible, and don't really have time/interest to look into it either.	2019-02-11 10:27:08 +01:00
Jonas Jenwald	13230a1123	Remove the ability to pass in more than one font to `BaseFontLoader.bind` - The only existing call-site, of this method, is never passing more than one font at a time anyway. - As far as I can remember, this functionality has never actually been used (caveat: I didn't check the git history). - This allows simplification of the method, especially by making use of the fact that it's now asynchronous. - It should be just as easy to call `BaseFontLoader.bind` from within a loop, rather than having the loop in the method itself.	2019-02-10 21:09:57 +01:00
Jonas Jenwald	af3fcca88d	Convert `BaseFontLoader.bind` to be async, and only utilize `BaseFontLoader._queueLoadingCallback` when actually necessary Currently all fonts are using the `_queueLoadingCallback` method to determine when they have been loaded[1]. However in most cases this is just adding unnecessary overhead, especially with `BaseFontLoader.bind` now being asynchronous, given how fonts are loaded: - For fonts loaded using the Font Loading API, it's already possible to easily tell when a font has been loaded simply by checking the `loaded` promise on the FontFace object itself. - For browsers, e.g. Firefox, which support synchronous font loading it's already assumed that fonts are immediately available. Hence the `_queueLoadingCallback` method is moved into the `GenericFontLoader`, such that it's only utilized for fonts which are loaded using CSS. --- [1] In the "fonts loaded using CSS" case, this is already a hack anyway as outlined in the comments.	2019-02-10 21:09:57 +01:00
Jonas Jenwald	614e502227	[api-minor] Remove the `document.currentScript` polyfill This polyfill is currently used in only one file, i.e. `src/display/api.js`, and only when trying to build a fallback `workerSrc` path. Given that the global `workerSrc` should always be set[1] when using the PDF.js library[2], and that the fallback `workerSrc` should only be regarded as a best-effort solution anyway, there isn't a particularily strong reason to keep the compatibility code in my opinion. --- [1] Other supported options include setting the global `workerPort`, or passing in a `PDFWorker` instance as part of the `getDocument` call. [2] Which is clearly mentioned in the JSDocs in `src/display/worker_options.js`.	2019-02-03 14:09:24 +01:00
Jonas Jenwald	5081063b9e	Attempt to clean-up/restore pending rendering operations when errors occurs while a `RenderTask` runs (PR 10202 follow-up) This piggybacks of the existing `cancel` functionality, to ensure that any pending operations are closed and that any temporary canvases are actually being removed. Also simplifies `finishPaintTask` in `PDFPageView.draw` slightly, by converting it to an async function.	2019-01-26 16:02:51 +01:00
Jonas Jenwald	01d624f6a0	Add an `Array.from` polyfill, using core-js, and remove some compatibility hacks from the `src/display/content_disposition.js` file	2019-01-20 08:49:20 +01:00
Jonas Jenwald	9f45f8dfda	When parsing Metadata, attempt to remove "junk" before the first tag (PR 10398 follow-up) This will allow the Metadata to be successfully extracted from the PDF file in issue 10395. Furthermore, this patch also fixes a bug in `Metadata.get` which causes the method to return `null` rather than an empty string or zero (since either ought to be allowed).	2019-01-16 12:44:27 +01:00
Jonas Jenwald	e8f4b47d59	Prevent errors, in `SimpleXMLParser.onEndElement`, when the stack has already been completely parsed (issue 10410) The error was triggered for a particular set of metadata, where an end tag was encountered without the corresponding begin tag being present in the data. (The patch also fixes a minor oversight, from a recent PR, in the `SimpleDOMNode.nextSibling` method.)	2019-01-05 11:15:34 +01:00
Jonas Jenwald	6cd9ff48f3	Prevent errors, because of incorrect scope, in the `XMLParserBase._resolveEntities` method (issue 10407)	2019-01-04 10:13:32 +01:00
Tim van der Meij	1b84b2ed60	Merge pull request #10398 from Snuffleupagus/issue-10395 Prevent errors in various methods in `SimpleDOMNode` when the `childNodes` property is not defined (issue 10395)	2019-01-01 16:22:11 +01:00
Jonas Jenwald	d371d23382	Prevent errors in various methods in `SimpleDOMNode` when the `childNodes` property is not defined (issue 10395) Given that the issue, as filed, is incomplete since no PDF file was provided for debugging, this patch is really the best that we can do here. Please note: This patch will not enable the Metadata to be successfully parsed, but it should at least prevent the errors.	2018-12-31 13:07:15 +01:00
Tim van der Meij	5b57e69da2	Optimize `CanvasGraphics.setFont` to avoid intermediate string creation This method creates quite a few intermediate strings on each call and it's called often, even for smaller documents like the Tracemonkey document. Scrolling from top to bottom in that document resulted in 14126 strings being created in this method. With this commit applied, this is reduced to 2018 strings.	2018-12-30 14:58:32 +01:00
Tim van der Meij	95f9075565	Optimize `TextLayerRenderTask._layoutText` to avoid intermediate string creation This method creates quite a few intermediate strings on each call and it's called often, even for smaller documents like the Tracemonkey document. Scrolling from top to bottom in that document resulted in 12936 strings being created in this method. With this commit applied, this is reduced to 3610 strings.	2018-12-30 14:39:08 +01:00
Tim van der Meij	103f4616ac	Merge pull request #10334 from Snuffleupagus/OpenAction-dest [api-minor] Add support for OpenAction destinations (issue 10332)	2018-12-23 20:49:50 +01:00
Jonas Jenwald	f0719ed565	[api-minor] Change the `getViewport` method, on `PDFPageProxy`, to take a parameter object rather than a bunch of (randomly) ordered parameters If, as PR 10368 suggests, more parameters should be added to `getViewport` I think that it would be a mistake to not change the signature first to avoid needlessly unwieldy call-sites. To not break any existing code and third-party use-cases, this is obviously implemented with a deprecation warning and with a working fallback[1] for the old method signature. --- [1] This is limited to `GENERIC` builds, which should be sufficient.	2018-12-21 11:55:20 +01:00
Jonas Jenwald	b05f053287	[api-minor] Add support for OpenAction destinations (issue 10332) Note that the OpenAction dictionary may contain other information besides just a destination array, e.g. instructions for auto-printing[1]. Given first of all that an arbitrary `Dict` cannot be sent from the Worker (since cloning would fail), and second of all that the data obviously needs to be validated, this patch purposely only adds support for fetching a destination from the OpenAction entry[2]. --- [1] This information is, currently in PDF.js, being included through the `getJavaScript` API method. [2] This significantly reduces the complexity of the implementation, which seems fine for now. If there's ever need for other kinds of OpenAction to be fetched, additional API methods could/should be implemented as necessary (could e.g. follow the `getOpenActionWhatever` naming scheme).	2018-12-19 11:45:16 +01:00
Jani Pehkonen	ddabeb0645	Handle line width of zero in SVG	2018-12-04 16:05:32 +02:00
Jonas Jenwald	ac6b94c9dd	Replace the remaining occurences, in `src/display/api.js`, of `var` with `let`/`const`	2018-11-18 19:08:27 +01:00
Jonas Jenwald	061f7bd2f3	Convert `PDFWorker`, in `src/display/api.js`, to an ES6 class Also changes all occurrences of `var` to `let`/`const` in this code.	2018-11-18 19:08:27 +01:00
Jonas Jenwald	02e77a39ec	Convert `InternalRenderTask`, in `src/display/api.js`, to an ES6 class This changes all occurrences of `var` to `let`/`const` in this code, and updates the signature of the constructor to use object destructuring for better readability (and self documentation). Also, `useRequestAnimationFrame` is changed to a parameter and the `typeof window` check is now done once rather than at every `_scheduleNext` call.	2018-11-18 19:08:27 +01:00
Jonas Jenwald	5a0d64a6de	Convert `PDFPageProxy`, in `src/display/api.js`, to an ES6 class This changes all occurrences of `var` to `let`/`const` in this code, and updates the signatures of a couple of methods to use object destructuring. Finally, when creating `InternalRenderTask` instances only the necessary parameter are now provided, since passing through the `RenderParameters` as-is seems completely unnecessary.	2018-11-18 19:08:25 +01:00
Jonas Jenwald	2c003a82d5	Convert `RenderTask`, in `src/display/api.js`, to an ES6 class Also deprecates the `then` method, in favour of the `promise` getter.	2018-11-18 19:08:00 +01:00
Jonas Jenwald	ef8e5fd77c	Convert `PDFDocumentLoadingTask`, in `src/display/api.js`, to an ES6 class Also deprecates the `then` method, in favour of the `promise` getter.	2018-11-18 19:07:57 +01:00
PalmerAL	5f15dc2023	Use `span` instead of `div` in the text layer This improves copy/pasting text content since it reduces the amount of unnecessary newlines.	2018-11-18 15:54:08 +01:00
Jonas Jenwald	60da2d882b	[api-minor] Refactor/simplify the `PDFObject` class First of all, note how there's currently two methods for checking if a certain object exists, which seems completely unwarranted. Furthermore, the rarely used `getData` method was removed and its only callsite changed to use a combination of `PDFObjects.{has, get}` instead. Finally, the methods were rearranged slightly, to bring the most important ones (for an API user) to the top of the class.	2018-11-08 10:13:39 +01:00
Jonas Jenwald	d32321d84f	Convert `PDFObjects`, in `src/display/api.js`, to an ES6 class Also changes all occurrences of `var` to `const`, and marks internal properties/methods as "private".	2018-11-08 10:11:40 +01:00
Tim van der Meij	ec76aa531e	Merge pull request #10202 from Snuffleupagus/issue-10200 Attempt to clean-up/restore pending rendering operations on `RenderTask.cancel` (issue 10200)	2018-11-02 23:11:47 +01:00
Jonas Jenwald	f23dba1c10	Change `canvasInRendering` to a `WeakSet` instead of a `WeakMap` Note how nowhere in the code `canvasInRendering.get()` is ever called, and that this structure is really only used to store references to `<canvas>` DOM elements. The reason for this being a `WeakMap` is probably because at the time we weren't using `core-js` polyfills yet, and since there already existed a manually implemented `WeakMap` polyfill it was probably simpler to use that.	2018-10-31 18:15:23 +01:00
Jonas Jenwald	f77b463339	Attempt to clean-up/restore pending rendering operations on `RenderTask.cancel` (issue 10200) Please note that, given the lack of a runnable example, I'm not totally sure if this first of all is enough to completely address the issue as filed and second of all if we actually want this new behaviour.	2018-10-31 16:22:17 +01:00
Tim van der Meij	ed4ac1bc67	Merge pull request #10162 from janpe2/svg-normalize-bbox Normalize BBox of form XObjects in SVG back-end	2018-10-28 13:18:48 +01:00
Jani Pehkonen	9cd5f94f03	Normalize the BBox of form XObjects on the /core side	2018-10-22 14:17:05 +03:00
Jonas Jenwald	5bb7f4b615	Convert `PDFDataRangeTransport` to an ES6 class	2018-10-20 17:15:27 +02:00
Jonas Jenwald	327f2eb588	Ensure that `onProgress` is always called when the entire PDF file has been loaded, regardless of how it was fetched (issue 10160) Please note: I'm totally fine with this patch being rejected, and the issue closed as WONTFIX; however these changes should address the issue if that's desired. From a conceptual point of view, reporting loading progress doesn't really make a lot of sense for PDF files opened by passing raw binary data directly to `getDocument` (since obviously all data was loaded). This is compared to PDF files loaded via e.g. `XMLHttpRequest` or the Fetch API, where the entire PDF file isn't available from the start and knowing the loading progress makes total sense. However I can certainly see why the current API could be considered inconsistent, which isn't great, since a registered `onProgress` callback will never be called for certain `getDocument` calls. The simplest solution to this inconsistency thus seem to be to ensure that `onProgress` is always called when handling the `DataLoaded` message, since that will always be dispatched[1] from the worker-thread. --- [1] Note that this isn't guaranteed to happen, since setting `disableAutoFetch = true` often prevents the entire file from ever loading. However, this isn't relevant for the issue at hand, and is a well-known consequence of using `disableAutoFetch = true`; note how the default viewer even has a specialized code-path for hiding the loadingBar.	2018-10-16 13:51:12 +02:00
Tim van der Meij	9e9426c354	Merge pull request #10143 from Snuffleupagus/getMainThreadWorkerMessageHandler-catch-errors Ensure that `getMainThreadWorkerMessageHandler` won't accidentally break `getDocument` (PR 10139 follow-up)	2018-10-11 00:05:01 +02:00
Jonas Jenwald	0e2c6047e4	Ensure that `getMainThreadWorkerMessageHandler` won't accidentally break `getDocument` (PR 10139 follow-up) This should have been part of PR 10139. In the event that a user has attempted to manually load the worker file on the main-thread, but somehow failed to do that correctly, there's a possibility that `getMainThreadWorkerMessageHandler` could throw. Considering how/where that helper function is being called, an error could still prevent `PDFDocumentLoadingTask` from completing (regardless if it's being resolved/rejected).	2018-10-09 15:44:31 +02:00
Jonas Jenwald	21c8dd4842	Combine the `pdfjsFilePath` and fallback `workerSrc` handling in `src/display/api.js` With the way that the `getWorkerSrc()` helper function is implemented now, there's no longer a particularly strong reason for keeping the global `pdfjsFilePath` variable around. With this patch the fallback `workerSrc` will thus, assuming is wasn't already set, be set to the "pdfjsFilePath" which simplifies the `getWorkerSrc()` function and reduces the amount of global state. Finally, the global `workerSrc` variable was renamed to prevent shadowing.	2018-10-09 13:47:48 +02:00
Tim van der Meij	f45e46d7ad	Merge pull request #10133 from kevinleedrum/fix-content-length Set returnValues.suggestedLength to Content-Length if integer	2018-10-09 00:05:57 +02:00
Kevin Lee Drum	4cf10ac79d	set returnValues.suggestedLength to Content-Length if integer	2018-10-07 13:26:29 -04:00
Jonas Jenwald	755c6edc5e	Ensure that the `PDFDocumentLoadingTask` is rejected when "setting up fake worker" failed (issue 10135) This should, hopefully, cover all the possible ways[1] in which "fake workers" are loaded. Given the different code-paths, adding unit-tests might not be that simple. Note that in order to make this work, the various `fakeWorkerFilesLoader` functions were converted to return `Promises`. --- [1] Unfortunately there's lots of them, for various build targets and configurations.	2018-10-06 13:18:51 +02:00
Simon Leblanc	b5806735d8	Add support of Ink annotation	2018-10-03 00:28:49 +02:00
Tim van der Meij	138324502c	Merge pull request #10119 from Snuffleupagus/rm-onFileAttachmentAnnotation Attempt to simplify the `fileattachmentannotation` event dispatching	2018-10-02 23:25:22 +02:00
Jonas Jenwald	d60ce998f1	Attempt to simplify the `fileattachmentannotation` event dispatching This attempts to reduced the level of indirection, and the amount of code, when dispatching `fileattachmentannotation` events, by removing the `PDFLinkService.onFileAttachmentAnnotation` method and just accessing `PDFLinkService.eventBus` directly in the `FileAttachmentAnnotationElement` constructor. Given that other properties, such as `externalLinkTarget`/`externalLinkRel`, are already being accessed directly this pattern seems fine here as well.	2018-10-01 15:09:08 +02:00
Jonas Jenwald	435ec6a0d5	Use the Font Loading API in `MOZCENTRAL` builds, and `GENERIC` builds for Firefox version 63 and above (issue 9945)	2018-09-29 16:05:00 +02:00
Jonas Jenwald	05b021bcce	Refactor the `FontLoader` into proper, build-specific, ES6 classes Also changes `var` to `let`/`const` in code already touched in the patch, and makes use of template strings in a few spots.	2018-09-29 16:05:00 +02:00
Jonas Jenwald	45d6651976	Refactor unused `Date.now()` calls in `FontLoader.queueLoadingCallback` The `started` timestamp is completely usused, and the `end` timestamp is currently[1] being used essentially like a boolean value. Hence this code can be simplified to use an actual boolean value instead, which avoids potentially hundreds (or even thousands) of unnecessary `Date.now()` calls. --- [1] Looking briefly at the history of this code, I cannot tell if the timestamps themselves were ever used for anything (except for tracking "boolean" state).	2018-09-29 15:57:04 +02:00
Jonas Jenwald	ad3e937816	Replace the `Font.loading` property with, the already existing, `Font.missingFile` property The `Font.loading` property is only ever used once in the code, whereas `Font.missingFile` is more widely used. Furthermore the name `loading` feels, at least to me, slight less clear than `missingFile`. Finally, note that these two properties are the inverse of each other.	2018-09-29 15:57:04 +02:00
Jonas Jenwald	caf90ff6ee	Convert `FontFaceObject` to an ES6 class Also changes `var` to `let`/`const` in code already touched in the patch, and makes use of template strings in a few spots.	2018-09-29 15:57:04 +02:00
Jonas Jenwald	842e9206c0	Replace `String.prototype.substr()` occurrences with `String.prototype.substring()` As outlined in https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/substr, which refers to the ECMA-262 specification, using the `substr` function is advised against. Hence this PR, which replaces all remaining `substr` occurrences with `substring` instead. Please refer to https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/substr#Syntax respectively https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/substring#Syntax for the differences between the two functions. Note that in most cases in the code-base there's only one argument passed to `substr`, and those require no other changes except replacing "substr" with "substring". For the other cases, the `substr(start, length)` calls are changed to `substring(start, start + length)` instead.	2018-09-28 11:41:07 +02:00
Tim van der Meij	9a115b41de	Merge pull request #10034 from timvandermeij/canvas-workaround Remove `getSinglePixelWidth` workaround	2018-09-09 17:36:04 +02:00

1 2 3 4 5 ...

824 Commits