pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	8ec99b200c	Prevent Metadata/XML parsing from breaking `PDFDocumentProxy.getMetadata` when no XML root document is found (issue 8884) With the new XML parser, see PR 9573, the referenced PDF file now causes `getMetadata` to fail when incomplete XML tags are encountered. This provides a simple, and hopefully generally useful, work-around that may also help prevent future bugs. (Without being able to reproduce nor even understand the other (non XML) errors mentioned in issue 8884, I'd say that this patch is enough to close that one as fixed.)	2018-07-18 11:37:40 +02:00
Jonas Jenwald	a9ce4e8417	Stop exposing the `URL` polyfill in the global scope This moves/exposes the `URL` polyfill similarily to the existing `ReadableStream` polyfill, rather than exposing it globally, to avoid interfering with any "outside" code. Both the `URL` and `ReadableStream` polyfills are now exposed on the `pdfjsLib` object, such that they are accessible to the viewer components. Furthermore, the `no-restricted-globals` ESLint rule is also enabled to prevent accidental usage of the native `URL`/`ReadableStream` implementations directly in the `src/` and `web/` folders; see also https://eslint.org/docs/rules/no-restricted-globals Addresses the remaining TODO in https://github.com/mozilla/pdf.js/projects/6	2018-07-04 09:16:28 +02:00
Jonas Jenwald	bf0aca86d7	Fix re-rendering, using the same canvas, when rendering was previously cancelled (PR 8519 follow-up) Currently if `RenderTask.cancel` is called immediately after rendering was started, then by the time that `InternalRenderTask.initializeGraphics` is called rendering will already have been cancelled. However, we're still inserting the canvas into the `canvasInRendering` map, thus breaking any future attempts at re-rendering using the same canvas. Considering that `InternalRenderTask.cancel` always removes the canvas from the map, I cannot imagine that we'd ever want to re-add it after rendering was cancelled (it was likely just a simple oversight in PR 8519). Fixes 9456.	2018-06-28 22:56:37 +02:00
Jonas Jenwald	74e9999044	Add unit-tests for `PDFPageProxy.stats` (PR 9245 follow-up) This wasn't included in PR 9245, since all the API options were still global at that time. Writing the unit-tests also uncovered an issue with `getOperatorList` not starting the "Page Request" timer.	2018-06-25 14:20:49 +02:00
Jonas Jenwald	275834ae66	Clean-up, and add JSDocs to, the `PDFDocumentProxy.loadingParams` method (PR 9830 follow-up)	2018-06-23 13:33:22 +02:00
eugenesqr	331ac8ae74	removed safari compatibility check	2018-06-21 12:57:56 +03:00
Brendan Dahl	a278c5a8dc	Merge pull request #9795 from timvandermeij/object-assign Replace `Util.extendObj` by `Object.assign`	2018-06-20 10:50:40 -07:00
Tim van der Meij	620da6f4df	Merge pull request #9802 from Snuffleupagus/ColorSpace-PDFImage-Uint8ClampedArray Update `ColorSpace` and `PDFImage` to use `Uint8ClampedArray`s and remove manual clamping/rounding	2018-06-16 17:55:10 +02:00
Jonas Jenwald	0958006713	Send `UnsupportedFeature` notification when errors are ignored in `FontFaceObject.getPathGenerator`	2018-06-13 11:02:10 +02:00
Jonas Jenwald	bf0db0fb72	Pass the `ignoreErrors` API option to the `FontFaceObject` constructor, and utilize it in `getPathGenerator` to ignore missing glyphs Obviously it's still not possible to render non-embedded fonts as paths, but in this way the rest of the page will at least be allowed to continue rendering. Please note: Including the 14 standard fonts in PDF.js probably wouldn't be that difficult to implement. (I'm not a lawyer, but the fonts from PDFium could probably be used given their BSD license.) However, the main blocker ought to be the total size of the necessary font data, since I cannot imagine people being OK with shipping ~5 MB of (additional) font data with Firefox. (Based on the reactions when the CMap files were added, and those are only ~1 MB in size.)	2018-06-13 11:02:06 +02:00
Jonas Jenwald	fe288bb872	Refactor the `FontFaceObject.getPathGenerator` method - Reduce the overall indentation level, by making use of early returns. - Replace `var` with `let`.	2018-06-13 11:02:02 +02:00
Jonas Jenwald	778981ec89	Catch, and propagate, errors in the `requestAnimationFrame` branch of `InternalRenderTask._scheduleNext` To support these changes, `InternalRenderTask._next` now returns a Promise.	2018-06-13 11:01:58 +02:00
Jonas Jenwald	731f2e6dfc	Remove manual clamping/rounding from `ColorSpace` and `PDFImage`, by having their methods use `Uint8ClampedArray`s The built-in image decoders are already using `Uint8ClampedArray` when returning data, and this patch simply extends that to the rest of the image/colorspace code. As far as I can tell, the only reason for using manual clamping/rounding in the first place was because TypedArrays used to be polyfilled (using regular arrays). And trying to polyfill the native clamping/rounding would probably have been had too much overhead, but given that TypedArray support is required in PDF.js version `2.0` that's no longer a concern. Please note: Because of different rounding behaviour, basically `Math.round` in `Uint8ClampedArray` respectively `Math.floor` in the old code, there will be very slight movement in quite a few existing test-cases. However, the changes should be imperceivable to the naked eye, given that the absolute difference is at most `1` for each RGB component when comparing `master` and this patch (see also the updated expectation values in the unit-tests).	2018-06-12 11:01:32 +02:00
Brendan Dahl	3ac638fad3	Merge pull request #9689 from RafaPolit/master Fixed critical unhandled promise that prevented error catching using API	2018-06-11 15:40:30 -06:00
Tim van der Meij	af8e88d00b	Replace `Util.extendObj` by `Object.assign`	2018-06-10 20:11:03 +02:00
Tim van der Meij	903bad1906	Remove `Util.appendToArray` and `Util.prependToArray` The former may be replaced by regular JavaScript array concatenation and the latter is unused. This avoids unnecessary function calls/imports.	2018-06-10 15:24:09 +02:00
Jonas Jenwald	07d610615c	Move, and modernize, `Util.loadScript` from `src/shared/util.js` to `src/display/dom_utils.js` Not only is the `Util.loadScript` helper function unused on the Worker side, even trying to use it there would throw an Error (since `document` isn't defined/available in Workers). Hence this helper function is moved, and its code modernized slightly by having it return a Promise rather than needing a callback function. Finally, to reduced code duplication, the "new" loadScript function is exported and used in the viewer.	2018-06-07 13:52:40 +02:00
Jonas Jenwald	547f119be6	Simplify the error handling slightly in the `src/display/node_stream.js` file The various classes have `this._errored` and `this._reason` properties, where the first one is a boolean indicating if an error was encountered and the second one contains the actual `Error` (or `null` initially). In practice this means that errors are basically tracked twice, rather than just once. This kind of double-bookkeeping is generally a bad idea, since it's quite easy for the properties to (accidentally) get into an inconsistent state whenever the relevant code is modified. Rather than using a separate boolean, we can just as well check the "error" property directly (since `null` is falsy). --- Somewhat unrelated to this patch, but `src/display/node_stream.js` is currently not handling errors in a consistent or even correct way; compared with `src/display/network.js` and `src/display/fetch_stream.js`. Obviously using the `createResponseStatusError` utility function, from `src/display/network_utils.js`, might not make much sense in a Node.js environment. However at the very least it seems that `MissingPDFException`, or `UnknownErrorException` when one cannot tell that the PDF file is "missing", should be manually thrown. As is, the API (i.e. `getDocument`) is not returning the expected errors when loading fails in Node.js environments (as evident from the `pending` API unit-test).	2018-06-06 09:05:45 +02:00
Jonas Jenwald	871bf5c68b	Remove the, now obsolete, handling of the `CMapReaderFactory` parameter in `getDocument` This special handling was added in PR 8567, but was made redundant in PR 8721 which stopped sending everything but the kitchen sink to the Worker side.	2018-06-06 08:52:43 +02:00
Jonas Jenwald	c8e2163bbc	Remove incorrect/unnecessary validation of the `verbosity` parameter in the `PDFWorker` constructor (PR 9480 follow-up)	2018-06-06 08:52:43 +02:00
Jonas Jenwald	b263b702e8	Rename `PDFPageProxy.pageInfo` to `PDFPageProxy._pageInfo` to indicate that the property should be considered "private" Since `PDFPageProxy` already provide getters for all the data returned by `GetPage` (in the Worker), there isn't any compelling reason for accessing the `pageInfo` directly on `PDFPageProxy`. The patch also changes the `GetPage` handler, in `src/core/worker.js`, to use modern JavaScript features.	2018-06-06 08:52:42 +02:00
Jonas Jenwald	4f4b50e01e	Rename `PDFDocumentProxy.pdfInfo` to `PDFDocumentProxy._pdfInfo` to indicate that the property should be considered "private" Since `PDFDocumentProxy` already provide getters for all the data returned by `GetDoc` (in the Worker), there isn't any compelling reason for accessing the `pdfInfo` directly on `PDFDocumentProxy`.	2018-06-06 08:52:42 +02:00
Jonas Jenwald	e89afa5899	Stop sending the `PDFManagerReady` message from the Worker, since it's unused in the API After PR 8617 the `PDFManagerReady` message handler function, in `src/display/api.js`, is now a no-op. Hence it seems completely unnecessary to keep sending this message from `src/core/worker.js`.	2018-06-06 08:52:42 +02:00
Jonas Jenwald	eef53347fe	Ensure that the correct data is sent, with the `test` message, from the worker if typed arrays aren't properly supported With native typed array support now being mandatory in PDF.js, since version 2.0, this probably isn't a huge problem even though the current code seems wrong (it was changed in PR 6571). Note how in the `!(data instanceof Uint8Array)` case we're currently attempting to send `handler.send('test', 'main', false);` to the main-thread, which doesn't really make any sense since the signature of the method reads `send(actionName, data, transfers) {`. Hence the data that's actually being sent here is `'main'`, with `false` as the transferList, which just seems weird. On the main-thread, this means that we're in this case checking `data && data.supportTypedArray`, where `data` contains the string `'main'` rather than being falsy. Since a string doesn't have a `supportTypedArray` property, that check still fails as expected but it doesn't seem great nonetheless.	2018-06-06 08:52:42 +02:00
Jonas Jenwald	dc6e1b4176	Use `Uint8ClampedArray` for the image data returned by `JpegDecode`, in src/display/api.js Since all the built-in PDF.js image decoders now return their data as `Uint8ClampedArray`, for consistency `JpegDecode` on the main-thread should be doing the same thing; follow-up to PR 8778.	2018-06-06 08:52:41 +02:00
Jonas Jenwald	47a9d38280	Add more validation in `PDFWorker.fromPort` The signature of the `PDFWorker.fromPort` method, in addition to the `PDFWorker` constructor, was changed in PR 9480. Hence it's probably a good idea to add a bit more validation to `PDFWorker.fromPort`, to ensure that it won't fail silently for an API consumer that updates to version 2.0 of the PDF.js library.	2018-06-06 08:52:41 +02:00
Jonas Jenwald	3c5c8d2a0b	Remove the typed array check when calling `LoopbackPort` in `PDFWorker._setupFakeWorker` With version 2.0, native support for typed arrays is now a requirement for using the PDF.js library; see PR 9094 where the old polyfills were removed. Hence the `isTypedArraysPresent` check, when setting up fake workers, no longer serves any purpose here and can thus be removed.	2018-06-06 08:52:33 +02:00
Jonas Jenwald	89caaf4071	Use `LoopbackPort` in the "message_handler" unit-tests There's no good reason, as far as I can tell, to duplicate the functionality of the `LoopbackPort` in the unit-tests. The only difference between the implementations is that `LoopbackPort` mimics the (native) structured cloning, however that shouldn't matter here since the tests are only sending "simple" data (strings respectively arrays with numbers). Furthermore the patch also changes `LoopbackPort` to default to using "structured cloning" and deferred invocation of the listeners, since native typed array support is now a requirement for using the PDF.js library.	2018-06-04 12:53:08 +02:00
Jonas Jenwald	44d8afd46b	Move `MessageHandler` into a separate `src/shared/message_handler.js` file The `MessageHandler` itself, and its assorted helper functions, are currently the single largest[1] piece of code in the `src/shared/util.js` file. By moving this code into its own file, `src/shared/util.js` thus becomes smaller and more manageable.	2018-06-04 12:53:08 +02:00
Jonas Jenwald	69f2a77543	Update the signature of the `PageViewport` constructor, and improve the JSDoc comments for the class This changes the constructor to take a parameter object, rather than a string of parameters.	2018-06-04 12:53:07 +02:00
Jonas Jenwald	51673dbc5a	Convert the `PageViewport` to a proper ES6 class Also converts all `var` to `let` for good measure.	2018-06-04 12:53:07 +02:00
Jonas Jenwald	5917b21702	Remove completely unused `fontScale` property from `PageViewport` The `fontScale` property was added in PR 1531, see commit `b312719d7e` in particular, apparently for the sole purpose of supporting the "acroforms" example. However, the `fontScale` property was never used anywhere else in the code-base, and after the modernization of the "acroforms" example in PR 8030 it's been completely unused. Finally, note that there's also a (more suitably named) `scale` property on `PageViewport` instances, which contains the exact same information as the property being removed here.	2018-06-04 12:53:07 +02:00
Jonas Jenwald	08c8f8733d	Move `PageViewport` from `src/shared/util.js` to `src/display/dom_utils.js` Since the `PageViewport` is not used in the worker, duplicating this code on both the main and worker sides seems completely unnecessary.	2018-06-04 12:53:07 +02:00
Tim van der Meij	8ce24744f2	Merge pull request #9769 from Snuffleupagus/node-unittest-rm-console-errors Reduce the amount of errors logged, primarily in Node.js/Travis, when running the unit-tests	2018-06-03 19:50:42 +02:00
Rob Wu	0e4e79169b	Fall back to ISO-8859-1 in content_disposition.js Updates content_disposition.js to include `9b789d9b3b`	2018-06-03 16:17:28 +02:00
Rob Wu	e992480baa	Fix multibyte decoding in content_disposition.js I made some mistakes when trying to make the content_disposition.js compatible with non-modern browsers (IE/Edge). Notably, text decoding was usually skipped because of the inverted logical check at the top of `textdecode`. I verified that this new version works as expected, as follows: 1. Visit `55c71eb44e/test/` and get test-content-disposition.js also get test-content-disposition.node.js if using Node.js, or get test-content-disposition.html if you use a browser. 2. Modify `test-content-disposition.node.js` (or the HTML file) and change `../extension/content-disposition.js` to `PDFJS-content_disposition.js` 3. Copy the `getFilenameFromContentDispositionHeader` function from `content_disposition.js` (i.e. the file without the trailing exports) and save it as `PDFJS-content_disposition.js`. 4. Run the tests (`node test-content-disposition.node.js` or by opening `test-content-disposition.html` in a browser). 5. Confirm that there are no failures: "Finished all tests (0 failures)" The code has a best-efforts fallback for Microsoft Edge, which lacks the TextDecoder API. The fallback only supports the common UTF-8 encoding. To simulate this in a test, modify `PDFJS-content_disposition.js` and deliberately throw an error before `new TextDecoder`. There will be two failures because we don't want to include too much code to support text decoding for non-UTF-8 encodings in Edge ``` test-content-disposition.js:265 Assertion failed: Input: attachment; filename=ISO-8859-1''%c3%a4 Expected: "Ã¤" Actual : "ä" test-content-disposition.js:268 Assertion failed: Input: attachment; filename=ISO-8859-1''%e2%82%ac Expected: "â‚¬" Actual : "€" ```	2018-06-03 15:28:22 +02:00
Jonas Jenwald	ef081a0531	Ensure that the `WorkerTransport._passwordCapability` is always rejected, even when errors are thrown in `PDFDocumentLoadingTask.onPassword` callback Please note that while the current code works, both in the viewer and the unit-tests, it can leave the `WorkerTransport._passwordCapability` Promise in a pending state. In the `PasswordRequest` handler, in src/display/api.js, we're returning the Promise from a `capability` object (rather than just a "plain" Promise). While an error thrown anywhere within this handler was fortunately enough to propagate it to the Worker side, it won't cause the Promise (in `WorkerTransport._passwordCapability`) to actually be rejected. Finally note that while we're now catching errors in the `PasswordRequest` handler, those errors are still propagated to the Worker side via the (now) rejected Promise and the existing `return this._passwordCapability.promise;` line. This prevents warnings about uncaught Promises, with messages such as "Error: Worker was destroyed during onPassword callback", when running the unit-tests both in browsers and in Node.js/Travis.	2018-06-03 00:28:40 +02:00
Jonas Jenwald	0ecc22cb04	Attempt to provide better default values for the `disableFontFace`/`nativeImageDecoderSupport` API options in Node.js This should provide a better out-of-the-box experience when using PDF.js in a Node.js environment, since it's missing native support for both `@font-face` and `Image`. Please note that this change only affects the default values, hence it's still possible for an API consumer to override those values when calling `getDocument`. Also, prevents "ReferenceError: document is not defined" errors, when running the unit-tests in Node.js/Travis.	2018-06-03 00:28:37 +02:00
Mukul Mishra	949c3e9417	Add abort functionality in fetch stream	2018-05-22 12:46:59 +05:30
RafaPolit	d63b17dbe3	Fixed critical unhandled promise that prevented error catching using API	2018-04-24 13:10:00 -05:00
Jani Pehkonen	fe2cf2f73f	SVG clip intersections and operators	2018-04-17 19:20:29 +03:00
Brendan Dahl	e8cf7fd512	Merge pull request #9624 from wojtekmaj/no-warning-on-dependency-operator Prevent warning on unimplemented operator thrown for OPS.dependency	2018-04-03 10:55:29 -07:00
Wojciech Maj	acc0a0fe95	Prevent warning on unimplemented operator thrown for OPS.dependency	2018-04-02 14:29:34 +02:00
Wojciech Maj	ea2850e9a7	Fix typos	2018-04-01 23:20:41 +02:00
Tim van der Meij	8887a09e8f	Merge pull request #9588 from swftvsn/patch-1 Improve node.js support	2018-04-01 12:26:39 +02:00
Jonas Jenwald	8b09f7c34e	Clean-up `getMainThreadWorkerMessageHandler` for non-PRODUCTION mode This is a final piece of clean-up of code that I recently wrote, after which I'm done :-) When the `getMainThreadWorkerMessageHandler` function was added, in PR 9385, it did so by basically introducing a `web/app.js` dependency in `src/display/api.js` through the `window.pdfjsNonProductionPdfWorker` property[1]. Even though this is limited to non-`PRODUCTION` mode, i.e. `gulp server`, it still seems unfortunate to have that sort of viewer dependency in the API code itself. With the new, much nicer and shorter, names introduced in PR 9565 we can remove this non-`PRODUCTION` hack and just use `window.pdfjsWorker` in both the viewer and the API regardless of the build mode. --- [1] It didn't seem correct to piggy-back on the `window.pdfjsDistBuildPdfWorker` property in non-`PRODUCTION` mode.	2018-03-29 11:03:47 +02:00
Tim van der Meij	5c1a16ba6e	Merge pull request #9586 from Snuffleupagus/pageSize-api-rotate Ensure that `PDFPageProxy.pageSizeInches` handles non-default /Rotate entries correctly	2018-03-25 18:03:32 +02:00
Jonas Jenwald	d547936827	Ensure that `PDFPageProxy.pageSizeInches` handles non-default /Rotate entries correctly Without this patch, the pageSize will be incorrectly reported for some PDF files. --- Move pageSizeInches to ui_utils	2018-03-25 16:48:29 +02:00
Brendan Dahl	24f766b14d	Merge pull request #9573 from yurydelendik/xml_parser New XML parser	2018-03-21 17:00:00 -07:00
swftvsn	c20426efef	Improve node.js support This change fixes "Unhandled rejection ReferenceError: HTMLElement is not defined" issue that is discussed in more detail in #8489.	2018-03-21 13:43:53 +02:00

1 2 3 4 5 ...

713 Commits