pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	f317a2cb40	Ensure that the DOM event listeners are removed at the end of the relevant `EventBus` unit-tests, to prevent the tests from interfering with each other	2018-09-20 23:12:01 +02:00
Tim van der Meij	99de25d6cc	Implement unit tests for the `isSameOrigin` and `createValidAbsoluteUrl` utility functions Moreover, mark the `isValidProtocol` function as private since it's only used in the utilities file and is not (meant to be) exported.	2018-09-11 16:17:45 +02:00
Jonas Jenwald	6d804d657f	Add initial support for "Whole words" searching in the viewer As outlined in https://bugzilla.mozilla.org/show_bug.cgi?id=1282759 the internal Firefox name for the feature is `entireWord`, hence that name is used here as well for consistency (with "Whole words" being limited to the UI). Given existing limitations of the PDF.js search functionality, e.g. the existing problems of searching across "new lines", there's some edge-cases where "Whole words" searching will ignore (valid) results. However, considering that this is a pre-existing issue related to the way that the find controller joins text-content together, that shouldn't have to block this new feature in my opionion. Please note: In order to enable this feature in the `MOZCENTRAL` version, a small follow-up patch for [PdfjsChromeUtils.jsm](https://hg.mozilla.org/mozilla-central/file/tip/browser/extensions/pdfjs/content/PdfjsChromeUtils.jsm) will be required once this has landed in `mozilla-central`.	2018-09-10 11:59:29 +02:00
Tim van der Meij	66422eb83e	Merge pull request #9340 from brendandahl/private-use Map all glyphs to the private use area and duplicate the first glyph.	2018-09-08 17:51:04 +02:00
Brendan Dahl	b76cf665ec	Map all glyphs to the private use area and duplicate the first glyph. There have been lots of problems with trying to map glyphs to their unicode values. It's more reliable to just use the private use areas so the browser's font renderer doesn't mess with the glyphs. Using the private use area for all glyphs did highlight other issues that this patch also had to fix: * small private use area - Previously, only the BMP private use area was used which can't map many glyphs. Now, the (much bigger) PUP 16 area can also be used. * glyph zero not shown - Browsers will not use the glyph from a font if it is glyph id = 0. This issue was less prevalent when we mapped to unicode values since the fallback font would be used. However, when using the private use area, the glyph would not be drawn at all. This is illustrated in one of the current test cases (issue #8234) where there's an "ä" glyph at position zero. The PDF looked like it rendered correctly, but it was actually not using the glyph from the font. To properly show the first glyph it is always duplicated and appended to the glyphs and the maps are adjusted. * supplementary characters - The private use area PUP 16 is 4 bytes, so String.fromCodePoint must be used where we previously used String.fromCharCode. This is actually an issue that should have been fixed regardless of this patch. * charset - Freetype fails to load fonts when the charset size doesn't match number of glyphs in the font. We now write out a fake charset with the correct length. This also brought up the issue that glyphs with seac/endchar should only ever write a standard charset, but we now write a custom one. To get around this the seac analysis is permanently enabled so those glyphs are instead always drawn as two glyphs.	2018-09-05 14:04:54 -07:00
Tim van der Meij	e812c6e7ac	Use shorter code for failing a test in `test/unit/api_spec.js`	2018-09-02 21:23:09 +02:00
Tim van der Meij	959ed3705b	Implement a permissions API	2018-09-02 21:23:09 +02:00
Jonas Jenwald	0b1f41c5b3	Add general support for re-dispatching events, on `EventBus` instances, to the DOM This patch is the first step to be able to eventually get rid of the `attachDOMEventsToEventBus` function, by allowing `EventBus` instances to simply re-dispatch most[1] events to the DOM. Note that the re-dispatching is purposely implemented to occur after all registered `EventBus` listeners have been serviced, to prevent the ordering issues that necessitated the duplicated page/scale-change events. The DOM events are currently necessary for the `mozilla-central` tests, see https://hg.mozilla.org/mozilla-central/file/tip/browser/extensions/pdfjs/test, and perhaps also for custom deployments of the PDF.js default viewer. Once this have landed, and been successfully uplifted to `mozilla-central`, I intent to submit a patch to update the test-code to utilize the new preference. This will thus, eventually, make it possible to remove the `attachDOMEventsToEventBus` functionality. Please note: I've successfully ran all `mozilla-central` tests locally, with these patches applied. --- [1] The exception being events that originated on the `window` or `document`, since those are already globally available anyway.	2018-08-30 17:28:12 +02:00
Tim van der Meij	1268aea2b6	Merge pull request #9975 from Snuffleupagus/getDestination-refactor Re-factor `destinations`/`getDestination` to reduce unnecessary duplication, and reject non-string inputs	2018-08-12 15:51:58 +02:00
Tim van der Meij	af19ed6ee9	Merge pull request #9822 from timvandermeij/annotations [api-minor] Refactor the annotation code to be asynchronous	2018-08-11 20:39:50 +02:00
Tim van der Meij	bbc769cf81	Convert `test/unit/annotation_spec.js` to ES6 syntax	2018-08-11 19:00:29 +02:00
dmitryskey	3741becb9b	[api-minor] Refactor the annotation code to be asynchronous This commit is the first step towards implementing parsing for the appearance streams of annotations. Co-authored-by: Jonas Jenwald <jonas.jenwald@gmail.com> Co-authored-by: Tim van der Meij <timvandermeij@gmail.com>	2018-08-11 19:00:29 +02:00
Jonas Jenwald	1179584fd6	Reject `getDestination`, in the API, for non-string inputs Note how e.g. the `getPage` method does basic validation of the input.	2018-08-11 16:06:35 +02:00
Jonas Jenwald	f78efd883e	Attempt to throw `MissingPDFException` when applicable in `node_stream.js` (issue 9791)	2018-08-06 10:00:03 +02:00
Tim van der Meij	f6eaa99cb2	Reword test reporter message The font tests use Jasmine too, so while they are technically unit tests, it's a bit confusing to see `Started unit tests` when the font tests are run on the bots.	2018-08-05 21:21:46 +02:00
Tim van der Meij	444976bcd5	Merge pull request #9956 from brendandahl/allow-zero-progress Allow loaded progress of 0 in unit tests.	2018-08-04 00:19:02 +02:00
Tim van der Meij	f19ee127a3	Merge pull request #9874 from boundlesshq/master [api-minor] Include export value for checkboxes	2018-08-03 23:43:23 +02:00
Brendan Dahl	d762567bcf	Allow loaded progress of 0 in unit tests.	2018-08-03 10:31:46 -07:00
Jonas Jenwald	f8388710e6	Add (basic) unit-tests for the non-global `URL` constructor (PR 9868 follow-up) This should really have been included in PR 9868, since it will help ensure that the `URL` constructor is correctly imported/exported by `src/shared/util.js`.	2018-08-02 10:32:06 +02:00
bion	c31ddf7edc	[api-minor] Include export value for checkboxes	2018-07-28 00:30:41 -07:00
Jonas Jenwald	928b89382e	[api-minor] Add an `IsLinearized` property to the `PDFDocument.documentInfo` getter, to allow accessing the linearization status through the API (via `PDFDocumentProxy.getMetadata`) There was a (somewhat) recent question on IRC about accessing the linearization status of a PDF document, and this patch contains a simple way to expose that through already existing API methods. Please note that during setup/parsing in `PDFDocument` the linearization data is already being fetched and parsed, provided of course that it exists. Hence this patch will not cause any additional data to be loaded.	2018-07-26 15:54:19 +02:00
Jonas Jenwald	8ec99b200c	Prevent Metadata/XML parsing from breaking `PDFDocumentProxy.getMetadata` when no XML root document is found (issue 8884) With the new XML parser, see PR 9573, the referenced PDF file now causes `getMetadata` to fail when incomplete XML tags are encountered. This provides a simple, and hopefully generally useful, work-around that may also help prevent future bugs. (Without being able to reproduce nor even understand the other (non XML) errors mentioned in issue 8884, I'd say that this patch is enough to close that one as fixed.)	2018-07-18 11:37:40 +02:00
Jonas Jenwald	61186698c3	Replace the remaining occurences of `instanceof Array` with `Array.isArray()` Follow-up to PRs 8864 and 8813. As explained in https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Array/isArray, `instanceof Array` can have inconsistent behavior. To ensure that only `Array.isArray` is used, an ESLint plugin/rule is added to enforce this.	2018-07-09 13:17:41 +02:00
Tim van der Meij	99f8f2c275	Merge pull request #9853 from Snuffleupagus/re-render-after-cancel Fix re-rendering, using the same canvas, when rendering was previously cancelled (PR 8519 follow-up)	2018-06-29 23:25:43 +02:00
Tim van der Meij	6fa2c779b5	Merge pull request #9838 from Snuffleupagus/invalid-path-OPS Error, rather than warn, once a number of invalid path operators are encountered in `EvaluatorPreprocessor.read` (bug 1443140)	2018-06-28 23:15:25 +02:00
Jonas Jenwald	bf0aca86d7	Fix re-rendering, using the same canvas, when rendering was previously cancelled (PR 8519 follow-up) Currently if `RenderTask.cancel` is called immediately after rendering was started, then by the time that `InternalRenderTask.initializeGraphics` is called rendering will already have been cancelled. However, we're still inserting the canvas into the `canvasInRendering` map, thus breaking any future attempts at re-rendering using the same canvas. Considering that `InternalRenderTask.cancel` always removes the canvas from the map, I cannot imagine that we'd ever want to re-add it after rendering was cancelled (it was likely just a simple oversight in PR 8519). Fixes 9456.	2018-06-28 22:56:37 +02:00
Jonas Jenwald	74e9999044	Add unit-tests for `PDFPageProxy.stats` (PR 9245 follow-up) This wasn't included in PR 9245, since all the API options were still global at that time. Writing the unit-tests also uncovered an issue with `getOperatorList` not starting the "Page Request" timer.	2018-06-25 14:20:49 +02:00
Jonas Jenwald	7f21e38787	Error, rather than warn, once a number of invalid path operators are encountered in `EvaluatorPreprocessor.read` (bug 1443140) Incomplete path operators, in particular, can result in fairly chaotic rendering artifacts, as can be observed on page four of the referenced PDF file. The initial (naive) solution that was attempted, was to simply throw a `FormatError` as soon as any invalid (i.e. too short) operator was found and rely on the existing `ignoreErrors` code-paths. However, doing so would have caused regressions in some files; see the existing `issue2391-1` test-case, which was promoted to an `eq` test to help prevent future bugs. Hence this patch, which adds special handling for invalid path operators since those may cause quite bad rendering artifacts. You could, in all fairness, argue that the patch is a handwavy solution and I wouldn't object. However, given that this only concerns corrupt PDF files, the way that PDF viewers (PDF.js included) try to gracefully deal with those could probably be described as a best-effort solution anyway. This patch also adjusts the existing `warn`/`info` messages to print the command name according to the PDF specification, rather than an internal PDF.js enumeration value. The former should be much more useful for debugging purposes. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1443140.	2018-06-24 16:05:08 +02:00
Jonas Jenwald	6bbcafcd26	Let `Lexer.getNumber` treat a single decimal point as zero (issue 9252) This is consistent with the behaviour in Adobe Reader.	2018-06-20 13:41:21 +02:00
Jonas Jenwald	df4799a12a	Ensure that line-breaks are only skipped after operators in `Lexer.getNumber` (PR 8359 follow-up) With the current code line-breaks are accepted not just after an operator, but after a decimal point as well. When looking at this again, the latter case seems prone to cause false positives and might also interfere with subsequent patches. Hence this is code is adjusted to actually do what the original commit message says, and nothing more.	2018-06-20 13:41:15 +02:00
Jonas Jenwald	731f2e6dfc	Remove manual clamping/rounding from `ColorSpace` and `PDFImage`, by having their methods use `Uint8ClampedArray`s The built-in image decoders are already using `Uint8ClampedArray` when returning data, and this patch simply extends that to the rest of the image/colorspace code. As far as I can tell, the only reason for using manual clamping/rounding in the first place was because TypedArrays used to be polyfilled (using regular arrays). And trying to polyfill the native clamping/rounding would probably have been had too much overhead, but given that TypedArray support is required in PDF.js version `2.0` that's no longer a concern. Please note: Because of different rounding behaviour, basically `Math.round` in `Uint8ClampedArray` respectively `Math.floor` in the old code, there will be very slight movement in quite a few existing test-cases. However, the changes should be imperceivable to the naked eye, given that the absolute difference is at most `1` for each RGB component when comparing `master` and this patch (see also the updated expectation values in the unit-tests).	2018-06-12 11:01:32 +02:00
Jonas Jenwald	32367c5968	Make the `getBytes`/`peekBytes` methods of `Stream`/`DecodeStream`/`ChunkedStream` able to return `Uint8ClampedArray`s The built-in image decoders are already returning data as `Uint8ClampedArray`, and subsequently the JPEG/JBIG2/JPX streams are as well. However, for general streams we obviously don't want to force the use of `Uint8ClampedArray` unless an "Image" is actually being decoded. Hence this patch, which adds a parameter that allows the caller of the `getBytes`/`peekBytes` methods to force a `Uint8ClampedArray` (rather than a `Uint8Array`) to be returned.	2018-06-12 11:01:32 +02:00
Jonas Jenwald	547f119be6	Simplify the error handling slightly in the `src/display/node_stream.js` file The various classes have `this._errored` and `this._reason` properties, where the first one is a boolean indicating if an error was encountered and the second one contains the actual `Error` (or `null` initially). In practice this means that errors are basically tracked twice, rather than just once. This kind of double-bookkeeping is generally a bad idea, since it's quite easy for the properties to (accidentally) get into an inconsistent state whenever the relevant code is modified. Rather than using a separate boolean, we can just as well check the "error" property directly (since `null` is falsy). --- Somewhat unrelated to this patch, but `src/display/node_stream.js` is currently not handling errors in a consistent or even correct way; compared with `src/display/network.js` and `src/display/fetch_stream.js`. Obviously using the `createResponseStatusError` utility function, from `src/display/network_utils.js`, might not make much sense in a Node.js environment. However at the very least it seems that `MissingPDFException`, or `UnknownErrorException` when one cannot tell that the PDF file is "missing", should be manually thrown. As is, the API (i.e. `getDocument`) is not returning the expected errors when loading fails in Node.js environments (as evident from the `pending` API unit-test).	2018-06-06 09:05:45 +02:00
Jonas Jenwald	89caaf4071	Use `LoopbackPort` in the "message_handler" unit-tests There's no good reason, as far as I can tell, to duplicate the functionality of the `LoopbackPort` in the unit-tests. The only difference between the implementations is that `LoopbackPort` mimics the (native) structured cloning, however that shouldn't matter here since the tests are only sending "simple" data (strings respectively arrays with numbers). Furthermore the patch also changes `LoopbackPort` to default to using "structured cloning" and deferred invocation of the listeners, since native typed array support is now a requirement for using the PDF.js library.	2018-06-04 12:53:08 +02:00
Jonas Jenwald	44d8afd46b	Move `MessageHandler` into a separate `src/shared/message_handler.js` file The `MessageHandler` itself, and its assorted helper functions, are currently the single largest[1] piece of code in the `src/shared/util.js` file. By moving this code into its own file, `src/shared/util.js` thus becomes smaller and more manageable.	2018-06-04 12:53:08 +02:00
Jonas Jenwald	11b4613e20	Reduce the amount of console "spam", by ignoring `info`/`warn` calls, when running the unit-tests in Node.js/Travis Compared to running the unit-tests in "regular" browsers, where any console output won't get mixed up with test output, in Node.js/Travis the test output looks quite noisy. By ignoring `info`/`warn` calls, when running unit-tests in Node.js/Travis, the test output is a lot smaller not to mention that any actual failures are more easily spotted.	2018-06-03 00:28:40 +02:00
Ryan Hendrickson	91cbc185da	Add scrolling modes to web viewer In addition to the default scrolling mode (vertical), this commit adds horizontal and wrapped scrolling, implemented primarily with CSS.	2018-05-14 23:10:32 -04:00
Jonas Jenwald	77d025dc14	Move the `isPortraitOrientation` helper function from `web/base_viewer.js` to `web/ui_utils.js` A couple of basic unit-tests are added, and a manual `isLandscape` check (in `web/base_viewer.js`) is also converted to use the helper function instead.	2018-03-25 18:48:53 +02:00
Tim van der Meij	5c1a16ba6e	Merge pull request #9586 from Snuffleupagus/pageSize-api-rotate Ensure that `PDFPageProxy.pageSizeInches` handles non-default /Rotate entries correctly	2018-03-25 18:03:32 +02:00
Tim van der Meij	6cc0efe1cc	Merge pull request #9576 from timvandermeij/versions Update packages	2018-03-25 17:52:26 +02:00
Tim van der Meij	95de23e6e3	Update packages Jasmine had a major version bump and required a few minor changes in our booting code. Most notably, using `pending` in a `describe` block is no longer supported, so we can only return early there. On the positive side, the unit tests now run in a random order by default, which eliminates any dependencies between unit tests. Note that upgrading to Webpack 4 is out of scope for this patch since the bots cannot work well with the newly generated bundles (both browsers on both bots do not react within 120 seconds). Webpack 4 is not faster for us than Webpack 3, so for now there is no need to upgrade.	2018-03-25 16:59:50 +02:00
Jonas Jenwald	d547936827	Ensure that `PDFPageProxy.pageSizeInches` handles non-default /Rotate entries correctly Without this patch, the pageSize will be incorrectly reported for some PDF files. --- Move pageSizeInches to ui_utils	2018-03-25 16:48:29 +02:00
Jonas Jenwald	e0ae157582	[api-minor] Fix various issues related to the pageSize information The `getPageSizeInches` method was implemented on `PDFDocumentProxy`, which seems conceptually wrong since the size property isn't global to the document but rather specific to each page. Hence the method is moved into `PDFPageProxy`, as `get pageSizeInches` instead to address this. Despite the fact that new API functionality was implemented, no unit-tests were added. To prevent issues later on, we should always ensure that new functionality has at least some test-coverage; something that this patch also takes care of. The new `PDFDocumentProperties._parsePageSize` method seemed unnecessary convoluted. Furthermore, in the "no data provided"-case it even returned incorrect data (an array, rather than the expected object). Finally, the fallback strings didn't actually agree with the `en-US` locale. This inconsistency doesn't look too great, and it's thus addressed here as well.	2018-03-18 09:10:19 +01:00
Tim van der Meij	f308d73d40	Implement a single `getInheritableProperty` utility function This function combines the logic of two separate methods into one. The loop limit is also a good thing to have for the calls in `src/core/annotation.js`. Moreover, since this is important functionality, a set of unit tests and documentation is added.	2018-03-03 19:19:39 +01:00
Jonas Jenwald	b8606abbc1	[api-major] Completely remove the global `PDFJS` object	2018-03-01 18:13:27 +01:00
Jonas Jenwald	3c2fbdffe6	Move the `cMapUrl` and `cMapPacked` options from the global `PDFJS` object and into `getDocument` instead	2018-03-01 18:11:16 +01:00
Jonas Jenwald	83d52518da	[api-major] Refactor `PDFWorker` to be initialized with a parameter object, rather than a bunch of regular parameters	2018-02-16 13:22:35 +01:00
Jonas Jenwald	c3c1fc511d	Move the `workerSrc` option from the global `PDFJS` object and into `GlobalWorkerOptions` instead	2018-02-16 13:22:35 +01:00
Rob Wu	a89071bdef	Merge pull request #9470 from Snuffleupagus/issue-4888 Ensure that `JpegImage.getData` returns the correct data length when `forceRGBoutput == true` (issue 4888)	2018-02-16 13:14:21 +01:00
Jonas Jenwald	11ab3b5c00	Ensure that `JpegImage.getData` returns the correct data length when `forceRGBoutput == true` (issue 4888) With PDF.js version `2.0` we'll only support browsers with built-in `TypedArray` functionality, hence there doesn't seem to be any good reason not to implement this now. Fixes 4888.	2018-02-13 20:44:21 +01:00

1 2 3 4 5 ...

452 Commits