pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	b8606abbc1	[api-major] Completely remove the global `PDFJS` object	2018-03-01 18:13:27 +01:00
Jonas Jenwald	3c2fbdffe6	Move the `cMapUrl` and `cMapPacked` options from the global `PDFJS` object and into `getDocument` instead	2018-03-01 18:11:16 +01:00
Jonas Jenwald	83d52518da	[api-major] Refactor `PDFWorker` to be initialized with a parameter object, rather than a bunch of regular parameters	2018-02-16 13:22:35 +01:00
Jonas Jenwald	c3c1fc511d	Move the `workerSrc` option from the global `PDFJS` object and into `GlobalWorkerOptions` instead	2018-02-16 13:22:35 +01:00
Rob Wu	a89071bdef	Merge pull request #9470 from Snuffleupagus/issue-4888 Ensure that `JpegImage.getData` returns the correct data length when `forceRGBoutput == true` (issue 4888)	2018-02-16 13:14:21 +01:00
Jonas Jenwald	11ab3b5c00	Ensure that `JpegImage.getData` returns the correct data length when `forceRGBoutput == true` (issue 4888) With PDF.js version `2.0` we'll only support browsers with built-in `TypedArray` functionality, hence there doesn't seem to be any good reason not to implement this now. Fixes 4888.	2018-02-13 20:44:21 +01:00
Jonas Jenwald	3a6f6d23d6	Move the `externalLinkTarget` and `externalLinkRel` options to `PDFLinkService` options This removes the `PDFJS.externalLinkTarget`/`PDFJS.externalLinkRel` dependency from the viewer components, but please note that as a temporary solution the default viewer still uses it.	2018-02-13 14:28:40 +01:00
Rob Wu	911659cd70	Add tests for file names with spaces and semicolons	2018-02-04 17:58:10 +01:00
Jonas Jenwald	56a8c934dd	[api-major] Remove the `PDFJS.disableWorker` option Despite this patch removing the `disableWorker` option itself, please note that we'll still fallback to loading the worker file(s) on the main-thread when running in environments without proper Web Worker support. Furthermore it's still possible, even with this patch, to force the use of fake workers by manually loading the necessary file using a `<script>` tag on the main-thread.[1] That way, the functionality of the now removed `SINGLE_FILE` build target and the resulting `build/pdf.combined.js` file can still be achieved simply by adding e.g. `<script src="build/pdf.worker.js"></script>` to the HTML (obviously with the path adjusted as needed). Finally note that the `disableWorker` option is a performance footgun, and unfortunately many existing third-party examples actually use it without providing any sort of warning/justification. --- [1] This approach is used in the default viewer, since certain kind of debugging may be easier if the code is running directly on the main-thread.	2018-01-31 12:52:10 +01:00
Jonas Jenwald	a5aaf62754	[api-minor] Add a (static) `PDFWorker.getWorkerSrc` method that returns the current `workerSrc` This method returns the currently used `workerSrc`, which thus allows obtaining the fallback `workerSrc` value (e.g. when the option wasn't set by the user).	2018-01-31 12:52:07 +01:00
Jonas Jenwald	42c71cd99f	Utilize `PDFNodeStream` to run more API unit-tests on Node.js/Travis	2018-01-28 17:14:08 +01:00
Jonas Jenwald	f0216484bc	Merge pull request #9383 from Rob--W/better-content-disposition-parser Better content disposition parser	2018-01-21 15:08:14 +01:00
Rob Wu	a4e907169e	Improve correctness of Content-Disposition parser Re-uses logic from `9f5fcae11c/extension/content-disposition.js` which is already covered by tests: `6f3bbb8bbf`	2018-01-21 13:31:12 +01:00
Jonas Jenwald	fe5102a27f	Merge pull request #9363 from Rob--W/fetch-http/s-only Limit PDFFetchStream to http(s) in the Chrome extension	2018-01-21 11:45:09 +01:00
Rob Wu	0ffe9b9289	Remove useless test from network_utils_spec.js Remove "returns null when content disposition is form-data". The name of the test is already misleading: It suggests that the return value is null if the Content-Disposition starts with "form-data". This is not the case, anything with the "filename" parameter is accepted. So, to correct this, one would have to rephrase the test description to "returns null when content disposition has no filename". But this is already tested by the test called "gets the filename from the response header". So, remove the test.	2018-01-19 17:28:47 +01:00
Jonas Jenwald	69a8336cf1	Address the final round of review comments for Content-Disposition filename extraction This patch updates the `IPDFStreamReader` interface and ensures that the interface/implementation of `network.js`, `fetch_stream.js`, `node_stream.js`, and `transport_stream.js` all match properly. The unit-tests are also adjusted, to more closely replicate the actual behaviour of the various actual `IPDFStreamReader` implementations. Finally, this patch adjusts the use of the Content-Disposition filename when setting the title in the viewer, and adds `PDFDocumentProperties` support as well.	2018-01-18 17:39:22 +01:00
Juan Salvador Perez Garcia	eb1f6f4c24	Content disposition filename File name is extracted from headers.	2018-01-18 17:38:44 +01:00
Rob Wu	1c8cacd6b9	Limit PDFFetchStream to http(s) in the Chrome extension The `fetch` API is only supported for http(s), even in Chrome extensions. Because of this limitation, we should use the XMLHttpRequest API when the requested URL is not a http(s) URL. Fixes #9361	2018-01-14 00:34:46 +01:00
Jonas Jenwald	0e1b5589e7	Restore the `btoa`/`atob` polyfills for Node.js These were removed in PR 9170, since they were unused in the browsers that we'll support in PDF.js version `2.0`. However looking at the output of Travis, where a subset of the unit-tests are run using Node.js, there's warnings about `btoa` being undefined. This doesn't appear to cause any errors, which probably explains why we didn't notice this before (despite PR 9201).	2018-01-13 01:31:05 +01:00
Jonas Jenwald	915e3f4c5f	Merge pull request #9099 from tiriana/allow-dontFlip-in-PDFPageProxy-getViewport Allows 'dontFlip' as third arg in PDFPageProxy.getViewport	2018-01-09 18:27:26 +01:00
Radomir Wojtera	3dfc540d04	Allows 'dontFlip' as third argument in PDFPageProxy.getViewport	2018-01-09 13:08:24 +01:00
Jonas Jenwald	2db75a2a3a	Update the ESLint dependencies, and also tweak the `no-multiple-empty-lines` rules Since multiple empty lines is virtually unused in the code-base, and the few cases that do exist look like "typos", let's enforce greater consistency here; please see https://eslint.org/docs/rules/no-multiple-empty-lines.	2018-01-03 13:32:57 +01:00
Tim van der Meij	c7af2db2ec	Implement unit tests for the encodings and fix missing items Initially I just implemented the unit tests, but quickly found that they were failing my expectation of having a size of 256 items. Some of them did contain 256 items and some did not. I looked up various resources and figured that they indeed all need to have 256 items. One of the good resources is https://github.com/davidben/poppler/blob/master/poppler/FontEncodingTables.cc Aside from some missing `notdef` (empty string) entries at the end of the arrays, which I assume causes issues since it may cause out-of-bounds array access which in JavaScript gives `undefined`, there was a `notdef` entry missing in the `MacExpertEncoding`, causing the entries after that to be shifted. This fix for this is similar to the one in #8589. The unit tests verify that, for known encoding names, the return value is not only an array, but that it is also of the right length and contains only strings.	2017-12-24 18:14:40 +01:00
Tim van der Meij	957e2d420d	Implement unit tests for the network utility code This should provide 100% coverage for the file.	2017-12-23 19:24:11 +01:00
Jonas Jenwald	ad5ed37059	Handle broken, Ghostscript generated, Metadata that contains HTML character names (bug 1424938) Please note that while this could be considered a regression in user-facing behaviour, I'm not convinced that it's really a regression as such since prior to PR 8912 the Metadata would fail to parse (with an XML error) and thus be ignored when setting the viewer title. With the refactored Metadata parsing we're now able to parse this, which uncovered issues with a subset of broken Ghostscript Metadata that uses HTML character names. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1424938	2017-12-13 14:32:47 +01:00
Tim van der Meij	c35bbd11b0	Use native `Math` functions in the custom `log2` function It is quite confusing that the custom function is called `log2` while it actually returns the ceiling value and handles zero and negative values differently than the native function. To resolve this, we add a comment that explains these differences and make the function use the native `Math` functions internally instead of using our own custom logic. To verify that the function does what we expect, we add unit tests. All browsers except for IE support `Math.log2` for quite a long time already (see https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Math/log2). For IE, we use the core-js polyfill. According to the microbenchmark at https://jsperf.com/log2-pdfjs/1, using the native functions should also be faster, in my testing almost six times as fast.	2017-12-10 16:35:17 +01:00
Jonas Jenwald	a5e3261b48	Merge pull request #9062 from mozilla/no_high Move char codes from high surrogate pair range into private use.	2017-12-08 12:31:22 +01:00
Brendan Dahl	306999c325	Move char codes from high surrogate pair range into private use. Fixes #2884	2017-12-07 10:35:50 -08:00
Tim van der Meij	a29c2d9944	Implement unit tests for utility functions that perform type/value checks	2017-12-04 22:36:45 +01:00
Tim van der Meij	70a28ab34f	Implement unit tests for the utility functions `bytesToString` and `stringToBytes`	2017-12-03 12:52:16 +01:00
Tim van der Meij	25b07812b9	Sanitize the display value for choice widget annotations	2017-11-18 20:37:27 +01:00
Tim van der Meij	edaf4b3173	Merge pull request #9037 from Snuffleupagus/refactor-streams-params Re-factor how parameters are passed to the network streams	2017-11-18 15:41:15 +01:00
Tim van der Meij	9686f6652c	Merge pull request #9089 from yurydelendik/rm-chunks Extracts OperatorList class and prepares for streaming	2017-11-13 23:35:40 +01:00
Jonas Jenwald	23699cef1c	Re-factor how parameters are passed to the network streams This patch is the result of me starting to look into moving parameters from `PDFJS` into `getDocument` and other API methods. When familiarizing myself with the code, the signatures of the various network streams seemed to be unnecessarily cumbersome since `disableRange` is currently handled separately from other parameters. I'm assuming that the explanation for this is probably "for historical reasons", as is often the case. Hence I'd like to clean this up before we start the larger, and more invasive, `PDFJS` parameter re-factoring.	2017-11-11 11:23:29 +01:00
Yury Delendik	85f544f55a	Moves OperatorList and QueueOptimizer into separate file.	2017-10-30 13:29:58 -05:00
Yury Delendik	b4e25fb2e8	Merge remote-tracking branch 'mozilla/version-2.0' into v2	2017-10-27 14:01:45 -05:00
Brendan Dahl	6b12612a52	Sanitize name index in compile phase of CFF. Fixes #8960	2017-10-23 17:13:49 -07:00
Tim van der Meij	17cc94db4e	Merge pull request #9034 from Snuffleupagus/javascript-null [api-major] Change `getJavaScript` to return `null`, rather than an empty Array, when no JavaScript exists	2017-10-17 21:58:45 +02:00
Tim van der Meij	7d7edd9cc6	[api-major] Remove the `PDFJS_NEXT` option Nothing uses this option anymore, so setting it is a no-op now. We can safely remove it. Use `SKIP_BABEL` (instead of `PDFJS_NEXT`) now if you want to skip Babel translation for a build.	2017-10-16 23:16:51 +02:00
Jonas Jenwald	1cd1582cb9	[api-major] Change `getJavaScript` to return `null`, rather than an empty Array, when no JavaScript exists Other API methods already return `null`, rather than empty Arrays/Objects, hence it makes sense to change `getJavaScript` to be consistent.	2017-10-15 22:17:14 +02:00
Jonas Jenwald	33b1d1b20a	Fix a `PDFHistory` regression with document hashes of the `nameddest=...` form Unfortunately I've just found out that this isn't working entirely correct; my apologies for accidentally breaking this in PR 8775. Compare e.g. this link: http://mirrors.ctan.org/info/lshort/english/lshort.pdf#page.157, with this one: http://mirrors.ctan.org/info/lshort/english/lshort.pdf#nameddest=page.157. Notice how in the second case, the history stops working correctly. The various edge-case regressions in the new `PDFHistory` code is reminding my why I put off the rewrite for so long :-(	2017-10-09 21:58:54 +02:00
Jonas Jenwald	b1472cddbb	Allow `getOperatorList`/`getTextContent` to skip errors when parsing broken XObjects (issue 8702, issue 8704) This patch makes use of the existing `ignoreErrors` property in `src/core/evaluator.js`, see PRs 8240 and 8441, thus allowing us to attempt to recovery as much as possible of a page even when it contains broken XObjects. Fixes 8702. Fixes 8704.	2017-09-29 17:14:21 +02:00
Jonas Jenwald	b8ec518a1e	Split the existing `PDFFunction` in two classes, a private `PDFFunction` and a public `PDFFunctionFactory, and utilize the latter in` PDFDocument `to allow various code to access the methods of` PDFFunction` Follow-up to PR 8909. This requires us to pass around `pdfFunctionFactory` to quite a lot of existing code, however I don't see another way of handling this while still guaranteeing that we can access `PDFFunction` as freely as in the old code. Please note that the patch passes all tests locally (unit, font, reference), and I very much hope that we have sufficient test-coverage for the code in question to catch any typos/mistakes in the re-factoring.	2017-09-29 15:30:53 +02:00
Jonas Jenwald	a159c4f357	Check that `this.baseUrl` is defined before attempting to fetch any data in `DOMCMapReaderFactory`/`NodeCMapReaderFactory`	2017-09-28 12:34:57 +02:00
Tim van der Meij	2281061882	Enable metadata unit tests for Travis CI and Node.js	2017-09-19 23:09:07 +02:00
Tim van der Meij	d4309614f9	Replace `DOMParser` with `SimpleXMLParser` The `DOMParser` is most likely overkill and may be less secure. Moreover, it is not supported in Node.js environments. This patch replaces the `DOMParser` with a simple XML parser. This should be faster and gives us Node.js support for free. The simple XML parser is a port of the one that existed in the examples folder with a small regex fix to make the parsing work correctly. The unit tests are extended for increased test coverage of the metadata code. The new method `getAll` is provided so the example does not have to access internal properties of the object anymore.	2017-09-19 23:09:07 +02:00
Tim van der Meij	320779e6ed	Merge pull request #8691 from timvandermeij/square-circle-annotations Implement support for square and circle annotations	2017-09-09 22:56:54 +02:00
Tim van der Meij	f7fd1db52f	Introduce `DOMSVGFactory` This patch provides a new unit tested factory for creating SVG containers and elements. This code is duplicated twice in the codebase, but with upcoming changes this would need to be duplicated even more. Moreover, consolidating this code in one factory allows us to replace it easily for e.g., supporting Node.js. Therefore, move this to a central place and update/ES6-ify the related code. Finally, we replace `setAttributeNS` with `setAttribute` because no namespace is provided.	2017-09-09 21:36:27 +02:00
Jonas Jenwald	5565a6f8bf	Slightly refactor the pages rotation handling code in the viewer This changes both `PDFViewer` and `PDFThumbnailViewer` to return early in the `pagesRotation` setters if the rotation doesn't change. It also fixes an existing issue, in `PDFViewer`, that would cause errors if the rotation changes before the scale has been set to a non-default value. Finally, in preparation for subsequent patches, it also refactors the rotation code in `web/app.js` to update the thumbnails and trigger rendering with the new `rotationchanging` event.	2017-09-09 11:27:05 +02:00
Tim van der Meij	1c9af00bee	Merge pull request #8775 from Snuffleupagus/rewrite-PDFHistory-2 Re-write `PDFHistory` from scratch	2017-09-03 20:38:59 +02:00

1 2 3 4 5 ...

408 Commits