pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	1cf116ab88	Enable the `mozilla/use-includes-instead-of-indexOf` ESLint rule globally This rule is available from https://www.npmjs.com/package/eslint-plugin-mozilla, and is enforced in mozilla-central. Note that we have the necessary `Array`/`String` polyfills and that most cases have already been fixed, see PRs 9032 and 9434.	2018-02-10 23:24:50 +01:00
Jonas Jenwald	2eb29409bc	Enable the `mozilla/avoid-removeChild` ESLint rule globally This rule is available from https://www.npmjs.com/package/eslint-plugin-mozilla, and is enforced in mozilla-central. Note that we have a polyfill for `ChildNode.remove()` and that most cases have already been fixed, see PRs 8056 and 8138.	2018-02-10 23:24:50 +01:00
Tim van der Meij	7bb066494f	Merge pull request #9427 from Snuffleupagus/native-JPEG-decoding-fallback Fallback to the built-in JPEG decoder when browser decoding fails, and attempt to handle JPEG images with DNL (Define Number of Lines) markers (issue 8614)	2018-02-09 21:36:08 +01:00
Jonas Jenwald	ad06979cca	Attempt to unify the `disableRange`/`contentLength` handling in the various network streams First of all, note how in both `fetch_stream.js` and `node_stream.js` we always overwrite the `this._contentLength` property even when the response headers doesn't actually contain any (valid) length information. This could thus result in the `length` parameter, as passed to the network stream, being completely ignored despite having no better information available. Secondly, in `node_stream.js` the `this._isRangeSupported` property wasn't always updated correctly based on the response headers.	2018-02-09 13:50:48 +01:00
Jonas Jenwald	25293628ff	Merge pull request #9459 from tonyjin/respect-worker-src Respect workerSrc if set	2018-02-08 13:57:43 +01:00
Jonas Jenwald	a18c65ae9f	Use the correct stream position when reading `maxSizeOfInstructions` from the `maxp` table (issue 9458) Please refer to the `maxp` table specification, found at https://developer.apple.com/fonts/TrueType-Reference-Manual/RM06/Chap6maxp.html. Fixes 9458.	2018-02-07 21:57:43 +01:00
Tony Jin	3c33f32dff	Respect workerSrc if set Respect user-defined workerSrc over internal overrides.	2018-02-07 11:31:18 -08:00
Jonas Jenwald	bf4166e6c9	Attempt to handle DNL (Define Number of Lines) markers when parsing JPEG images (issue 8614) Please refer to the specification, found at https://www.w3.org/Graphics/JPEG/itu-t81.pdf#page=49 Given how the JPEG decoder is currently implemented, we need to know the value of the scanLines parameter (among others) before parsing of the SOS (Start of Scan) data begins. Hence the best solution I could come up with here, is to re-parse the image in the hopefully rare case of JPEG images that include a DNL (Define Number of Lines) marker. Fixes 8614.	2018-02-05 21:05:32 +01:00
Jonas Jenwald	80441346a3	Fallback to the built-in JPEG decoder if 'JpegStream', in `src/display/api.js`, fails to load the image This works by making `PartialEvaluator.buildPaintImageXObject` wait for the success/failure of `loadJpegStream` on the API side before parsing continues. Please note that in practice, it should be quite rare for the browser to fail loading/decoding of a JPEG image. In the general case, it should thus not be completely surprising if even `src/core/jpg.js` will fail to decode the image.	2018-02-05 21:05:31 +01:00
Jonas Jenwald	76afe1018b	Fallback to built-in image decoding if the `NativeImageDecoder` fails In particular this means that if 'JpegDecode', in `src/display/api.js`, fails we'll fallback to the built-in JPEG decoder.	2018-02-05 17:01:35 +01:00
Jonas Jenwald	2570717e77	Inline the code in `loadJpegStream` at the only call-site in `src/display/api.js`.js` Since `loadJpegStream` is only used at a single spot in the code-base, and given that it's very heavily tailored to the calling code (since it relies on the data structure of `PDFObjects`), this patch simply inlines the code in `src/display/api.js` instead.	2018-02-05 17:01:35 +01:00
Jonas Jenwald	7f73fc9ace	Re-factor `PartialEvaluator.buildPaintImageXObject` to make it asynchronous This is necessary for upcoming changes, which will add fallback code-paths to allow graceful handling of native image decoding failures.	2018-02-05 17:01:35 +01:00
Jonas Jenwald	ec85d5c625	Change the signature of `PartialEvaluator.buildPaintImageXObject` to take a parameter object This method currently requires a fair number of parameters, which creates quite unwieldy call-sites. When invoking `buildPaintImageXObject`, you have to remember not only which arguments to supply, but also the correct order, to prevent run-time errors.	2018-02-05 17:01:35 +01:00
Rob Wu	2f19d9d906	Support spaces and semicolons in filename Imports the following changes: `5b1afa7c29` `7e2e35a38b`	2018-02-04 16:19:40 +01:00
Jonas Jenwald	712090eff8	Upstream the changes from: Bug 1339461 - Convert foo.indexOf(...) == -1 to foo.includes() and implement an eslint rule to enforce this Yet another case where PDF.js code was modified in `mozilla-central` without the changes happening in the GitHub repo first; sigh. If we don't upstream at least the changes in `extensions/firefox/`, any future update of PDF.js in `mozilla-central` will be blocked. Please see: - https://bugzilla.mozilla.org/show_bug.cgi?id=1339461 - https://hg.mozilla.org/mozilla-central/rev/d5a5ad1dbbf2	2018-02-04 14:59:27 +01:00
Jonas Jenwald	9ac9ef8ef1	Polyfill `String.prototype.includes` using core-js See https://github.com/zloirock/core-js#ecmascript-6-string.	2018-02-04 14:31:59 +01:00
Tim van der Meij	73436c0d12	Implement the `AESBaseCipher` class and let the `AES128Cipher` and `AES256Cipher` classes extend it	2018-02-03 20:16:33 +01:00
Tim van der Meij	9a959e4df7	Update the `AES128Cipher` and `AES256Cipher` implementations to be more similar This commit is the first step for extracting a base class for the `AES128Cipher` and the `AES256Cipher` classes. The objective here is to make code changes (not altering the logic) to make the implementations as similar as possible as found by creating a diff of both classes. In particular, we extract the key size and cycles of repetitions constants since they are different for AES-128 and AES-256. Moreover, we rename functions to be similar. In the `AES256Cipher` class, there was an additional assignment to `this` in the decryption function. However, this was unnecessary because the assignment would also be done when the loop was exited.	2018-02-03 20:16:29 +01:00
Jonas Jenwald	f4a95de694	Attempt to find the next valid marker when encountering invalid image data in `JpegImage.parse` (issue 9425) In the JPEG images in the referenced PDF file, the DHT (Define Huffman Tables) segments contain more data than expected based on the length parameter. Fixes 9425.	2018-02-03 16:01:19 +01:00
Jonas Jenwald	39c5e1ed1a	Remove the (unnecessary) `WorkerMessageHandler` variable from the `setupFakeWorkerGlobal()` function in the `src/display/api.js` file	2018-01-31 12:52:10 +01:00
Jonas Jenwald	56a8c934dd	[api-major] Remove the `PDFJS.disableWorker` option Despite this patch removing the `disableWorker` option itself, please note that we'll still fallback to loading the worker file(s) on the main-thread when running in environments without proper Web Worker support. Furthermore it's still possible, even with this patch, to force the use of fake workers by manually loading the necessary file using a `<script>` tag on the main-thread.[1] That way, the functionality of the now removed `SINGLE_FILE` build target and the resulting `build/pdf.combined.js` file can still be achieved simply by adding e.g. `<script src="build/pdf.worker.js"></script>` to the HTML (obviously with the path adjusted as needed). Finally note that the `disableWorker` option is a performance footgun, and unfortunately many existing third-party examples actually use it without providing any sort of warning/justification. --- [1] This approach is used in the default viewer, since certain kind of debugging may be easier if the code is running directly on the main-thread.	2018-01-31 12:52:10 +01:00
Jonas Jenwald	a5aaf62754	[api-minor] Add a (static) `PDFWorker.getWorkerSrc` method that returns the current `workerSrc` This method returns the currently used `workerSrc`, which thus allows obtaining the fallback `workerSrc` value (e.g. when the option wasn't set by the user).	2018-01-31 12:52:07 +01:00
Jonas Jenwald	c56f3f04dd	[api-major] Remove the `SINGLE_FILE` build target Please note that this build target, and the resulting `build/pdf.combined.js` file, is equivalent to setting the `PDFJS.disableWorker` option to `true` which is a performance footgun.	2018-01-29 14:44:44 +01:00
Rob Wu	5d1c541702	Enable some polyfills for compat with Chrome 49 Successfully tested with Chrome 49.	2018-01-26 12:31:41 +01:00
Rob Wu	44025a3ec1	Explicitly state intended support in compatibility.js Add comments with supported browser versions where missing. Method: - Use MDN compat tables if available. - Otherwise test in Chrome (31+) otherwise. (the Chrome Web Store does not update older versions of Chrome, so probably nobody is interested in even older versions, even though there is an existing comment for Chrome<29 at `document.currentScript`).	2018-01-26 12:31:41 +01:00
Jonas Jenwald	d33c763dd5	Re-factor resetting of `StatTimer` instances to fix completely broken benchmarking (PR 9245 follow-up) It turns out that PR 9245 unfortunately broke benchmarking completely, sorry about that! The bug is that we were attempting to reset the current instance of `StatTimer`, instead of creating a new one as was previously done. By resetting the current instance, the `StatTimer` data fetched in `test/driver.js` is now wiped out since it points to the same underlying object. This re-use of a `StatTimer` instance was asked for during review, and unfortunately I didn't test this thoroughly enough before submitting the final version of the PR.[1] --- [1] Note that while I did test the benchmarking scripts with that PR before initially submitting it, I did however forget to do that after addressing the review comments which might explain why this problem went unnoticed.	2018-01-25 19:46:03 +01:00
Jani Pehkonen	5593c970e0	Implement Huffman coding in JBIG2	2018-01-23 17:04:07 +02:00
Jonas Jenwald	f0216484bc	Merge pull request #9383 from Rob--W/better-content-disposition-parser Better content disposition parser	2018-01-21 15:08:14 +01:00
Tim van der Meij	9746646511	Merge pull request #9386 from shikhar-scs/remove-parsejbig2-function removed parseJbig2 function	2018-01-21 14:51:59 +01:00
Rob Wu	a4e907169e	Improve correctness of Content-Disposition parser Re-uses logic from `9f5fcae11c/extension/content-disposition.js` which is already covered by tests: `6f3bbb8bbf`	2018-01-21 13:31:12 +01:00
Jonas Jenwald	fe5102a27f	Merge pull request #9363 from Rob--W/fetch-http/s-only Limit PDFFetchStream to http(s) in the Chrome extension	2018-01-21 11:45:09 +01:00
Shikhar Agnihotri	43e003cf5c	removed parseJbig2 function	2018-01-20 19:49:06 +05:30
Jonas Jenwald	69a8336cf1	Address the final round of review comments for Content-Disposition filename extraction This patch updates the `IPDFStreamReader` interface and ensures that the interface/implementation of `network.js`, `fetch_stream.js`, `node_stream.js`, and `transport_stream.js` all match properly. The unit-tests are also adjusted, to more closely replicate the actual behaviour of the various actual `IPDFStreamReader` implementations. Finally, this patch adjusts the use of the Content-Disposition filename when setting the title in the viewer, and adds `PDFDocumentProperties` support as well.	2018-01-18 17:39:22 +01:00
Juan Salvador Perez Garcia	eb1f6f4c24	Content disposition filename File name is extracted from headers.	2018-01-18 17:38:44 +01:00
shikhar-scs	32080f1081	changed decodeURI to decodeURIComponent	2018-01-15 19:31:25 +05:30
Tim van der Meij	237bc2ef9d	Merge pull request #9323 from juncaixinchi/master Get correct path in node_stream on windows platform( issue #9020)	2018-01-14 15:41:56 +01:00
Rob Wu	1c8cacd6b9	Limit PDFFetchStream to http(s) in the Chrome extension The `fetch` API is only supported for http(s), even in Chrome extensions. Because of this limitation, we should use the XMLHttpRequest API when the requested URL is not a http(s) URL. Fixes #9361	2018-01-14 00:34:46 +01:00
juncaixinchi	8e278d9a45	Get correct path in node_stream on windows platform	2018-01-13 21:13:24 +08:00
Jonas Jenwald	0e1b5589e7	Restore the `btoa`/`atob` polyfills for Node.js These were removed in PR 9170, since they were unused in the browsers that we'll support in PDF.js version `2.0`. However looking at the output of Travis, where a subset of the unit-tests are run using Node.js, there's warnings about `btoa` being undefined. This doesn't appear to cause any errors, which probably explains why we didn't notice this before (despite PR 9201).	2018-01-13 01:31:05 +01:00
Brendan Dahl	d77fc8882a	Merge pull request #9352 from Snuffleupagus/issue-9285 Attempt to actually resolve ColourSpace names in accordance with the specification (issue 9285)	2018-01-12 13:01:22 -08:00
Jonas Jenwald	d0c8992e8a	Attempt to actually resolve ColourSpace names in accordance with the specification (issue 9285) Please refer to the PDF specification, in particular http://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G7.3801570 > A colour space shall be specified in one of two ways: > - Within a content stream, the CS or cs operator establishes the current colour space parameter in the graphics state. The operand shall always be name object, which either identifies one of the colour spaces that need no additional parameters (DeviceGray, DeviceRGB, DeviceCMYK, or some cases of Pattern) or shall be used as a key in the ColorSpace subdictionary of the current resource dictionary (see 7.8.3, "Resource Dictionaries"). In the latter case, the value of the dictionary entry in turn shall be a colour space array or name. A colour space array shall never be inline within a content stream. > > - Outside a content stream, certain objects, such as image XObjects, shall specify a colour space as an explicit parameter, often associated with the key ColorSpace. In this case, the colour space array or name shall always be defined directly as a PDF object, not by an entry in the ColorSpace resource subdictionary. This convention also applies when colour spaces are defined in terms of other colour spaces.	2018-01-10 20:20:43 +01:00
Jonas Jenwald	5a52ee0a79	Merge pull request #9350 from janpe2/svg-closeEOFillStroke Implement `closeEOFillStroke` in SVG backend	2018-01-09 21:09:11 +01:00
Brendan Dahl	3925aab010	Merge pull request #9282 from Snuffleupagus/TrueType-Collection Add support for TrueType Collection fonts (issue 9262)	2018-01-09 11:22:52 -08:00
Jani Pehkonen	d1e1dbfc14	Implement `closeEOFillStroke` in SVG backend	2018-01-09 19:42:12 +02:00
Jonas Jenwald	915e3f4c5f	Merge pull request #9099 from tiriana/allow-dontFlip-in-PDFPageProxy-getViewport Allows 'dontFlip' as third arg in PDFPageProxy.getViewport	2018-01-09 18:27:26 +01:00
Radomir Wojtera	3dfc540d04	Allows 'dontFlip' as third argument in PDFPageProxy.getViewport	2018-01-09 13:08:24 +01:00
Jonas Jenwald	d6c028b946	Add support for TrueType Collection fonts (issue 9262) The specification can be found at https://www.microsoft.com/typography/otspec/otff.htm, under the "Font Collections" heading. Fixes 9262.	2018-01-08 22:31:08 +01:00
Tim van der Meij	6b2ed504b7	Merge pull request #9336 from Snuffleupagus/jpx-SIZ Correctly extract component data from "Image and tile size" (SIZ) markers in JPEG 2000 images	2018-01-03 23:34:34 +01:00
Jonas Jenwald	873556865b	Correctly extract component data from "Image and tile size" (SIZ) markers in JPEG 2000 images This is something that I noticed while attempting to debug https://bugzilla.mozilla.org/show_bug.cgi?id=1374945. Just looking at the code, the `YRsiz` parameter seemed immediately wrong and the fact that every component used the same data also looked strange. Comparing with the specification, see https://www.itu.int/rec/dologin_pub.asp?lang=e&id=T-REC-T.800-200208-S!!PDF-E&type=items#page=37, confirmed that this is indeed incorrect. Note that I haven't got any example of a PDF file that is fixed by this patch, but that might be more luck than anything else. Manually checking a couple of files with included JPEG 2000 images, the `Csiz`/`XRsiz`/`YRsiz` parameters were `1` which could explain why this hasn't been an issue before. Obviously we shouldn't generally make changes to `core` code without adding tests, but in this case I'm simply not sure how to obtain/create one. However, since the existing code doesn't make sense this patch could hopefully be deemed acceptable anyway.	2018-01-03 16:26:28 +01:00
Jonas Jenwald	2db75a2a3a	Update the ESLint dependencies, and also tweak the `no-multiple-empty-lines` rules Since multiple empty lines is virtually unused in the code-base, and the few cases that do exist look like "typos", let's enforce greater consistency here; please see https://eslint.org/docs/rules/no-multiple-empty-lines.	2018-01-03 13:32:57 +01:00

... 11 12 13 14 15 ...

3757 Commits