pdf.js

Author	SHA1	Message	Date
Rob Wu	44025a3ec1	Explicitly state intended support in compatibility.js Add comments with supported browser versions where missing. Method: - Use MDN compat tables if available. - Otherwise test in Chrome (31+) otherwise. (the Chrome Web Store does not update older versions of Chrome, so probably nobody is interested in even older versions, even though there is an existing comment for Chrome<29 at `document.currentScript`).	2018-01-26 12:31:41 +01:00
Jonas Jenwald	d33c763dd5	Re-factor resetting of `StatTimer` instances to fix completely broken benchmarking (PR 9245 follow-up) It turns out that PR 9245 unfortunately broke benchmarking completely, sorry about that! The bug is that we were attempting to reset the current instance of `StatTimer`, instead of creating a new one as was previously done. By resetting the current instance, the `StatTimer` data fetched in `test/driver.js` is now wiped out since it points to the same underlying object. This re-use of a `StatTimer` instance was asked for during review, and unfortunately I didn't test this thoroughly enough before submitting the final version of the PR.[1] --- [1] Note that while I did test the benchmarking scripts with that PR before initially submitting it, I did however forget to do that after addressing the review comments which might explain why this problem went unnoticed.	2018-01-25 19:46:03 +01:00
Jani Pehkonen	5593c970e0	Implement Huffman coding in JBIG2	2018-01-23 17:04:07 +02:00
Jonas Jenwald	f0216484bc	Merge pull request #9383 from Rob--W/better-content-disposition-parser Better content disposition parser	2018-01-21 15:08:14 +01:00
Tim van der Meij	9746646511	Merge pull request #9386 from shikhar-scs/remove-parsejbig2-function removed parseJbig2 function	2018-01-21 14:51:59 +01:00
Rob Wu	a4e907169e	Improve correctness of Content-Disposition parser Re-uses logic from `9f5fcae11c/extension/content-disposition.js` which is already covered by tests: `6f3bbb8bbf`	2018-01-21 13:31:12 +01:00
Jonas Jenwald	fe5102a27f	Merge pull request #9363 from Rob--W/fetch-http/s-only Limit PDFFetchStream to http(s) in the Chrome extension	2018-01-21 11:45:09 +01:00
Shikhar Agnihotri	43e003cf5c	removed parseJbig2 function	2018-01-20 19:49:06 +05:30
Jonas Jenwald	69a8336cf1	Address the final round of review comments for Content-Disposition filename extraction This patch updates the `IPDFStreamReader` interface and ensures that the interface/implementation of `network.js`, `fetch_stream.js`, `node_stream.js`, and `transport_stream.js` all match properly. The unit-tests are also adjusted, to more closely replicate the actual behaviour of the various actual `IPDFStreamReader` implementations. Finally, this patch adjusts the use of the Content-Disposition filename when setting the title in the viewer, and adds `PDFDocumentProperties` support as well.	2018-01-18 17:39:22 +01:00
Juan Salvador Perez Garcia	eb1f6f4c24	Content disposition filename File name is extracted from headers.	2018-01-18 17:38:44 +01:00
shikhar-scs	32080f1081	changed decodeURI to decodeURIComponent	2018-01-15 19:31:25 +05:30
Tim van der Meij	237bc2ef9d	Merge pull request #9323 from juncaixinchi/master Get correct path in node_stream on windows platform( issue #9020)	2018-01-14 15:41:56 +01:00
Rob Wu	1c8cacd6b9	Limit PDFFetchStream to http(s) in the Chrome extension The `fetch` API is only supported for http(s), even in Chrome extensions. Because of this limitation, we should use the XMLHttpRequest API when the requested URL is not a http(s) URL. Fixes #9361	2018-01-14 00:34:46 +01:00
juncaixinchi	8e278d9a45	Get correct path in node_stream on windows platform	2018-01-13 21:13:24 +08:00
Jonas Jenwald	0e1b5589e7	Restore the `btoa`/`atob` polyfills for Node.js These were removed in PR 9170, since they were unused in the browsers that we'll support in PDF.js version `2.0`. However looking at the output of Travis, where a subset of the unit-tests are run using Node.js, there's warnings about `btoa` being undefined. This doesn't appear to cause any errors, which probably explains why we didn't notice this before (despite PR 9201).	2018-01-13 01:31:05 +01:00
Brendan Dahl	d77fc8882a	Merge pull request #9352 from Snuffleupagus/issue-9285 Attempt to actually resolve ColourSpace names in accordance with the specification (issue 9285)	2018-01-12 13:01:22 -08:00
Jonas Jenwald	d0c8992e8a	Attempt to actually resolve ColourSpace names in accordance with the specification (issue 9285) Please refer to the PDF specification, in particular http://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G7.3801570 > A colour space shall be specified in one of two ways: > - Within a content stream, the CS or cs operator establishes the current colour space parameter in the graphics state. The operand shall always be name object, which either identifies one of the colour spaces that need no additional parameters (DeviceGray, DeviceRGB, DeviceCMYK, or some cases of Pattern) or shall be used as a key in the ColorSpace subdictionary of the current resource dictionary (see 7.8.3, "Resource Dictionaries"). In the latter case, the value of the dictionary entry in turn shall be a colour space array or name. A colour space array shall never be inline within a content stream. > > - Outside a content stream, certain objects, such as image XObjects, shall specify a colour space as an explicit parameter, often associated with the key ColorSpace. In this case, the colour space array or name shall always be defined directly as a PDF object, not by an entry in the ColorSpace resource subdictionary. This convention also applies when colour spaces are defined in terms of other colour spaces.	2018-01-10 20:20:43 +01:00
Jonas Jenwald	5a52ee0a79	Merge pull request #9350 from janpe2/svg-closeEOFillStroke Implement `closeEOFillStroke` in SVG backend	2018-01-09 21:09:11 +01:00
Brendan Dahl	3925aab010	Merge pull request #9282 from Snuffleupagus/TrueType-Collection Add support for TrueType Collection fonts (issue 9262)	2018-01-09 11:22:52 -08:00
Jani Pehkonen	d1e1dbfc14	Implement `closeEOFillStroke` in SVG backend	2018-01-09 19:42:12 +02:00
Jonas Jenwald	915e3f4c5f	Merge pull request #9099 from tiriana/allow-dontFlip-in-PDFPageProxy-getViewport Allows 'dontFlip' as third arg in PDFPageProxy.getViewport	2018-01-09 18:27:26 +01:00
Radomir Wojtera	3dfc540d04	Allows 'dontFlip' as third argument in PDFPageProxy.getViewport	2018-01-09 13:08:24 +01:00
Jonas Jenwald	d6c028b946	Add support for TrueType Collection fonts (issue 9262) The specification can be found at https://www.microsoft.com/typography/otspec/otff.htm, under the "Font Collections" heading. Fixes 9262.	2018-01-08 22:31:08 +01:00
Tim van der Meij	6b2ed504b7	Merge pull request #9336 from Snuffleupagus/jpx-SIZ Correctly extract component data from "Image and tile size" (SIZ) markers in JPEG 2000 images	2018-01-03 23:34:34 +01:00
Jonas Jenwald	873556865b	Correctly extract component data from "Image and tile size" (SIZ) markers in JPEG 2000 images This is something that I noticed while attempting to debug https://bugzilla.mozilla.org/show_bug.cgi?id=1374945. Just looking at the code, the `YRsiz` parameter seemed immediately wrong and the fact that every component used the same data also looked strange. Comparing with the specification, see https://www.itu.int/rec/dologin_pub.asp?lang=e&id=T-REC-T.800-200208-S!!PDF-E&type=items#page=37, confirmed that this is indeed incorrect. Note that I haven't got any example of a PDF file that is fixed by this patch, but that might be more luck than anything else. Manually checking a couple of files with included JPEG 2000 images, the `Csiz`/`XRsiz`/`YRsiz` parameters were `1` which could explain why this hasn't been an issue before. Obviously we shouldn't generally make changes to `core` code without adding tests, but in this case I'm simply not sure how to obtain/create one. However, since the existing code doesn't make sense this patch could hopefully be deemed acceptable anyway.	2018-01-03 16:26:28 +01:00
Jonas Jenwald	2db75a2a3a	Update the ESLint dependencies, and also tweak the `no-multiple-empty-lines` rules Since multiple empty lines is virtually unused in the code-base, and the few cases that do exist look like "typos", let's enforce greater consistency here; please see https://eslint.org/docs/rules/no-multiple-empty-lines.	2018-01-03 13:32:57 +01:00
Tim van der Meij	d36c46b2c9	Remove the `CustomStyle` class It is only used in a few places to handle prefixing style properties if necessary. However, we used it only for `transform`, `transformOrigin` and `borderRadius`, which according to Can I Use are supported natively (unprefixed) in the browsers that PDF.js 2.0 supports. Therefore, we can remove this class, which should help performance too since this avoids extra function calls in parts of the code that are called often.	2017-12-31 14:22:11 +01:00
Jonas Jenwald	c5700211d6	Adjust `decodeACSuccessive` in src/core/jpg.js to improve the rendering quality of (progressive) JPEG images I've been looking into the remaining point in 8637 about blurry images, to see if we could perhaps improve the rendering quality slightly there. After quite a bit of debugging, it seems that the issue is limited to certain progressive JPEG images. As mentioned previously, I've got no detailed knowledge of the JPEG format, but this patch does seem to improve things quite a bit for the images in question. Squinting at https://searchfox.org/mozilla-central/rev/6c33dde6ca02b389c52e8db3d22494df8b916f33/media/libjpeg/jdphuff.c#492-639, it seems reasonable that we should take the sign of the data into account. Furthermore, looking at the specification in https://www.w3.org/Graphics/JPEG/itu-t81.pdf#page=118, the "F.2.4.3 Decoding the binary decision sequence for non-zero DC differences and AC coefficients" section even contains a description of this (even though I cannot claim to really understand the details).	2017-12-30 15:24:09 +01:00
Jonas Jenwald	d6eed132e5	Correct the indentation in the `switch` statement in `decodeACSuccessive` in src/core/jpg.js	2017-12-30 15:22:30 +01:00
Tim van der Meij	18d82d9c54	Merge pull request #9287 from himanish-star/PDFjs-compatible-Librejs PDFjs now compatible with Librejs	2017-12-30 14:00:25 +01:00
Jonas Jenwald	8c4b7d0439	Avoid truncating JPEG images with DeviceGray ColourSpaces when using the `src/core/jpg.js` built-in decoder The bug that this patch fixes is limited to the built-in JPEG decoder, and was unearthed by PR 9260. The underlying issue has existed since PR 6984, where the contents of this patch ought to have been included (if it weren't for the fact that we had no easy way to test `src/core/jpg.js` back then). Please note: The slight movement in the reference test is a result of using the `src/core/jpg.js` decoder, rather than the native browser one.	2017-12-29 18:44:07 +01:00
Tim van der Meij	25bbff4692	Merge pull request #9320 from Snuffleupagus/pr-9095-followup Avoid rendering errors by passing in the `webGLContext` when creating a new `CanvasGraphics` in `getColorN_Pattern` (PR 9095 follow-up)	2017-12-28 23:17:30 +01:00
Jonas Jenwald	ec21bd9626	Merge pull request #9314 from timvandermeij/encodings Implement unit tests for the encodings and fix missing items	2017-12-27 22:02:38 +01:00
Jonas Jenwald	06605abbc2	Avoid rendering errors by passing in the `webGLContext` when creating a new `CanvasGraphics` in `getColorN_Pattern` (PR 9095 follow-up) This was an oversight in PR 9095, which unfortunately breaks rendering in some PDF files (e.g. the one from issue 6737). It thus appears that we don't have any test-coverage for this code-path, and given the relative complexity of the PDF files affected by this bug I wasn't able to easily create a reduced test-case. Please note: The linked test-case included in this patch is currently not rendered correctly (that'd be the PR 6606), but it at least gives us some test-coverage here.	2017-12-27 13:50:53 +01:00
Tim van der Meij	c7af2db2ec	Implement unit tests for the encodings and fix missing items Initially I just implemented the unit tests, but quickly found that they were failing my expectation of having a size of 256 items. Some of them did contain 256 items and some did not. I looked up various resources and figured that they indeed all need to have 256 items. One of the good resources is https://github.com/davidben/poppler/blob/master/poppler/FontEncodingTables.cc Aside from some missing `notdef` (empty string) entries at the end of the arrays, which I assume causes issues since it may cause out-of-bounds array access which in JavaScript gives `undefined`, there was a `notdef` entry missing in the `MacExpertEncoding`, causing the entries after that to be shifted. This fix for this is similar to the one in #8589. The unit tests verify that, for known encoding names, the return value is not only an array, but that it is also of the right length and contains only strings.	2017-12-24 18:14:40 +01:00
Jonas Jenwald	d4cd44fd16	Add a fallback for non-embedded LucidaSans-Demi fonts (issue 9291) The PDF file in the issue uses a number of embedded versions of Lucida fonts, but for some reason does not embed the LucidaSans-Demi font. According to https://en.wikipedia.org/wiki/Lucida#Usages that one should be bold, so we can at least improve rendering here (even though it won't look perfect). Fixes 9291.	2017-12-24 17:36:58 +01:00
Tim van der Meij	957e2d420d	Implement unit tests for the network utility code This should provide 100% coverage for the file.	2017-12-23 19:24:11 +01:00
Jonas Jenwald	e58f2f513a	[api-major] Remove the unused `encrypted` property from the `pdfInfo` object sent from the worker via the `GetDoc` message I recall being confused as to the purpose of the `encrypted` property all the way back when working on PR 4750. Looking at the history, this property was added in PR 1698 when password support was added to the API/viewer. However, its only purpose seem to have been to facilitate the addition of a `isEncrypted` function in the API. That function never, as far as I can tell, saw any use and was unceremoniously removed in PR 4144. Since we want to avoid sending all non-essential data early during initial document loading (e.g. PR 4750), it seems correct to get rid of the `encrypted` property. Especially since it hasn't even been exposed in the API for over three years, with no complaints that I'm aware of. Finally note that the `encrypt` property on the `XRef` instance isn't tied to the code that's being removed here. Given that we're calling `PDFDocument.parse` during `createDocumentHandler` in the worker which, via `PDFDocument.setup`, calls `XRef.parse` where the `Encrypt` data (if it exists) is always parsed.	2017-12-21 13:10:23 +01:00
Jonas Jenwald	9ff3c6f99d	Remove the `document.readyState` polyfill This is only relevant for browsers that we don't intend to support with PDF.js version `2.0`.	2017-12-19 15:05:19 +01:00
Jonas Jenwald	6af45052c5	Remove the `input.type` polyfill This is only relevant for browsers that we don't intend to support with PDF.js version `2.0`.	2017-12-19 15:05:15 +01:00
Jonas Jenwald	cf88b7b212	Remove the `ImageData.set` polyfill This is only relevant for browsers that we don't intend to support with PDF.js version `2.0`.	2017-12-19 15:05:14 +01:00
Jonas Jenwald	363e517acf	Remove the `HTMLElement.dataset` polyfill This is only relevant for browsers that we don't intend to support with PDF.js version `2.0`.	2017-12-19 14:50:18 +01:00
Jonas Jenwald	4880200cd4	Remove the `XMLHttpRequest.response` polyfill This is only relevant for browsers that we don't intend to support with PDF.js version `2.0`.	2017-12-19 14:48:43 +01:00
Jonas Jenwald	8266cc18e7	Remove the `webkitURL` polyfill This is only relevant for browsers that we don't intend to support with PDF.js version `2.0`.	2017-12-19 14:46:04 +01:00
Soumya Himanish Mohapatra	95ad956f68	PDFjs now compatible with Librejs	2017-12-19 15:13:50 +05:30
Jonas Jenwald	1dc54ddb40	Handle PDF files with missing 'endobj' operators, by searching for the "obj" string rather than "endobj" in `XRef.indexObjects` (issue 9105) This patch refactors the searching for 'endobj', to try and find the next occurance of "obj" and then check if it was in fact an 'endobj' and continue searching otherwise. This approach is used to avoid having to first find 'endobj', and then re-check the entire contents of the object and having to run (potentially expensive) regular expressions on arbitrary long strings. Fixes 9105.	2017-12-18 13:17:45 +01:00
Tim van der Meij	6bbe91079b	Merge pull request #9272 from nveenjain/fix/8846 Replaced occurence of `throw new Error` with `unreachable`	2017-12-15 22:11:32 +01:00
Jonas Jenwald	6515b91118	Merge pull request #9276 from mozilla/loca-fix Fix loca table when offsets aren't in ascending order.	2017-12-15 20:59:42 +01:00
Brendan Dahl	9b51cea724	Fix loca table when offsets aren't in ascending order.	2017-12-15 11:20:28 -06:00
Naveen Jain	1135674647	Replaced occurence of `throw new Error` with `unreachable` where applicable	2017-12-14 12:58:50 +05:30

1 2 3 4 5 ...

3133 Commits