pdf.js

Author	SHA1	Message	Date
Fabian Lange	063ca95f5f	Remove TryCatch in canvas fill As verified by @Rob--W, the evenodd fill rule works correctly in all supported browsers. This now allows optimization by JS engines. This fixes #5458	2015-09-05 11:10:51 +02:00
Brendan Dahl	238e16feeb	Merge pull request #6407 from Snuffleupagus/bug-1200096 Fallback in `readCmapTable`, instead of using `error`, for TrueType fonts with unsupported cmap formats (bug 1200096)	2015-09-04 18:10:34 -07:00
Jonas Jenwald	cfd5a64df5	Ensure that the clipping path is reset when the state is restored (issue 6413) According to the specification, see `NOTE 2` in http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G7.3882161, it appears that we should ensure that the clipping path is reset when the restore (`Q`) operator is encountered. Fixes 6413.	2015-09-03 17:35:32 +02:00
Jonas Jenwald	b1d148a4aa	Remove `Parser_fetchIfRef` since it's obsolete This code was added in PR 1214, but was made obsolete by PRs 1488/1493. Prior to the latter ones, `Dict_get` retured the raw objects. However, afterwards (and currently) `Dict_get` now resolves indirect objects, which makes `Parser_fetchIfRef` redundant. Potential risks with this patch: This patch passes all tests locally, but there's a small possibility that it could break some weird PDF files. In the current code, wrapping `Dict_get` inside `Parser_fetchIfRef` will potentially mean two back-to-back call of `XRef_fetch`, if a reference points directly to another reference. I'm not sure if this can actually happen in practice, and I'd think that if that were the case we'd already have run into it elsewhere in the code-base, given that `Parser` is the only place where we try to "double" resolve references.	2015-09-02 23:11:00 +02:00
Jonas Jenwald	0fb31a4a9e	Fallback in `readCmapTable`, instead of using `error`, for TrueType fonts with unsupported cmap formats (bug 1200096) Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1200096. The problematic font has a `format 2` cmap, which we've never supported properly. Prior to PR 2606, we were able to fallback to a working state, despite not having proper support for that cmap format. Obviously the best/correct solution would be to implement actual support for more cmap formats[1]. However, I'm hoping that a simple patch will be OK for now, given that: - `format 2` cmaps seem to be quite rare in practice, since this has been broken for 2.5 years before anyone noticed. - Having a simple patch will make potential uplifts a lot easier. [1] See the specification at https://developer.apple.com/fonts/TrueType-Reference-Manual/RM06/Chap6cmap.html	2015-09-01 14:01:19 +02:00
Tim van der Meij	0020f33873	Merge pull request #6357 from Snuffleupagus/bidi-result Avoid more allocations for RTL text in bidi.js	2015-09-01 00:44:33 +02:00
Tim van der Meij	b42b894570	Merge pull request #6386 from Snuffleupagus/Parser_makeFilter-warn-on-empty-stream Add a warning when we encounter an empty stream in `Parser_makeFilter`	2015-08-30 23:14:22 +02:00
Rob Wu	582573b96b	Merge pull request #6358 from Snuffleupagus/Parser_tryShift-missingDataException Don't catch `MissingDataException` in `Parser_tryShift`	2015-08-27 14:46:24 +02:00
Jonas Jenwald	f814fdc215	Add a warning when we encounter an empty stream in `Parser_makeFilter` Having a warning here would have meant that issue 6360 could have been solved in approximately five minutes, instead of an hour. To avoid that happening again, this patch adds a warning whenever we treat a stream as empty.	2015-08-26 20:14:30 +02:00
Brendan Dahl	88e0326787	Merge pull request #6337 from Snuffleupagus/issue-6336 Adjust which TrueType (3, 1) glyphs we attempt to skip mapping of (issue 6336)	2015-08-25 09:49:46 -07:00
Jonas Jenwald	56a43a3181	Make `XRef_indexObjects` more robust against bad PDF files (issue 5752) This patch improves the detection of `xref` in files where it is followed by an arbitrary whitespace character (not just a line-breaking char). It also adds a check for missing whitespace, e.g. `1 0 obj<<`, to speed up `readToken` for the PDF file in the referenced issue. Finally, the patch also replaces a bunch of magic numbers with suitably named constants. Fixes 5752. Also improves 6243, but there are still issues.	2015-08-21 20:33:02 +02:00
Yury Delendik	23cb01c8af	Merge pull request #6372 from Snuffleupagus/issue-6360 Also check `maybeLength` when deciding if a stream is empty in `Parser_makeFilter` (issue 6360, bug 1191694)	2015-08-20 17:43:11 -05:00
Jonas Jenwald	5128603f64	Also check `maybeLength` when deciding if a stream is empty in `Parser_makeFilter` (issue 6360) The problem with the PDF files in the issue, besides the obviously broken XRef tables which we're able to recover from, is that many/most of the streams have Dictionaries where the `Length` entry is set to `0`. This causes us to return `NullStream`, instead of the appropriate one in `Parser_makeFilter`. Fixes 6360.	2015-08-20 23:04:18 +02:00
Yury Delendik	b11bc727c2	Merge pull request #6370 from castevinz/fix-getdocument-pdfBytes-check api/getDocument: handle ArrayBuffer check for PDF binary data (byteLength)	2015-08-20 07:01:53 -05:00
Vincent Castelain	0cd4cc4e80	api/getDocument : handle ArrayBuffer check for PDF binary data (byteLength)	2015-08-20 08:56:05 +02:00
Jonas Jenwald	ede5235d3d	Merge pull request #6332 from Rob--W/postMessage-error Serialize errors before invoking postMessage	2015-08-19 12:00:02 +02:00
Yury Delendik	c56dc9a093	Merge pull request #6141 from skalnik/fix-font-csp-issues Provide a fallback for font rendering when not allowed to use `eval`	2015-08-18 18:50:11 -05:00
Jonas Jenwald	3fa5f6cc3b	Only take the fast-path in `PDFImage_createImageData` for un-masked JPEG images with "standard" colour spaces (issue 6364) Fixes 6364.	2015-08-18 22:25:37 +02:00
Jonas Jenwald	8c3b8238ac	Don't catch `MissingDataException` in `Parser_tryShift` I overlooked this while reviewing PR 6197, but I don't think that we should be catching that particular kind of exception here; hence this patch.	2015-08-16 11:35:54 +02:00
Jonas Jenwald	b1cf4d98ad	Avoid more allocations for RTL text in bidi.js Instead of building the resulting string char-by-char for RTL text, which is inefficient, we can just as well `join` the `chars` array.	2015-08-14 21:46:59 +02:00
Jonas Jenwald	88bf19396e	Merge pull request #6349 from yurydelendik/node-mozchunk Fixes supportsMozChunked for node.js	2015-08-14 10:33:18 +02:00
Mike Skalnik	341c5e9d1f	[PATCH] Add fallback for font loading when eval disabled In some cases, such as in use with a CSP header, constructing a function with a string of javascript is not allowed. However, compiling the various commands that need to be done on the canvas element is faster than interpreting them. This patch changes the font renderer to instead emit commands that are compiled by the font loader. If, during compilation, we receive an EvalError, we instead interpret them.	2015-08-13 14:33:18 -07:00
Jonas Jenwald	ee5ce4b4a2	Fix typo in `drawFigures`, in webgl.js, which causes shadingPatterns with `figure.type === triangles` to render incorrectly The file `issue2948.pdf` from the test-suite can be used to (manually) test the patch.	2015-08-13 17:58:18 +02:00
Yury Delendik	20b46aaf88	Fixes supportsMozChunked for node.js	2015-08-12 18:48:59 -05:00
Jonas Jenwald	99d29487ab	Adjust which TrueType (3, 1) glyphs we attempt to skip mapping of (issue 6336) Fixes 6336.	2015-08-09 12:51:43 +02:00
Rob Wu	1e3078d6c4	Serialize errors before invoking postMessage Serialize errors to make sure that the callback is still invoked when an error is thrown. Firefox: "DataCloneError: The object could not be cloned." Chrome: "DataCloneError: Failed to execute 'postMessage' on 'WorkerGlobalScope': An object could not be cloned."	2015-08-08 21:44:57 +02:00
Rob Wu	b0a8c0fa40	cmaps: Use cmap.forEach instead of Array.forEach CMaps may be sparse. Array.prototype.forEach is terribly slow in Chrome (and also in Firefox) when the sparse array contains a key with a high value. E.g. console.time('forEach sparse') var a = []; a[0xFFFFFF] = 1; a.forEach(function(){}); console.timeEnd('forEach sparse'); // Chrome: 2890ms // Firefox: 1345ms Switching to CMap.prototype.forEach, which is optimized for such scenarios fixes the problem.	2015-08-08 13:30:30 +02:00
Tilman Hausherr	6d1e0f7e8d	fix handling of flags 1-3 in tensor shading pi is an index in the stream and is explained on page 201 of the 32000-spec (however 1-based there), and ps is an index to something in PDF.js. I used the code from flag 0 (which works) to understand which is which. It is also important to understand that for flags 1,2 and 3, the stream is always assigned to the same coordinates and colors. What changes is which "old" coordinates and colors are assigned to what is "missing" in the stream. This is why for these flags, the code is identical except for the assignments in the first "row". (Same principle as in #6304). Note that this change will not improve the lamp_cairo.pdf file, only the two files mentioned in #6305.	2015-08-04 18:21:29 +02:00
Tilman Hausherr	c85fa00d62	fix handling of flags 1-3 in coons shading Short story: somebody got lost in two different indices. pi is an index in the stream and is explained on page 198 of the 32000-spec (however 1-based there), and ps is an index to something in PDF.js. I used the code from flag 0 (which works) to understand which is which. It is also important to understand that for flags 1,2 and 3, the stream is always assigned to the same coordinates and colors. What changes is which "old" coordinates and colors are assigned to what is "missing" in the stream. This is why for these flags, the code is identical except for the assignments in the first "row".	2015-08-03 21:15:26 +02:00
Brendan Dahl	977397ebfd	Merge pull request #6270 from Snuffleupagus/opentype-cff-2 Adjust the heuristics used to detect OpenType font file with CFF data (bug 1186827, bug 1182130, issue 6264)	2015-08-03 09:43:33 -07:00
Tim van der Meij	72ecbec49d	Merge pull request #6292 from Snuffleupagus/issue-6287 Fix various shading pattern regressions (issue 6287)	2015-07-31 22:26:01 +02:00
Jonas Jenwald	1d65daf5e5	Correctly access `colorSpace.numComps` in `MeshStreamReader` (issue 6287) This regressed in `f750e35224`.	2015-07-31 18:00:58 +02:00
Jonas Jenwald	7fe2442a18	Ensure that we don't use the same typed array for both `coords` and `colors` in Mesh `figures` (issue 6287) This regressed in `1e8d70af98`.	2015-07-31 18:00:23 +02:00
Jonas Jenwald	55bc98a8b0	Rename `PatternType` to `ShadingType` to avoid confusion The current name is somewhat confusing, since the specification calls it `ShadingType`, see http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G7.4044105 and http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G7.3882826. The real problem, however, is that there is actually another property called `PatternType`, which makes the current code very confusing, see http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G7.1850929. Since `ShadingType` is only relevant for shading patterns (i.e. `PatternType === 2`), and not for tiling patterns (i.e `PatternType === 1`), this patch should help reduce confusion when reading the code.	2015-07-30 20:03:45 +02:00
Tim van der Meij	4f920ad100	Refactor annotation code to use a factory Currently, `src/core/core.js` uses the `fromRef` method on an `Annotation` object to obtain the right annotation type object (such as `LinkAnnotation` or `TextAnnotation`). That method in turn uses a method `getConstructor` to find out which annotation type object must be returned. Aside from the fact that there is currently a lot of code to achieve this, these methods should not be part of the base `Annotation` class at all. Creation of annotation object should be done by a factory (as also recommended by @yurydelendik at https://github.com/mozilla/pdf.js/pull/5218#issuecomment-52779659) that handles finding out the correct annotation type object and returning it. This patch implements this separation of concerns. Doing this allows us to also simplify the code quite a bit and to make it more readable. Additionally, we are now able to get rid of the hardcoded array of supported annotation types. The factory takes care of checking the annotation types and falls back to returning the base annotation type (and issuing a warning, which the current code also does not do well) when an annotation type is unsupported. I have manually tested this commit with 20 test PDFs with different annotation types, such as /Link, /Text, /Widget, /FileAttachment and /FreeText. All render identically before and after the patch, and unsupported annotation types are now properly indicated with a warning in the console.	2015-07-29 00:31:51 +02:00
Tim van der Meij	d08895d659	Merge pull request #6236 from Rob--W/print-javascript-action Detect scripted auto-print requests	2015-07-25 19:42:31 +02:00
Jonas Jenwald	0a024b5051	Adjust the heuristics used to detect OpenType font file with CFF data (bug 1186827, bug 1182130, issue 6264) This is a tentative patch. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1186827. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1182130. Fixes 6264.	2015-07-25 12:26:36 +02:00
Jonas Jenwald	385e2e5aaf	Check if the `Decode` entry is non-default when deciding if JPEG images are natively supported/decodable (issue 6238) Tentatively fixes 6238.	2015-07-21 12:23:07 +02:00
Tim van der Meij	980aa10e04	Refactor annotation rectangle code and add unit tests This patch refactors the code responsible for setting the annotation's rectangle. Its goal is to: - Actually check that the input array is actually an array, and if so, that it contains exactly four elements. - Only call `normalizeRect` if the input array is valid, i.e., we do not call it for the default rectangle anymore. Unit tests are provided just like with the other patches in this series.	2015-07-20 22:01:47 +02:00
Rob Wu	c676ecb5a0	Detect scripted auto-print requests Fixes #6106 To avoid future regressions, two new unit tests were added: 1. A new PDF based on the report from #6106, which contains an OpenAction of type JavaScript and a string "this.print({...}". 2. An existing PDF from https://bugzil.la/1001080 (from #4698). Although it does not matter, since we don't execute the JavaScript code, I have also changed "print(true)" to "print({})" since the print method takes an object (not a boolean). See "Printing PDF documents", page 62: http://adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/js_developer_guide.pdf	2015-07-20 18:25:02 +02:00
Tim van der Meij	995c5ba205	Simplify annotation data passing	2015-07-19 14:02:49 +02:00
Tim van der Meij	465611a2ff	More cleanup regarding annotation border styles	2015-07-17 21:51:24 +02:00
Jonas Jenwald	c718d1ab10	Ignore double negative in `Lexer_getNumber` (issue 6218) Basic mathematics would suggest that a double negative should always become positive, but it appears that Adobe Reader simply ignores that case. Hence I think that it makes sense for us to do the same. Fixes 6218.	2015-07-16 12:11:49 +02:00
Tim van der Meij	a2e9845093	Refactor annotation color handling and add unit tests	2015-07-15 18:49:19 +02:00
Jonas Jenwald	28f40b1b58	Fetch all indirect objects (i.e. `Ref`s) in `NameTree_getAll` and `NameTree_get` (issue 6204)	2015-07-14 10:56:56 +02:00
Brendan Dahl	367794f0c7	Merge pull request #4990 from fkaelberer/refactor_chunked_stream Minor refactoring of chunked_stream.js	2015-07-13 16:51:35 -07:00
Tim van der Meij	1416a1b521	Merge pull request #6187 from Snuffleupagus/more-efficient-getDestination A couple of improvements of `getDestination` (unit-test included)	2015-07-13 23:03:13 +02:00
Rob Wu	e211c25f06	Improve robustness of stream parser (invalid length) When the parser finds a stream, it retrieves the Length from the stream dictionary and advances the lexer to the offset as specified in Length. If this Length is incorrect, the lexer could end up anywhere. When the lexer gets in an invalid state, it could throw errors. For example, in issue 6108, the lexer ends up inside the stream data. This stream has the ASCIIHexDecode filter, so all characters are made up from ASCII characters, and the lexer interprets it as a command token. Tokens cannot be longer than 127 bytes, so eventually 128 bytes are consumed and the lexer throws "Command token too long" error. Another possible error is "Illegal character: 41" when the lexer happens to end up at a ')' due to the length mismatch. These problems are solved by catching lexer errors and recovering the parser via the existing stream length detection branch.	2015-07-11 20:12:49 +02:00
Tim van der Meij	7d4303b7c4	Merge pull request #6194 from Rob--W/recover-mode-start-offset Subtract start offset for xrefs in recovery mode	2015-07-11 17:22:08 +02:00
Rob Wu	fd29bb0c57	Subtract start offset for xrefs in recovery mode Xref offsets are relative to the start of the PDF data, not to the start of the PDF file. This is clear if you look at the other code: - In the XRef's readXRefTable and processXRefTable methods of XRef, the offset of a xref entry is set to the bytes as given by a PDF file. These values are always relative to the start of the PDF file (%PDF-). - The XRef's readXRef method adds the start offset of the stream to Xref entry's offset: "stream.pos = startXRef + stream.start". Clearly, this line assumes that the entry offset excludes the start offset. However, when the PDF is parsed in recovery mode, the xref table is filled with entries whose offset is relative to the start of the stream rather than the PDF file. This is incorrect, and the fix is to subtract the start offset of the stream from the entry's byte offset. The manually created PDF file serves as a regression test. It is a valid PDF, except: - The integer to point to the start of the xref table and the %%EOF trailer are missing. This will activate recovery mode in PDF.js - Some junk was added before the start of the PDF file. This exposes the bad offset bug.	2015-07-10 23:33:10 +02:00

... 4 5 6 7 8 ...

2491 Commits