pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	50a70429ec	Ignore the /Mask entry in images unless its /ImageMask entry is explicitly set to `true` (issue 6621) Fixes 6621.	2015-11-12 22:49:26 +01:00
Yury Delendik	7381ff9523	Merge pull request #6599 from prometheansacrifice/generate-better-api-docs Generate better API documentation	2015-11-12 14:26:18 -06:00
Manas	dbcb46c8de	Uses @alias to fix missing comments on JSDocs pages	2015-11-13 01:24:15 +05:30
Yury Delendik	3c6df26704	Merge pull request #6608 from Rob--W/improved-error-message-local-file Improve error message for non-existent local files	2015-11-09 15:40:41 -06:00
Rob Wu	c604cc22d1	Improve error message for non-existent local files I received multiple reports about the following cryptic error in the Chrome extension when the user tried to open a local file: > PDF.js v1.1.527 (build: 2096a2a) > Message: Cannot read property 'Symbol(Symbol.iterator)' of null This error most likely originated from core/stream.js: function Stream(arrayBuffer, start, length, dict) { this.bytes = (arrayBuffer instanceof Uint8Array ? arrayBuffer : new Uint8Array(arrayBuffer)); ^^^^^^^^^^^ `arrayBuffer` is `null`, and that in turn is caused by the fact that for non-existing files, there is no data. I've applied two fixes: 1. Never call onDone with a void buffer, but call the error handler instead. 2. Show a sensible error message for local files with status = 0.	2015-11-08 18:03:28 +01:00
Jonas Jenwald	ff64ef0243	Prevent `readCmapTable` from failing if the `cmap` is missing in TrueType fonts Fixes http://arrow.dit.ie/cgi/viewcontent.cgi?article=1000&context=aaschadpoth#page=3.	2015-11-08 16:48:37 +01:00
Yury Delendik	bb29e13307	Merge pull request #6601 from yurydelendik/ascent Fixes incorrect PDF file font metrics.	2015-11-06 20:16:04 -06:00
Brendan Dahl	9a830a7b62	Merge pull request #6590 from yurydelendik/combinechars Combines standalone chars into text groups.	2015-11-06 15:06:41 -08:00
Yury Delendik	cc5bc18728	Fixes incorrect PDF file font metrics.	2015-11-06 14:47:10 -06:00
Yury Delendik	fa423cfab0	Refactors fake space heuristics for speed.	2015-11-06 10:55:43 -06:00
Yury Delendik	376f8bde14	Combines standalone divs into text groups.	2015-11-06 10:20:49 -06:00
Yury Delendik	601d29b14e	Fixes all examples to require workerSrc to be set.	2015-11-06 07:50:21 -06:00
Yury Delendik	28d340679a	Uses document.currentScript for pdf.worker.js path.	2015-11-06 07:50:21 -06:00
Yury Delendik	fa46b73c47	Better spacing in text layer.	2015-11-02 08:54:15 -06:00
Brendan Dahl	b56b41514c	Merge pull request #6578 from yurydelendik/issue6577 Ignore any pending data when worker is terminated.	2015-10-29 11:49:40 -07:00
Yury Delendik	8d15ecb14b	Ignore any pending data when worker is terminated.	2015-10-29 13:06:22 -05:00
Yury Delendik	d26ef21d52	Merge pull request #6568 from tonyjin/api-rangeChunkSize [api-minor] Add an optional param to DocumentInitParameters for speci…	2015-10-28 16:52:52 -05:00
Tony Jin	ef667823dd	[api-minor] Add an optional param to DocumentInitParameters for specifying the range request chunk size to use. Defaults to 2^16 = 65536.	2015-10-26 17:22:11 -07:00
Jonas Jenwald	1c66d4a106	Add a `totalLength` getter to `OperatorList`, since the `length` is zero after flushing In the `RenderPageRequest` handler in `worker.js`, we attempt to print an `info` message containing the rendering time and the length of the operator list. The latter is currently broken (and has been for quite some time), since the `length` of an `OperatorList` is reset when flushing occurs. This patch attempts to rectify this, by adding a getter which keeps track of the total length.	2015-10-26 18:12:14 +01:00
Jonas Jenwald	5bd95df427	Prevent `TypeError: page is undefined` when the document has been destroyed (PR 6546 follow-up) Follow-up to PR 6546. If rendering has already started when the document is destroyed, then `this.pageCache[data.pageIndex]` may already have been cleared when the `StartRenderingPage`/`RenderPageChunk` messages are recieved in `api.js`, which results in `TypeError`s being thrown.	2015-10-23 22:16:34 +02:00
Yury Delendik	5135aa9bec	Adds deprecation warning for the API calls.	2015-10-23 09:06:32 -05:00
Yury Delendik	58c3ea0820	Adds thread abort capabilities.	2015-10-23 09:06:32 -05:00
Yury Delendik	59c13b32aa	Adds destroy method to the document loading task. Also renames PDFPageProxy.destroy method to cleanup.	2015-10-23 08:57:14 -05:00
Jonas Jenwald	487ba9065a	Fail gracefully, and with a notification, if paintXObject is encountered in canvas.js We should never actually try to execute `paintXObject` in canvas.js, but in some cases where we fail to parse the PDF file correctly it can happen. Currently this will potentially cause an entire page to fail to render, which seems suboptimal. With this patch, we will instead continue rendering with a warning that things might not work correctly.	2015-10-21 21:30:59 +02:00
Jonas Jenwald	2e751199fb	Prevent getOperatorList from failing to correctly parse OPS.paintXObject for TilingPatterns that are missing some /Resources entries (issue 6541) Fixes 6541.	2015-10-21 21:30:56 +02:00
Rob Wu	50ff2d4c2a	Ignore operators that are known to be unsupported `operatorList.addOp` adds the arguments to the list which is then passed as-is by postMessage to the main thread. But since we don't parse these operations, they are raw PDF objects and may therefore cause a serialization error. This is a conservative patch, and only affects operators which are known to be unsupported. We should ignore all unknown operators, but I haven't really looked into the consequences of doing that. Fixes #6549	2015-10-21 15:39:25 +02:00
Brendan Dahl	e4f0e6f2a0	Merge pull request #6531 from covlllp/new_merge Fixes bluebeam password protection issue	2015-10-16 13:47:06 -07:00
Colin VanLang	6d8e883fe6	Fixes bluebeam password protection issue	2015-10-15 21:22:27 -04:00
Jonas Jenwald	49883439a5	Ensure that `Dict_getArray` doesn't fail if `xref` in undefined (PR 6485 follow-up) In PR 6485 I somehow missed to account for the case where `xref` is undefined. Since a dictonary can be initialized without providing a reference to an `xref` instance, `Dict_getArray` can thus fail without this added check.	2015-10-15 11:47:07 +02:00
Jonas Jenwald	9ab896e307	[api-minor] Add an option to PDFJS for specifying the \|target\| attribute of external links Replaces `PDFJS.openExternalLinksInNewWindow` with a more generic configuration option. Note: `PDFJS.openExternalLinksInNewWindow = true;` is equal to `PDFJS.externalLinkTarget = PDFJS.LinkTarget.BLANK;`.	2015-10-13 21:52:00 +02:00
Brendan Dahl	3eaeacfe19	Merge pull request #6476 from Snuffleupagus/PartialEvaluator_readToUnicode-cmap-length Right-size the `map` array in PartialEvaluator_readToUnicode	2015-10-09 10:31:28 -07:00
Tim van der Meij	5e4910f7b6	Merge pull request #6491 from Snuffleupagus/check-trailer-if-xref-missing Make `XRef_indexObjects` even more robust against bad PDF files, by checking for the existence of 'trailer' if 'xref' is not found	2015-10-04 16:00:00 +02:00
Tim van der Meij	dd9d0b8770	Merge pull request #5480 from CodingFabian/issue-5458 Remove TryCatch in canvas for EvenOdd winding rule.	2015-10-04 15:31:34 +02:00
Jonas Jenwald	9b12c64be5	Cache the regular expression used for finding `obj`s in `XRef_indexObjects`, to avoid unnecessary allocations	2015-10-02 12:46:58 +02:00
Jonas Jenwald	192907e0d2	Make `XRef_indexObjects` even more robust against bad PDF files, by checking for the existence of 'trailer' if 'xref' is not found Fixes http://www.cyjack.com/cognition/Terence%20McKenna%20-%20Lectures%20on%20Alchemy.pdf.	2015-10-01 15:01:25 +02:00
Tim van der Meij	1bdfc47de8	Merge pull request #6411 from Snuffleupagus/remove-Parser_fetchIfRef Remove `Parser_fetchIfRef` since it's obsolete	2015-09-30 00:38:35 +02:00
Jonas Jenwald	1b8cb52555	Prevent `PartialEvaluator_buildFormXObject` from failing if the `Matrix` or `BBox` contains indirect objects This patch fixes yet another instance of bad PDF data, specifically a case where the `BBox` array contains indirect objects (i.e. `Ref`s). Fixes the missing image in http://www.int.washington.edu/talks/WorkShops/int_08_37W/People/Franz_M/Franz.pdf#page=24. Note: There are missing images on a number of the pages in that file.	2015-09-29 10:11:49 +02:00
Jonas Jenwald	75557d27d1	Add `getArray` method to `Dict` This method extend `get`, and will fetch all indirect objects (i.e. `Ref`s) when the result is an `Array`.	2015-09-29 10:11:47 +02:00
Jonas Jenwald	9eab463b6d	Ensure that the `baseTransform` is always defined for TilingPatterns Fixes http://www2.emersonprocess.com/siteadmincenter/PM%20Micro%20Motion%20Documents/High-Pressure-Measurement-WP-001287.pdf#page=3.	2015-09-27 22:49:34 +02:00
Jonas Jenwald	8d831449ab	Right-size the `map` array in PartialEvaluator_readToUnicode We can avoid a lot of intermediate resizings, by directly allocating the required number of elements for the `map` array.	2015-09-24 13:08:53 +02:00
Fabian Lange	2564827503	Fix text spacing with vertical fonts (#6387 ) According to the PDF spec 5.3.2, a positive value means in horizontal, that the next glyph is further to the left (so narrower), and in vertical that it is further down (so wider). This change fixes the way PDF.js has interpreted the value.	2015-09-15 09:28:45 +02:00
Tim van der Meij	12b0b9744b	Merge pull request #6427 from Snuffleupagus/slightly-more-robust-get-fingerprint Make `get fingerprint` slightly more robust against corrupt PDF files	2015-09-10 22:07:44 +02:00
Jonas Jenwald	5853553455	Make `get fingerprint` slightly more robust against corrupt PDF files This patch adjusts `get fingerprint` to also check that the `/ID` entry contains (non-empty) strings, to prevent more possible failures when loading corrupt PDF files (follow-up to PR 5602). Note that I've not actually encountered such a PDF file in the wild. However given that `stringToBytes` will assert that the input is a string, and that we'll thus fail to load a document unless `get fingerprint` succeeds, making this more robust seems like a good idea to me.	2015-09-08 13:42:53 +02:00
Jonas Jenwald	29a1cdb6a6	Only choose a (3, 1) cmap table for TrueType fonts that have an encoding specified (issue 6410) For (1, 0) cmaps, we have two different codepaths depending on whether the font has/hasn't got an encoding. But with (3, 1) cmaps we don't have a good fallback when the encoding is missing, hence this patch changes `readCmapTable` to only choose a (3, 1) cmap table if the font is non-symbolic and an encoding exists. Without this, we'll not be able to successfully create a working glyph map for some TrueType fonts with (3, 1) cmap tables. Fixes 6410.	2015-09-07 16:56:05 +02:00
Fabian Lange	063ca95f5f	Remove TryCatch in canvas fill As verified by @Rob--W, the evenodd fill rule works correctly in all supported browsers. This now allows optimization by JS engines. This fixes #5458	2015-09-05 11:10:51 +02:00
Brendan Dahl	238e16feeb	Merge pull request #6407 from Snuffleupagus/bug-1200096 Fallback in `readCmapTable`, instead of using `error`, for TrueType fonts with unsupported cmap formats (bug 1200096)	2015-09-04 18:10:34 -07:00
Jonas Jenwald	cfd5a64df5	Ensure that the clipping path is reset when the state is restored (issue 6413) According to the specification, see `NOTE 2` in http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G7.3882161, it appears that we should ensure that the clipping path is reset when the restore (`Q`) operator is encountered. Fixes 6413.	2015-09-03 17:35:32 +02:00
Jonas Jenwald	b1d148a4aa	Remove `Parser_fetchIfRef` since it's obsolete This code was added in PR 1214, but was made obsolete by PRs 1488/1493. Prior to the latter ones, `Dict_get` retured the raw objects. However, afterwards (and currently) `Dict_get` now resolves indirect objects, which makes `Parser_fetchIfRef` redundant. Potential risks with this patch: This patch passes all tests locally, but there's a small possibility that it could break some weird PDF files. In the current code, wrapping `Dict_get` inside `Parser_fetchIfRef` will potentially mean two back-to-back call of `XRef_fetch`, if a reference points directly to another reference. I'm not sure if this can actually happen in practice, and I'd think that if that were the case we'd already have run into it elsewhere in the code-base, given that `Parser` is the only place where we try to "double" resolve references.	2015-09-02 23:11:00 +02:00
Jonas Jenwald	0fb31a4a9e	Fallback in `readCmapTable`, instead of using `error`, for TrueType fonts with unsupported cmap formats (bug 1200096) Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1200096. The problematic font has a `format 2` cmap, which we've never supported properly. Prior to PR 2606, we were able to fallback to a working state, despite not having proper support for that cmap format. Obviously the best/correct solution would be to implement actual support for more cmap formats[1]. However, I'm hoping that a simple patch will be OK for now, given that: - `format 2` cmaps seem to be quite rare in practice, since this has been broken for 2.5 years before anyone noticed. - Having a simple patch will make potential uplifts a lot easier. [1] See the specification at https://developer.apple.com/fonts/TrueType-Reference-Manual/RM06/Chap6cmap.html	2015-09-01 14:01:19 +02:00
Tim van der Meij	0020f33873	Merge pull request #6357 from Snuffleupagus/bidi-result Avoid more allocations for RTL text in bidi.js	2015-09-01 00:44:33 +02:00

... 13 14 15 16 17 ...

2985 Commits