pdf.js

Author	SHA1	Message	Date
Tim van der Meij	8b79becad6	Improve code structure of the annotation code This patch improves the code structure of the annotation code. - Create the annotation border style object in the `setBorderStyle` method instead of in the constructor. The behavior is the same as the `setBorderStyle` method is always called, thus a border style object is still always available. - Put all data object manipulation lines in one block in the constructor. This improves readability and maintainability as it is more visible which properties are exposed. - Simplify `appendToOperatorList` by removing the promise capability and removing an unused parameter. - Remove some unnecessary newlines/spaces.	2015-11-29 00:04:21 +01:00
Jonas Jenwald	995e1a45b8	Ensure that `Lexer_getName` does not fail if a `Name` contains in invalid usage of the NUMBER SIGN (#) (issue 6692) This is a regression from PR 3424. The PDF file in the referenced issue is using `Type3` fonts. In one of those, the `/CharProcs` dictionary contains an entry with the name `/#`. Before the changes to `Lexer_getName` in PR 3424, we were allowing certain invalid `Name` patterns containing the NUMBER SIGN (#). It's unfortunate that this has been broken for close to two and a half years before the bug surfaced, but it should at least indicate that this is not a widespread issue. Fixes 6692.	2015-11-28 11:59:09 +01:00
Yury Delendik	e4e69e2f05	Set error font for Type3 if its loading failed.	2015-11-27 13:05:51 -06:00
Yury Delendik	8dff301ce1	Worker shall wait for MessageHandler to be created at api side.	2015-11-25 18:21:23 -06:00
Jonas Jenwald	6dfe53b976	[api-minor] Add a parameter to `PDFPageProxy_getTextContent` that enables replacing of all whitespace with standard spaces in the textLayer (issue 6612) This patch goes a bit further than issue 6612 requires, and replaces all kinds of whitespace with standard spaces. When testing this locally, it actually seemed to slightly improve two existing test-cases (`tracemonkey-text` and `taro-text`). Fixes 6612.	2015-11-25 17:28:40 +01:00
Yury Delendik	06c1904675	Refactors FontLoader to group fonts per document.	2015-11-24 13:27:22 -06:00
Yury Delendik	09772e1e15	Creates PDFWorker, separates fetchDocument from transport.	2015-11-24 13:27:22 -06:00
Yury Delendik	acdd49f480	Adds peer communication between MessageHandlers.	2015-11-24 12:16:58 -06:00
Yury Delendik	4b243cdd89	Merge pull request #6675 from Snuffleupagus/getAnnotations-intent [api-minor] Let `getAnnotations` fetch all annotations by default, unless an intent is specified	2015-11-24 12:11:51 -06:00
Jonas Jenwald	a2a5d36d5b	Restore the `data.annotationFlags` parameter for annotations (PR 6672 follow-up)	2015-11-23 10:17:11 +01:00
Jonas Jenwald	b05652ca97	[api-minor] Let `getAnnotations` fetch all annotations by default, unless an intent is specified Currently `getAnnotations` will only fetch annotations that are either `viewable` or `printable`. This is "hidden" inside the `core.js` file, meaning that API consumers might be confused as to why they are not recieving all the annotations present for a page. I thus think that the API should, by default, return all available annotations unless specifically told otherwise. In e.g. the default viewer, we obviously only want to display annotations that are `viewable`, hence this patch adds an `intent` parameter to `getAnnotations` that makes it possible to decide if only `viewable` or `printable` annotations should be fetched.	2015-11-22 15:51:37 +01:00
Yury Delendik	0029000c9f	Merge pull request #6671 from Snuffleupagus/make-stripCommentHeaders-less-gready Make `stripCommentHeaders` less greedy, to ensure that it doesn't eat 'use strict' directive at the top of files (PR 6627 follow-up)	2015-11-22 07:24:20 -06:00
Tim van der Meij	0991c06395	Refactor annotation flags code This patch makes it possible to set and get all possible flags that the PDF specification defines. Even though we do not support all possible annotation types and not all possible annotation flags yet, this general framework makes it easy to access all flags for each annotation such that annotation type implementations can use this information. We add constants for all possible annotation flags such that we do not need to hardcode the flags in the code anymore. The `isViewable()` and `isPrintable()` methods are now easier to read. Additionally, unit tests have been added to ensure correct behavior. This is another part of #5218.	2015-11-22 01:06:37 +01:00
Jonas Jenwald	373da010ac	Move the `globals` comments in bidi.js and metadata.js to after the Copyright comments	2015-11-21 18:43:08 +01:00
Yury Delendik	2f1a626d6a	Merge pull request #6640 from dsprenkels/issue-6006-radial-gradient-size Apply transformation matrix to RadialGradient radiuses	2015-11-17 11:40:13 -06:00
Daan Sprenkels	6ce83d3290	apply transformation matrix to RadialGradient radiuses, not only to circle origin points fix for #6006	2015-11-17 00:20:42 +01:00
Manas	a2ba1b8189	Uses editorconfig to maintain consistent coding styles Removes the following as they unnecessary /* -- Mode: Java; tab-width: 2; indent-tabs-mode: nil; c-basic-offset: 2 -- / / vim: set shiftwidth=2 tabstop=2 autoindent cindent expandtab: */	2015-11-14 07:32:18 +05:30
Jonas Jenwald	50a70429ec	Ignore the /Mask entry in images unless its /ImageMask entry is explicitly set to `true` (issue 6621) Fixes 6621.	2015-11-12 22:49:26 +01:00
Yury Delendik	7381ff9523	Merge pull request #6599 from prometheansacrifice/generate-better-api-docs Generate better API documentation	2015-11-12 14:26:18 -06:00
Manas	dbcb46c8de	Uses @alias to fix missing comments on JSDocs pages	2015-11-13 01:24:15 +05:30
Yury Delendik	3c6df26704	Merge pull request #6608 from Rob--W/improved-error-message-local-file Improve error message for non-existent local files	2015-11-09 15:40:41 -06:00
Rob Wu	c604cc22d1	Improve error message for non-existent local files I received multiple reports about the following cryptic error in the Chrome extension when the user tried to open a local file: > PDF.js v1.1.527 (build: 2096a2a) > Message: Cannot read property 'Symbol(Symbol.iterator)' of null This error most likely originated from core/stream.js: function Stream(arrayBuffer, start, length, dict) { this.bytes = (arrayBuffer instanceof Uint8Array ? arrayBuffer : new Uint8Array(arrayBuffer)); ^^^^^^^^^^^ `arrayBuffer` is `null`, and that in turn is caused by the fact that for non-existing files, there is no data. I've applied two fixes: 1. Never call onDone with a void buffer, but call the error handler instead. 2. Show a sensible error message for local files with status = 0.	2015-11-08 18:03:28 +01:00
Jonas Jenwald	ff64ef0243	Prevent `readCmapTable` from failing if the `cmap` is missing in TrueType fonts Fixes http://arrow.dit.ie/cgi/viewcontent.cgi?article=1000&context=aaschadpoth#page=3.	2015-11-08 16:48:37 +01:00
Yury Delendik	bb29e13307	Merge pull request #6601 from yurydelendik/ascent Fixes incorrect PDF file font metrics.	2015-11-06 20:16:04 -06:00
Yury Delendik	cc5bc18728	Fixes incorrect PDF file font metrics.	2015-11-06 14:47:10 -06:00
Yury Delendik	fa423cfab0	Refactors fake space heuristics for speed.	2015-11-06 10:55:43 -06:00
Yury Delendik	376f8bde14	Combines standalone divs into text groups.	2015-11-06 10:20:49 -06:00
Yury Delendik	fa46b73c47	Better spacing in text layer.	2015-11-02 08:54:15 -06:00
Yury Delendik	d26ef21d52	Merge pull request #6568 from tonyjin/api-rangeChunkSize [api-minor] Add an optional param to DocumentInitParameters for speci…	2015-10-28 16:52:52 -05:00
Tony Jin	ef667823dd	[api-minor] Add an optional param to DocumentInitParameters for specifying the range request chunk size to use. Defaults to 2^16 = 65536.	2015-10-26 17:22:11 -07:00
Jonas Jenwald	1c66d4a106	Add a `totalLength` getter to `OperatorList`, since the `length` is zero after flushing In the `RenderPageRequest` handler in `worker.js`, we attempt to print an `info` message containing the rendering time and the length of the operator list. The latter is currently broken (and has been for quite some time), since the `length` of an `OperatorList` is reset when flushing occurs. This patch attempts to rectify this, by adding a getter which keeps track of the total length.	2015-10-26 18:12:14 +01:00
Yury Delendik	58c3ea0820	Adds thread abort capabilities.	2015-10-23 09:06:32 -05:00
Yury Delendik	59c13b32aa	Adds destroy method to the document loading task. Also renames PDFPageProxy.destroy method to cleanup.	2015-10-23 08:57:14 -05:00
Jonas Jenwald	2e751199fb	Prevent getOperatorList from failing to correctly parse OPS.paintXObject for TilingPatterns that are missing some /Resources entries (issue 6541) Fixes 6541.	2015-10-21 21:30:56 +02:00
Rob Wu	50ff2d4c2a	Ignore operators that are known to be unsupported `operatorList.addOp` adds the arguments to the list which is then passed as-is by postMessage to the main thread. But since we don't parse these operations, they are raw PDF objects and may therefore cause a serialization error. This is a conservative patch, and only affects operators which are known to be unsupported. We should ignore all unknown operators, but I haven't really looked into the consequences of doing that. Fixes #6549	2015-10-21 15:39:25 +02:00
Brendan Dahl	e4f0e6f2a0	Merge pull request #6531 from covlllp/new_merge Fixes bluebeam password protection issue	2015-10-16 13:47:06 -07:00
Colin VanLang	6d8e883fe6	Fixes bluebeam password protection issue	2015-10-15 21:22:27 -04:00
Jonas Jenwald	49883439a5	Ensure that `Dict_getArray` doesn't fail if `xref` in undefined (PR 6485 follow-up) In PR 6485 I somehow missed to account for the case where `xref` is undefined. Since a dictonary can be initialized without providing a reference to an `xref` instance, `Dict_getArray` can thus fail without this added check.	2015-10-15 11:47:07 +02:00
Brendan Dahl	3eaeacfe19	Merge pull request #6476 from Snuffleupagus/PartialEvaluator_readToUnicode-cmap-length Right-size the `map` array in PartialEvaluator_readToUnicode	2015-10-09 10:31:28 -07:00
Jonas Jenwald	9b12c64be5	Cache the regular expression used for finding `obj`s in `XRef_indexObjects`, to avoid unnecessary allocations	2015-10-02 12:46:58 +02:00
Jonas Jenwald	192907e0d2	Make `XRef_indexObjects` even more robust against bad PDF files, by checking for the existence of 'trailer' if 'xref' is not found Fixes http://www.cyjack.com/cognition/Terence%20McKenna%20-%20Lectures%20on%20Alchemy.pdf.	2015-10-01 15:01:25 +02:00
Tim van der Meij	1bdfc47de8	Merge pull request #6411 from Snuffleupagus/remove-Parser_fetchIfRef Remove `Parser_fetchIfRef` since it's obsolete	2015-09-30 00:38:35 +02:00
Jonas Jenwald	1b8cb52555	Prevent `PartialEvaluator_buildFormXObject` from failing if the `Matrix` or `BBox` contains indirect objects This patch fixes yet another instance of bad PDF data, specifically a case where the `BBox` array contains indirect objects (i.e. `Ref`s). Fixes the missing image in http://www.int.washington.edu/talks/WorkShops/int_08_37W/People/Franz_M/Franz.pdf#page=24. Note: There are missing images on a number of the pages in that file.	2015-09-29 10:11:49 +02:00
Jonas Jenwald	75557d27d1	Add `getArray` method to `Dict` This method extend `get`, and will fetch all indirect objects (i.e. `Ref`s) when the result is an `Array`.	2015-09-29 10:11:47 +02:00
Jonas Jenwald	8d831449ab	Right-size the `map` array in PartialEvaluator_readToUnicode We can avoid a lot of intermediate resizings, by directly allocating the required number of elements for the `map` array.	2015-09-24 13:08:53 +02:00
Fabian Lange	2564827503	Fix text spacing with vertical fonts (#6387 ) According to the PDF spec 5.3.2, a positive value means in horizontal, that the next glyph is further to the left (so narrower), and in vertical that it is further down (so wider). This change fixes the way PDF.js has interpreted the value.	2015-09-15 09:28:45 +02:00
Tim van der Meij	12b0b9744b	Merge pull request #6427 from Snuffleupagus/slightly-more-robust-get-fingerprint Make `get fingerprint` slightly more robust against corrupt PDF files	2015-09-10 22:07:44 +02:00
Jonas Jenwald	5853553455	Make `get fingerprint` slightly more robust against corrupt PDF files This patch adjusts `get fingerprint` to also check that the `/ID` entry contains (non-empty) strings, to prevent more possible failures when loading corrupt PDF files (follow-up to PR 5602). Note that I've not actually encountered such a PDF file in the wild. However given that `stringToBytes` will assert that the input is a string, and that we'll thus fail to load a document unless `get fingerprint` succeeds, making this more robust seems like a good idea to me.	2015-09-08 13:42:53 +02:00
Jonas Jenwald	29a1cdb6a6	Only choose a (3, 1) cmap table for TrueType fonts that have an encoding specified (issue 6410) For (1, 0) cmaps, we have two different codepaths depending on whether the font has/hasn't got an encoding. But with (3, 1) cmaps we don't have a good fallback when the encoding is missing, hence this patch changes `readCmapTable` to only choose a (3, 1) cmap table if the font is non-symbolic and an encoding exists. Without this, we'll not be able to successfully create a working glyph map for some TrueType fonts with (3, 1) cmap tables. Fixes 6410.	2015-09-07 16:56:05 +02:00
Jonas Jenwald	b1d148a4aa	Remove `Parser_fetchIfRef` since it's obsolete This code was added in PR 1214, but was made obsolete by PRs 1488/1493. Prior to the latter ones, `Dict_get` retured the raw objects. However, afterwards (and currently) `Dict_get` now resolves indirect objects, which makes `Parser_fetchIfRef` redundant. Potential risks with this patch: This patch passes all tests locally, but there's a small possibility that it could break some weird PDF files. In the current code, wrapping `Dict_get` inside `Parser_fetchIfRef` will potentially mean two back-to-back call of `XRef_fetch`, if a reference points directly to another reference. I'm not sure if this can actually happen in practice, and I'd think that if that were the case we'd already have run into it elsewhere in the code-base, given that `Parser` is the only place where we try to "double" resolve references.	2015-09-02 23:11:00 +02:00

... 23 24 25 26 27 ...

1981 Commits