pdf.js

Author	SHA1	Message	Date
Brendan Dahl	cdc79a4721	Don’t skip glyph 0 in cmap.	2017-04-05 15:17:38 -07:00
Yury Delendik	b665b0319a	Merge pull request #8222 from tjgrathwell/ios-fake-cancel-animation-frame ios: Patch cancelAnimationFrame whenever fakeRequestAnimationFrame is used	2017-04-04 10:29:04 -05:00
Yury Delendik	31f8875614	Merge pull request #8157 from Snuffleupagus/api-RenderTask-cancel-Error [api-minor] Reject the `RenderTask` with an actual `Error`, instead of just a `string`, when rendering is cancelled	2017-04-04 09:38:47 -05:00
Travis Grathwell	bd70a73d43	ios: Patch cancelAnimationFrame whenever fakeRequestAnimationFrame is used The existing implementation of fakeRequestAnimationFrame did not return a timer ID, so the frame could not be cancelled if you wanted to cancel it. But if you do want to cancel it, it needs to be cancelled through clearTimeout instead of cancelAnimationFrame, because the timer IDs are different. Signed-off-by: Jonathan Barnes <jbarnes@pivotal.io>	2017-03-31 15:31:04 -07:00
Tim van der Meij	8cee63df5d	Merge pull request #8205 from Snuffleupagus/built-in-CMap-errors Improve the error handling when loading of built-in CMap files fail (PR 8064 follow-up)	2017-03-30 23:01:13 +02:00
Jonas Jenwald	437104969d	Improve the error handling when loading of built-in CMap files fail (PR 8064 follow-up) I happened to notice that the error handling wasn't that great, which I missed previously since there were no unit-tests for failure to load built-in CMap files. Hence this patch, which improves the error handling and adds tests.	2017-03-29 22:38:29 +02:00
Jonas Jenwald	61ee0de29f	Use a simple `RefSetCache` to significantly improve the performance of `Catalog.getPageDict` for certain long documents (PR 8105 follow-up) I found that PR 8105 unfortunately causes a very serious performance regression in long PDF documents where the `Pages` tree only has one level; my apologies for this! Obviously we cannot revert that PR, since that would cause more issues than it solves. Hence it seems to me that the only viable solution here, is to add a simple `RefSetCache` to reduce the amount of redundant lookups. Previously in PR 8105 caching was thought to be unnecessary, but as it turns out I don't think that we really have a choice in the matter any more.	2017-03-28 21:39:55 +02:00
Jonas Jenwald	62eee8c782	Try harder to find the next valid JPEG marker when decoding Scan data (issue 8182, issue 8189) Tentatively fixes 8182 and fixes 8189.	2017-03-27 15:55:21 +02:00
Jonas Jenwald	e229c21ce1	Remove unnecessary `xref` parameters from various method signatures in `PartialEvaluator`, since `this.xref` is already available in the relevant scope For reasons I don't pretend to understand, we're passing around `xref` arguments to a bunch of methods despite `this.xref` being available in `PartialEvaluator`. This patch is a small first small step towards cleaning up the, often unwieldy, signatures of methods in `PartialEvaluator`.	2017-03-26 14:12:53 +02:00
Jonas Jenwald	e40fd63bd3	In `src/core/evaluator.js`, convert a couple of `if (!someVariable) { error(...); }` instances to `assert(someVariable);` instead Rather than, in a number of places, basically duplicating the logic of `assert` we can simply utilize the function directly instead.	2017-03-26 13:53:13 +02:00
Jonas Jenwald	5c0c122a7d	Ensure that the `XMLHttpRequest` is `open`ed before attempting to set the `responseType` in the `DOMCMapReaderFactory`, since IE fails otherwise (issue 8193) I really cannot understand why this change is necessary, since modern browsers such as Firefox and Chrome work just fine with the old code. Hence this is patch is yet another "hack" that's needed just because IE apparently cannot just work like you'd expect. For consistency, the Node factory used in the CMap unit-tests is changed as well. Fixes 8193.	2017-03-25 17:44:48 +01:00
Jonas Jenwald	3705e5e459	Use a proper `MessageHandler` for `PartialEvaluator.getTextContent` to avoid errors for fonts relying on built-in CMap files (PR 8064 follow-up) My apologies for inadvertently breaking this in PR 8064; apparently we don't have any tests that cover this use-case :( Without this patch `getTextContent` will fail if called before `getOperatorList`, since loading of fonts during text-extraction may require fetching of built-in CMap files. Please note: The `text` test added here, which uses an already existing PDF file, fails without this patch.	2017-03-24 17:39:33 +01:00
Rob Wu	49af56f730	Rethrow MissingDataException when needed In core/document.js: `PDFDocument.prototype.parse` accesses a dictionary property, which could throw if the underlying data is not yet available. In core/obj.js: `get Catalog.prototype.metadata` calls `stream.getBytes`, which can throw MissingDataException too when the stream is a ChunkedStream.	2017-03-22 14:55:59 +01:00
Jonas Jenwald	8527d27eae	Ensure that `PDFDocument.documentInfo` doesn't fail during document load, when the entire XRef table hasn't been fetched yet (issue 8180) Similar to other `try-catch` statements in `/core` code, we must re-throw `MissingDataException` to prevent issues with missing data during document loading. Note that I'm not sure if/how we can test this, which is why the patch doesn't include any test(s). Fixes 8180.	2017-03-22 14:14:38 +01:00
Jonas Jenwald	e2e13df4a5	Merge pull request #8164 from Snuffleupagus/issue-7828 Don't read past the EOI marker for JPEG images with non-default restart interval (issue 7828)	2017-03-20 22:17:28 +01:00
Jonas Jenwald	d6d0f778aa	Don't read past the EOI marker for JPEG images with non-default restart interval (issue 7828) After browsing through (a version of) the JPEG specification, see https://www.w3.org/Graphics/JPEG/itu-t81.pdf, I hope that this patch makes sense. Note that while issue 7828 became a problem after PR 7661, it isn't really a regression from than PR. The explanation is rather that we're now relying on `core/jpg.js` instead of the Native Image decoder in more situations than before, which thus exposed an existing issue in our JPEG decoder. Another factor also seems to be that in many JPEG images, the DRI (Define Restart Interval) marker isn't present, in which case this bug won't manifest either. According to https://www.w3.org/Graphics/JPEG/itu-t81.pdf#page=89 (at the bottom of the page): "NOTE – The final restart interval may be smaller than the size specified by the DRI marker segment, as it includes only the number of MCUs remaining in the scan." Furthermore, according to https://www.w3.org/Graphics/JPEG/itu-t81.pdf#page=39 (in the middle of the page): "[...] If restart is enabled and the restart interval is defined to be Ri, each entropy-coded segment except the last one shall contain Ri MCUs. The last one shall contain whatever number of MCUs completes the scan." Based on the above, it thus seem to me that we should simply ensure that we're not attempting to continue to parse Scan data once we've found all MCUs (Minimum Coded Unit) of the image. Fixes 7828.	2017-03-20 17:16:33 +01:00
Jonas Jenwald	be1a6f294f	Try to recover when encountering JPEG markers with too short marker lengths (issue 8169) The issue with the JPEG image in question, is that the COM (Comment) marker has an incorrect length entry. Fixes 8169.	2017-03-20 17:05:51 +01:00
Jonas Jenwald	a7c19d9cbb	Adjust the `yoda` ESLint rule to apply to inequalities as well I happened to notice that some inequalities had the wrong order, and was surprised since I thought that the `yoda` rule should have caught that. However, reading http://eslint.org/docs/rules/yoda#options a bit more closely than previously, it's quite obvious that the `onlyEquality` option does exactly what its name suggests. Hence I think that it makes sense to adjust the options such that only ranges are allowed instead.	2017-03-19 13:27:14 +01:00
Jonas Jenwald	098a56270d	Normalize the `BBox` entry in Tiling Pattern dictionaries (issue 8117) According to the PDF specification, see http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G7.3982967, the `BBox` entry should have the form `[left, bottom, right, top]`. Since some PDF generators apparently violates the specification, we normalize the `BBox` to ensure that the pattern is (correctly) rendered. Fixes 8117.	2017-03-16 21:43:11 +01:00
Jonas Jenwald	d37d271afa	[api-minor] Reject the `RenderTask` with an actual `Error`, instead of just a `string`, when rendering is cancelled This patch gets rid of the only case in the code-base where we're throwing a plain `string`, rather than an `Error`, which besides better/more consistent error handling also allows us to enable the [`no-throw-literal`](http://eslint.org/docs/rules/no-throw-literal) ESLint rule.	2017-03-13 18:58:21 +01:00
Jonas Jenwald	6d672c4ba6	[api-minor] Add a `pdfjsNext` parameter, and `PDFJS_NEXT` build flag, to allow backwards incompatible API changes	2017-03-13 18:43:43 +01:00
Jonas Jenwald	224613a511	Merge pull request #8135 from jasonjensen/issue8097 Handle cff fonts with erroneous stackSize (issue 8097)	2017-03-11 09:55:00 +01:00
Tim van der Meij	fc5810c97a	Merge pull request #8144 from timvandermeij/issue-8143 Widget annotations: do not crash if `Parent` is not a dictionary during field name construction (issue 8143)	2017-03-10 00:40:13 +01:00
Tim van der Meij	936d3c0698	Widget annotations: do not crash if `Parent` is not a dictionary during field name construction (issue 8143)	2017-03-09 23:51:52 +01:00
Jason O. Jensen	d230784ac3	Handle cff fonts with erroneous stackSize	2017-03-06 19:28:46 -05:00
Tim van der Meij	4e3e97be8e	Merge pull request #8129 from Snuffleupagus/getInheritedPageProp-undefined Return `undefined` instead of `Dict.empty` from `Page.getInheritedPageProp` for non-existent properties to prevent possible future bugs	2017-03-04 16:18:24 +01:00
Yury Delendik	c290561488	Merge pull request #8120 from yurydelendik/lib Publishes processed sources into pdfjs-dist/lib	2017-03-04 08:48:36 -06:00
Jonas Jenwald	9bed87f5dc	Return `undefined` instead of `Dict.empty` from `Page.getInheritedPageProp` for non-existent properties to prevent possible future bugs This is something that I noticed while working on PR 8126, which is (more) fallout from PR 6065. In general, it's actually not correct to return `Dict.empty` as the default value for non-existent properties. Please note that a prior PR, see https://github.com/mozilla/pdf.js/pull/5957#issuecomment-103112698, asked for that behaviour but I don't think that's right. Obviously for properties that are (or should) be `Dict`s it makes sense, however certain properties can be e.g. Strings or Arrays instead. In the latter case, returning `Dict.empty` is just plain wrong, and it's quite fascinating that this hasn't caused any errors in practice. (The existing validation in the various getters has actually saved us here.) Also, when looking at this code again, it seemed unnecessary to duplicate the `MAX_LOOP_COUNT` check since we could just return immediately instead.	2017-03-04 13:08:39 +01:00
Tim van der Meij	1eb96d7ca9	Merge pull request #8128 from timvandermeij/csp-headers Network: use the current location to prevent errors when using CSP headers	2017-03-04 00:01:50 +01:00
Yury Delendik	e7cc07cc11	Moves checkProblematicCharRanges to font_spec.js	2017-03-03 16:33:35 -06:00
Job van der Weiden	a05115d2ec	Network: use the current location to prevent errors when using CSP headers When using content security headers to restrict connections to the same origin, you may not make connections to `example.com`. This feature detection also works with a request to the current location.	2017-03-03 23:18:51 +01:00
Jonas Jenwald	4a0ff5dbf7	Ensure that we don't ignore `0` values in `Page.getInheritedPageProp` (issue 8125) It appears that I accidentally broke this in PR 6065, sorry about that! The issue in this particular PDF file is that there's `/Rotate` entries on different levels of the `/Pages` tree. We're supposed to use the `/Rotate` entry in the `/Page` dict (which is `0`), but because of an incorrect condition we instead ended up with the one from the `/Pages` dict (which is `180`). Fixes 8125.	2017-03-03 12:27:40 +01:00
Jonas Jenwald	9163a6fba4	Merge pull request #8112 from Snuffleupagus/JS-action-newWindow Support the `newWindow` flag in white-listed `app.launchURL` JavaScript actions (PR 7794 follow-up)	2017-03-01 21:24:34 +01:00
Tim van der Meij	25f772a255	Merge pull request #8050 from yurydelendik/systemjs Replaces RequireJS to SystemJS.	2017-02-27 23:31:41 +01:00
Tim van der Meij	4e201d3787	Merge pull request #8072 from timvandermeij/annotation-append-operator-list Annotations: move operator list addition logic to `src/core/document.js`	2017-02-27 22:50:57 +01:00
Tim van der Meij	0739f90707	Annotations: move operator list addition logic to `src/core/document.js` Ideally, the `Annotation` class should not have anything to do with the page's operator list. How annotations are added to the page's operator list is logic that belongs in `src/core/document.js` instead where the operator list is constructed. Moreover, some comments have been added to clarify the intent of the code.	2017-02-27 22:17:49 +01:00
Tim van der Meij	9db4240b85	Merge pull request #8110 from timvandermeij/interactive-forms-choice-inherit-options Interactive forms: make choice widget options inheritable (issue 8094)	2017-02-27 22:14:25 +01:00
Jonas Jenwald	2a7e5b8a54	Support the `newWindow` flag in white-listed `app.launchURL` JavaScript actions (PR 7794 follow-up) A simple follow-up to PR 7794, which let's us add support for the `newWindow` parameter; refer to https://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/js_api_reference.pdf#G5.1507380. The patch also fixes an embarrassing oversight regarding the placement of the case-insensitive flag, and also allows arbitrary white-space at the beginning of JS actions.	2017-02-27 15:58:28 +01:00
Yury Delendik	5b50e0d414	Replaces RequireJS to SystemJS.	2017-02-27 08:32:39 -06:00
Tim van der Meij	8990de8614	Interactive forms: make choice widget options inheritable (issue 8094) Even though the PDF specification does not state that `Opt` fields are inheritable, in practice there are PDF generators that let annotations inherit the options from a parent.	2017-02-25 23:34:26 +01:00
Jonas Jenwald	14cc6acb90	Ensure that `Dict`s found in Object Streams are assigned an `objId` in `XRef.fetch` This fixes something that I noticed while working with the code in `Catalog.getPageDict` when debugging issue 8088. Note that while I don't have an example where this patch really matters, given that e.g. `PartialEvaluator.hasBlendModes` depends on the `objId` to avoid cyclic references this patch could potentially help for some PDF files.	2017-02-25 10:20:19 +01:00
Tim van der Meij	752510ffa0	Merge pull request #8107 from yurydelendik/init-via-port Init PDFWorker via MesssagePort.	2017-02-25 00:13:33 +01:00
Tim van der Meij	59392fd544	Merge pull request #8102 from yurydelendik/mv-compatibilty Move compatibility code to the shared/compatibility.js.	2017-02-24 22:47:49 +01:00
Yury Delendik	51767d63fe	Init PDFWorker via MesssagePort.	2017-02-24 13:33:18 -06:00
Jonas Jenwald	1ce295541c	Always check all Kids nodes, in `Catalog.getPageDict`, to avoid getting stuck in an empty node further down in the Pages tree (issue 8088) As discussed on IRC, we need to check all nodes at the bottom of the tree to ensure that we find the correct `Page` dict. Furthermore, this patch also gets rid of the caching present in a previous version, since it's not clear if that really helps. Note that this patch purposely adds an `eq` test, using a reduced test-case, so that we can be sure that the algorithm actually finds the correct `Page` dict for each `pageIndex`. Fixes 8088.	2017-02-24 12:09:46 +01:00
Yury Delendik	facefb0c79	Move compatibility code to the shared/compatibility.js.	2017-02-23 19:18:44 -06:00
Jonas Jenwald	9082f08e37	Enable running the `cmap` unit-tests on Travis by utilizing a `NodeCMapReaderFactory`	2017-02-17 23:15:36 +01:00
Yury Delendik	cfaa621a05	Merge pull request #8064 from Snuffleupagus/fetchBuiltInCMap [api-minor] Refactor fetching of built-in CMaps to utilize a factory on the `display` side instead, to allow users of the API to provide a custom CMap loading factory (e.g. for use with Node.js)	2017-02-17 15:30:31 -06:00
Brendan Dahl	425ad30912	Merge pull request #8071 from Snuffleupagus/bug-1337429 Always choose a (3, 1) cmap table for TrueType fonts that have an encoding specified, regardless of the Symbolic font flag (bug 1337429)	2017-02-16 15:13:46 -08:00
Jonas Jenwald	111419a64a	Cache built-in binary CMap files in the worker (issue 4794)	2017-02-16 10:55:39 +01:00

1 2 3 4 5 ...

2780 Commits