pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	11408da340	Replace the `isInt` helper function with the native `Number.isInteger` function Follow-up to PR 8643.	2017-09-01 16:52:50 +02:00
Jonas Jenwald	772a5412a4	Avoid some redundant type checks in `XRef.fetchUncompressed` When looking briefly at using `Number.isInteger`/`Number.isNan` rather than `isInt`/`isNaN`, I noticed that there's a couple of not entirely straightforward cases to consider. At first I really couldn't understand why `parseInt` is being used like it is in `XRef.fetchUncompressed`, since the `num` and `gen` properties of an object reference should always be integers. However, doing a bit of code archaeology pointed to PR 4348, and it thus seem that this was a very deliberate change. Since I didn't want to inadvertently introduce any regressions, I've kept the `parseInt` calls intact but moved them to occur only when actually necessary.[1] Secondly, I noticed that there's a redundant `isCmd` check for an edge-case of broken operators. Since we're throwing a `FormatError` if `obj3` isn't a command, we don't need to repeat that check. In practice, this patch could perhaps be considered as a micro-optimization, but considering that `XRef.fetchUncompressed` can be called many thousand times when loading larger PDF documents these changes at least cannot hurt. --- [1] I even ran all tests locally, with an added `assert(Number.isInteger(obj1) && Number.isInteger(obj2));` check, and everything passed with flying colours. However, since it appears that this was in fact necessary at one point, one possible explanation is that the failing test-case(s) have now been replaced by reduced ones.	2017-08-31 16:49:04 +02:00
Tim van der Meij	a4cc85fc5f	Merge pull request #8828 from timvandermeij/es6-annotations Improve the annotation code by converting to ES6 syntax and removing duplicate code	2017-08-31 00:02:07 +02:00
Jonas Jenwald	49b8cd5a6a	Attempt to improve the `EI` detection heuristics, for inline images, in streams containing `NUL` bytes (issue 8823) Since this patch will now treat (some) `NUL` bytes as "ASCII", the number of `followingBytes` checked are thus increased to (hopefully) reduce the risk of introducing new false positives. Fixes 8823.	2017-08-27 12:48:28 +02:00
Tim van der Meij	2512eccbf0	Implement `getOperatorList` method in the `WidgetAnnotation` class to avoid duplication in subclasses	2017-08-27 01:02:41 +02:00
Tim van der Meij	4f02857394	Let the two annotation factories use static methods This corresponds to how other factories are implemented.	2017-08-27 01:02:40 +02:00
Tim van der Meij	24d741d045	Convert `src/core/annotation.js` to ES6 syntax	2017-08-27 00:53:45 +02:00
Jonas Jenwald	42f2d36d1f	Account for broken outlines/annotations, where the destination dictionary contains an invalid `/Dest` entry According to the specification, see http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#page=377, a `Dest` entry in an outline item should not contain a dictionary. Unsurprisingly there's PDF generators that completely ignore this, treating is an `A` entry instead. The patch also adds a little bit more validation code in `Catalog.parseDestDictionary`.	2017-08-26 17:38:15 +02:00
Jonas Jenwald	4660cf8238	Prevent an infinite loop in `XRef.readXRef` by keeping track of already parsed tables (bug 1393476) With this patch, not only is the infinite loop prevented, but we're also able to actually render the file (which e.g. Adobe Reader isn't able to). Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1393476.	2017-08-24 19:18:08 +02:00
Tim van der Meij	e9ba54940d	Merge pull request #8800 from Snuffleupagus/issue-8798 Try to recover if we reach the end of the stream when searching for the `EI` marker of an inline image (issue 8798)	2017-08-23 23:47:51 +02:00
Jonas Jenwald	ca936ee0c7	Merge pull request #8491 from janpe2/jbig2Halftone-2 JBIG2 halftone regions and pattern dictionaries	2017-08-23 00:13:43 +02:00
Jonas Jenwald	cb55506b95	Try to recover if we reach the end of the stream when searching for the `EI` marker of an inline image (issue 8798)	2017-08-22 09:33:13 +02:00
Jonas Jenwald	2112999db7	Fix caching of small inline images in `Parser.makeInlineImage` (issue 8790) Follow-up to PR 5445. Using the PDF file from issue 2618, i.e. http://bugzilla-attachments.gnome.org/attachment.cgi?id=226471, with the following manifest file: ```json [ { "id": "issue2618", "file": "../web/pdfs/issue2618.pdf", "md5": "", "rounds": 50, "type": "eq" } ] ``` I get the following results when comparing `master` against this patch: ``` browser \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| ------------ \| ----- \| ------------ \| ----------- \| ---- \| ------ \| ------------- firefox \| Overall \| 50 \| 4694 \| 3974 \| -721 \| -15.35 \| faster firefox \| Page Request \| 50 \| 2 \| 1 \| 0 \| -22.83 \| firefox \| Rendering \| 50 \| 4692 \| 3972 \| -720 \| -15.35 \| faster ``` So, based on these results, it seems like a fairly clear win to fix this broken caching :-)	2017-08-18 23:08:55 +02:00
Jonas Jenwald	563b68e74d	Remove manual clamping code in `src/core/jpx.js` Since we're now using `Uint8ClampedArray`, rather than `Uint8Array`, doing manual clamping shouldn't be necessary given that that is now handled natively. This shouldn't have any measurable performance impact, but just to sanity check that I've done some quick benchmarking with the following manifest file: ```json [ { "id": "S2-eq", "file": "pdfs/S2.pdf", "md5": "d0b6137846df6e0fe058f234a87fb588", "rounds": 100, "type": "eq" } ] ``` which gave the following results against the current `master` (repeated benchmark runs didn't result in any meaningful differences): ``` -- Grouped By browser, stat -- browser \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ----- \| ------------- firefox \| Overall \| 100 \| 592 \| 592 \| 1 \| 0.12 \| firefox \| Page Request \| 100 \| 3 \| 3 \| 0 \| -9.88 \| firefox \| Rendering \| 100 \| 588 \| 589 \| 1 \| 0.18 \| ```	2017-08-16 13:24:28 +02:00
Jonas Jenwald	f6636d6b19	Use `Uint8ClampedArray` when returning image data in `src/core/jbig2.js` and `src/core/jpg.js`	2017-08-16 13:24:28 +02:00
Jonas Jenwald	74ad90cb8f	Update the mask data inversion in `PDFImage.createMask` to be compatible with both `Uint8Array` and `Uint8ClampedArray`	2017-08-16 13:24:21 +02:00
Jonas Jenwald	d6cd5355f0	Use `Uint8ClampedArray`, when returning data, and remove manual clamping in `src/core/jpg.js` (issue 4901) This patch removes the `clamp0to255` helper function, as well as manual clamping code in `src/core/jpg.js`. The adjusted constants in `_convertCmykToRgb` were taken from CMYK to RGB conversion code found in `src/core/colorspace.js`. Please note: There will be some very slight movement in a number of existing test-cases, since `Uint8ClampedArray` appears to use `Math.round` (or equivalent) and the old code used (basically) `Math.floor`.	2017-08-14 16:19:57 +02:00
Jani Pehkonen	9a581ee9ed	Implement JBIG2 halftone regions and pattern dictionaries	2017-08-08 15:38:29 +03:00
Jonas Jenwald	093afd1212	Replace the `coded` property with `isType3Font` when building the font `properties` object in `PartialEvaluator.translateFont` This appears to simply have been forgotten in the re-factoring in PR 4815, where the `coded` property was renamed to the much more descriptive `isType3Font` property.	2017-08-08 14:03:02 +02:00
Jonas Jenwald	4729e96fb7	Remove leftover `args[0].code` checks from the `OPS.paintXObject` cases in evaluator.js From looking at blame, it seems that these checks became obsolete with PR 692 (which landed close to six years ago). Note how, after that PR, there's no longer anything being assigned to the `code` property of an Object.	2017-08-07 10:48:37 +02:00
Jonas Jenwald	ace9de6f7d	Merge pull request #8747 from brendandahl/first-cmap Fix two cmap related issues.	2017-08-04 14:11:12 +02:00
Brendan Dahl	0bef50d56d	Fix two cmap related issues. In issue #8707, there's a char code mapped to a non- existing glyph which shouldn't be drawn. However, we saw it was missing and tried to then use the post table and end up mapping it incorrectly. This illuminated a problem with issue #5704 and bug 893730 where glyphs disappeared after above fix. This was from the cmap returning the wrong glyph id. Which in turn was caused because the font had multiple of the same type of cmap table and we were choosing the last one. Now, we instead default to the first one. I'm unsure if we should instead be merging the multiple cmaps, but using only the first one works.	2017-08-03 22:19:36 -07:00
Yury Delendik	a1dfbec532	Properly cancel streams and guard at getTextContent.	2017-08-03 16:36:46 -05:00
Jonas Jenwald	e20d4a9c21	Merge pull request #8681 from brendandahl/glyph-ids Fix several issues with glyph id mappings (issue 8668, bug 1383504)	2017-08-03 14:25:34 +02:00
Brendan Dahl	5b7f712ca7	Merge pull request #8627 from yurydelendik/issue-8591 Fallback on font widths if CFF data is broken	2017-08-02 10:53:14 -07:00
Apoorv Mishra	a129de7bd1	Add unit-tests for colorspace.js Added unit-tests for DeviceGray, DeviceRGB and DeviceCMYK Added unit-tests for CalGray Added unit-tests for CalRGB Removed redundant code Added unit-tests for LabCS Added unit-tests for IndexedCS Update comment Change lookup to Uint8Array as mentioned in pdf specs(these tests will pass after PR #8666 is merged). Added unit-tests for AlternateCS Resolved code-style issues Fixed code-style issues Addressed issues pointed out in https://github.com/mozilla/pdf.js/pull/8611#pullrequestreview-52865469	2017-07-28 14:24:56 +05:30
Yury Delendik	343b4dc2b6	Merge pull request #8617 from mukulmishra18/network-streaming Adds Streams API support for networking task of PDF.js project.	2017-07-27 16:15:06 -05:00
Mukul Mishra	109106794d	Adds Streams API support for networking task of PDF.js project. network.js file moved to main thread and `PDFNetworkStream` implemented at worker thread, that is used to ask for data whenever worker needs.	2017-07-28 02:32:30 +05:30
Brendan Dahl	ac33358e1f	Fix several issues with glyph id mappings. The initial issue with #8255 was I added a missing glyphs check to adjustMapping, but this caused us to skip re-mapping a glyph if the fontCharCode was a missingGlyph which in turn caused us to overwrite a valid glyph id with an invalid one. While fixing this, I also added a warning if the private use area is full since this also accidentally happened when I made a different mistake. This brought to light a number of issues where we map missing glyphs to notdef, but often the notdef is actually defined and then ends up being drawn. Now the glyphs don't get mapped in toFontChar and so they are not drawn by the canvas. Fixing the above brought up another issue though in bug1050040.pdf. In this PDF, the font fails to load by the browser and before we were still drawing the glyphs because it looked like the font had them, but with the fixes above the glyphs showed up as missing so we didn't attempt draw them. To fix this, I now throw an error when the loca table is in really bad shape and we fall back to trying to use a system font. We now also use this fall back if there are any format errors during converting fonts.	2017-07-26 13:00:55 -07:00
Tim van der Meij	37ac8f8623	Merge pull request #8698 from Snuffleupagus/issue-8697 Add a fallback for non-embedded SegoeUISymbol font (issue 8697)	2017-07-25 22:35:52 +02:00
Tim van der Meij	44a5cec25e	Merge pull request #8666 from apoorv-mishra/fix-colorspace Fix TypeError that occurs in colorspace.js on accidentally passing an 'Array' instead of 'TypedArray'	2017-07-25 22:13:20 +02:00
Yury Delendik	c830021b07	Fixes CFF data glyph widths	2017-07-25 12:29:51 -05:00
Jonas Jenwald	23ec6b16ca	Add a fallback for non-embedded SegoeUISymbol font (issue 8697) The PDF file uses a non-embedded SegoeUISymbol font, which is not a standard font (and is mainly used by Microsoft, see https://en.wikipedia.org/wiki/Segoe). Fixes 8697.	2017-07-25 12:45:11 +02:00
Tim van der Meij	af71ea7a7d	Merge pull request #8673 from Snuffleupagus/api-pageMode [api-minor] Add support for PageMode in the API and viewer (issue 8657)	2017-07-23 13:17:07 +02:00
Tim van der Meij	e7cddcce28	Merge pull request #8684 from Snuffleupagus/rm-assert Remove most `assert()` calls (issue 8506)	2017-07-22 19:42:24 +02:00
Tim van der Meij	7ded895d0c	Merge pull request #8638 from Snuffleupagus/issue-4926-built-in-jpg In `src/core/jpg.js`, ensure that the Adobe JPEG marker always takes precedence, even when the color transform code is zero	2017-07-22 17:25:09 +02:00
Jonas Jenwald	814fa1dee3	Remove most `assert()` calls (issue 8506) This replaces `assert` calls with `throw new FormatError()`/`throw new Error()`. In a few places, throwing an `Error` (which is what `assert` meant) isn't correct since the enclosing function is supposed to return a `Promise`, hence some cases were changed to `Promise.reject(...)` and similarily for `createPromiseCapability` instances.	2017-07-21 18:51:02 +02:00
Apoorv Mishra	d14956d4b8	Fix TypeError that occurs in colorspace.js on accidentally passing an 'Array' instead of 'TypedArray' Fix TypeError that occurs in colorspace.js on accidentally passing an 'Array' instead of 'TypedArray' Changed getRgbItem(...) to getRgbBuffer(...) since this.lookup has values in range[0, 255] whereas getRgbItem(...) expects those to be in range [0, 1] Revert changes for IE9 compatibility	2017-07-21 01:15:05 +05:30
Jonas Jenwald	15f0963f51	Fix a typo, in the `Catalog.numPages` getter, than prevents shadowing from working correctly Looking at the blame, it seems that this typo was present even before PR 700 (almost six years ago). The result of using `'num'`, rather than the correct `'numPages'` string, is that the `Catalog.numPages` getter isn't actually being shadowed.	2017-07-20 12:35:09 +02:00
Jonas Jenwald	16c5d41c5b	[api-minor] Add support for PageMode in the API (issue 8657) Please refer to https://wwwimages2.adobe.com/content/dam/Adobe/en/devnet/pdf/pdfs/PDF32000_2008.pdf#page=82.	2017-07-19 16:40:03 +02:00
Jonas Jenwald	e2ea9b693c	In `src/core/jpg.js`, ensure that the Adobe JPEG marker always takes precedence, even when the color transform code is zero According to the PDF specification, please see http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G6.2394361, if an Adobe JPEG marker is present it should always take precedence. This even seem to be consistent with the existing comment that is present in the code. Hence it seems reasonable to interpret `transformCode === 0` as no color conversion being necessary. Fixes the rendering of page 1 in `issue-4926` (from the test-suite), when the built-in `src/core/jpg.js` image decoder is used.	2017-07-11 17:08:30 +02:00
Yury Delendik	d028c26210	Removes error()	2017-07-07 09:40:24 -05:00
Jonas Jenwald	ea71d23f74	Fix a stupid spelling error in the `ASCII85Decode` name in `Parser.makeInlineImage` (issue 8613) This is a trivial follow-up to PR 5383, and it's a bit strange that this has been wrong since late 2014 without anyone noticing (maybe because inline images aren't too common). So, apparently code works better if you actually spell correctly, who knew ;-) Fixes 8613.	2017-07-05 19:43:09 +02:00
Yury Delendik	b3bac5100c	Merge pull request #8596 from mukulmishra18/proper-read-result Fixes wrong structure of fullReader.read() result.	2017-07-05 09:03:57 -05:00
Jonas Jenwald	eff257b820	Merge pull request #8580 from brendandahl/missing-glyf Fix how we detect and handle missing glyph data.	2017-07-04 12:16:07 +02:00
Brendan Dahl	9f5c1550ed	Merge pull request #8592 from brendandahl/cmap-3-0 Only mask char codes of (3, 0) cmap tables in the range of 0xF000 to 0…	2017-07-03 17:58:28 -07:00
Brendan Dahl	efbbd8533f	Only mask char codes of (3, 0) cmap tables in the range of 0xF000 to 0xF0FF.	2017-07-03 13:13:46 -07:00
Brendan Dahl	6d4f748fb1	Fix how we detect and handle missing glyph data.	2017-07-03 13:06:06 -07:00
Mukul Mishra	308a83e5ca	Fixes wrong structure of fullReader.read() result.	2017-07-01 15:52:47 +05:30
Jonas Jenwald	de0e7a9a68	Check that the `MessageHandler` isn't already terminated in the `onFailure` handler in `src/core/worker.js` (issue 8584) All other code-paths already checks that the `MessageHandler` isn't terminated, but apparently `onFailure` was missing that check (compare e.g. with the `onSuccess` function). From what I can tell, this is only an issue if workers are disabled, hence why I didn't bother adding a unit-test. Fixes 8584.	2017-06-30 10:11:13 +02:00

1 2 3 4 5 ...

1239 Commits