pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	a159c4f357	Check that `this.baseUrl` is defined before attempting to fetch any data in `DOMCMapReaderFactory`/`NodeCMapReaderFactory`	2017-09-28 12:34:57 +02:00
Jonas Jenwald	7d3efe43a2	Ensure that the same exact version of PDF.js is used in both the API and the Worker I don't have a good example at hand right know, but I recall seeing custom deployments of PDF.js that bundle a specific version of the `build/pdf.js` file and then set `PDFJS.workerSrc` to point to https://mozilla.github.io/pdf.js/build/pdf.worker.js. That practice seems really bad since, besides (obviously) causing unnecessary server load, it will very quickly result in a version mismatch between the `pdf.js` and `pdf.worker.js` files in those PDF.js deployments. Such a version mismatch could easily lead to either breaking errors, or even worse slightly inconsistent behaviour for an API call (if the API -> Worker interface changes, which does happen from time to time). To avoid the problems described above, I'm thus proposing that we enforce that the versions of the `pdf.js` and `pdf.worker.js` files must always match.	2017-09-27 15:41:57 +02:00
Brendan Dahl	18e2321845	Overwrite maxSizeOfInstructions in maxp with computed value. In issue #7507 the value is less than the actuall max size of the glyph instructions causing OTS to fail the font.	2017-09-25 17:53:26 -07:00
Jonas Jenwald	10727572a2	Merge pull request #8950 from timvandermeij/polygon-polyline-annotations Implement support for polyline and polygon annotations	2017-09-24 15:16:14 +02:00
Tim van der Meij	c69a7a83da	Merge pull request #8932 from janpe2/jbig2-sym-offset JBIG2 symbol offsets	2017-09-23 17:11:45 +02:00
Tim van der Meij	8ccad276b2	Implement support for polygon annotations	2017-09-23 16:52:47 +02:00
Tim van der Meij	99b17a494d	Implement support for polyline annotations	2017-09-23 16:37:23 +02:00
Tim van der Meij	40b89e9ba4	Merge pull request #8949 from Snuffleupagus/ColorSpace-rm-instanceof-AlternateCS Remove the `instanceof AlternateCS` check in `ColorSpace.parse` since it's dead code	2017-09-23 16:09:17 +02:00
Tim van der Meij	2aac994171	Merge pull request #8928 from mukulmishra18/decode-file-path Fix #8907: Decode URL to get correct path in node_stream.	2017-09-23 14:44:58 +02:00
Jonas Jenwald	8a084aff0f	Remove the `instanceof AlternateCS` check in `ColorSpace.parse` since it's dead code Looking at `ColorSpace.parseToIR`, it will do one of the following things when called: 1. Return a String. 2. Return an Array. 3. Throw a `FormatError`. 4. In one case, return the result of another `ColorSpace.parseToIR` call. However, under no circumstances will it ever return an `AlternateCS` instance. Since it's often useful to understand why code, which has become unused, existed in the first place, let's grab a hard hat and a shovel and start digging through the history of this code :-) The current condition was introduced in commit `c198ec4323`, in PR 794, but it was actually already obsolete by that time. The preceeding `instanceof SeparationCS` condition predates commit `a7278b7fbc`, in PR 700. That condition was originally introduced all the way back in commit `4e3f87b60c`, in PR 692. However, it was made obsolete by commit `9dcefe1efc`, which is included in the very same PR! Hence we're left with the conclusion that not only has this code be unused for almost six years, it was basically never used at all save for a few refactoring commits that're part of PR 692.	2017-09-23 14:36:10 +02:00
Tim van der Meij	d7b37ae745	Merge pull request #8912 from timvandermeij/xml-parser [api-minor] Replace `DOMParser` with `SimpleXMLParser`	2017-09-20 23:45:00 +02:00
Jonas Jenwald	abc864fca9	Merge pull request #8938 from brendandahl/bug1392647 Use font's default width even when 0. (bug 1392647)	2017-09-20 22:38:39 +02:00
Brendan Dahl	10ba292b46	Use font's default width even when 0. Bug 1392647 has a PDF where the default width of the font is 0. It draws some charcodes that don't have glyphs, but we were wrongly using the 1000 default width for these charcodes causing some text to be overlapping.	2017-09-20 11:38:30 -07:00
Tim van der Meij	d4309614f9	Replace `DOMParser` with `SimpleXMLParser` The `DOMParser` is most likely overkill and may be less secure. Moreover, it is not supported in Node.js environments. This patch replaces the `DOMParser` with a simple XML parser. This should be faster and gives us Node.js support for free. The simple XML parser is a port of the one that existed in the examples folder with a small regex fix to make the parsing work correctly. The unit tests are extended for increased test coverage of the metadata code. The new method `getAll` is provided so the example does not have to access internal properties of the object anymore.	2017-09-19 23:09:07 +02:00
Jani Pehkonen	5d1074c110	Fix JBIG2 symbol offsets in text regions	2017-09-19 23:43:23 +03:00
Tim van der Meij	bc9afdf3c4	Convert `src/display/metadata.js` to ES6 syntax	2017-09-19 22:13:59 +02:00
Jani Pehkonen	3d99b8d706	CCITTFaxStream problem when EndOfBlock is false	2017-09-19 22:19:40 +03:00
Mukul Mishra	e4c09c7cba	Fix #8907 : Decode URL to get correct path in node_stream.	2017-09-19 15:10:36 +05:30
Tilman Hausherr	d75a497a6b	support tiff predictor for 16bit (for issue #6289) This does the same for 16 bit as the existing 8 bit tiff predictor code, an addition of the last word to this word. The last two "& 0xFF" may or may not be needed, I see this isn't done in the 8 bit code, but I'm not a JS developer.	2017-09-18 22:24:25 +02:00
Tim van der Meij	400e4aae0e	Implement support for stamp annotations	2017-09-16 16:37:50 +02:00
Tim van der Meij	3be941d982	Merge pull request #8909 from Snuffleupagus/PDFFunction-isEvalSupported Check `isEvalSupported`, and test that `eval` is actually supported, before attempting to use the `PostScriptCompiler` (issue 5573)	2017-09-16 16:11:03 +02:00
Jonas Jenwald	eece66fa3e	For /Filter entries containing `Name`s, ignore the /DecodeParms entry if it contains an Array (issue 8895)	2017-09-15 23:02:16 +02:00
Jonas Jenwald	dc926ffc0f	Check `isEvalSupported`, and test that `eval` is actually supported, before attempting to use the `PostScriptCompiler` (issue 5573) Currently `PDFFunction` is implemented (basically) like a class with only `static` methods. Since it's used directly in a number of different `src/core/` files, attempting to pass in `isEvalSupported` would result in code that's very messy, not to mention difficult to maintain (since every single `PDFFunction` method call would need to include a `isEvalSupported` argument). Rather than having to wait for a possible re-factoring of `PDFFunction` that would avoid the above problems by design, it probably makes sense to at least set `isEvalSupported` globally for `PDFFunction`. Please note that there's one caveat with this solution: If `PDFJS.getDocument` is used to open multiple files simultaneously, with different `PDFJS.isEvalSupported` values set before each call, then the last one will always win. However, that seems like enough of an edge-case that we shouldn't have to worry about it. Besides, since we'll also test that `eval` is actually supported, it should be fine. Fixes 5573.	2017-09-15 12:02:45 +02:00
Mukul Mishra	ef7038fe34	Fix #8888 : Change behaviour of fetch to make it compatible with XHR.	2017-09-14 23:53:06 +05:30
Jonas Jenwald	f2618eb2e4	Merge pull request #8808 from janpe2/issue8741 Fix color of image masks inside uncolored patterns	2017-09-12 14:27:56 +02:00
Tim van der Meij	320779e6ed	Merge pull request #8691 from timvandermeij/square-circle-annotations Implement support for square and circle annotations	2017-09-09 22:56:54 +02:00
Tim van der Meij	44c116ac49	Implement support for circle annotations	2017-09-09 21:36:27 +02:00
Tim van der Meij	cace2e9047	Implement support for square annotations	2017-09-09 21:36:27 +02:00
Tim van der Meij	f7fd1db52f	Introduce `DOMSVGFactory` This patch provides a new unit tested factory for creating SVG containers and elements. This code is duplicated twice in the codebase, but with upcoming changes this would need to be duplicated even more. Moreover, consolidating this code in one factory allows us to replace it easily for e.g., supporting Node.js. Therefore, move this to a central place and update/ES6-ify the related code. Finally, we replace `setAttributeNS` with `setAttribute` because no namespace is provided.	2017-09-09 21:36:27 +02:00
Tim van der Meij	437e9cb056	Merge pull request #8865 from Snuffleupagus/hide-unsupported-LinkAnnotation Hide unsupported `LinkAnnotation`s (issue 3897)	2017-09-09 19:07:43 +02:00
Jonas Jenwald	8686baede5	Replace `value === (value \| 0)` checks with `Number.isInteger(value)` in the `src/` folder Rather than doing what (at first) may seem like a fairly obscure comparison, using `Number.isInteger` will clearly indicate the intent of the code.	2017-09-09 14:12:52 +02:00
Jonas Jenwald	7115e136e4	Hide unsupported `LinkAnnotation`s (issue 3897) Rather than displaying links that does nothing when clicked, it probably makes more sense to simply not render them instead. Especially since it turns out that, at least at this point in time, this is very easy to both implement and test. Fixes 3897.	2017-09-06 12:52:56 +02:00
Jani Pehkonen	86020396cb	Fix color of image masks inside uncolored patterns	2017-09-06 13:41:48 +03:00
Jonas Jenwald	41415ba0a2	Correctly validate the response status for non-HTTP fetch requests (PR 8768 follow-up) It seems that the status check, for non-HTTP loads, causes the default viewer to refuse to open local PDF files. *STR:* 1. Make sure that fetch support is enabled in the browser. In Firefox Nightly, set `dom.streams.enabled = true` and `javascript.options.streams = true` in `about:config`. 2. Open https://mozilla.github.io/pdf.js/web/viewer.html. 3. Click on the "Open file" button, and open a new PDF file. *ER:* A new PDF file should open in the viewer. *AR:* The PDF file fails to open, with an error message of the following format: `Message: Unexpected server response (200) while retrieving PDF "blob:https://mozilla.github.io/a4fc455f-bc05-45b5-b6aa-2ecff3cb45ce".`	2017-09-05 17:07:44 +02:00
Jonas Jenwald	cfb4955a92	Replace the `isArray` helper function with the native `Array.isArray` function Follow-up to PR 8813.	2017-09-01 20:27:13 +02:00
Jonas Jenwald	11408da340	Replace the `isInt` helper function with the native `Number.isInteger` function Follow-up to PR 8643.	2017-09-01 16:52:50 +02:00
Tim van der Meij	d332f62d60	Merge pull request #8857 from Snuffleupagus/fetchUncompressed-type-checks Avoid some redundant type checks in `XRef.fetchUncompressed`	2017-08-31 23:33:02 +02:00
Tim van der Meij	51be27853f	Merge pull request #8847 from Snuffleupagus/AnnotationElement-isRenderable-regression Correct the default value for `isRenderable` in the `AnnotationElement` constructor, to fix breaking errors when rendering unsupported annotations	2017-08-31 22:08:10 +02:00
Jonas Jenwald	772a5412a4	Avoid some redundant type checks in `XRef.fetchUncompressed` When looking briefly at using `Number.isInteger`/`Number.isNan` rather than `isInt`/`isNaN`, I noticed that there's a couple of not entirely straightforward cases to consider. At first I really couldn't understand why `parseInt` is being used like it is in `XRef.fetchUncompressed`, since the `num` and `gen` properties of an object reference should always be integers. However, doing a bit of code archaeology pointed to PR 4348, and it thus seem that this was a very deliberate change. Since I didn't want to inadvertently introduce any regressions, I've kept the `parseInt` calls intact but moved them to occur only when actually necessary.[1] Secondly, I noticed that there's a redundant `isCmd` check for an edge-case of broken operators. Since we're throwing a `FormatError` if `obj3` isn't a command, we don't need to repeat that check. In practice, this patch could perhaps be considered as a micro-optimization, but considering that `XRef.fetchUncompressed` can be called many thousand times when loading larger PDF documents these changes at least cannot hurt. --- [1] I even ran all tests locally, with an added `assert(Number.isInteger(obj1) && Number.isInteger(obj2));` check, and everything passed with flying colours. However, since it appears that this was in fact necessary at one point, one possible explanation is that the failing test-case(s) have now been replaced by reduced ones.	2017-08-31 16:49:04 +02:00
Jonas Jenwald	84fe442b35	Correctly set the `credentials` of a fetch request, when the `withCredentials` parameter was passed to `getDocument` Skimming through https://developer.mozilla.org/en-US/docs/Web/API/Fetch_API/Using_Fetch#Sending_a_request_with_credentials_included, it looks to me like the `credentials` option was accidentally inverted.	2017-08-31 09:20:05 +02:00
Jonas Jenwald	87fc9bafea	Correct the default value for `isRenderable` in the `AnnotationElement` constructor, to fix breaking errors when rendering unsupported annotations This regressed in PR 8828. When attempting to open e.g. http://mirrors.ctan.org/macros/latex/contrib/pdfcomment/doc/example.pdf, the annotation layers are now missing since `Error: Abstract method `AnnotationElement.render` called` is thrown multiple times.	2017-08-31 08:47:36 +02:00
Tim van der Meij	a4cc85fc5f	Merge pull request #8828 from timvandermeij/es6-annotations Improve the annotation code by converting to ES6 syntax and removing duplicate code	2017-08-31 00:02:07 +02:00
Yury Delendik	cd95b426c7	Disables fetch when ReadableStream is not available.	2017-08-30 10:53:59 -05:00
Yury Delendik	3cff7da0e7	Fixes fetch and node behavior when disableAutoFetch adn disableStream is used.	2017-08-30 10:53:38 -05:00
Mukul Mishra	3516a59384	Adds fetch stream logic for networking part of PDF.js	2017-08-29 22:56:48 +05:30
Jonas Jenwald	49b8cd5a6a	Attempt to improve the `EI` detection heuristics, for inline images, in streams containing `NUL` bytes (issue 8823) Since this patch will now treat (some) `NUL` bytes as "ASCII", the number of `followingBytes` checked are thus increased to (hopefully) reduce the risk of introducing new false positives. Fixes 8823.	2017-08-27 12:48:28 +02:00
Tim van der Meij	2512eccbf0	Implement `getOperatorList` method in the `WidgetAnnotation` class to avoid duplication in subclasses	2017-08-27 01:02:41 +02:00
Tim van der Meij	4f02857394	Let the two annotation factories use static methods This corresponds to how other factories are implemented.	2017-08-27 01:02:40 +02:00
Tim van der Meij	af10f8b586	Convert `src/display/annotation_layer.js` to ES6 syntax	2017-08-27 01:02:40 +02:00
Tim van der Meij	24d741d045	Convert `src/core/annotation.js` to ES6 syntax	2017-08-27 00:53:45 +02:00
Rob Wu	7cc7260634	Merge pull request #8796 from timvandermeij/svg-text-rise Implement text rise for the SVG back-end	2017-08-26 19:02:51 +02:00
Jonas Jenwald	42f2d36d1f	Account for broken outlines/annotations, where the destination dictionary contains an invalid `/Dest` entry According to the specification, see http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#page=377, a `Dest` entry in an outline item should not contain a dictionary. Unsurprisingly there's PDF generators that completely ignore this, treating is an `A` entry instead. The patch also adds a little bit more validation code in `Catalog.parseDestDictionary`.	2017-08-26 17:38:15 +02:00
Tim van der Meij	d368a5baed	Merge pull request #8817 from mukulmishra18/intermittent-error Set this.isCancelled in close method of streamSink.	2017-08-25 00:31:44 +02:00
Jonas Jenwald	88167b5e38	Merge pull request #8824 from Snuffleupagus/bug-1393476 Prevent an infinite loop in `XRef.readXRef` by keeping track of already parsed tables (bug 1393476)	2017-08-24 22:13:48 +02:00
Jonas Jenwald	4660cf8238	Prevent an infinite loop in `XRef.readXRef` by keeping track of already parsed tables (bug 1393476) With this patch, not only is the infinite loop prevented, but we're also able to actually render the file (which e.g. Adobe Reader isn't able to). Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1393476.	2017-08-24 19:18:08 +02:00
Yury Delendik	e82811adb4	Merge pull request #8712 from mukulmishra18/node_stream Adds node.js logic for networking tasks for PDF.js	2017-08-24 11:35:29 -05:00
Mukul Mishra	efad0c7a40	Set this.isCancelled in close method of streamSink.	2017-08-24 13:51:27 +05:30
Mukul Mishra	d16709f5e4	Adds tests for node_stream	2017-08-24 12:46:44 +05:30
Tim van der Meij	e9ba54940d	Merge pull request #8800 from Snuffleupagus/issue-8798 Try to recover if we reach the end of the stream when searching for the `EI` marker of an inline image (issue 8798)	2017-08-23 23:47:51 +02:00
Yury Delendik	438c0b28f2	Use Array.isArray in the LoopbackPort.	2017-08-23 09:25:25 -05:00
Mukul Mishra	18ede8c65d	Adds http support to node_stream logic	2017-08-23 14:08:41 +05:30
Mukul Mishra	ed78b23ff2	Adds node.js logic for networking tasks for PDF.js	2017-08-23 14:06:43 +05:30
Jonas Jenwald	cb10c03d0a	Merge pull request #8812 from yurydelendik/central-global Moves global scope out of shared/util.	2017-08-23 09:36:13 +02:00
Yury Delendik	57bc3296f4	Moves global scope out of shared/util.	2017-08-22 18:20:52 -05:00
Tim van der Meij	cfc052a515	Implement text rise for the SVG back-end The property and the setter for text rise were already present, but they were never used or called. This patch completes the implementation by calling the setter when the operator is encountered and by using the text rise value when rendering text.	2017-08-23 00:34:39 +02:00
Jonas Jenwald	ca936ee0c7	Merge pull request #8491 from janpe2/jbig2Halftone-2 JBIG2 halftone regions and pattern dictionaries	2017-08-23 00:13:43 +02:00
Jonas Jenwald	cb55506b95	Try to recover if we reach the end of the stream when searching for the `EI` marker of an inline image (issue 8798)	2017-08-22 09:33:13 +02:00
Jonas Jenwald	2112999db7	Fix caching of small inline images in `Parser.makeInlineImage` (issue 8790) Follow-up to PR 5445. Using the PDF file from issue 2618, i.e. http://bugzilla-attachments.gnome.org/attachment.cgi?id=226471, with the following manifest file: ```json [ { "id": "issue2618", "file": "../web/pdfs/issue2618.pdf", "md5": "", "rounds": 50, "type": "eq" } ] ``` I get the following results when comparing `master` against this patch: ``` browser \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| ------------ \| ----- \| ------------ \| ----------- \| ---- \| ------ \| ------------- firefox \| Overall \| 50 \| 4694 \| 3974 \| -721 \| -15.35 \| faster firefox \| Page Request \| 50 \| 2 \| 1 \| 0 \| -22.83 \| firefox \| Rendering \| 50 \| 4692 \| 3972 \| -720 \| -15.35 \| faster ``` So, based on these results, it seems like a fairly clear win to fix this broken caching :-)	2017-08-18 23:08:55 +02:00
Tim van der Meij	50e10fdafc	Merge pull request #8785 from Rob--W/svg-ignore-missing-glyph SVG: Don't render missing glyphs	2017-08-17 23:40:57 +02:00
Rob Wu	f07ce2bbc2	SVG: Don't render missing glyphs This bug is similar to the canvas bug of #6721. I found this bug when I tried to run pdf2svg on a SVG file, and the generated SVG could not be viewed in Chrome due to a SVG/XML parsing error: "PCDATA invalid Char value 3" Reduced test case: - https://github.com/mozilla/pdf.js/files/1229507/pcdatainvalidchar.pdf - expected: "hardware performance" - Actual SVG source: "hardware\x03performance" (where "\x03" is a non-printable character, and invalid XML). In terms of rendering, this bug is similar to #6721, where an unexpected glyph appeared in the canvas renderer. This was fixed by #7023, which skips over missing glyphs. This commit follows a similar logic. The test case from #6721 can be used here too: - https://github.com/mozilla/pdf.js/files/52205/issue6721_reduced.pdf expected: "Issue 6721" actual (before this patch): "Issue ààà6721"	2017-08-16 23:49:55 +02:00
Jonas Jenwald	563b68e74d	Remove manual clamping code in `src/core/jpx.js` Since we're now using `Uint8ClampedArray`, rather than `Uint8Array`, doing manual clamping shouldn't be necessary given that that is now handled natively. This shouldn't have any measurable performance impact, but just to sanity check that I've done some quick benchmarking with the following manifest file: ```json [ { "id": "S2-eq", "file": "pdfs/S2.pdf", "md5": "d0b6137846df6e0fe058f234a87fb588", "rounds": 100, "type": "eq" } ] ``` which gave the following results against the current `master` (repeated benchmark runs didn't result in any meaningful differences): ``` -- Grouped By browser, stat -- browser \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ----- \| ------------- firefox \| Overall \| 100 \| 592 \| 592 \| 1 \| 0.12 \| firefox \| Page Request \| 100 \| 3 \| 3 \| 0 \| -9.88 \| firefox \| Rendering \| 100 \| 588 \| 589 \| 1 \| 0.18 \| ```	2017-08-16 13:24:28 +02:00
Jonas Jenwald	f6636d6b19	Use `Uint8ClampedArray` when returning image data in `src/core/jbig2.js` and `src/core/jpg.js`	2017-08-16 13:24:28 +02:00
Jonas Jenwald	74ad90cb8f	Update the mask data inversion in `PDFImage.createMask` to be compatible with both `Uint8Array` and `Uint8ClampedArray`	2017-08-16 13:24:21 +02:00
Jonas Jenwald	d6cd5355f0	Use `Uint8ClampedArray`, when returning data, and remove manual clamping in `src/core/jpg.js` (issue 4901) This patch removes the `clamp0to255` helper function, as well as manual clamping code in `src/core/jpg.js`. The adjusted constants in `_convertCmykToRgb` were taken from CMYK to RGB conversion code found in `src/core/colorspace.js`. Please note: There will be some very slight movement in a number of existing test-cases, since `Uint8ClampedArray` appears to use `Math.round` (or equivalent) and the old code used (basically) `Math.floor`.	2017-08-14 16:19:57 +02:00
Jonas Jenwald	be36c60e0f	Polyfill `Uint8ClampedArray` using core-js https://github.com/zloirock/core-js	2017-08-14 16:19:55 +02:00
Tim van der Meij	903f372e3d	Merge pull request #8762 from Snuffleupagus/evaluator-coded-isType3Font Replace the `coded` property with `isType3Font` when building the font `properties` object in `PartialEvaluator.translateFont`	2017-08-08 22:02:28 +02:00
Tobias Schneider	da44d10af1	Fallback to plain object for globalScope.	2017-08-08 09:17:48 -07:00
Jani Pehkonen	9a581ee9ed	Implement JBIG2 halftone regions and pattern dictionaries	2017-08-08 15:38:29 +03:00
Jonas Jenwald	093afd1212	Replace the `coded` property with `isType3Font` when building the font `properties` object in `PartialEvaluator.translateFont` This appears to simply have been forgotten in the re-factoring in PR 4815, where the `coded` property was renamed to the much more descriptive `isType3Font` property.	2017-08-08 14:03:02 +02:00
Jonas Jenwald	4729e96fb7	Remove leftover `args[0].code` checks from the `OPS.paintXObject` cases in evaluator.js From looking at blame, it seems that these checks became obsolete with PR 692 (which landed close to six years ago). Note how, after that PR, there's no longer anything being assigned to the `code` property of an Object.	2017-08-07 10:48:37 +02:00
Jonas Jenwald	ace9de6f7d	Merge pull request #8747 from brendandahl/first-cmap Fix two cmap related issues.	2017-08-04 14:11:12 +02:00
Jonas Jenwald	bbf5b4d706	Merge pull request #8745 from yurydelendik/cancel-stream Properly cancel streams and guard at getTextContent.	2017-08-04 13:00:31 +02:00
Brendan Dahl	0bef50d56d	Fix two cmap related issues. In issue #8707, there's a char code mapped to a non- existing glyph which shouldn't be drawn. However, we saw it was missing and tried to then use the post table and end up mapping it incorrectly. This illuminated a problem with issue #5704 and bug 893730 where glyphs disappeared after above fix. This was from the cmap returning the wrong glyph id. Which in turn was caused because the font had multiple of the same type of cmap table and we were choosing the last one. Now, we instead default to the first one. I'm unsure if we should instead be merging the multiple cmaps, but using only the first one works.	2017-08-03 22:19:36 -07:00
Yury Delendik	a1dfbec532	Properly cancel streams and guard at getTextContent.	2017-08-03 16:36:46 -05:00
Yury Delendik	5b5781b45d	Merge pull request #8738 from ChenMachluf/remove_workerPort_after_PDFWorker_destroy Delete workerPort to PDFWorker cache after PDFWorker destroy	2017-08-03 15:49:40 -05:00
Yury Delendik	6beb925f0b	Checks Edge support for streams.	2017-08-03 08:48:51 -05:00
Jonas Jenwald	e20d4a9c21	Merge pull request #8681 from brendandahl/glyph-ids Fix several issues with glyph id mappings (issue 8668, bug 1383504)	2017-08-03 14:25:34 +02:00
Chen Machluf	9b1b160d4f	remove PDFWorker from cache after detsroy	2017-08-02 23:48:42 +03:00
Brendan Dahl	5b7f712ca7	Merge pull request #8627 from yurydelendik/issue-8591 Fallback on font widths if CFF data is broken	2017-08-02 10:53:14 -07:00
Mukul Mishra	00e026ebcd	Reduces the amount of data send via GetDocRequest.	2017-07-30 00:00:03 +05:30
Tim van der Meij	783d42ec2b	Merge pull request #8611 from apoorv-mishra/colorspace-tests Add unit-tests for colorspace.js	2017-07-29 00:14:03 +02:00
Yury Delendik	01b47d9012	Use streams-lib as polyfill	2017-07-28 11:54:33 -05:00
Apoorv Mishra	a129de7bd1	Add unit-tests for colorspace.js Added unit-tests for DeviceGray, DeviceRGB and DeviceCMYK Added unit-tests for CalGray Added unit-tests for CalRGB Removed redundant code Added unit-tests for LabCS Added unit-tests for IndexedCS Update comment Change lookup to Uint8Array as mentioned in pdf specs(these tests will pass after PR #8666 is merged). Added unit-tests for AlternateCS Resolved code-style issues Fixed code-style issues Addressed issues pointed out in https://github.com/mozilla/pdf.js/pull/8611#pullrequestreview-52865469	2017-07-28 14:24:56 +05:30
Yury Delendik	343b4dc2b6	Merge pull request #8617 from mukulmishra18/network-streaming Adds Streams API support for networking task of PDF.js project.	2017-07-27 16:15:06 -05:00
Mukul Mishra	109106794d	Adds Streams API support for networking task of PDF.js project. network.js file moved to main thread and `PDFNetworkStream` implemented at worker thread, that is used to ask for data whenever worker needs.	2017-07-28 02:32:30 +05:30
Brendan Dahl	ac33358e1f	Fix several issues with glyph id mappings. The initial issue with #8255 was I added a missing glyphs check to adjustMapping, but this caused us to skip re-mapping a glyph if the fontCharCode was a missingGlyph which in turn caused us to overwrite a valid glyph id with an invalid one. While fixing this, I also added a warning if the private use area is full since this also accidentally happened when I made a different mistake. This brought to light a number of issues where we map missing glyphs to notdef, but often the notdef is actually defined and then ends up being drawn. Now the glyphs don't get mapped in toFontChar and so they are not drawn by the canvas. Fixing the above brought up another issue though in bug1050040.pdf. In this PDF, the font fails to load by the browser and before we were still drawing the glyphs because it looked like the font had them, but with the fixes above the glyphs showed up as missing so we didn't attempt draw them. To fix this, I now throw an error when the loca table is in really bad shape and we fall back to trying to use a system font. We now also use this fall back if there are any format errors during converting fonts.	2017-07-26 13:00:55 -07:00
Tim van der Meij	37ac8f8623	Merge pull request #8698 from Snuffleupagus/issue-8697 Add a fallback for non-embedded SegoeUISymbol font (issue 8697)	2017-07-25 22:35:52 +02:00
Tim van der Meij	44a5cec25e	Merge pull request #8666 from apoorv-mishra/fix-colorspace Fix TypeError that occurs in colorspace.js on accidentally passing an 'Array' instead of 'TypedArray'	2017-07-25 22:13:20 +02:00
Yury Delendik	c830021b07	Fixes CFF data glyph widths	2017-07-25 12:29:51 -05:00
Jonas Jenwald	23ec6b16ca	Add a fallback for non-embedded SegoeUISymbol font (issue 8697) The PDF file uses a non-embedded SegoeUISymbol font, which is not a standard font (and is mainly used by Microsoft, see https://en.wikipedia.org/wiki/Segoe). Fixes 8697.	2017-07-25 12:45:11 +02:00
Mukul Mishra	568b0b6a42	Adds ready capability rejection logic for stream sink.	2017-07-25 02:07:38 +05:30
Tim van der Meij	af71ea7a7d	Merge pull request #8673 from Snuffleupagus/api-pageMode [api-minor] Add support for PageMode in the API and viewer (issue 8657)	2017-07-23 13:17:07 +02:00
Tim van der Meij	e7cddcce28	Merge pull request #8684 from Snuffleupagus/rm-assert Remove most `assert()` calls (issue 8506)	2017-07-22 19:42:24 +02:00
Tim van der Meij	7ded895d0c	Merge pull request #8638 from Snuffleupagus/issue-4926-built-in-jpg In `src/core/jpg.js`, ensure that the Adobe JPEG marker always takes precedence, even when the color transform code is zero	2017-07-22 17:25:09 +02:00
Jonas Jenwald	814fa1dee3	Remove most `assert()` calls (issue 8506) This replaces `assert` calls with `throw new FormatError()`/`throw new Error()`. In a few places, throwing an `Error` (which is what `assert` meant) isn't correct since the enclosing function is supposed to return a `Promise`, hence some cases were changed to `Promise.reject(...)` and similarily for `createPromiseCapability` instances.	2017-07-21 18:51:02 +02:00
Apoorv Mishra	d14956d4b8	Fix TypeError that occurs in colorspace.js on accidentally passing an 'Array' instead of 'TypedArray' Fix TypeError that occurs in colorspace.js on accidentally passing an 'Array' instead of 'TypedArray' Changed getRgbItem(...) to getRgbBuffer(...) since this.lookup has values in range[0, 255] whereas getRgbItem(...) expects those to be in range [0, 1] Revert changes for IE9 compatibility	2017-07-21 01:15:05 +05:30
Jonas Jenwald	15f0963f51	Fix a typo, in the `Catalog.numPages` getter, than prevents shadowing from working correctly Looking at the blame, it seems that this typo was present even before PR 700 (almost six years ago). The result of using `'num'`, rather than the correct `'numPages'` string, is that the `Catalog.numPages` getter isn't actually being shadowed.	2017-07-20 12:35:09 +02:00
Jonas Jenwald	16c5d41c5b	[api-minor] Add support for PageMode in the API (issue 8657) Please refer to https://wwwimages2.adobe.com/content/dam/Adobe/en/devnet/pdf/pdfs/PDF32000_2008.pdf#page=82.	2017-07-19 16:40:03 +02:00
Yury Delendik	ca3c08f12b	Merge pull request #8620 from Rob--W/issue-8560-improve-png-compression Improve compression of PNG images embedded in generated SVG files	2017-07-14 07:23:38 -05:00
Tim van der Meij	26be1df5f7	Merge pull request #8641 from Snuffleupagus/eslint-version-4-upgrade Update ESLint (and eslint-plugin-mozilla) to the latest version	2017-07-14 14:17:42 +02:00
Jonas Jenwald	f2270252c7	Add `Number.isNaN` and `Number.isInteger` polyfills in compatibility.js, since the Streams polyfill relies on them Without this, the Streams polyfill will fail in Internet Explorer when the code-paths containing these methods are used.	2017-07-13 12:02:14 +02:00
Jonas Jenwald	6f3565e638	Update ESLint (and eslint-plugin-mozilla) to the latest version	2017-07-12 13:14:25 +02:00
Jonas Jenwald	e2ea9b693c	In `src/core/jpg.js`, ensure that the Adobe JPEG marker always takes precedence, even when the color transform code is zero According to the PDF specification, please see http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G6.2394361, if an Adobe JPEG marker is present it should always take precedence. This even seem to be consistent with the existing comment that is present in the code. Hence it seems reasonable to interpret `transformCode === 0` as no color conversion being necessary. Fixes the rendering of page 1 in `issue-4926` (from the test-suite), when the built-in `src/core/jpg.js` image decoder is used.	2017-07-11 17:08:30 +02:00
Rob Wu	01f03fe393	Optimize PNG compression in SVG backend on Node.js Use the environment's zlib implementation if available to get reasonably-sized SVG files when an XObject image is converted to PNG. The generated PNG is not optimal because we do not use a PNG predictor. Futher, when our SVG backend is run in a browser, the generated PNG images will still be unnecessarily large (though the use of blob:-URLs when available should reduce the impact on memory usage). If we want to optimize PNG images in browsers too, we can either try to use a DEFLATE library such as pako, or re-use our XObject image painting logic in src/display/canvas.js. This potential improvement is not implemented by this commit Tested with: - Node.js 8.1.3 (uses zlib) - Node.js 0.11.12 (uses zlib) - Node.js 0.10.48 (falls back to inferior existing implementation). - Chrome 59.0.3071.86 - Firefox 54.0 Tests: Unit test on Node.js: ``` $ gulp lib $ JASMINE_CONFIG_PATH=test/unit/clitests.json node ./node_modules/.bin/jasmine --filter=SVG ``` Unit test in browser: Run `gulp server` and open http://localhost:8888/test/unit/unit_test.html?spec=SVGGraphics To verify that the patch works as desired, ``` $ node examples/node/pdf2svg.js test/pdfs/xobject-image.pdf $ du -b svgdump/xobject-image-1.svg # ^ Calculates the file size. Confirm that the size is small # (784 instead of 80664 bytes). ```	2017-07-10 18:56:57 +02:00
Rob Wu	94f1dde07d	Move DEFLATE logic in convertImgDataToPng Move the DEFLATE logic in convertImgDataToPng to a separate function. A later commit will introduce a more efficient deflate algorithm, and fall back to the existing, naive algorithm if needed.	2017-07-10 18:45:47 +02:00
Rob Wu	742ed3d1c9	Remove __pdfjsdev_webpack__, use webpack options `__pdfjsdev_webpack__` was used to skip evaluating part of an AST, in order to not mangle some `require` symbols. This commit removes `__pdfjsdev_webpack__`, and: - Uses `__non_webpack_require__` when one wants the output to contain `require` instead of `__webpack_require__`. - Adds options to the webpack config to prevent "polyfills" for some Node.js-specific APIs to be added. - Use `// eslint-disable-next-line no-undef` instead of `/* globals ... */` for variables that are not meant to be used globally.	2017-07-09 16:35:48 +02:00
Yury Delendik	d028c26210	Removes error()	2017-07-07 09:40:24 -05:00
Jonas Jenwald	ea71d23f74	Fix a stupid spelling error in the `ASCII85Decode` name in `Parser.makeInlineImage` (issue 8613) This is a trivial follow-up to PR 5383, and it's a bit strange that this has been wrong since late 2014 without anyone noticing (maybe because inline images aren't too common). So, apparently code works better if you actually spell correctly, who knew ;-) Fixes 8613.	2017-07-05 19:43:09 +02:00
Yury Delendik	b3bac5100c	Merge pull request #8596 from mukulmishra18/proper-read-result Fixes wrong structure of fullReader.read() result.	2017-07-05 09:03:57 -05:00
Jonas Jenwald	eff257b820	Merge pull request #8580 from brendandahl/missing-glyf Fix how we detect and handle missing glyph data.	2017-07-04 12:16:07 +02:00
Brendan Dahl	9f5c1550ed	Merge pull request #8592 from brendandahl/cmap-3-0 Only mask char codes of (3, 0) cmap tables in the range of 0xF000 to 0…	2017-07-03 17:58:28 -07:00
Brendan Dahl	efbbd8533f	Only mask char codes of (3, 0) cmap tables in the range of 0xF000 to 0xF0FF.	2017-07-03 13:13:46 -07:00
Brendan Dahl	6d4f748fb1	Fix how we detect and handle missing glyph data.	2017-07-03 13:06:06 -07:00
Mukul Mishra	308a83e5ca	Fixes wrong structure of fullReader.read() result.	2017-07-01 15:52:47 +05:30
Jonas Jenwald	de0e7a9a68	Check that the `MessageHandler` isn't already terminated in the `onFailure` handler in `src/core/worker.js` (issue 8584) All other code-paths already checks that the `MessageHandler` isn't terminated, but apparently `onFailure` was missing that check (compare e.g. with the `onSuccess` function). From what I can tell, this is only an issue if workers are disabled, hence why I didn't bother adding a unit-test. Fixes 8584.	2017-06-30 10:11:13 +02:00
Brendan Dahl	a8a8909d2d	Fix missing notdef in expert encoding.	2017-06-29 12:12:39 -07:00
Tim van der Meij	f1a87bab10	SVG: move setting the stroke properties to the `stroke` method In general, we may not know the stroke properties when path construction happens. Since we must know the properties when we apply the stroke, we should set the properties at that point. Note that we already do that for the color and opacity, but not yet for the other properties.	2017-06-25 22:28:46 +02:00
Jonas Jenwald	859e3d4dce	Merge pull request #8564 from timvandermeij/svg-opacity SVG: implement fill and stroke opacity	2017-06-24 22:42:18 +02:00
Brendan Dahl	f1f9d98519	Merge pull request #8507 from Snuffleupagus/issue-8480 Only special-case OpenType fonts with `CFF` data if it's both a composite (i.e. Type0) font and also has a non-default CID to GID map (issue 8480)	2017-06-23 13:36:58 -07:00
Yury Delendik	e2ca894fec	Merge pull request #8488 from mukulmishra18/streams-getTextContent Streams get text content	2017-06-23 12:52:13 -05:00
ghetolay	7f79e12383	Fix error when using custom CMapReaderFactory and worker	2017-06-23 15:59:43 +02:00
Tim van der Meij	f9eafefa09	SVG: implement stroke opacity	2017-06-23 00:37:27 +02:00
Tim van der Meij	440914e49b	SVG: implement fill opacity This makes the `eoFill` method similar to the `eoStroke` method and the ones in `src/display/canvas.js`.	2017-06-23 00:37:27 +02:00
Tim van der Meij	c6ee05f7e5	Merge pull request #8542 from Rob--W/svg-clipping Move svg:clipPath generation from clip to endPath	2017-06-22 23:48:06 +02:00
Rob Wu	fc6448d18c	Move svg:clipPath generation from clip to endPath In the PDF from issue 8527, the clip operator (W) shows up before a path is defined. The current SVG backend however expects a path to exist before generating a `<svg:clipPath>` element. In the example, the path was defined after the clip, followed by a endPath operator (n). So this commit fixes the bug by moving the path generation logic from clip to endPath. Our canvas backend appears to use similar logic: `CanvasGraphics_endPath` calls `consumePath`, which in turn draws the clip and resets the `pendingClip` state. The canvas backend calls `consumePath` from multiple other places, so we probably need to check whether doing so is also necessary for the SVG backend. I scanned our corpus of PDF files in test/pdfs, and found that in every instance (except for one), the "W" PDF operator (clip) is immediately followed by "n" (endPath). The new test from this commit (clippath.pdf) starts with "W", followed by a path definition and then "n". # Commands used to find some of the clipping commands: grep -ra '^W$' -C7 \| less -S grep -ra '^W ' -C7 \| less -S grep -ra ' W$' -C7 \| less -S test/pdfs/issue6413.pdf is the only file where "W" (a tline 55) is not followed by "n". In fact, the "W" is the last operation of a series of XObject painting operations, and removing it does not have any effect on the rendered PDF (confirmed by looking at the output of PDF.js's canvas backend, and ImageMagick's convert command).	2017-06-22 01:08:17 +02:00
Yury Delendik	679ffc84f6	Merge pull request #8544 from Rob--W/compatiblity-safari-strict-error compatibility.js: Rename parameters in JURL	2017-06-19 10:34:33 -05:00
Rob Wu	f912f89e69	compatibility.js: Rename parameters in JURL	2017-06-19 17:03:41 +02:00
Jonas Jenwald	73234577e1	Rename `map` to `_map` inside of `Dict`, to make it clearer that it should be regarded as a "private" property	2017-06-17 17:32:00 +02:00
Mukul Mishra	0c13d0ff46	Adds Streams API in getTextContent to stream data. This patch adds Streams API support in getTextContent so that we can stream data in chunks instead of fetching whole data from worker thread to main thread. This patch supports Streams API without changing the core functionality of getTextContent. Enqueue textContent directly at getTextContent in partialEvaluator. Adds desiredSize and ready property in streamSink.	2017-06-17 20:03:27 +05:30
Jonas Jenwald	3a20fd165f	Refactor `ObjectLoader` to use `Dict`s correctly, rather than abusing their internal properties The `ObjectLoader` currently takes an Object as input, despite actually working with `Dict`s internally. This means that at the (two) existing call-sites, we're passing in the "private" `Dict.map` property directly. Doing this seems like an anti-pattern, and we could (and even should) simply provide the actual `Dict` when creating an `ObjectLoader` instance. Accessing properties stored in the `Dict` is now done using the intended methods instead, in particular `getRaw` which (as the name suggests) doesn't do any de-referencing, thus maintaining the current functionality of the code. The only functional change in this patch is that `ObjectLoader.load` will now ignore empty nodes, such that `ObjectLoader._walk` only needs to deal with nodes that are known to contain data. (This lets us skip, among other checks, meaningless `addChildren` function calls.)	2017-06-16 22:59:32 +02:00
Jonas Jenwald	f2fc9ee281	Slightly refactor and ES6-ify the code in `ObjectLoader` This patch changes all `var` to `let`, and caches the array lengths in all loops. Also removes two unnecessary temporary variable assignments.	2017-06-16 22:59:32 +02:00
Yury Delendik	0c93dee0de	Merge pull request #8515 from yurydelendik/bloborigin Adds special case for origin of blob to the compatibility URL.	2017-06-16 11:21:45 -05:00
Yury Delendik	631e6bebff	Fixes WeakMap polyfill (and improves PDFWorker port check).	2017-06-13 09:36:58 -05:00
Yury Delendik	b44848b918	Adds special case for origin of blob to the compatibility URL.	2017-06-13 08:19:46 -05:00
Yury Delendik	24f14d44cb	Preventing from using the same canvas for multiple render()	2017-06-12 16:33:49 -05:00
Yury Delendik	db7a770542	Additional check in globalScope detections	2017-06-12 10:14:46 -05:00
Jonas Jenwald	1766fe8184	Merge pull request #8508 from yurydelendik/issue8246 Fixes duplicate creation of PDFWorker for the same port.	2017-06-10 14:56:05 +02:00
Yury Delendik	69c804a0f4	Fixes duplicate creation of PDFWorker for the same port.	2017-06-10 07:02:29 -05:00
Jonas Jenwald	e589834f13	Ensure that `TilingPattern`s have valid (non-zero) /BBox arrays (issue 8330) Fixes 8330.	2017-06-09 21:41:48 +02:00
Jonas Jenwald	8b4a42e5b8	Only special-case OpenType fonts with `CFF` data if it's both a composite (i.e. Type0) font and also has a non-default CID to GID map (issue 8480) As mentioned the last time that I touched this particular part of the font code, I'm sincerely hope that this doesn't cause any regressions! However, the patch passes all tests added in PRs 5770, 6270, and 7904 (and obviously all other tests as well). Furthermore, I've manually checked all the issues/bugs referenced in those PRs without finding any issues. Fixes 8480.	2017-06-09 21:15:39 +02:00
Jonas Jenwald	999e30723d	Reduce the duplication slightly when detecting an OpenType font (in the `Font` constructor)	2017-06-09 18:26:57 +02:00
Mukul Mishra	bbd9968f76	Added sendWithStream method in MessageHandler. Adds functionality to accept Queueing Strategy in sendWithStream method. Using Queueing Strategy we can control the data that is enqueued into the sink, and hence regulated the flow of chunks from worker to main thread. Adds capability in pull and cancel methods. Adds ready and desiredSize property in streamSink. Adds unit test for ReadableStream and sendWithStream.	2017-06-07 21:05:27 +05:30
Tim van der Meij	a3fae906a6	Merge pull request #8474 from Snuffleupagus/ESLint-object-styles-src-core Fix inconsistent spacing and trailing commas in objects in `src/core/` files, so we can enable the `comma-dangle` and `object-curly-spacing` ESLint rules later on	2017-06-03 22:22:00 +02:00
Jonas Jenwald	f20d2cd2ae	Fix inconsistent spacing and trailing commas in objects in remaining `src/` files, so we can enable the `comma-dangle` and `object-curly-spacing` ESLint rules later on http://eslint.org/docs/rules/comma-dangle http://eslint.org/docs/rules/object-curly-spacing Given that we currently have quite inconsistent object formatting, fixing this in one big patch probably wouldn't be feasible (since I cannot imagine anyone wanting to review that); hence I've opted to try and do this piecewise instead. Please note: This patch was created automatically, using the ESLint `--fix` command line option. In a couple of places this caused lines to become too long, and I've fixed those manually; please refer to the interdiff below for the only hand-edits in this patch. ```diff diff --git a/src/display/canvas.js b/src/display/canvas.js index 5739f6f2..4216b2d2 100644 --- a/src/display/canvas.js +++ b/src/display/canvas.js @@ -2071,7 +2071,7 @@ var CanvasGraphics = (function CanvasGraphicsClosure() { var map = []; for (var i = 0, ii = positions.length; i < ii; i += 2) { map.push({ transform: [scaleX, 0, 0, scaleY, positions[i], - positions[i + 1]], x: 0, y: 0, w: width, h: height, }); + positions[i + 1]], x: 0, y: 0, w: width, h: height, }); } this.paintInlineImageXObjectGroup(imgData, map); }, diff --git a/src/display/svg.js b/src/display/svg.js index 9eb05dfa..2aa21482 100644 --- a/src/display/svg.js +++ b/src/display/svg.js @@ -458,7 +458,11 @@ SVGGraphics = (function SVGGraphicsClosure() { for (var x = 0; x < fnArrayLen; x++) { var fnId = fnArray[x]; - opList.push({ 'fnId': fnId, 'fn': REVOPS[fnId], 'args': argsArray[x], }); + opList.push({ + 'fnId': fnId, + 'fn': REVOPS[fnId], + 'args': argsArray[x], + }); } return opListToTree(opList); }, ```	2017-06-02 12:32:18 +02:00
Jonas Jenwald	a8c87f8019	Fix inconsistent spacing and trailing commas in objects in `src/core/` files, so we can enable the `comma-dangle` and `object-curly-spacing` ESLint rules later on Unfortunately this patch is fairly big, even though it only covers the `src/core` folder, but splitting it even further seemed difficult. http://eslint.org/docs/rules/comma-dangle http://eslint.org/docs/rules/object-curly-spacing Given that we currently have quite inconsistent object formatting, fixing this in one big patch probably wouldn't be feasible (since I cannot imagine anyone wanting to review that); hence I've opted to try and do this piecewise instead. Please note: This patch was created automatically, using the ESLint --fix command line option. In a couple of places this caused lines to become too long, and I've fixed those manually; please refer to the interdiff below for the only hand-edits in this patch. ```diff diff --git a/src/core/evaluator.js b/src/core/evaluator.js index abab9027..dcd3594b 100644 --- a/src/core/evaluator.js +++ b/src/core/evaluator.js @@ -2785,7 +2785,8 @@ var EvaluatorPreprocessor = (function EvaluatorPreprocessorClosure() { t['Tz'] = { id: OPS.setHScale, numArgs: 1, variableArgs: false, }; t['TL'] = { id: OPS.setLeading, numArgs: 1, variableArgs: false, }; t['Tf'] = { id: OPS.setFont, numArgs: 2, variableArgs: false, }; - t['Tr'] = { id: OPS.setTextRenderingMode, numArgs: 1, variableArgs: false, }; + t['Tr'] = { id: OPS.setTextRenderingMode, numArgs: 1, + variableArgs: false, }; t['Ts'] = { id: OPS.setTextRise, numArgs: 1, variableArgs: false, }; t['Td'] = { id: OPS.moveText, numArgs: 2, variableArgs: false, }; t['TD'] = { id: OPS.setLeadingMoveText, numArgs: 2, variableArgs: false, }; diff --git a/src/core/jbig2.js b/src/core/jbig2.js index 5a17d482..71671541 100644 --- a/src/core/jbig2.js +++ b/src/core/jbig2.js @@ -123,19 +123,22 @@ var Jbig2Image = (function Jbig2ImageClosure() { { x: -1, y: -1, }, { x: 0, y: -1, }, { x: 1, y: -1, }, { x: -2, y: 0, }, { x: -1, y: 0, }], [{ x: -3, y: -1, }, { x: -2, y: -1, }, { x: -1, y: -1, }, { x: 0, y: -1, }, - { x: 1, y: -1, }, { x: -4, y: 0, }, { x: -3, y: 0, }, { x: -2, y: 0, }, { x: -1, y: 0, }] + { x: 1, y: -1, }, { x: -4, y: 0, }, { x: -3, y: 0, }, { x: -2, y: 0, }, + { x: -1, y: 0, }] ]; var RefinementTemplates = [ { coding: [{ x: 0, y: -1, }, { x: 1, y: -1, }, { x: -1, y: 0, }], - reference: [{ x: 0, y: -1, }, { x: 1, y: -1, }, { x: -1, y: 0, }, { x: 0, y: 0, }, - { x: 1, y: 0, }, { x: -1, y: 1, }, { x: 0, y: 1, }, { x: 1, y: 1, }], + reference: [{ x: 0, y: -1, }, { x: 1, y: -1, }, { x: -1, y: 0, }, + { x: 0, y: 0, }, { x: 1, y: 0, }, { x: -1, y: 1, }, + { x: 0, y: 1, }, { x: 1, y: 1, }], }, { - coding: [{ x: -1, y: -1, }, { x: 0, y: -1, }, { x: 1, y: -1, }, { x: -1, y: 0, }], - reference: [{ x: 0, y: -1, }, { x: -1, y: 0, }, { x: 0, y: 0, }, { x: 1, y: 0, }, - { x: 0, y: 1, }, { x: 1, y: 1, }], + coding: [{ x: -1, y: -1, }, { x: 0, y: -1, }, { x: 1, y: -1, }, + { x: -1, y: 0, }], + reference: [{ x: 0, y: -1, }, { x: -1, y: 0, }, { x: 0, y: 0, }, + { x: 1, y: 0, }, { x: 0, y: 1, }, { x: 1, y: 1, }], } ]; ```	2017-06-02 11:20:19 +02:00
Yury Delendik	bd288df909	Merge pull request #8396 from mukulmishra18/streams-lib Adds streams-lib polyfill and exports ReadableStream from shared/util.	2017-05-31 08:42:48 -05:00
Yury Delendik	66c8893815	Removes last UMDs from the modules.	2017-05-31 07:14:17 -05:00
Jonas Jenwald	982b6aa65b	Convert the files in the `/src/core` folder to ES6 modules Please note that the `glyphlist.js` and `unicode.js` files are converted to CommonJS modules instead, since Babel cannot handle files that large and they are thus excluded from transpilation.	2017-05-30 22:06:21 +02:00
Yury Delendik	b66b705ed7	Using pre-built code for testing.	2017-05-30 22:06:21 +02:00
Jonas Jenwald	4ce5e520fb	Add different code-paths to `{CMap, ToUnicodeMap}.charCodeOf` depending on length, since `Array.prototype.indexOf` can be extremely inefficient for very large arrays (issue 8372) Fixes 8372.	2017-05-24 19:47:04 +02:00
Jonas Jenwald	ac942ac657	Merge pull request #8437 from yurydelendik/default-ctx Resets canvas 2d context to the default state.	2017-05-23 23:31:57 +02:00
Yury Delendik	a67198895f	Resets canvas 2d context to the default state.	2017-05-23 15:10:30 -05:00
Jonas Jenwald	31c24ed631	Don't map glyphs to the HANGUL FILLER (0x3164) Unicode location (issue 8424) This patch follows a similar pattern as previous ones, by skipping certain problematic Unicode locations. According to http://searchfox.org/mozilla-central/rev/6c2dbacbba1d58b8679cee700fd0a54189e0cf1b/gfx/harfbuzz/src/hb-unicode-private.hh#136, it seems that the HANGUL FILLER (0x3164) location is "special". Fixes 8424.	2017-05-23 16:12:45 +02:00
Jonas Jenwald	0ddf52aca5	Remove the special handling for `nameddest`s that look like standard pageNumbers PR 7341 added special handling for `nameddest`s that look like pageNumbers, to prevent issues since we previously incorrectly supported specifying a pageNumber directly in the hash; i.e. `#10` versus the correct `#page=10` format. Since this behaviour wasn't correct, PR 7757 fixed and deprecated the old format, which means that we no longer need to maintain the `nameddest` hack in multiple files.	2017-05-20 11:29:29 +02:00
Mukul Mishra	c9f44f30e5	Adds streams-lib polyfill and exports ReadableStream from shared/util. Added test for ReadableStream. Adds ref-implementation license-header in streams-lib and change gulp task to copy external/streams/ in build/ external/streams/ and build/dist/external/streams folder. Adds README.md and LICENSE.md	2017-05-20 00:26:34 +05:30
Yury Delendik	5dc8dcdc0f	Merge pull request #8388 from Snuffleupagus/issue-8380 Cache JPEG images, just as we do for other image formats, in `evaluator.js` (issue 8380)	2017-05-17 17:25:51 -05:00
chris.greening	cfc2f36f5c	Adds additional parameter so background color of canvas can be set	2017-05-17 17:06:44 +01:00
Jonas Jenwald	bbe8c3d8ed	Enable running a subset of the API unit-tests on Travis Notably, this patch skips all canvas rendering tests in Node.js.	2017-05-12 11:48:27 +02:00
Jonas Jenwald	c5f73edcd2	Convert the `DOMCanvasFactory` to an ES6 class For consistency, also updates the `pdf2png.js` example to use the slightly less verbose `canvasAndContext` parameter name.	2017-05-11 20:15:22 +02:00
Jonas Jenwald	32baa6af7a	Convert the `DOMCMapReaderFactory` to an ES6 class Given that we only create one instance of this class per `getDocument` call, this shouldn't matter performance wise.	2017-05-11 20:15:19 +02:00
巴里切罗	8d5d97264e	fix(svg) adjust strategy for decoding JPEG images	2017-05-08 11:32:44 +08:00
Jonas Jenwald	0c2ebda31c	Cache JPEG images, just as we do for other image formats, in `evaluator.js` (issue 8380) For some reason, we're putting all kind of images except JPEG into the `imageCache` in `evaluator.js`.[1] This means that in the PDF file in issue 8380, we'll keep sending the same two small images[2] to the main-thread and decoding them over and over. This is obviously hugely inefficient! As can be seen from the discussion in the issue, the performance becomes extremely bad if the user has the addon "Adblock Plus" installed. However, even in a clean Firefox profile, the performance isn't that great. This patch not only addresses the performance implications of the "Adblock Plus" addon together with that particular PDF file, but it also improves the rendering times considerably for all users. Locally, with a clean profile, the rendering times are reduced from `~2000 ms` to `~500 ms` for my setup! Obviously, the general structure of the PDF file and its operator sequence is still hugely inefficient, however I'd say that the performance with this patch is good enough to consider the issue (as it stands) resolved.[3] Fixes 8380. --- [1] Not technically true, since inline images are cached from `parser.js`, but whatever :-) [2] The two JPEG images have dimensions 1x2, respectively 4x2. [3] To make this even more efficient, a new state would have to be added to the `QueueOptimizer`. Given that PDF files this stupid fortunately aren't too common, I'm not convinced that it's worth doing.	2017-05-07 13:07:41 +02:00
Jonas Jenwald	50d026fbda	Merge pull request #8385 from tobytailor/master Fix typo in LoopbackPort export	2017-05-06 10:35:49 +02:00
Tobias Schneider	e1a3e46cba	Fix typo	2017-05-05 19:10:00 -07:00
Yury Delendik	c3cfcbe72f	Merge pull request #8340 from ydfzgyj/fix-svg-spacing Fix char spacing bug in SVG mode	2017-05-05 07:41:25 -05:00
巴里切罗	d58040aa29	fix(svg) char spacing bug	2017-05-05 12:11:20 +08:00
Jonas Jenwald	60b14f526e	Convert the files in the `/src/shared` folder to ES6 modules	2017-05-04 21:07:59 +02:00
Yury Delendik	3adda80f97	Merge pull request #8358 from Snuffleupagus/PartialEvaluator-method-signatures Change the signatures of the `PartialEvaluator` "constructor" and its `getOperatorList`/`getTextContent` methods to take parameter objects	2017-05-04 08:10:30 -05:00
Jonas Jenwald	6c81b8e6dd	Replace unnecessary `bind(this)` and `var self = this` statements with arrow functions in remaining `src/` files	2017-05-03 23:12:35 +02:00
Yury Delendik	74ba3033e8	Merge pull request #8359 from Snuffleupagus/Lexer-getNumber-ignore-line-breaks Ignore line-breaks between operator and digit in `Lexer.getNumber`	2017-05-03 09:43:59 -05:00
Jonas Jenwald	3e20d30afc	Change the signatures of the `PartialEvaluator` "constructor" and its `getOperatorList`/`getTextContent` methods to take parameter objects Currently these methods accept a large number of parameters, which creates quite unwieldy call-sites. When invoking them, you have to remember not only what arguments to supply, but also the correct order, to avoid runtime errors. Furthermore, since some of the parameters are optional, you also have to remember to pass e.g. `null` or `undefined` for those ones. Also, adding new parameters to these methods (which happens occasionally), often becomes unnecessarily tedious (based on personal experience). Please note that I do not think that we need/should convert every single method in `evaluator.js` (or elsewhere in `/core` files) to take parameter objects. However, in my opinion, once a method starts relying on approximately five parameter (or even more), passing them in individually becomes quite cumbersome. With these changes, I obviously needed to update the `evaluator_spec.js` unit-tests. The main change there, except the new method signatures[1], is that it's now re-using one `PartialEvalutor` instance, since I couldn't see any compelling reason for creating a new one in every single test. Note: If this patch is accepted, my intention is to (time permitting) see if it makes sense to convert additional methods in `evaluator.js` (and other `/core` files) in a similar fashion, but I figured that it'd be a good idea to limit the initial scope somewhat. --- [1] A fun fact here, note how the `PartialEvaluator` signature used in `evaluator_spec.js` wasn't even correct in the current `master`.	2017-05-03 12:10:20 +02:00
Yury Delendik	84f174bb2f	Merge pull request #8363 from yurydelendik/worker-fromport Adds initializeFromPort to the WorkerMessageHandler.	2017-05-02 19:22:20 -05:00
Yury Delendik	008aa56ac6	Adds initializeFromPort to the WorkerMessageHandler.	2017-05-02 16:11:54 -05:00
Jonas Jenwald	40feca12c1	Ignore line-breaks between operator and digit in `Lexer.getNumber` This is consistent with the behaviour in Adobe Reader (and PDFium), and it fixes the display of page 30 in https://bug1354114.bmoattachments.org/attachment.cgi?id=8855457 (taken from https://bugzilla.mozilla.org/show_bug.cgi?id=1354114). The patch also makes the `error` message for invalid numbers slightly more useful, by including the charCode as well. (Having that information available would have reduced the time spent on debugging the PDF file above.)	2017-05-02 20:59:42 +02:00
Tobias Schneider	a80c405941	Rename FakeWorkerPort to LoopbackPort and export it	2017-05-02 11:33:19 -07:00
Jonas Jenwald	ebaa22478c	Replace unnecessary `bind(this)` and `var self = this` statements with arrow functions in remaining `src/core/` files	2017-05-02 15:47:43 +02:00
Jonas Jenwald	95bbc8101c	Replace unnecessary `bind(this)` and `var self = this` statements with arrow functions in `src/core/evaluator.js` Note that by using `let` instead of `var` in `PartialEvaluator.setGState` and `TranslatedFont.loadType3Data`, we can get rid of further `bind` usages since `let` is block-scoped. Also, the fact that `bind` wasn't used in the `Font` case inside of `setGState` is actually a bug which has been present ever since PR 5205, where a closure was replaced by a standard loop.[1] --- [1] I'm not aware of any bugs caused by this, but that is probably more a happy accident than anything else, since e.g. just removing the `bind` from the `SMask` case without using block-scoped variables causes test failures.	2017-05-01 20:29:44 +02:00
Tim van der Meij	06c93d8fbd	Merge pull request #8342 from Snuffleupagus/eslint_object-shorthand-src-core Enable the `object-shorthand` ESLint rule in `src/core`	2017-04-29 23:59:20 +02:00
Jonas Jenwald	165294a05f	Merge pull request #8335 from Snuffleupagus/jbig2-decodeRefinement-subtract-offsets Subtract the X/Y offsets when decoding refinement regions of JBIG2 images (issue 7145, 7308, 7401, 7850, 8270)	2017-04-28 23:13:07 +02:00
Yury Delendik	71bbcfad8a	Merge pull request #8309 from vologab/feature/allow_query_string_for_pdfjs Allow use versions for pdf.js script (i.e. - pdf.js?2412313)	2017-04-27 19:00:43 -05:00
Jonas Jenwald	ee09336f32	Restore the `URL.createObjectURL` check to the `createObjectURL` utility function (issue 8344) This is a regression from commit `3888a993b1`. It turns out the even though we have a `URL` polyfill, it's still dependent on the existence of native `URL.{createObjectURL, revokeObjectURL}` functions. Since no such thing exists in Node.js, our `createObjectURL` utility function breaks there.	2017-04-27 20:57:15 +02:00
Jonas Jenwald	afc74b0178	Enable the `object-shorthand` ESLint rule in `src/shared` Please see http://eslint.org/docs/rules/object-shorthand. For the most part, these changes are of the search-and-replace kind, and the previously enabled `no-undef` rule should complement the tests in helping ensure that no stupid errors crept into to the patch.	2017-04-27 17:29:40 +02:00
Jani Pehkonen	64deb6c700	Subtract the X/Y offsets when decoding refinement regions of JBIG2 images (issue 7145, 7308, 7401, 7850, 8270) Please refer to the JBIG2 standard, see https://www.itu.int/rec/dologin_pub.asp?lang=e&id=T-REC-T.88-200002-I!!PDF-E&type=items. In particular, section "6.3.5.3 Fixed templates and adaptive templates" mentions that the offsets should be subtracted; where the offsets are defined according to "Table 6" under section "6.3.2 Input parameters". Fixes 7145. Fixes 7308. Fixes 7401. Fixes 7850. Fixes 8270.	2017-04-26 16:06:15 +02:00
Jonas Jenwald	07b5574006	Enable the `object-shorthand` ESLint rule in `src/display` Please see http://eslint.org/docs/rules/object-shorthand. For the most part, these changes are of the search-and-replace kind, and the previously enabled `no-undef` rule should complement the tests in helping ensure that no stupid errors crept into to the patch.	2017-04-25 16:17:18 +02:00
Jonas Jenwald	7bee0c2aa3	Enable the `object-shorthand` ESLint rule in `src/shared` Please see http://eslint.org/docs/rules/object-shorthand. For the most part, these changes are of the search-and-replace kind, and the previously enabled `no-undef` rule should complement the tests in helping ensure that no stupid errors crept into to the patch.	2017-04-25 16:07:59 +02:00
Jonas Jenwald	3888a993b1	Remove the `URL` checks in the `createObjectURL` utility function, since the `URL` polyfill have made them redundant Also, this changes `createBlob` to throw when `Blob` isn't supported.	2017-04-20 10:16:06 +02:00
Vladimir Bloshchitsyn	319ea0f350	Allow use versions for pdf.js script (i.e. - pdf.js?2412313)	2017-04-18 15:07:09 +03:00
Jonas Jenwald	52e3de3c0a	Convert the files in the `/src/display` folder to ES6 modules Also disables ES2015 transpilation in development mode.	2017-04-16 12:19:10 +02:00
Jonas Jenwald	fd51a7cb8c	Merge pull request #8287 from yurydelendik/babel-es2015-preset Allow to convert (some of) ES6 code to ES5.	2017-04-14 21:47:45 +02:00
Yury Delendik	5855c0a8be	Allow to convert (some of) ES6 code to ES5.	2017-04-14 14:39:25 -05:00
Yury Delendik	30bee9fe0c	Moves Uint32ArrayView and hasCanvasTypedArrays into compatibility.js.	2017-04-14 10:04:52 -05:00
Yury Delendik	c4c44c1bbe	Merge pull request #8240 from Snuffleupagus/api-stopAtErrors [api-minor] Always allow e.g. rendering to continue even if there are errors, and add a `stopAtErrors` parameter to `getDocument` to opt-out of this behaviour (issue 6342, issue 3795, bug 1130815)	2017-04-13 10:58:49 -05:00
Yury Delendik	46646a9dd1	Merge pull request #8131 from timvandermeij/remove-umd-validation ES6 modules: remove UMD header validation	2017-04-13 10:49:41 -05:00
Yury Delendik	ad1023ff55	Merge pull request #8262 from tcorral/master Fix worker resolution on using minified version	2017-04-13 10:41:36 -05:00
Tim van der Meij	32e01cda96	Merge pull request #8228 from timvandermeij/line-annotations Implement support for line annotations	2017-04-13 00:18:31 +02:00
Tim van der Meij	e15a2ec523	Annotations: implement support for line annotations This patch implements support for line annotations. Other viewers only show the popup annotation when hovering over the line, which may have any orientation. To make this possible, we render an invisible line (SVG element) over the line on the canvas that acts as the trigger for the popup annotation. This invisible line has the same starting coordinates, ending coordinates and width of the line on the canvas.	2017-04-12 23:05:25 +02:00
Jonas Jenwald	3a302fdb53	Correctly detect if `requestAnimationFrame` is supported in `compatibility.js` (issue 8272) This is a regression from PR 8222. Fixes 8272.	2017-04-11 17:01:35 +02:00
Jonas Jenwald	fbe7b2eee7	Always ignore Type3 glyphs if their `OperatorList`s contain errors, regardless of the value of the `stopAtErrors` option Compared to the parsing of e.g. an entire page, it doesn't really make sense to only be able to render a Type3 glyph partially.	2017-04-11 08:59:22 +02:00
Jonas Jenwald	a39d636eb8	[api-minor] Always allow e.g. rendering to continue even if there are errors, and add a `stopAtErrors` parameter to `getDocument` to opt-out of this behaviour (issue 6342, issue 3795, bug 1130815) Other PDF readers, e.g. Adobe Reader and PDFium (in Chrome), will attempt to render as much of a page as possible even if there are errors present. Currently we just bail as soon the first error is hit, which means that we'll usually not render anything in these cases and just display a blank page instead. NOTE: This patch changes the default behaviour of the PDF.js API to always attempt to recover as much data as possible, even when encountering errors during e.g. `getOperatorList`/`getTextContent`, which thus improve our handling of corrupt PDF files and allow the default viewer to handle errors slightly more gracefully. In the event that an API consumer wishes to use the old behaviour, where we stop parsing as soon as an error is encountered, the `stopAtErrors` parameter can be set at `getDocument`. Fixes, inasmuch it's possible since the PDF files are corrupt, e.g. issue 6342, issue 3795, and [bug 1130815](https://bugzilla.mozilla.org/show_bug.cgi?id=1130815) (and probably others too).	2017-04-11 08:59:22 +02:00
Jonas Jenwald	10e5f766a2	Merge pull request #8266 from brendandahl/issue6652 Normalize blend mode names.	2017-04-11 08:54:42 +02:00
Brendan Dahl	4969b2ad97	Normalize blend mode names.	2017-04-10 16:18:08 -07:00
TCASAS	010d38a8c0	Fix worker resolution on using minified version - When the minified version is used the resolver of the worker can not find it properly and throws 404 error. - The problem was that: - It was getting the current name of the file. - It was replacing .js by .worker.js - When it was loading the unminified version it was working fine because: - pdf.js - .js + .worker.js = pdf.worker.js - When it was loading the minified version it didtn't work because: - pdf.min.js - .js + .worker.js = pdf.min.worker.js - pdf.min.worker.js doesn't exist the real file name is pdf.worker.min.js	2017-04-10 13:41:50 +02:00
Tim van der Meij	30d63b0c50	Annotations: move container border removal to the display layer The display layer is responsible for creating the HTML elements for the annotations from the core layer. If we need to ignore border styling for the containers of certain elements, the display layer should do so and not the core layer. I noticed this during the implementation of line annotations, for which we actually need the original border width in the display layer, even though we ignore it for the container. If we set the border style to zero in the core layer, this becomes impossible. To prevent this, this patch moves the container border removal code from the core layer to the display layer. This makes the core layer output the unchanged annotation data and lets the display layer remove any border styling if necessary.	2017-04-09 19:01:38 +02:00
Jonas Jenwald	f41d80bdd3	Enable the `prefer-promise-reject-errors` ESLint rule See http://eslint.org/docs/rules/prefer-promise-reject-errors, note that this is similar to the already used `no-throw-literal` rule.	2017-04-08 11:47:22 +02:00
Brendan Dahl	cdc79a4721	Don’t skip glyph 0 in cmap.	2017-04-05 15:17:38 -07:00
Yury Delendik	b665b0319a	Merge pull request #8222 from tjgrathwell/ios-fake-cancel-animation-frame ios: Patch cancelAnimationFrame whenever fakeRequestAnimationFrame is used	2017-04-04 10:29:04 -05:00
Yury Delendik	31f8875614	Merge pull request #8157 from Snuffleupagus/api-RenderTask-cancel-Error [api-minor] Reject the `RenderTask` with an actual `Error`, instead of just a `string`, when rendering is cancelled	2017-04-04 09:38:47 -05:00
Travis Grathwell	bd70a73d43	ios: Patch cancelAnimationFrame whenever fakeRequestAnimationFrame is used The existing implementation of fakeRequestAnimationFrame did not return a timer ID, so the frame could not be cancelled if you wanted to cancel it. But if you do want to cancel it, it needs to be cancelled through clearTimeout instead of cancelAnimationFrame, because the timer IDs are different. Signed-off-by: Jonathan Barnes <jbarnes@pivotal.io>	2017-03-31 15:31:04 -07:00
Tim van der Meij	8cee63df5d	Merge pull request #8205 from Snuffleupagus/built-in-CMap-errors Improve the error handling when loading of built-in CMap files fail (PR 8064 follow-up)	2017-03-30 23:01:13 +02:00
Jonas Jenwald	437104969d	Improve the error handling when loading of built-in CMap files fail (PR 8064 follow-up) I happened to notice that the error handling wasn't that great, which I missed previously since there were no unit-tests for failure to load built-in CMap files. Hence this patch, which improves the error handling and adds tests.	2017-03-29 22:38:29 +02:00
Jonas Jenwald	61ee0de29f	Use a simple `RefSetCache` to significantly improve the performance of `Catalog.getPageDict` for certain long documents (PR 8105 follow-up) I found that PR 8105 unfortunately causes a very serious performance regression in long PDF documents where the `Pages` tree only has one level; my apologies for this! Obviously we cannot revert that PR, since that would cause more issues than it solves. Hence it seems to me that the only viable solution here, is to add a simple `RefSetCache` to reduce the amount of redundant lookups. Previously in PR 8105 caching was thought to be unnecessary, but as it turns out I don't think that we really have a choice in the matter any more.	2017-03-28 21:39:55 +02:00
Jonas Jenwald	62eee8c782	Try harder to find the next valid JPEG marker when decoding Scan data (issue 8182, issue 8189) Tentatively fixes 8182 and fixes 8189.	2017-03-27 15:55:21 +02:00
Jonas Jenwald	e229c21ce1	Remove unnecessary `xref` parameters from various method signatures in `PartialEvaluator`, since `this.xref` is already available in the relevant scope For reasons I don't pretend to understand, we're passing around `xref` arguments to a bunch of methods despite `this.xref` being available in `PartialEvaluator`. This patch is a small first small step towards cleaning up the, often unwieldy, signatures of methods in `PartialEvaluator`.	2017-03-26 14:12:53 +02:00
Jonas Jenwald	e40fd63bd3	In `src/core/evaluator.js`, convert a couple of `if (!someVariable) { error(...); }` instances to `assert(someVariable);` instead Rather than, in a number of places, basically duplicating the logic of `assert` we can simply utilize the function directly instead.	2017-03-26 13:53:13 +02:00
Jonas Jenwald	5c0c122a7d	Ensure that the `XMLHttpRequest` is `open`ed before attempting to set the `responseType` in the `DOMCMapReaderFactory`, since IE fails otherwise (issue 8193) I really cannot understand why this change is necessary, since modern browsers such as Firefox and Chrome work just fine with the old code. Hence this is patch is yet another "hack" that's needed just because IE apparently cannot just work like you'd expect. For consistency, the Node factory used in the CMap unit-tests is changed as well. Fixes 8193.	2017-03-25 17:44:48 +01:00
Jonas Jenwald	3705e5e459	Use a proper `MessageHandler` for `PartialEvaluator.getTextContent` to avoid errors for fonts relying on built-in CMap files (PR 8064 follow-up) My apologies for inadvertently breaking this in PR 8064; apparently we don't have any tests that cover this use-case :( Without this patch `getTextContent` will fail if called before `getOperatorList`, since loading of fonts during text-extraction may require fetching of built-in CMap files. Please note: The `text` test added here, which uses an already existing PDF file, fails without this patch.	2017-03-24 17:39:33 +01:00
Rob Wu	49af56f730	Rethrow MissingDataException when needed In core/document.js: `PDFDocument.prototype.parse` accesses a dictionary property, which could throw if the underlying data is not yet available. In core/obj.js: `get Catalog.prototype.metadata` calls `stream.getBytes`, which can throw MissingDataException too when the stream is a ChunkedStream.	2017-03-22 14:55:59 +01:00
Jonas Jenwald	8527d27eae	Ensure that `PDFDocument.documentInfo` doesn't fail during document load, when the entire XRef table hasn't been fetched yet (issue 8180) Similar to other `try-catch` statements in `/core` code, we must re-throw `MissingDataException` to prevent issues with missing data during document loading. Note that I'm not sure if/how we can test this, which is why the patch doesn't include any test(s). Fixes 8180.	2017-03-22 14:14:38 +01:00
Jonas Jenwald	e2e13df4a5	Merge pull request #8164 from Snuffleupagus/issue-7828 Don't read past the EOI marker for JPEG images with non-default restart interval (issue 7828)	2017-03-20 22:17:28 +01:00
Jonas Jenwald	d6d0f778aa	Don't read past the EOI marker for JPEG images with non-default restart interval (issue 7828) After browsing through (a version of) the JPEG specification, see https://www.w3.org/Graphics/JPEG/itu-t81.pdf, I hope that this patch makes sense. Note that while issue 7828 became a problem after PR 7661, it isn't really a regression from than PR. The explanation is rather that we're now relying on `core/jpg.js` instead of the Native Image decoder in more situations than before, which thus exposed an existing issue in our JPEG decoder. Another factor also seems to be that in many JPEG images, the DRI (Define Restart Interval) marker isn't present, in which case this bug won't manifest either. According to https://www.w3.org/Graphics/JPEG/itu-t81.pdf#page=89 (at the bottom of the page): "NOTE – The final restart interval may be smaller than the size specified by the DRI marker segment, as it includes only the number of MCUs remaining in the scan." Furthermore, according to https://www.w3.org/Graphics/JPEG/itu-t81.pdf#page=39 (in the middle of the page): "[...] If restart is enabled and the restart interval is defined to be Ri, each entropy-coded segment except the last one shall contain Ri MCUs. The last one shall contain whatever number of MCUs completes the scan." Based on the above, it thus seem to me that we should simply ensure that we're not attempting to continue to parse Scan data once we've found all MCUs (Minimum Coded Unit) of the image. Fixes 7828.	2017-03-20 17:16:33 +01:00
Jonas Jenwald	be1a6f294f	Try to recover when encountering JPEG markers with too short marker lengths (issue 8169) The issue with the JPEG image in question, is that the COM (Comment) marker has an incorrect length entry. Fixes 8169.	2017-03-20 17:05:51 +01:00
Jonas Jenwald	a7c19d9cbb	Adjust the `yoda` ESLint rule to apply to inequalities as well I happened to notice that some inequalities had the wrong order, and was surprised since I thought that the `yoda` rule should have caught that. However, reading http://eslint.org/docs/rules/yoda#options a bit more closely than previously, it's quite obvious that the `onlyEquality` option does exactly what its name suggests. Hence I think that it makes sense to adjust the options such that only ranges are allowed instead.	2017-03-19 13:27:14 +01:00
Jonas Jenwald	098a56270d	Normalize the `BBox` entry in Tiling Pattern dictionaries (issue 8117) According to the PDF specification, see http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G7.3982967, the `BBox` entry should have the form `[left, bottom, right, top]`. Since some PDF generators apparently violates the specification, we normalize the `BBox` to ensure that the pattern is (correctly) rendered. Fixes 8117.	2017-03-16 21:43:11 +01:00
Jonas Jenwald	d37d271afa	[api-minor] Reject the `RenderTask` with an actual `Error`, instead of just a `string`, when rendering is cancelled This patch gets rid of the only case in the code-base where we're throwing a plain `string`, rather than an `Error`, which besides better/more consistent error handling also allows us to enable the [`no-throw-literal`](http://eslint.org/docs/rules/no-throw-literal) ESLint rule.	2017-03-13 18:58:21 +01:00
Jonas Jenwald	6d672c4ba6	[api-minor] Add a `pdfjsNext` parameter, and `PDFJS_NEXT` build flag, to allow backwards incompatible API changes	2017-03-13 18:43:43 +01:00
Jonas Jenwald	224613a511	Merge pull request #8135 from jasonjensen/issue8097 Handle cff fonts with erroneous stackSize (issue 8097)	2017-03-11 09:55:00 +01:00
Tim van der Meij	fc5810c97a	Merge pull request #8144 from timvandermeij/issue-8143 Widget annotations: do not crash if `Parent` is not a dictionary during field name construction (issue 8143)	2017-03-10 00:40:13 +01:00
Tim van der Meij	936d3c0698	Widget annotations: do not crash if `Parent` is not a dictionary during field name construction (issue 8143)	2017-03-09 23:51:52 +01:00
Jason O. Jensen	d230784ac3	Handle cff fonts with erroneous stackSize	2017-03-06 19:28:46 -05:00
Tim van der Meij	5eb090f288	ES6 modules: remove UMD header validation This patch is another step towards enabling Babel. Since we're moving towards ES6 modules, we will not be using UMD headers anymore, so we can remove the validation.	2017-03-04 21:43:25 +01:00
Tim van der Meij	4e3e97be8e	Merge pull request #8129 from Snuffleupagus/getInheritedPageProp-undefined Return `undefined` instead of `Dict.empty` from `Page.getInheritedPageProp` for non-existent properties to prevent possible future bugs	2017-03-04 16:18:24 +01:00
Yury Delendik	c290561488	Merge pull request #8120 from yurydelendik/lib Publishes processed sources into pdfjs-dist/lib	2017-03-04 08:48:36 -06:00
Jonas Jenwald	9bed87f5dc	Return `undefined` instead of `Dict.empty` from `Page.getInheritedPageProp` for non-existent properties to prevent possible future bugs This is something that I noticed while working on PR 8126, which is (more) fallout from PR 6065. In general, it's actually not correct to return `Dict.empty` as the default value for non-existent properties. Please note that a prior PR, see https://github.com/mozilla/pdf.js/pull/5957#issuecomment-103112698, asked for that behaviour but I don't think that's right. Obviously for properties that are (or should) be `Dict`s it makes sense, however certain properties can be e.g. Strings or Arrays instead. In the latter case, returning `Dict.empty` is just plain wrong, and it's quite fascinating that this hasn't caused any errors in practice. (The existing validation in the various getters has actually saved us here.) Also, when looking at this code again, it seemed unnecessary to duplicate the `MAX_LOOP_COUNT` check since we could just return immediately instead.	2017-03-04 13:08:39 +01:00
Tim van der Meij	1eb96d7ca9	Merge pull request #8128 from timvandermeij/csp-headers Network: use the current location to prevent errors when using CSP headers	2017-03-04 00:01:50 +01:00
Yury Delendik	e7cc07cc11	Moves checkProblematicCharRanges to font_spec.js	2017-03-03 16:33:35 -06:00
Job van der Weiden	a05115d2ec	Network: use the current location to prevent errors when using CSP headers When using content security headers to restrict connections to the same origin, you may not make connections to `example.com`. This feature detection also works with a request to the current location.	2017-03-03 23:18:51 +01:00
Jonas Jenwald	4a0ff5dbf7	Ensure that we don't ignore `0` values in `Page.getInheritedPageProp` (issue 8125) It appears that I accidentally broke this in PR 6065, sorry about that! The issue in this particular PDF file is that there's `/Rotate` entries on different levels of the `/Pages` tree. We're supposed to use the `/Rotate` entry in the `/Page` dict (which is `0`), but because of an incorrect condition we instead ended up with the one from the `/Pages` dict (which is `180`). Fixes 8125.	2017-03-03 12:27:40 +01:00
Jonas Jenwald	9163a6fba4	Merge pull request #8112 from Snuffleupagus/JS-action-newWindow Support the `newWindow` flag in white-listed `app.launchURL` JavaScript actions (PR 7794 follow-up)	2017-03-01 21:24:34 +01:00
Tim van der Meij	25f772a255	Merge pull request #8050 from yurydelendik/systemjs Replaces RequireJS to SystemJS.	2017-02-27 23:31:41 +01:00
Tim van der Meij	4e201d3787	Merge pull request #8072 from timvandermeij/annotation-append-operator-list Annotations: move operator list addition logic to `src/core/document.js`	2017-02-27 22:50:57 +01:00
Tim van der Meij	0739f90707	Annotations: move operator list addition logic to `src/core/document.js` Ideally, the `Annotation` class should not have anything to do with the page's operator list. How annotations are added to the page's operator list is logic that belongs in `src/core/document.js` instead where the operator list is constructed. Moreover, some comments have been added to clarify the intent of the code.	2017-02-27 22:17:49 +01:00
Tim van der Meij	9db4240b85	Merge pull request #8110 from timvandermeij/interactive-forms-choice-inherit-options Interactive forms: make choice widget options inheritable (issue 8094)	2017-02-27 22:14:25 +01:00
Jonas Jenwald	2a7e5b8a54	Support the `newWindow` flag in white-listed `app.launchURL` JavaScript actions (PR 7794 follow-up) A simple follow-up to PR 7794, which let's us add support for the `newWindow` parameter; refer to https://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/js_api_reference.pdf#G5.1507380. The patch also fixes an embarrassing oversight regarding the placement of the case-insensitive flag, and also allows arbitrary white-space at the beginning of JS actions.	2017-02-27 15:58:28 +01:00
Yury Delendik	5b50e0d414	Replaces RequireJS to SystemJS.	2017-02-27 08:32:39 -06:00
Tim van der Meij	8990de8614	Interactive forms: make choice widget options inheritable (issue 8094) Even though the PDF specification does not state that `Opt` fields are inheritable, in practice there are PDF generators that let annotations inherit the options from a parent.	2017-02-25 23:34:26 +01:00
Jonas Jenwald	14cc6acb90	Ensure that `Dict`s found in Object Streams are assigned an `objId` in `XRef.fetch` This fixes something that I noticed while working with the code in `Catalog.getPageDict` when debugging issue 8088. Note that while I don't have an example where this patch really matters, given that e.g. `PartialEvaluator.hasBlendModes` depends on the `objId` to avoid cyclic references this patch could potentially help for some PDF files.	2017-02-25 10:20:19 +01:00
Tim van der Meij	752510ffa0	Merge pull request #8107 from yurydelendik/init-via-port Init PDFWorker via MesssagePort.	2017-02-25 00:13:33 +01:00
Tim van der Meij	59392fd544	Merge pull request #8102 from yurydelendik/mv-compatibilty Move compatibility code to the shared/compatibility.js.	2017-02-24 22:47:49 +01:00
Yury Delendik	51767d63fe	Init PDFWorker via MesssagePort.	2017-02-24 13:33:18 -06:00
Jonas Jenwald	1ce295541c	Always check all Kids nodes, in `Catalog.getPageDict`, to avoid getting stuck in an empty node further down in the Pages tree (issue 8088) As discussed on IRC, we need to check all nodes at the bottom of the tree to ensure that we find the correct `Page` dict. Furthermore, this patch also gets rid of the caching present in a previous version, since it's not clear if that really helps. Note that this patch purposely adds an `eq` test, using a reduced test-case, so that we can be sure that the algorithm actually finds the correct `Page` dict for each `pageIndex`. Fixes 8088.	2017-02-24 12:09:46 +01:00
Yury Delendik	facefb0c79	Move compatibility code to the shared/compatibility.js.	2017-02-23 19:18:44 -06:00
Jonas Jenwald	9082f08e37	Enable running the `cmap` unit-tests on Travis by utilizing a `NodeCMapReaderFactory`	2017-02-17 23:15:36 +01:00
Yury Delendik	cfaa621a05	Merge pull request #8064 from Snuffleupagus/fetchBuiltInCMap [api-minor] Refactor fetching of built-in CMaps to utilize a factory on the `display` side instead, to allow users of the API to provide a custom CMap loading factory (e.g. for use with Node.js)	2017-02-17 15:30:31 -06:00
Brendan Dahl	425ad30912	Merge pull request #8071 from Snuffleupagus/bug-1337429 Always choose a (3, 1) cmap table for TrueType fonts that have an encoding specified, regardless of the Symbolic font flag (bug 1337429)	2017-02-16 15:13:46 -08:00
Jonas Jenwald	111419a64a	Cache built-in binary CMap files in the worker (issue 4794)	2017-02-16 10:55:39 +01:00
Jonas Jenwald	769c1450b7	[api-minor] Refactor fetching of built-in CMaps to utilize a factory on the `display` side instead, to allow users of the API to provide a custom CMap loading factory (e.g. for use with Node.js) Currently the built-in CMap files are loaded in `src/core/cmap.js` using `XMLHttpRequest` directly. For some environments that might be a problem, hence this patch refactors that to instead use a factory to load built-in CMaps on the main thread and message the data to the worker thread. This is inspired by other recent work, e.g. the addition of the `CanvasFactory`, and to a large extent on the IRC discussion starting at http://logs.glob.uno/?c=mozilla%23pdfjs&s=12+Oct+2016&e=12+Oct+2016#c53010.	2017-02-16 10:55:35 +01:00
Tim van der Meij	8aad33e8a3	Merge pull request #8065 from timvandermeij/annotation-appearances Annotations: refactor setting the normal appearance stream	2017-02-15 23:27:40 +01:00
Tim van der Meij	26fc79d51d	Annotations: refactor setting the normal appearance stream Previously, we had a function called `getDefaultAppearance`. This name, however, is misleading as the method gets the normal appearance (in the `N` entry) and not the default appearance (in the `DA` entry). Moreover, it was not entirely clear how it works just from reading the code. It primarily lacks comments and explicit error case handling. This patch improves the situation by fixing the issues mentioned above and making this function a proper method of the `Annotation` class, just like e.g., `setColor` and `setBorderStyle`.	2017-02-15 22:42:17 +01:00
Jonas Jenwald	ce072022c1	Always choose a (3, 1) cmap table for TrueType fonts that have an encoding specified, regardless of the Symbolic font flag (bug 1337429) This patch basically reverts one aspect of TrueType (3, 1) cmap parsing to the state prior to PR 4259. After that PR, a number of regressions occurred in this particular code-path, which necessitated a number of follow-ups such as PRs 5703, 5743, and 6425. The empirical data suggests, at least to me, that we should always prefer a (3, 1) cmap for TrueType fonts when they have an encoding, regardless of the Symbolic font flag. Obviously this patch passes all unit/font/reference tests locally, and I made sure that all the PRs mentioned above landed with test-cases included. However, in my opinion, there's still a very real possibility that this patch could potentially cause new regressions. Given that the PDF file in bug 1337429 has been broken for almost three years before anyone noticed, and considering that the code-path in question has been the source of numerous regressions, I do not intend to request uplift of this patch to previous Firefox versions (assuming that it's even accepted). Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1337429.	2017-02-15 17:38:08 +01:00
Yury Delendik	fa0e559fe2	New node.js check to protect from webpack.	2017-02-14 15:00:52 -06:00
Jonas Jenwald	23c62cc321	Consume the current character when encountering illegal characters in `Lexer.getObject`, in order to prevent infinite loops during reading of streams (issue 8061) Please note: The rendering of the PDF file in issue 8061 first regressed in PR 7039, and then PR 7493 exacerbated the problem even further by causing an infinite loop. In this particular case, when errors were encountered inside of the `Lexer.getObject` method itself, we didn't advance the stream position. This thus caused an inifinite loop in `parseCMap`, since the exact same character was then parsed over and over again. Fixes 8061.	2017-02-11 19:32:48 +01:00
pmysore1	af8292058f	Font ascent descent calculation fix	2017-02-11 01:25:05 -05:00
Tim van der Meij	466760efca	Merge pull request #8056 from Snuffleupagus/ChildNode.remove Use `ChildNode.remove` instead of `ChildNode.ParentNode.removeChild` in a couple of places (bug 1334831, issue 8008)	2017-02-10 23:17:17 +01:00
Yury Delendik	7d9941d870	Fixes pdf.combined.js for webpack.	2017-02-10 11:24:35 -06:00
Jonas Jenwald	63f13773e7	Use `ChildNode.remove` instead of `ChildNode.ParentNode.removeChild` in a couple of places (bug 1334831, issue 8008) Re: [bug 1334831](https://bugzilla.mozilla.org/show_bug.cgi?id=1334831) and issue 8008. Note that according to the specification, see https://dom.spec.whatwg.org/#interface-childnode, the `remove` method shouldn't throw. This is also consistent with e.g. the Firefox implementation, see http://searchfox.org/mozilla-central/rev/d3307f19d5dac31d7d36fc206b00b686de82eee4/dom/base/nsINode.cpp#1852. Obviously this isn't supported in IE (because that would be too easy), however we can easily polyfill it to avoid having to WONTFIX the bug/issue.	2017-02-10 14:39:50 +01:00
Yury Delendik	32856f0adb	Merge pull request #8046 from yurydelendik/webpack Replacing custom bundling with webpack 2	2017-02-09 16:04:54 -06:00
Yury Delendik	a048519fa1	Replace copyright headers; changes UMD to CommonJS.	2017-02-08 16:35:58 -06:00
Yury Delendik	eb4c88cd44	Replacing custom bundling with webpack2.	2017-02-08 16:32:15 -06:00
vkuryakov	4e181e59ef	Interactive forms: values for radio buttons (issue #6995 )	2017-02-07 23:42:40 +01:00
Yury Delendik	9b0e0954fb	Merge pull request #8036 from mukulmishra18/node-canvas [api-minor] Fixes behaviour of DOMCanvasFactory to return {canvas, context}.	2017-02-07 07:39:12 -06:00
Tim van der Meij	d3ae5b38ce	Merge pull request #8035 from Snuffleupagus/api-disableNativeImageDecoder [api-minor] Add a `getDocument` parameter that allows disabling of the `NativeImageDecoder` (e.g. for use with Node.js)	2017-02-06 23:37:02 +01:00
Mukul Mishra	41d092d04b	Fixes behaviour of DOMCanvasFactory to return {canvas, context}.	2017-02-07 03:47:13 +05:30
Jonas Jenwald	9c34d0aa8c	[api-minor] Add a `getDocument` parameter that allows disabling of the `NativeImageDecoder` (e.g. for use with Node.js) Note that I initially tried to add this as a parameter to the `PDFPageProxy.render` method, such that it could be passed to `PartialEvaluator.getOperatorList`. However, given all the different code-paths that call `getOperatorList` (there's a bunch only in `annotation.js`), this seemed to very quickly become unwieldy and thus difficult to maintain compared to simply using the existing `evaluatorOptions`.	2017-02-06 22:21:34 +01:00
Tim van der Meij	ec26a7e565	Merge pull request #8028 from Snuffleupagus/tests-prevent-console-errors Prevent browser console errors during testing	2017-02-06 22:04:54 +01:00
Yury Delendik	d842c9c6b0	Merge pull request #8002 from mukulmishra18/refactor-canvas [api-minor] Fix #7798: Refactor scratch canvas usage.	2017-02-06 07:45:41 -06:00
Mukul Mishra	32817633c9	Fix #7798 : Refactor scratch canvas usage. Fixes extra canvas create calls. Fixes unnecessary call of `new DOMCanvasFactory`. Fixes undefined error of DOMCanvasFactory. Fixes failures in some of the tests. Fixes expected behaviour. Remove unused vars.	2017-02-05 20:19:47 +05:30
Jonas Jenwald	e416032b38	Prevent browser console errors during testing The `Driver._cleanup` method is removing all stylesheets between test runs, which causes "TypeError: styleElement.parentNode is null" console errors in `FontLoader.clear`. As can also be seen during various tests, some of the changes I made in PR 7972 unfortunately causes console errors. It seems that I didn't test this properly, since it should have been obvious to me that while tests are triggered using Node.js, the files in question are run within the browser. My apologies for not testing this thoroughly, and for causing unnecessary churn in the code!	2017-02-05 13:23:42 +01:00
Jonas Jenwald	bc736fdc7d	Adjust the `brace-style` ESLint rule to disallow single lines (and also enable `no-iterator`) See http://eslint.org/docs/rules/brace-style. Having the opening/closing braces on the same line can often make the code slightly more difficult to read, in particular for `if`/`else if` statements, compared to using new lines. This patch also, for consistency with `mozilla-central`, enables the [`no-iterator`](http://eslint.org/docs/rules/no-iterator) rule. Note that this rule didn't require a single code change.	2017-02-04 15:53:08 +01:00
Tim van der Meij	6f0cf8c4cb	Merge pull request #7972 from Snuffleupagus/eslint_no-unused-vars Enable the `no-unused-vars` ESLint rule	2017-02-01 23:50:23 +01:00
Tim van der Meij	fe3b64d4ab	Merge pull request #8016 from Snuffleupagus/remove-isStream-property Remove the unused `isStream` property on various `Stream`s	2017-02-01 23:07:40 +01:00
Jonas Jenwald	f7d99ccc26	Remove the unused `isStream` property on various `Stream`s This property was added all the way back in PR 542, but hasn't actually been relied upon ever since PR 692. Note that there's a `isStream()` utility function which replaced the property years ago, hence the `isStream` property is now dead code.	2017-02-01 11:38:11 +01:00
Jonas Jenwald	c102232275	Append the contents of `FileAttachment` annotations to the attachments view of the sidebar, for easier access to the embedded files Other PDF viewers, e.g. Adobe Reader, seem to append `FileAttachment`s to their attachments views. One obvious difference in PDF.js is that we cannot append all the annotations on document load, since that would require parsing every page. Despite that, it still seems like a good idea to add `FileAttachment`s, since it's thus possible to access all the various types of attachments from a single place. Note: With the previous patch we display a notification when a `FileAttachment` is added to the sidebar, which thus makes appending the contents of these annotations to the sidebar slightly more visible/useful.	2017-01-31 22:26:16 +01:00
Tim van der Meij	95732279b6	Remove usage of `mozFillRule` The non-standard `mozFillRule` has been removed in Firefox 51 [1, 2]. Instead, a parameter of the standard methods should be used. Note that this is supported in all major browsers for a long time now, so there should be no need keeping this Firefox-specific code around. [1] https://developer.mozilla.org/en-US/Firefox/Releases/51 [2] https://bugzilla.mozilla.org/show_bug.cgi?id=826619	2017-01-29 23:24:44 +01:00
Jonas Jenwald	52e0f51917	Enable the `no-unused-vars` ESLint rule Please see http://eslint.org/docs/rules/no-unused-vars; note that this patch purposely uses the same rule options as in `mozilla-central`, such that it fixes part of issue 7957. It wasn't, in my opinion, entirely straightforward to enable this rule compared to the already existing rules. In many cases a `var descriptiveName = ...` format was used (more or less) to document the code, and I choose to place the old variable name in a trailing comment to not lose that information. I welcome feedback on these changes, since it wasn't always entirely easy to know what changes made the most sense in every situation.	2017-01-29 23:23:17 +01:00
Jonas Jenwald	50c2856097	Move `EOF`/`isEOF` from core/parser.js to core/primitives.js Given the nature of `EOF` and `isEOF`, it seems to me that they really ought to be placed in `core/primitives.js` instead. In general, it doesn't seem great to have to depend on the entire `core/parser.js` file for such simple primitives/helper functions. In particular, while `core/ps_parser.js` is completely separate from `core/parser.js` with regards to its function, it still depends on the latter for just one primitive. Note that compared to e.g. PR 7389, this will not reduce the number of dependencies for `core/ps_parser`, however the new dependency IMHO makes more sense.	2017-01-27 13:37:48 +01:00
Jonas Jenwald	af42c266e7	Merge pull request #7987 from yurydelendik/noopener [api-minor] Adds noopener and nofollow to rel attribute of hyperlinks.	2017-01-26 20:54:09 +01:00
Jonas Jenwald	f000417ce0	[Firefox addon] Stop bundling `src/core/network.js` into the `FIREFOX`/`MOZCENTRAL` builds (PR 7322 follow-up) PR 7322 added the `PdfJsNetwork.jsm` file, instead of the general `src/core/network.js` file for the Firefox addon. However, `make.js` wasn't updated to actually stop including the now obsolete network file.	2017-01-23 22:23:17 +01:00
Jonas Jenwald	f77c52291e	Enable the `no-empty-pattern`/`no-floating-decimal`/`no-self-compare`/`no-delete-var`/`no-new-object` ESLint rules The following rules required no code changes: http://eslint.org/docs/rules/no-empty-pattern http://eslint.org/docs/rules/no-floating-decimal http://eslint.org/docs/rules/no-delete-var http://eslint.org/docs/rules/no-new-object There was just one change needed in order to enable: http://eslint.org/docs/rules/no-self-compare; which I think helps readability a lot, since that comparison makes no sense until you realize that we push `NaN` onto the `stack` in some cases and furthermore that `NaN !== NaN`.	2017-01-23 20:30:50 +01:00
Yury Delendik	fc2d8c15e8	Adds noopener and nofollow to rel attribute of hyperlinks.	2017-01-23 10:34:27 -06:00
Tim van der Meij	1948a53ebb	Merge pull request #7973 from Snuffleupagus/eslint_spaced-comment Enable the `spaced-comment` ESLint rule	2017-01-22 21:58:42 +01:00
Tim van der Meij	17dd2e6b20	Merge pull request #7979 from Snuffleupagus/eslint-more-rules Enable the `no-unsafe-finally`/`no-octal`/`no-useless-call` ESLint rules	2017-01-22 20:53:26 +01:00
Mukul Mishra	4e38200030	Fix #7978 : Fixes ESLint yoda rule for the URL polyfill.	2017-01-21 22:47:28 +05:30
Jonas Jenwald	82ea7e6e6e	Enable the `no-unsafe-finally`/`no-octal`/`no-useless-call` ESLint rules http://eslint.org/docs/rules/no-unsafe-finally, there's just one violation which in this case can actually be ignored since there's nothing `return`ed there. http://eslint.org/docs/rules/no-octal, there're no violations in the code-base. http://eslint.org/docs/rules/no-useless-call, there's just one violation that needs to be fixed.	2017-01-21 17:15:57 +01:00
Jonas Jenwald	4626fc8342	Enable the `spaced-comment` ESLint rule Please see http://eslint.org/docs/rules/spaced-comment. Note that the exceptions added for `line` comments are intended to still allow use of the old preprocessor without linting errors. Also, I took the opportunity to improve the grammar slightly (w.r.t. capitalization and punctuation) for comments touched in the patch.	2017-01-19 16:41:59 +01:00
Tim van der Meij	1fda987a4c	Merge pull request #7904 from Snuffleupagus/issue-7901 Further adjust the heuristics used to detect OpenType font files with CFF data, to ensure that all Type0 fonts are handled the same way regardless of font Subtype (issue 7901)	2017-01-12 21:55:57 +01:00
Syed Abdullah	857a5da8f1	Fix inverted calculation of RTL text percentage in bidi.	2017-01-12 23:54:06 +08:00
Yury Delendik	393740e2ae	Merge pull request #7869 from PedroPachecoInf/master Fixes issue #6071 - TIFF with 1 bit-depth	2017-01-10 12:37:26 -06:00
jazzchipc	493853031b	Fixes issue #6071 . Corrects readBlockTiff() case for 1-bit depth and 1 color TIFF images incorporated in the PDF. Adds reference test for PDF used to fix this issue.	2017-01-10 16:42:43 +00:00
Yury Delendik	77b7b84d1e	Removes rest of 'no-else-return' comments.	2017-01-09 19:13:36 -06:00
Jonas Jenwald	642d8621ef	Replace direct lookup of `uniquePrefix`/`idCounters`, in `Page` instances, with an `idFactory` containing an `createObjId` method instead We're currently making use of `uniquePrefix`/`idCounters` in multiple files, to create unique object id's, and adding a new occurrence of them requires some care to ensure that an object id isn't accidentally reused. Furthermore, having to pass around multiple parameters as we currently do seem like something you want to avoid. Instead, this patch adds a factory which means that there's only one thing that needs to be passed around. And since it's now only necessary to call a method in order to obtain a unique object id, the details are thus abstracted away at the call-sites which avoids accidental reuse of object id's. To test that this works as expected a very simple `Page` unit-test is added, and the existing `Annotation layer` tests are also adjusted slightly.	2017-01-09 23:16:25 +01:00
Jonas Jenwald	4046d67fde	Enable the `no-else-return` ESLint rule Using `else` after `return` is not necessary, and can often lead to unnecessarily cluttered code. By using the `no-else-return` rule in ESLint we can avoid this pattern, see http://eslint.org/docs/rules/no-else-return.	2017-01-09 20:27:39 +01:00
Jonas Jenwald	14b8523314	Refactor the `password` handling so that it's stored in the `PdfManager`s, instead of in the `XRef` We're already passing in a, currently unused, `PdfManager` instance when initializing the `XRef`. To avoid having to pass a single `password` parameter around, we could thus simply get the `password` through the `PdfManager` instance instead.	2017-01-03 20:29:52 +01:00
Jonas Jenwald	27513cd23b	[api-minor] Ensure that the `getDocument` Promise is rejected if the `loadingTask` is destroyed, or an `Error` is thrown, inside of the `onPassword` callback (issue 7806) This patch also removes the `UpdatePassword` message, in favour of using the `sendWithPromise` method of `MessageHandler`. Furthermore, the patch also refactors the `BasePdfManager_updatePassword`/`BasePdfManager_passwordChanged` methods (in pdf_manager.js), and the `pdfManagerReady` function (in worker.js).	2017-01-03 20:29:46 +01:00
Jonas Jenwald	ddea9a6b04	Improve the handling of `Encoding` dictionary, with `Differences` array, in `PartialEvaluator_preEvaluateFont` I recently happened to look at the code I wrote for PR 5964, which fixed [bug 1157493](https://bugzilla.mozilla.org/show_bug.cgi?id=1157493), and I quickly realized that the solution is way too simplistic. The fact that only using the `length` of a `Differences` array worked seems more like a happy accident for a particular set of font data, but could just as easily be incorrect for other PDF files. Note that in practice, the case where the `Encoding` entry is a regular `Dict` (and not a `Ref` or `Name`) is very rare, hence I don't think that we really need to worry about having to reparse this data. Also, the performance of this code-block is quite a bit better by updating the `hash` with the data from the entire `Differences` array, instead of at every loop iteration.	2016-12-28 21:32:54 +01:00
Jonas Jenwald	e963971244	Further adjust the heuristics used to detect OpenType font files with CFF data, to ensure that all Type0 fonts are handled the same way regardless of font Subtype (issue 7901) Changing this particular code makes me somewhat nervous about regressions, since PR 5770 necessitated the follow-up PR 6270. However, the patch passes all tests added in those PRs (and obviously all other tests). Furthermore, I've manually checked all the issues/bugs referenced in PRs 5770 and 6270 without finding any issues. Please note: This patch fixes only the font bug, not the SVG conversion, present on pages two and three of the PDF file in issue 7901.	2016-12-20 17:03:51 +01:00
Yury Delendik	3b3a179486	Merge pull request #7879 from rossj/highlight-fix Make use of textAdvanceScale consistent during combineTextItems. Fix for #7878.	2016-12-19 09:18:13 -06:00
Tim van der Meij	a428899b3c	Button widget annotations: improve unit tests, simplify code and remove labels Modern browsers support styling radio buttons and checkboxes with CSS. This makes the implementation much easier, and the fallback for older browsers is still decent.	2016-12-17 20:38:48 +01:00
Tim van der Meij	77148c7880	Button widget annotations: implement radio button value fetching according to the specification	2016-12-17 20:34:32 +01:00
Tim van der Meij	0c9a06c020	Button widget annotations: implement reference testing Moreover, ensure that the read-only state is respected and improve CSS names.	2016-12-17 20:33:35 +01:00
benweet	ba012c7a68	Button widget annotations: implement checkboxes and radio buttons	2016-12-17 20:31:30 +01:00
Jonas Jenwald	bd91f34513	Ensure that we handle indirect objects in all types of `Opt` entries in `ChoiceWidget` annotation dictionaries I haven't got an example where the current code breaks, but given all the previous cases we've seen where PDF generators use indirect objects in Arrays it makes sense to fix this pro-actively. I've modified the relevant unit-tests slightly, and they would not pass without the code changes in this patch. Note: `Dict_getArray` only dereferences Array elements on the "top-level", to avoid recursion issues. Furthermore if you have to loop through the Array at the call-site anyway, then using `Dict_get` in combination with `XRef_fetchIfRef` is a tiny bit more efficient.	2016-12-17 13:44:20 +01:00
Jonas Jenwald	c850968fa7	Remove `globals` that are now unnecessary thanks to the use of various ESLint environments (e.g. Node, ShellJS, Jasmine)	2016-12-16 21:09:55 +01:00
Jonas Jenwald	2f3805efbc	Switch to using ESLint, instead of JSHint, for linting Please note that most of the necessary code adjustments were made in PR 7890. ESLint has a number of advantageous properties, compared to JSHint. Among those are: - The ability to find subtle bugs, thanks to more rules (e.g. PR 7881). - Much more customizable in general, and many rules allow fine-tuned behaviour rather than the just the on/off rules in JSHint. - Many more rules that can help developers avoid bugs, and a lot of rules that can be used to enforce a consistent coding style. The latter should be particularily useful for new contributors (and reduce the amount of stylistic review comments necessary). - The ability to easily specify exactly what rules to use/not to use, as opposed to JSHint which has a default set. Note: in future JSHint version some of the rules we depend on will be removed, according to warnings in http://jshint.com/docs/options/, so we wouldn't be able to update without losing lint coverage. - More easily disable one, or more, rules temporarily. In JSHint this requires using a numeric code, which isn't very user friendly, whereas in ESLint the rule name is simply used instead. By default there's no rules enabled in ESLint, but there are some default rule sets available. However, to prevent linting failures if we update ESLint in the future, it seemed easier to just explicitly specify what rules we want. Obviously this makes the ESLint config file somewhat bigger than the old JSHint config file, but given how rarely that one has been updated over the years I don't think that matters too much. I've tried, to the best of my ability, to ensure that we enable the same rules for ESLint that we had for JSHint. Furthermore, I've also enabled a number of rules that seemed to make sense, both to catch possible errors and various style guide violations. Despite the ESLint README claiming that it's slower that JSHint, https://github.com/eslint/eslint#how-does-eslint-performance-compare-to-jshint, locally this patch actually reduces the runtime for `gulp` lint (by approximately 20-25%). A couple of stylistic rules that would have been nice to enable, but where our code currently differs to much to make it feasible: - `comma-dangle`, controls trailing commas in Objects and Arrays (among others). - `object-curly-spacing`, controls spacing inside of Objects. - `spaced-comment`, used to enforce spaces after `//` and `/*. (This is made difficult by the fact that there's still some usage of the old preprocessor left.) Rules that I indend to look into possibly enabling in follow-ups, if it seems to make sense: `no-else-return`, `no-lonely-if`, `brace-style` with the `allowSingleLine` parameter removed. Useful links: - http://eslint.org/docs/user-guide/configuring - http://eslint.org/docs/rules/	2016-12-16 21:06:36 +01:00
Ross Johnson	4537590033	Consitently apply textAdvanceScale during building of textContentItems for improved highlighting. Fixes #7878 .	2016-12-14 21:02:19 -06:00
Jonas Jenwald	28e50cfa21	Fix errors reported by the `space-infix-ops` ESLint rule http://eslint.org/docs/rules/space-infix-ops	2016-12-12 20:36:00 +01:00
Jonas Jenwald	68bf47d55d	Fix errors reported by the `space-before-function-paren` ESLint rule http://eslint.org/docs/rules/space-before-function-paren	2016-12-12 20:35:59 +01:00
Jonas Jenwald	551eb263e3	Fix errors reported by the `semi-spacing` ESLint rule http://eslint.org/docs/rules/semi-spacing	2016-12-12 20:35:58 +01:00
Jonas Jenwald	efbb1e9b1c	Fix errors reported by the `new-cap` ESLint rule http://eslint.org/docs/rules/new-cap	2016-12-12 20:35:57 +01:00
Jonas Jenwald	c36468cbce	Fix errors reported by the `keyword-spacing` ESLint rule http://eslint.org/docs/rules/keyword-spacing	2016-12-12 20:35:56 +01:00
Jonas Jenwald	86ba634c97	Fix errors reported by the `key-spacing` ESLint rule http://eslint.org/docs/rules/key-spacing	2016-12-12 20:35:55 +01:00
Jonas Jenwald	ad915f8af1	Fix errors reported by the `comma-spacing` ESLint rule http://eslint.org/docs/rules/comma-spacing	2016-12-12 20:35:53 +01:00
Jonas Jenwald	66d2637b3f	Fix errors reported by the `yoda` ESLint rule http://eslint.org/docs/rules/yoda	2016-12-12 20:35:52 +01:00
Jonas Jenwald	3820946301	Fix (most) errors reported by the `no-multi-spaces` ESLint rule http://eslint.org/docs/rules/no-multi-spaces	2016-12-12 20:35:51 +01:00
Jonas Jenwald	25bf5db47e	Fix errors reported by the `no-extra-boolean-cast` ESLint rule http://eslint.org/docs/rules/no-extra-boolean-cast	2016-12-12 20:26:18 +01:00
Jonas Jenwald	fb5e756683	Fix errors reported by the `no-cond-assign` ESLint rule http://eslint.org/docs/rules/no-cond-assign	2016-12-12 20:26:06 +01:00
Tim van der Meij	00a006e466	Merge pull request #7705 from Snuffleupagus/issue-2594 Move symbolic font glyphs to private use area if they don't have unicode mappings (issue 2594, bug 789074, bug 865644)	2016-12-10 21:30:28 +01:00
Tim van der Meij	47f03b619f	Merge pull request #7873 from timvandermeij/mediabox-cropbox-indirect Document: handle indirect objects in `MediaBox` and `CropBox` entries	2016-12-08 23:59:45 +01:00
Tim van der Meij	3800b5e463	Document: extract `CropBox` fetching and validation into a getter This patch refactors the `CropBox` code to combine fetching and validation code in a getter, like we already did for the `MediaBox` property. Combined with variable name changes, this improves readability of the code and makes the `view` getter simpler as well.	2016-12-08 22:44:53 +01:00
Jonas Jenwald	9be3aee9c9	Add a parameter to `Page_getInheritedPageProp` to make it possible to fetch (and dereference) Arrays, and use that for the `MediaBox`/`CropBox` getters (issue 7872)	2016-12-08 22:03:42 +01:00
Jonas Jenwald	b4ac6bd2f6	Ensure that we resolve indirect objects in `Filter` and `DecodeParms` arrays in `parser.js` I've not actually, thus far, come across a PDF file that this patch fixes. However, given the string of recent patches that has fixed issues with indirect objects in arrays, I think that it makes sense to proactively avoid any issues in this code.	2016-12-08 11:55:08 +01:00
Jonas Jenwald	77bcc9232e	Remove a misplaced `false` from a condition in `fixMetadata`, in `metadata.js`, since it currently short circuits the entire condition This looks to me like a simple oversight, which has existed ever since PR 1598 all the way back in 2012.	2016-12-07 22:51:46 +01:00
Jonas Jenwald	94ddd8f61d	Merge pull request #7863 from timvandermeij/colorspace Colorspace: refactoring to prevent unnecessary creation of intermediate arrays	2016-12-06 11:18:53 +01:00
Tim van der Meij	90d94815ad	Colorspace: miscellaneous improvements - Remove an unnecessary check and assignment. - Clean up code regarding mode setting (no need for a member variable). - Indent two methods correctly.	2016-12-02 16:47:39 +01:00
Tim van der Meij	c5c0a00dca	Colorspace: reduce duplication in `AlternateCS.getRgbBuffer`	2016-12-02 16:42:22 +01:00
Tim van der Meij	ef653d952b	Colorspace: optimize default color initialization This patch avoids the creation of extra arrays when initializing an array with default (zero) values. Doing this additionally makes the code more readable by allocating enough space for the number of color components.	2016-12-02 16:42:22 +01:00
Jonas Jenwald	c5b06cb40d	Ensure that `PartialEvaluator_extractWidths` is able to handle indirect objects in all kinds of "width" data (issue 7855) Fixes 7855.	2016-11-29 20:49:07 +01:00
Jonas Jenwald	451956c0b1	Merge pull request #7628 from Snuffleupagus/issue-7580 Fallback to the `StandardEncoding` for Nonsymbolic fonts without `/Encoding` entry (issue 7580)	2016-11-29 12:37:36 +01:00
Jonas Jenwald	013f69e65f	Merge pull request #7700 from Snuffleupagus/non-embedded-NuptialScript Improve rendering of non-embedded NuptialScript font	2016-11-29 11:00:21 +01:00
Jonas Jenwald	c6008b4d7c	Fix the JSDoc comment for `Catalog.parseDestDictionary`	2016-11-27 11:18:18 +01:00
Tim van der Meij	424fc2df4f	Merge pull request #7846 from timvandermeij/bidi-types Bidi: import Unicode types from the specification	2016-11-24 22:59:31 +01:00
Tim van der Meij	995be19378	Bidi: skip invalid Unicode character to make indexing work For Arabic characters, the Unicode character codes are mapped to Unicode character types using the character codes for indexing. However, the character code 0x061D is undefined (and therefore invalid) in the Unicode standard. The imported list does not contain this entry, but not having it in the list breaks indexing for items after it. Therefore, put an empty string on its position to make indexing work properly and issue a warning in the unlikely event that we encounter this character.	2016-11-24 22:13:12 +01:00
Tim van der Meij	11839f018f	Bidi: import Unicode types from the specification Mention the specification in the comments for future reference. These types have been imported from the CSV source.	2016-11-24 21:08:31 +01:00
Tim van der Meij	9ff19985c0	Merge pull request #7832 from seanburke-wf/expose-userunit-on-page Expose the optional UserUnit entry as a page property	2016-11-22 21:18:57 +01:00
Jonas Jenwald	3170a4c40a	Improve rendering of non-embedded NuptialScript font This patch fixes something that I noticed while debugging https://bugzilla.mozilla.org/show_bug.cgi?id=1308536. The PDF file contains a font called "NuptialScript", which unfortunately is not embedded. Since that is a non-standard font we will not be able to render it entirely correct. However, by adding "NuptialScript" to the `getNonStdFontMap`, we can at least improve the rendering slightly by using an italic (serif) fallback font.	2016-11-22 17:56:17 +01:00
Sean Burke	f76cd2ce43	Expose the optional UserUnit entry as a page property	2016-11-22 09:18:19 -07:00
Jonas Jenwald	d3043167de	Correctly detect more cases of non-embedded Arial Black fonts (issue 7835) This patch adds support for non-embedded Arial Black fonts, that use a `Arial-Black...` format for the font names. Also, this patch changes `canvas.js` such that we always render Arial Black fonts with the maximum weight, which actually improves a number of existing test-cases. This should thus explain the test "failures", which are clear improvements compared with e.g. Adobe Reader. Fixes 7835.	2016-11-22 13:56:21 +01:00
Yury Delendik	f7d6f3a739	Adds SVG rendering capabilities to the PDFViewer.	2016-11-18 13:03:49 -06:00
Jonas Jenwald	a930f9af15	For commands with with too few arguments, clear out `args` if it's an Array instead of replacing it with `null` in `EvaluatorPreprocessor_read` (issue 7804) For `PartialEvaluator_getTextContent`, the same `args` Array should be re-used for every `EvaluatorPreprocessor_read` call. Hence we want to ensure that it's not accidentally replaced with `null` in `EvaluatorPreprocessor_read`, since otherwise corrupt PDF files (with too few arguments for certain commands) will cause errors in `PartialEvaluator_getTextContent`. Perhaps a micro-optimization, but this patch also changes two `!args` comparisons to `args === null`, since that should be a tiny bit more efficient.	2016-11-16 10:20:29 +01:00
Mukul Mishra	6ce2be98b7	Fix #7701 : additional check for http/https protocols to fix unsafe header request. add missing ! and removed trailing whitespaces.	2016-11-14 11:39:10 +05:30
Jonas Jenwald	6d8a404a9c	[api-minor] Add support for a couple of white-listed `JavaScript` actions that contains valid URLs (issue 3897, bug 843699) By only allowing very specific type of `JavaScript` actions, and also utilizing the existing `URL` validation, this patch shouldn't pose too much risk. Fixes one of the points in issue 3897 (with the PDF file taken from issue 3438). Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=843699 (probably, since that bug doesn't contain a test-case).	2016-11-08 16:48:27 +01:00
Jonas Jenwald	b4100ba651	Merge pull request #7698 from Snuffleupagus/bug-1308536 Ignore reserved commands when parsing operands in `CFFParser_parseDict`, instead of just rejecting the entire font (bug 1308536)	2016-11-03 23:53:14 +01:00
Jonas Jenwald	0844a72b4d	Add a bit more validation to `Catalog_readPageLabels`, to ensure that the Page Labels are well formed	2016-11-03 20:08:06 +01:00
Jonas Jenwald	2d8d8b5e53	Use `stringToPDFString` to sanitizing bad "Prefix" entries in Page Label dictionaries It seems that certain bad PDF generators can create badly encoded "Prefix" entries for Page Labels, one example being http://ukjewishfilm.org/wp-content/uploads/2015/09/Jewish-Film-Festival-Programme-ONLINE.pdf. Unfortunately I didn't come across such a PDF file while adding the API support for Page Labels, but with them now being used in the viewer I just found this issue. With this patch, we now display the Page Labels in the same way as Adobe Reader.	2016-11-03 19:48:08 +01:00
Jonas Jenwald	9dc6463933	Ignore reserved commands when parsing operands in `CFFParser_parseDict`, instead of just rejecting the entire font (bug 1308536) According to the CFF specification, see http://partners.adobe.com/public/developer/en/font/5176.CFF.pdf#page=11, certain commands are currently reserved. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1308536.	2016-11-03 12:50:40 +01:00
Tim van der Meij	9f8d67475e	Merge pull request #7775 from timvandermeij/widget-annotation-name Widget annotation: implement field name according to the specification	2016-11-02 22:43:17 +01:00
Tim van der Meij	1d96854019	Widget annotation: implement field name according to the specification The original code is difficult to read and, more importantly, performs actions that are not described in the specification. It replaces empty names with a backtick and an index, but this behavior is not described in the specification. While the specification is not entirely clear about what should happen in this case, it does specify that the `T` field is optional and that multiple field dictionaries may have the same fully qualified name, so to achieve this it makes the most sense to ignore missing `T` fields during construction of the field name. This is the most specification-compliant solution and, judging by opened issue #6623, also the required and expected behavior.	2016-11-02 21:44:44 +01:00
Tim van der Meij	6e22b32372	Merge pull request #7745 from Snuffleupagus/Launch-actions [api-minor] Add basic support for `Launch` actions (issue 1778, issue 3897, issue 6616)	2016-11-01 21:12:08 +01:00
Tim van der Meij	5194e68134	Lint: correct code style violations Manual observations and working with other linting tools found these.	2016-11-01 15:04:21 +01:00
Jonas Jenwald	81b9d553cf	Add TeX-specific glyph names to `glyphlist.js` to improve both glyph mapping and text selection for mathematic fonts (issue 2594)	2016-10-26 16:39:58 +02:00
Brendan Dahl	8d036faf40	Move symbolic font glyphs to private use area if they don't have unicode mappings.	2016-10-26 16:39:21 +02:00
Jonas Jenwald	d5e3b2fbf0	Update `PDFOutlineViewer_bindLink` to look more like `LinkAnnotationElement_bindLink`	2016-10-23 16:08:26 +02:00
Jonas Jenwald	2b79782377	[api-minor] Add basic support for `Launch` actions (issue 1778, issue 3897, issue 6616) In general we neither want, nor can, support arbitrary `Launch` actions. But in practice, all the cases we've seen so far just contains relative URLs to other PDF files. Building on PR 7689, we can thus at least support basic `Launch` actions.	2016-10-21 13:40:32 +02:00
Jonas Jenwald	d284cfd5eb	[api-minor] Add support for relative URLs, in both annotations and the outline, by adding a `docBaseUrl` parameter to `PDFJS.getDocument` (bug 766086) Note that in `FIREFOX/MOZCENTRAL/CHROME` builds of the standard viewer the `docBaseUrl` parameter will be set by default, since in that case it makes sense to use the current URL as a base. For the `GENERIC` viewer, or the API itself, it doesn't make sense to try and set the `docBaseUrl` by default. However, custom deployments/implementations may still find the parameter useful.	2016-10-19 22:20:24 +02:00
Jonas Jenwald	71a781ee5c	Deprecate the `isValidUrl` utility function and replace it with `createValidAbsoluteUrl`/`isValidProtocal` functions instead, since the main URL validation is now done using the `new URL` constructor	2016-10-19 22:11:22 +02:00
Jonas Jenwald	42f07c6262	[api-minor] Use the `new URL` constructor when validating URLs in annotations and the outline, as a complement to only checking the protocol, and add a bit more validation to `Catalog_parseDestDictionary` Note that this will automatically reject any relative URL. To make the API more useful to consumers, URLs that are rejected will be available via the `unsafeUrl` property in the data object returned by `PDFPageProxy_getAnnotations`. The patch also adds a bit more validation of the data for `Named` actions.	2016-10-19 22:11:17 +02:00
Jonas Jenwald	e64bc1fd13	Move parsing of destination dictionaries to a helper function This not only reduces code duplication, but it also allow us to easily support the same kind of URLs we currently do for Link annotations in the Outline as well.	2016-10-18 16:14:07 +02:00
Yury Delendik	1236b27993	Removes SVG this.cgrp usages.	2016-10-17 16:09:24 -05:00
Yury Delendik	273d2de6ec	Merge pull request #7715 from timvandermeij/svg-groups SVG: optimize and refactor group creation code	2016-10-17 10:10:47 -05:00
Jonas Jenwald	2ce9da9b7a	Fix a couple of JSDoc `@typedef`s to use `@property` (instead of `@param`) to fix some missing documentation when running `gulp jsdoc`	2016-10-17 13:04:55 +02:00
Tim van der Meij	426fc454de	SVG: factor out initialization code into a private method Each well-formed SVG image has the following structure: SVG element - Definitions element - Root group - Other group 1 - ... - Other group n This patch factors out initialization code into a private method in such as way that the creation of this structure is clear from the code. The root group is the replacement for the parent group from before. We need this group as we cannot apply the viewport transform on the SVG element itself (this caused issues in Chrome). If other code appends groups to the SVG image, in reality it is appending those groups to the root group, but this detail is abstracted away by this patch.	2016-10-15 21:45:44 +02:00
Tim van der Meij	fa90573c4b	SVG: optimize transform group creation This patch ensures that we only create transformation groups when it is actually required and that we re-use transform groups as much as possible. It reduces the number of transform groups for the Tracemonkey paper from 2790 to 1271, thereby making the DOM much lighter and rendering/scrolling smoother. Moreover, it simplifies the code and prevents duplication. Finally, we issue a warning when an unimplemented graphic state is encountered. Before, this was ignored silently, making debugging harder.	2016-10-15 21:43:12 +02:00
Tim van der Meij	2e20000b71	Merge pull request #7727 from Snuffleupagus/parser-stream-decodeParms Let `Parser_makeFilter` pass in the `DecodeParms` data to various image `Stream`s, instead of re-fetching it in various `[...]Stream.prototype.ensureBuffer` methods	2016-10-15 20:04:17 +02:00
Yury Delendik	ea5949f1fd	Merge pull request #7668 from Snuffleupagus/issue-7665 Prevent an infinite loop in `XRef_fetchUncompressed` for encrypted PDF files with indirect objects in the /Encrypt dictionary (issue 7665)	2016-10-15 10:52:08 -05:00
Jonas Jenwald	c8f83d6487	Let `Parser_makeFilter` pass in the `DecodeParms` data to various image `Stream`s, instead of re-fetching it in various `[...]Stream.prototype.ensureBuffer` methods In `Parser_filter` the `DecodeParms` data is fetched and passed to `Parser_makeFilter`, where we also make sure that a `Ref` is resolved to a direct object. We can thus pass this along to the various image `Stream` constructors, to avoid the current situation where we lookup/resolve data that is already available. Note also that we currently do not handle the case where `DecodeParms` is an Array entirely correct in the various image `Stream`s, and this patch fixes that for free.	2016-10-15 12:09:51 +02:00
Jonas Jenwald	1da59bec9b	Remove a remaining old-style preprocessor from `src/core/fonts.js` (PR 7322 follow-up) Note that this code was added after PR 7322 was opened, which thus explains why it was missed during rebasing.	2016-10-15 11:33:09 +02:00
Yury Delendik	0576c9c6c6	Replaces all preprocessor directives with PDFJSDev calls.	2016-10-14 10:57:53 -05:00
Chas Emerick	85c52f1fd6	Fix getTextContent evaluation to only apply TJ horizontal offsets using numeric items/args While the array argument to TJ should only contain strings and numbers, other unfortunate items are found in PDFs in the wild, e.g.: [(Grandes) 0.0 Tc -250.0 (Client\350les,) 0.0 Tc -250.0 (Financements) 0.0 Tc -250.0 (et) 0.0 Tc -250.0 (March\351s) ] TJ getOperatorList already properly ignores any non-string, non-numeric values in TJ arrays; without this patch to getTextContent, returned text items can have NaN widths due to calculations being applied to those non-numeric values.	2016-10-13 08:08:31 -04:00
Yury Delendik	e336604ef1	Disables Font Loading API for Firefox.	2016-10-06 09:30:18 -05:00
Tim van der Meij	9b3a91f365	Merge pull request #7671 from timvandermeij/interactive-forms-choice-fields Interactive forms: render choice widget annotations	2016-10-05 23:27:45 +02:00
Tim van der Meij	d5d9f362aa	Choice widget annotations: core and display layer implementation	2016-10-05 21:25:29 +02:00
Yury Delendik	7b2a9ee4e0	Merge pull request #7670 from Snuffleupagus/Parser_makeFilter-maybeLength Only skip parsing a stream in `Parser_makeFilter` when we know for sure that it is empty (PR 6372 follow-up)	2016-10-05 10:38:12 -05:00
Jonas Jenwald	54ee83eb12	Attempt to skip zero bytes at the end of Scan blocks when decoding JPEG images (issue 4090)	2016-09-28 16:31:02 +02:00
Jonas Jenwald	116ba19dd9	Respect the 'ColorTransform' entry in the image dictionary when decoding JPEG images (bug 956965, issue 6574) Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=956965. Fixes 6574.	2016-09-26 21:55:43 +02:00
Jonas Jenwald	a22f0ae820	Only skip parsing a stream in `Parser_makeFilter` when we know for sure that it is empty (PR 6372 follow-up) For PDF files with multiple `/Filter`s, where the `/Length` entry is zero, we fail to render the file correctly. The reason is that `maybeLength` is `null` for the every filter except the first, and `!maybeLength` is thus truthy. Hence it seems that we should completely ignore the `/Length` entry and also explicitly check `maybeLength === 0`. Note that I've not (yet) come across a PDF file with this issue in the wild, but given all the stupid things PDF generators do I wouldn't be surprised if such a file actually exists. In order to prevent a possible future bug, I'm submitting this patch which includes a hand-edited PDF file that we currently cannot render correctly (but e.g. Adobe Reader can).	2016-09-25 12:40:15 +02:00
Jonas Jenwald	3e77cf6b32	Prevent an infinite loop in `XRef_fetchUncompressed` for encrypted PDF files with indirect objects in the /Encrypt dictionary (issue 7665)	2016-09-25 00:18:47 +02:00
Jonas Jenwald	6c263c1994	Merge pull request #7649 from timvandermeij/interactive-forms-tx-comb Text widget annotations: implement comb support	2016-09-22 11:36:30 +02:00
Tim van der Meij	375229d6b9	Widget annotations: simplify field flag handling Directly use the hexadecimal representation, just like the `AnnotationFlags`, to avoid calculations and to improve readability. This allows us to simplify the unit tests for text widget annotations as well.	2016-09-21 21:11:37 +02:00
Jonas Jenwald	5f16cbd2c0	When rendering forms, don't use `element.value` since it prevents the AnnotationLayer rasterizer (in `test/driver.js`) from parsing the elements correctly Without this, the reference test images will have empty fields despite the viewer working as intended.	2016-09-21 12:33:04 +02:00
Jonas Jenwald	ded01356c7	Pass in the `renderInteractiveForms` parameter to `Annotation_appendToOperatorList`, in `Page_getOperatorList`, instead of to the `Annotation` constructor (PR 7633 follow-up) When debugging issue 7643, I noticed that the `forms` tests currently doesn't look like the rendering in the viewer (with `renderInteractiveForms = true` set). After scratching my head for a little while, I realized that PR 7633 make the implicit assumption that `Page_getOperatorList` (in `core/document.js`) is called before fetching the annotation with `PDFPageProxy_getAnnotations` (in `display/api.js`). Hence this patch, that changes it so that we instead pass in the `renderInteractiveForms` parameter to `Annotation_appendToOperatorList` to ensure that it's always correctly set.	2016-09-21 12:21:20 +02:00
Tim van der Meij	6100ab4b18	Text widget annotations: implement comb support	2016-09-20 22:31:10 +02:00
Brendan Dahl	15e1ae4e3f	Merge pull request #7639 from Snuffleupagus/bug-1252420 Replace empty CharStrings with '.notdef' in `Type1Font_wrap` to prevent OTS from rejecting the font (bug 1252420)	2016-09-20 11:56:47 -07:00
Tim van der Meij	ab1b4cec5d	Merge pull request #7640 from timvandermeij/interactive-forms-rm-global Interactive forms: remove global PDFJS usage	2016-09-19 01:02:44 +02:00
Tim van der Meij	2da2c45889	Interactive forms: remove global PDFJS usage	2016-09-19 00:12:42 +02:00
Jonas Jenwald	170871ab3d	Prevent rendering `TextWidgetAnnotation`s in both the `core`/`display` layer (issue 7643)	2016-09-18 15:42:22 +02:00
Tim van der Meij	f062695d62	Merge pull request #7633 from timvandermeij/interactive-forms-tx-flags Text widget annotations: support read-only/multiline fields and improve testing	2016-09-17 17:19:47 +02:00
Tim van der Meij	dbea302a6e	Text widget annotations: do not render on canvas as well If interactive forms are enabled, then the display layer takes care of rendering the form elements. There is no need to draw them on the canvas as well. This also leads to issues when values are prefilled, because the text fields are transparent, so the contents that have been rendered onto the canvas will be visible too. We address this issue by passing the `renderInteractiveForms` parameter to the render task and handling it when the page is rendered (i.e., when the canvas is rendered).	2016-09-17 15:24:48 +02:00
Tim van der Meij	adf0972ca5	Text widget annotations: improve unit and reference tests This patch improves the unit tests by testing the support for read-only and multiline fields. Moreover, we add a reference test to ensure that the text widgets are not only rendered, but also that their contents are styled properly. Finally, we perform minor improvements in `src/core/annotation.js`, for example adding missing comments.	2016-09-17 15:24:48 +02:00
Tim van der Meij	f6965fadc0	Text widget annotations: support multiline and read-only fields Moreover, this patch provides us with a framework for handling field flags in general for all types of widget annotations.	2016-09-17 15:24:47 +02:00
Jonas Jenwald	aadcbe98c8	Replace empty CharStrings with '.notdef' in `Type1Font_wrap` to prevent OTS from rejecting the font (bug 1252420) Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1252420.	2016-09-17 14:39:10 +02:00
Jonas Jenwald	4acd31f51e	Merge pull request #7550 from Snuffleupagus/Type1-toUnicode-builtInEncoding-fallback For embedded Type1 fonts without included `ToUnicode`/`Encoding` data, attempt to improve text selection by using the `builtInEncoding` to amend the `toUnicode` map (issue 6901, issue 7182, issue 7217, bug 917796, bug 1242142)	2016-09-16 17:51:55 +02:00
Tim van der Meij	26da2d57ce	Merge pull request #7632 from Snuffleupagus/more-efficient-expandTextDivs [EnhanceTextSelection] Make `expandTextDivs` more efficient by updating all styles at once instead of piecewise	2016-09-15 16:01:07 +02:00
Jonas Jenwald	8eaa2cbce3	Remove the deprecated `mozDash`/`mozDashOffset` canvas 2D context methods According to [MDN](https://developer.mozilla.org/en-US/docs/Web/API/CanvasRenderingContext2D/setLineDash#Browser_compatibility) the standard versions of these methods have been supported since Firefox 27, which was released over two and a half years ago. (See the dates in https://wiki.mozilla.org/RapidRelease/Calendar#Past_branch_dates) Furthermore the non-standard properties are now in the process of being removed, please see https://groups.google.com/forum/#!topic/mozilla.dev.platform/UIudMABegcY. Hence I don't think that we need to keep the old `moz` prefixed ones as fallback any more.	2016-09-15 10:05:40 +02:00
Jonas Jenwald	cb5f9df0c8	[EnhanceTextSelection] Make `expandTextDivs` more efficient by updating all styles at once instead of piecewise I intended to provide proper benchmarking results here, as outlined in https://github.com/mozilla/pdf.js/wiki/Benchmarking-your-changes, but after wasting a couple of hours over the weekend getting weird results I gave up. It appears that there's a lot of, i.e. way too much, variance between subsequent runs of `text` tests for the results to be meaningful. (Previously I've only benchmarked `eq` tests, so I don't know if the `text` tests has never worked well or if it's a newer problem. For reference, please see the results of back-to-back benchmark runs on the current `master` with a very simple manifest file: [link here].) Instead I used `console.time/timeEnd` in `appendText` and `expandTextDivs` to be able to compare the performance with/without the patch. The entire viewer was (skip-cache) reloaded between measurements, and the result are available here: [link here]. Given the troubles I've had with benchmarking, I've not yet computed any statistics on the results (e.g. mean, variance, confidence intervals, and so on). However, just by looking at the data I think it's safe to say that this patch first of all doesn't seem to regress the current performance. Secondly it certainly looks very likely that this patch actually improves the performance, especially for the one-glyph-per-text-div case (cf. issue 7224). Re: issue 7584.	2016-09-14 21:19:28 +02:00
Tim van der Meij	323e86c442	Text widget annotations: implement unit testing and sanitize data values	2016-09-13 14:57:11 +02:00
Jonas Jenwald	356b321f6d	Fallback to the `StandardEncoding` for Nonsymbolic fonts without `/Encoding` entry (issue 7580) Even though this patch passes all tests (unit/font/reference) locally, including the new ones that I added in PR 7621, I'm still a bit nervous about modifying the code that choose the fallback encoding for fonts without an `/Encoding` entry. Note that over the years this code has been changed on a number of occasions, see a possibly incomplete [list here], to deal with various cases of incorrect font data. According to the PDF specification, see http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G8.1904184, it seems that we should fallback to the `StandardEncoding` for Nonsymbolic fonts. There's obviously a risk that fixing this particular issue could break other PDF files for which we don't have tests. However I've tried to change the logic as little as possible in this patch, to hopefully reduce possible breakage. Based on debugging numerous font issue, it seems that a lot of fonts actually set the Symbolic flag, even when they are in fact not Symbolic. Fonts actually marked as Nonsymbolic seem to be somewhat less common, which I hope should reduce the risk of the patch somewhat. Fixes 7580.	2016-09-13 14:07:16 +02:00
Yash Srivastav	4e428c7675	Fix lint warnings in URL polyfill	2016-09-12 20:34:51 +05:30
Tim van der Meij	03588ccbf7	Merge pull request #7623 from Snuffleupagus/jpx-error Change `src/core/jpx.js` to use the `error` utility function instead of using `throw new Error`	2016-09-12 15:34:05 +02:00
Yury Delendik	160b176109	Adding "proper" message port for fake worker.	2016-09-12 11:17:10 +02:00
Jonas Jenwald	f620f61887	Change `src/core/jpx.js` to use the `error` utility function instead of using `throw new Error` Note that in `parseCodestream` I purposly left the `throw new Error` instances inside of the `try` block, since we don't want to throw any `Errors` while in recovery mode. Finally somewhat unrelated to the rest of the patch, but I moved the `doNotRecover` variable declaration outside of the `try` block to avoid variable hoisting given that it's accessed inside the `catch` block.	2016-09-12 11:05:43 +02:00
Jonas Jenwald	325f7afcca	For embedded Type1 fonts without included `ToUnicode`/`Encoding` data, attempt to improve text selection by using the `builtInEncoding` to amend the `toUnicode` map (issue 6901, issue 7182, issue 7217, bug 917796, bug 1242142) Note that in order to prevent any possible issues, this patch does not try to amend the `toUnicode` data for Type1 fonts that contain either `ToUnicode` or `Encoding` entries in the font dictionary. Fixes, or at least improves, issues/bugs such as e.g. 6658, 6901, 7182, 7217, bug 917796, bug 1242142.	2016-09-11 20:54:10 +02:00
Tim van der Meij	be485f59ab	Text widget annotations: implement maximum length and text alignment Moreover, we refactor the code a bit to extract code that is shared between the two branches and we only apply text alignment (and create the array) when it is actually defined, since it's optional and left is already the default.	2016-09-11 20:49:00 +02:00
Jonas Jenwald	0b75f63c03	Don't duplicate the first entry in the `charCodeToGlyphId` map for CIDFontType2 fonts with a `CIDToGIDMap` that already mapped the first entry to a non-zero `glyphId` (issue 7544) Fixes 7544.	2016-09-09 22:33:41 +02:00
Tim van der Meij	b112f9f9f4	Merge pull request #7600 from Snuffleupagus/issue-7598 Check that Type1C fonts does not actually contain OpenType font files (issue 7598)	2016-09-09 22:02:58 +02:00
Tim van der Meij	e686db250c	Render interactive form (AcroForm) text widget annotations This patch is the first step towards implementing support for interactive forms (AcroForms). It makes it possible to render text widget annotations exactly like Adobe Reader/Acrobat. Everything we implement for AcroForms is disabled by default using a preference, mainly because it is not ready to use yet, but has to implemented in many steps to avoid complexity. The preference allows us to work with the code while not exposing the behavior by default. Mainly storing entered values and printing them is still absent, which would be minimal requirements for enabling this by default.	2016-09-07 15:37:28 +02:00
Jonas Jenwald	8dbb5a7c4a	Merge pull request #7596 from timvandermeij/widget-annotation-cleanup Improve the structure for widget annotations	2016-09-06 13:46:31 +02:00
Jonas Jenwald	44b75c01a1	Check that Type1C fonts does not actually contain OpenType font files (issue 7598) This patch is yet another instalment in the (never ending) series of patches for PDF files that specify completely incorrect Type/Subtype for its fonts. In this case Type1/Type1C, when in fact OpenType would have been correct. Fixes 7598.	2016-09-06 10:13:11 +02:00
Tim van der Meij	576f742047	Improve the structure for widget annotations Currently, we only support text widget annotations (field type 'Tx') partially. However, the current code does not make this entirely clear and does not provide a warning when an unsupported field type is encountered, making it harder to determine why rendering fails. Moreover, in the display layer we make no distinction between the various types of widget annotations, causing the code for text widget annotations to also be executed for other types of widget annotations in a fallback situation. This patch improves the structure of the widget annotation code. In the core layer, we use the same structure we use for non-widget annotations in the factory and provide a clear warning when an unsupported type is encountered. In the display layer, we do the same and split the `WidgetAnnotationElement` class into two classes, namely `TextWidgetAnnotationElement` for text widget annotations and `WidgetAnnotationElement` for other unsupported annotations as a fallback. From this it clear that we only support text widget annotations and nothing else.	2016-09-06 00:26:05 +02:00
Jonas Jenwald	37998076c9	In `display/api.js` ensure that we always reject with an `Error` in `JpegDecode`, and adjust a couple of other rejection sites as well In the case where the document was destroyed, we were rejecting the `Promise` in `JpegDecode` with a string instead of an `Error`. The patch also brings the wording more inline with other such rejections. Use the `isInt` utility function when validating the `pageNumber` parameter in `WorkerTransport_getPage`, to make it more obvious what's actually happening. There's also a couple more unit-tests added, to ensure that we always fail in the expected way. Finally, we can simplify the rejection handling in `WorkerTransport_getPageIndexByRef` somewhat. (Note that the only reason for using `catch` here is that since the promise is rejected on the worker side, the `reason` becomes a string instead of an `Error` which is why we "re-reject" on the display side.)	2016-09-05 16:35:32 +02:00
Jonas Jenwald	38c85039d1	Merge pull request #7588 from timvandermeij/text-layer-weakmap Use a `WeakMap` in `src/display/text_layer.js`	2016-09-04 21:25:48 +02:00
Tim van der Meij	96593571eb	Optimize scale calculation in `text_layer.js` This patch avoids having to calculate the scale twice by saving it in the properties object. Moreover, we remove a temporary variable and place parentheses around a calculation inside a string concatenation.	2016-09-04 20:19:31 +02:00
Jonas Jenwald	a35773ec8c	Change `src/core/jpg.js` to use the `error` utility function instead of `throw`ing This allows us to remove the `try/catch` statements used in `src/core/stream.js` when parsing JPEG images. As far as I can tell, the only reason for the current usage of plain `throw` is that `jpg.js` originally was external code. Given that this code now lives in our repo, this patch brings the JPEG code more in line with e.g. `src/core/jpx.js` and `src/core/jbig2.js`.	2016-09-04 16:28:23 +02:00
Tim van der Meij	d03651efff	Merge pull request #7407 from Snuffleupagus/issue-7406 Assign the `quantizationTables` after parsing the entire JPEG image, to prevent issues when the DQT (Define Quantization Tables) marker is encountered after SOF{n} (Start of Frame) markers (issue 7406)	2016-09-04 14:49:01 +02:00
Tim van der Meij	b3818d5c36	Replace `div.dataset` with a `WeakMap` in `text_layer.js` This patch improves performance by avoiding unnecessary type conversions, which also help the JIT for optimizations. Moreover, this patch fixes issues with the div expansion code where `textScale` would be undefined in a division. Because of the `dataset` usage, other comparisons evaluated to `true` while `false` would have been correct. This makes the expansion mode now work correctly for cases with, for example, each glyph in one div. The polyfill for `WeakMap` has been provided by @yurydelendik.	2016-09-03 20:06:42 +02:00
Tim van der Meij	b10add14f3	Refactor `text_layer.js` to pass the task as a parameter We pass many parameters to `appendText` while we might as well pass the `task` object that contains them. This saves a few lines of code and makes the signature of `appendText` more clear. We do the same for `expand`, which is useful for the next commit in which we replace `div.dataset` with a `WeakMap`. Furthermore, this patch adds a missing parameter to a comment block to make it clear which parameters remain.	2016-09-02 20:46:36 +02:00
Tim van der Meij	7c961b6b7a	Minor code style improvements after #7539	2016-09-01 18:07:12 +02:00
Tim van der Meij	6bb95e3129	Merge pull request #7539 from jeremypress/fairexpand [api-minor] Expanding divs to improve selection	2016-09-01 17:43:31 +02:00
Jeremy Press	6faa84abdb	Continuing fairexpand #6663 1. Expanding divs to improve text selection. (Yury) 2. Adding enhanceTextSelection as an option. 3. Moving feature functionality from text_layer_builder.js to text_layer.js. 4. Added expandTextDivs method to only load expanded divs on first click, and only show on subsequent clicks	2016-08-31 09:54:52 -07:00
Jonas Jenwald	1bbc694ac3	Assign the `quantizationTables` after parsing the entire JPEG image, to prevent issues when the DQT (Define Quantization Tables) marker is encountered after SOF{n} (Start of Frame) markers (issue 7406) This is a tentative patch that fixes 7406.	2016-08-31 18:42:05 +02:00
Yury Delendik	ffa99397ad	Merge pull request #7387 from Snuffleupagus/issue-5808 Attempt to ignore multiple identical Tf (setFont) commands in `PartialEvaluator_getTextContent` (issue 5808)	2016-08-30 15:21:41 -05:00
Tim van der Meij	f520616e00	Merge pull request #7570 from Snuffleupagus/issue-7569 Create a fallback annotation `id` for entries in `Annots` dictionaries that are not indirect objects (issue 7569)	2016-08-28 00:23:59 +02:00
Jonas Jenwald	088ce6c009	Add a unit-test to check that `ProblematicCharRanges` contains valid entries When adding new entries to `ProblematicCharRanges`, you have to be careful to not make any mistakes since that could cause glyph mapping issues. Currently the existing reference tests should probably help catch any errors, but based on experience I think that having a unit-test which specifically checks `ProblematicCharRanges` would be both helpful and timesaving when modifying/reviewing changes to this code. Hence this patch which adds a function (and unit-test) that is used to validate the entries in `ProblematicCharRanges`, and also checks that we don't accidentally add more character ranges than the Private Use Area can actually contain. The way that the validation code, and thus the unit-test, is implemented also means that we have an easy way to tell how much of the Private Use Area is potentially utilized by re-mapped characters.	2016-08-27 11:56:00 +02:00
Jonas Jenwald	78889646c8	Create a fallback annotation `id` for entries in `Annots` dictionaries that are not indirect objects (issue 7569) According to the PDF specification, see http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#page=86, entries in `Annots` dictionaries should be indirect objects, but obviously there're PDF generators that ignore this. Fixes 7569.	2016-08-27 10:56:16 +02:00
Jonas Jenwald	5379749d4b	Try to prevent `CanvasGraphics_getSinglePixelWidth` from intermittently returning incorrect values in Firefox (issue 7188) Fixes 7188.	2016-08-22 20:00:24 +02:00
Tim van der Meij	b4c8814fc9	Merge pull request #7534 from Snuffleupagus/isName-name-check Add a parameter to the `isName` function that enables checking not just that something is a `Name`, but also that the actual `name` properties matches	2016-08-17 15:48:42 +02:00
Jonas Jenwald	544d29f5cb	Add a `recoveryMode` that suppresses errors from the `Parser`, and utilize it when searching for the main trailer in `XRef_indexObjects` (bug 1250079) Instead of having `Parser_getObj` fail unconditionally for the referenced PDF file, this patch attempts to let searching for the main trailer continue even if there are errors. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1250079.	2016-08-17 12:37:35 +02:00
Jonas Jenwald	83ce6f0b6d	Adjust the (applicable) existing `isName` callsites to use the new `isName(v, name)` version of the function	2016-08-10 11:15:08 +02:00
Jonas Jenwald	af636aae96	Add a parameter to the `isName` function that enables checking not just that something is a `Name`, but also that the actual `name` properties matches This is similar to the existing `isCmd` and `isDict` functions, which already support similar kind of checks. With the updated `isName` function, we'll be able to simplify many callsites from: `isName(someVariable) && someVariable.name === 'someName'` to: `isName(someVariable, 'someName')`.	2016-08-10 11:15:03 +02:00
Jonas Jenwald	77c6ed5389	Attempt to ignore multiple identical Tf (setFont) commands in `PartialEvaluator_getTextContent` (issue 5808) This patch improves the performance of issue 5808, but I'm not sure if it's enough to call it fixed. On average, this patch reduces the number of textLayer div's by a factor of 3, and it also reduces the time spend in `getTextContent` by a factor of ~2. The PDF file is generated by `Scribus PDF`, which for reasons I cannot understand is placing redundant `Tf` commands before every showText command. Note how the PDF file also contains lots of (basically) identical fonts, but with slightly different names, which causes unnecessary font-switching. This causes some unnecessary breaking of textLayer div's, but this issue cannot be easily worked around.	2016-07-27 21:37:52 +02:00
Yury Delendik	a02e2686b9	Merge pull request #7475 from Snuffleupagus/api-getTextContent-combineTextItems [api-minor] Add a parameter to `PDFPageProxy_getTextContent` that controls whether `PartialEvaluator_getTextContent` will attempt to combine same line text items	2016-07-27 08:34:24 -05:00
Jonas Jenwald	558a22cd02	Prevent errors when parsing Annotations with missing (or invalid) /Subtype entries (issue 7446) Note that I used a separate warning message for this case, instead of utilizing the same one as in the unsupported subtype case, to more clearly indicate that the PDF file itself is to blame rather than PDF.js. Fixes 7446.	2016-07-25 13:59:26 +02:00
Brendan Dahl	5678486802	Merge pull request #7347 from Snuffleupagus/evaluator-more-Ref_toString Slightly refactor the `fontRef` handling in `PartialEvaluator_loadFont` (issue 7403 and issue 7402)	2016-07-22 17:21:47 -07:00
Brendan Dahl	50d6e4f147	Merge pull request #7447 from Snuffleupagus/buildToUnicode-notdef Ignore .notdef in the `differences` array when building a fallback `toUnicode` map in `PartialEvaluator_buildToUnicode` (issue 5256)	2016-07-22 14:33:32 -07:00
Jonas Jenwald	390c02a3e9	Attempt to cache fonts that are direct objects (i.e. `Dict`s), as opposed to `Ref`s, to prevent re-rendering after `cleanup` from breaking (issue 7403 and issue 7402) Fonts that are not referenced by `Ref`s are very uncommon in practice, but it can unfortunately happen. In this case, we're currently not caching them in the usual way, i.e. by `Ref`, which leads to failures when a page is rendered after `cleanup` has run. The simplest solution would have been to remove the `font.translated` workaround, but since this would have meant loading these kind of fonts over and over, the patch attempts to be a bit clever about this situation. Note that if we instead loaded fonts per page, instead of per document, this issue wouldn't have existed.	2016-07-21 16:04:07 +02:00
Jonas Jenwald	2e9cd3ea64	Slightly refactor the `fontRef` handling in `PartialEvaluator_loadFont` (issue 7403 and issue 7402) Originally, I was just going to change this code to use `Ref_toString` in a couple more places. When I started reading the code, I figured that it wouldn't hurt to clean up a couple of comments. While doing this, I noticed that the logic for the (rare) `isDict(fontRef)` case could do with a few improvements. There should be no functional changes with this patch, but given the added reference checks, we will now avoid bogus `Ref`s when resolving font aliases. In practice, as issue 7403 shows, the current code can break certain PDF files even if it's very rare. Note that the only thing that this patch will change, is the `font.loadedName` in the case where a `fontRef` is a reference and the font doesn't have a descriptor. Previously for `fontRef = Ref(4, 0)` we'd get `font.loadedName = 'g_d0_f4_0'`, and with this patch `font.loadedName = g_d0_f4R`, which is actually one character shorted in most cases. (Given that `Ref_toString` contains an optimization for the `gen === 0` case, which is by far the most common `gen` value.) In the already existing fallback case, where the `fontName` is used to when creating the `font.loadedName`, we allow any alphanumeric character. Hence I don't see how (as mentioned above) e.g. `font.loadedName = g_d0_f4R` would be an issue here.	2016-07-21 16:03:33 +02:00
Tim van der Meij	10f9f11ec4	Merge pull request #7490 from Snuffleupagus/issue-7426 Don't map glyphs to the Lepcha Unicode block (issue 7426)	2016-07-21 14:39:19 +02:00
Jonas Jenwald	f297e4d17c	[api-minor] Add a parameter to `PDFPageProxy_getTextContent` that controls whether `PartialEvaluator_getTextContent` will attempt to combine same line text items From the discussion in issue 7445, it seems that there may be cases where an API consumer would want to get the text content as is, without combined text items.	2016-07-19 13:38:57 +02:00
Jonas Jenwald	90d19de935	Catch errors and continue parsing in `parseCMap` (issue 7492) After PR 7039, the PDF file in issue 7492 no longer renders at all, but note that text selection wasn't working correctly previously. The problem with the PDF file in issue 7492 is that the `cMap`, in the `toUnicode` entry in the font, contains an invalid name: ``` /CMapName /-usr-share-fonts-truetype-Panton-Panton Family-Fontfabric - Panton.otf,000-UTF16 def ``` When we parse that line, things obviously break because there are spaces present in the wrong places. To avoid that issue, the patch simply lets `parseCMap` continue when errors are encountered, to try and recover usable data. Note that by not aborting immediatly when an error is encountered, we are also able to fix the text selection. Obviously, it could be argued that we should just immediatly reject a corrupt `cMap`. But given that they usually are correct, it seems that trying to recover as much data as possible from corrupt one can only be a good thing for both glyph mapping and text selection. Fixes 7492.	2016-07-18 16:39:56 +02:00
Jonas Jenwald	64783c8b6e	Don't map glyphs to the Lepcha Unicode block (issue 7426) In the PDF file in the issue, some of the glyphs end up being mapped to the Lepcha Unicode block; see https://en.wikipedia.org/wiki/Lepcha_(Unicode_block). This didn't use to matter, but after HarfBuzz updates that improved support for Lepcha fonts, in particular https://bugzilla.mozilla.org/show_bug.cgi?id=1249861, some glyphs are now moved horizontally. To avoid that, this patch adds the Lepcha block to the list of Unicode ranges that we skip when building the glyph mapping. Fixes 7426.	2016-07-17 16:53:36 +02:00
klemens	6f03f62327	trivial spelling fixes	2016-07-17 14:33:41 +02:00
Jonas Jenwald	8f4ec669d0	Remove the obsolete `MozBlobBuilder` fallback from the `createBlob` utility function `MozBlobBuilder` has been obsolete since Firefox 14, so there's no reason to keep this code around anymore.	2016-07-09 16:37:05 +02:00
Jonas Jenwald	51e46fa1a7	Change the `warn` to `info` in `recoverGlyphName` to reduce the console spam After PR 7441, where `recoverGlyphName` is used a lot more than before, many PDF files will generate a lot of warnings the console. For normal usage, compared to debugging/development, this is probably more annoying than helpful.	2016-07-09 12:08:41 +02:00
Tim van der Meij	b6826a46a8	Merge pull request #7453 from simoncpu/master Expose the text widget's maximum length.	2016-07-07 01:03:40 +02:00
Brendan Dahl	1f3f4a8dd7	Merge pull request #7441 from Snuffleupagus/issue-7439 Fallback to attempt to recover standard glyph names when amending the `charCodeToGlyphId` with entries from the `differences` array in `type1FontGlyphMapping` (issue 7439)	2016-07-06 13:02:21 -07:00
Brendan Dahl	e2e657e44f	Merge pull request #7390 from Snuffleupagus/issue-7180 Add upper-case `I` as a possible space replacement fallback in `Font.spaceWidth` to improve text-selection (issue 7180)	2016-06-29 15:11:19 -07:00
Simon Cornelius P. Umacob	d872fc90b9	Expose the text widget's maximum length.	2016-06-29 17:04:33 +08:00
Jonas Jenwald	bdd58ab1d2	Ignore .notdef in the `differences` array when building a fallback `toUnicode` map in `PartialEvaluator_buildToUnicode` (issue 5256) Fixes 5256.	2016-06-27 16:20:23 +02:00
Jonas Jenwald	7866109af9	Fallback to attempt to recover standard glyph names when amending the `charCodeToGlyphId` with entries from the `differences` array in `type1FontGlyphMapping` (issue 7439) Fixes 7439.	2016-06-25 14:54:34 +02:00
Jonas Jenwald	c1ca268ef3	Skip mapping of glyphs to Unicode "Ideographic space" (issue 7416) Fixes 7416, which is an IE specific issue.	2016-06-22 08:58:00 +02:00
Tim van der Meij	f97d52182a	Merge pull request #7341 from Snuffleupagus/getDestinationHash-Array [api-minor] Improve handling of links that are using explicit destination arrays	2016-06-09 00:29:10 +02:00
Tim van der Meij	70b3eea4a3	Merge pull request #7389 from Snuffleupagus/move-isSpace Move the `isSpace` utility function from core/parser.js to shared/util.js	2016-06-08 22:57:45 +02:00
Jonas Jenwald	6a0b047bfa	Add upper-case `I` as a possible space replacement fallback in `Font.spaceWidth` to improve text-selection (issue 7180) In fonts with only upper-case glyphs, that are also missing a space glyph, `get spaceWidth` won't be able to return anything useful. By adding upper-case `I` as a fallback, we can thus improve text-selection in some PDF files. Note that locally, the patch causes slight movement in a few existing `text` tests, but in my opinion this actually looks like slight improvements. Fixes 7180.	2016-06-07 22:55:25 +02:00
Jonas Jenwald	6260fc09a3	Attempt to recover valid `format 3` FDSelect data from broken CFF fonts (bug 1146106) According to the CFF specification, see http://partners.adobe.com/public/developer/en/font/5176.CFF.pdf#G3.46884, for `format 3` FDSelect data: "The first range must have a ‘first’ GID of 0". Since the PDF file (attached in the bug) violates that part of the specification, this patch tries to recover valid FDSelect data to prevent OTS from rejecting the font. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1146106.	2016-06-06 18:20:52 +02:00
Jonas Jenwald	a36a946976	Move the `isSpace` utility function from core/parser.js to shared/util.js Currently the `isSpace` utility function is a member of `Lexer`, which seems suboptimal, given that it's placed in `core/parser.js`. In practice, this means that in a number of `core/.js` files we thus have an otherwise* completely unnecessary dependency on `core/parser.js` for a one-line function. Instead, this patch moves `isSpace` into `shared/util.js` which seems more appropriate for this kind of utility function. Not to mention that since all the affected `core/*.js` files already depends on `shared/util.js`, this doesn't incur any more file dependencies.	2016-06-06 09:11:33 +02:00
Jonas Jenwald	b02d560ae0	Fix errors in `setGState` in `PartialEvaluator_getTextContent` that prevents text-selection from working properly Currently `setGState` is completely broken, and looking through the history of that code, it seems to me that this may never have worked correctly. This patch fixes the text-selection in `extgstate.pdf` in the test-suite, which is also added as a `text` test.	2016-06-01 22:58:49 +02:00
Jonas Jenwald	98fe094d18	Let non-viewable Popup Annotations inherit the parent's Annotation Flags if the parent is viewable Fixes http://www.pdf-archive.com/2013/09/30/file2/file2.pdf. Note how it's not possible to show the various Popup Annotations in the above document. To fix that, this patch lets the Popup inherit the flags of the parent, in the special case where the parent is `viewable` and the Popup is not. In general, I don't think that a Popup must have the same flags set as the parent. However, it seems very strange to have a `viewable` parent annotation, and then not being able to view the Popup. Annoyingly the PDF specification doesn't, as far as I can find, mention anything about how this case should be handled, but this patch seem consistent with the actual behaviour in Adobe Reader.	2016-05-25 23:00:26 +02:00
Brendan Dahl	b86610ffdb	Merge pull request #7300 from Snuffleupagus/bug-1068432 Prevent adding invalid values in `CFFDict_setByKey` (bug 1068432)	2016-05-24 12:12:38 -07:00
Jonas Jenwald	b354682dd6	[api-minor] Let `LinkAnnotation`/`PDFLinkService_getDestinationHash` return a stringified version of the destination array for explicit destinations Currently for explicit destinations, compared to named destinations, we manually try to build a hash that often times is a quite poor representation of the actual destination. (Currently this only, kind of, works for `\XYZ` destinations.) For PDF files using explicit destinations, this can make it difficult/impossible to obtain a link to a specific section of the document through the URL. Note that in practice most PDF files, especially newer ones, use named destinations and these are thus unnaffected by this patch. This patch also fixes an existing issue in `PDFLinkService_getDestinationHash`, where a named destination consisting of only a number would not be handled correctly. With the added, and already existing, type checks in place for destinations, I really don't think that this patch exposes any "sensitive" internal destination code not already accessible through normal hash parameters. Please note: Just trying to improve the algorithm that generates the hash is unfortunately not possible in general, since there are a number of cases where it will simply never work well. - First of all, note that `getDestinationHash` currently relies on the `_pagesRefCache`, hence it's possible that the hash returned is empty during e.g. ranged/streamed loading of a PDF file. - Second of all, the currently computed hash is actually dependent on the document rotation. With named destinations, the fetched internal destination array is rotational invariant (as it should be), but this will not hold in general for the hash. We can easily avoid this issue by using a stringified destination array. - Third of all, note that according to the PDF specification[1], `GoToR` destinations may actually contain explicit destination arrays. Since we cannot really construct a hash in `annotation.js`, we currently have no good way to support those. Even though this case seems very rare in practice (I've not actually seen such a PDF file), it's in the specification, and this patch allows us to support that for "free". --- [1] http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G11.1951685	2016-05-21 14:14:07 +02:00
Jonas Jenwald	01ab15a6f1	[api-minor] Let `Catalog_getPageIndex` check that the `Ref` actually points to a /Page dictionary Currently the `getPageIndex` method will happily return `0`, even if the `Ref` parameter doesn't actually point to a proper /Page dictionary. Having the API trust that the consumer is doing the right thing seems error-prone, hence this patch which adds a check for this case. Given that the `Catalog_getPageIndex` method isn't used in any hot part of the codebase, this extra check shouldn't be a problem. (Note: in the standard viewer, it is only ever used from `PDFLinkService_navigateTo` if a destination needs to be resolved during document loading, which isn't common enough to be an issue IMHO.)	2016-05-21 14:13:41 +02:00
Tim van der Meij	db46829ef7	Merge pull request #7316 from timvandermeij/remove-unused Remove unused variables	2016-05-21 14:07:33 +02:00
Jonas Jenwald	7ddb0bc718	Attempt to combine text runs positioned with `setTextMatrix`	2016-05-18 17:21:58 +02:00
Tim van der Meij	6a7012aaca	Remove unused variables These have been found using `gulp lint` in combination with the `unused: true` parameter for JSHint. Unfortunately there are too many false positives to enable this feature, but now that most globals have been removed because of the conversion to UMD the results are much more useful than before.	2016-05-11 16:11:13 +02:00
Tim van der Meij	c1c199d702	Merge pull request #7295 from Snuffleupagus/core-getArray Use `Dict_getArray` in more places in `src/core/` to avoid issues when Arrays contain indirect objects	2016-05-10 23:21:54 +02:00
Jonas Jenwald	182d33800a	Ignore 'endobj' commands inside of `ObjStm` streams (issue 5241, bug 898610, bug 1037816) According to an example in the PDF specification, see http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#page=56, an `ObjStm` stream should not contain 'endobj' commands. Fixes 5241. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=898610. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1037816.	2016-05-09 09:50:45 +02:00
Jonas Jenwald	c9b6de3b16	Prevent adding invalid values in `CFFDict_setByKey` (bug 1068432) In the font in question, there are a couple of `topDict` entries that have invalid values (`0xF 0xF`, i.e. just eof markers without any actual numbers). This causes the `parseFloatOperand` function, inside `CFFParser_parseDict`, to return `NaN`. Currently we pass this broken font onto the browser, which OTS unsurprisingly rejects. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1068432.	2016-05-07 21:09:58 +02:00
Jonas Jenwald	6111c17c8a	Use `Dict_getArray` in more places in `src/core/` to avoid issues when Arrays contain indirect objects As evident from e.g. PRs 6485 and 7118, some bad PDF generators unfortunately create Arrays where some elements are indirect objects (i.e. `Ref`s). This seems to mostly affect Arrays that contain numbers, such as e.g. `Matrix/FontMatrix/BBox/FontBBox/Rect/Color/...`, and has manifested itself in PDF files that fail to render correctly (some elements are missing). The problem in both the cases above, besides broken rendering, was that there were no errors/warnings that indicated what the problem was, making it difficult to pinpoint the issue. Hence this patch, where I've audited all usages of `Dict_get` in `src/core/` files, and replaced it with `Dict_getArray` where appropriate to try and prevent unnecessary future bugs.	2016-05-05 19:42:57 +02:00
Tim van der Meij	9c95d089de	Merge pull request #7281 from yurydelendik/static-warnings Fixes some static analysis warnings and recommendations	2016-05-03 01:26:17 +02:00
Yury Delendik	32ce369d88	Fixes some static analysis warnings and recommendations * Useless conditional * Superfluous trailing arguments * Useless assignment to local variable * Misspelled identifier * JSDoc tag for non-existent parameter	2016-05-02 17:34:58 -05:00
Yury Delendik	4cde9984f9	Fixes unneed conversion to array in CFF encodeInteger.	2016-05-02 15:24:16 -05:00
Yury Delendik	4016f9fd43	Fixes weird loop in the bidi.js. Closes #7231.	2016-04-27 16:14:33 -05:00
Yury Delendik	3d49879211	Merge pull request #7130 from nschloe/patch-1 Add element to text layer even if width === 0	2016-04-22 16:10:07 -05:00
Jonas Jenwald	9ceeb21741	Prevent accidentally overriding the `error` function in the `commonobj` messageHandler in api.js (issue 7232) This naming issue has been present since PR 3529, but at least I cannot find any issues/bugs that seem to have been caused by it, which is good. The patch also removes an unnecessary `else` branch, since an already existing `break` means that it's redundant. Fixes 7232.	2016-04-22 10:12:12 +02:00
Jonas Jenwald	e281ef15db	Adjust incorrect first obj number of "free" xref entry in `XRef_readXRefTable` (issue 7229) Fixes 7229.	2016-04-21 16:36:32 +02:00
Jonas Jenwald	19e0599f74	Split the two paths in `PDFImage.resize` into separate helper functions, placed in colorspace.js and image.js Re: issue 6777.	2016-04-17 10:24:36 +02:00
Jonas Jenwald	f3f825cc71	Various improvements for `GoToR` actions - Add support for the 'NewWindow' property. - Ensure that destinations are applied to the remote document, instead of the current one. - Handle the `F` entry being a standard string, instead of a dictionary.	2016-04-15 22:55:05 +02:00
Jonas Jenwald	b63ef7a8b6	Refactor `LinkAnnotation` slightly to add `data.url`/`data.dest` at the end This patch also makes sure that all URLs are converted to the correct encoding.	2016-04-15 22:55:05 +02:00
Tim van der Meij	4a601ffc28	Merge pull request #7197 from prakashpalanisamy/remove-combineurl-test Remove `combineUrl` and replace it with `new URL`. Issue #7183, for reference.	2016-04-15 22:44:07 +02:00
Prakash Palanisamy	a25c29d98d	Remove `combineUrl` and replace it with `new URL`.	2016-04-15 21:33:10 +05:30
Jonas Jenwald	079b563e2d	Ensure that the `params` parameter of the `PredictorStream` is a dictionary (issue 7200) Fixes 7200.	2016-04-15 16:30:18 +02:00
Yury Delendik	6282ec24d1	Merge pull request #7172 from yurydelendik/umd-web Introduces UMD headers to the web/ folder.	2016-04-13 10:23:23 -05:00
Yury Delendik	006e8fb59d	Introduces UMD headers to the web/ folder.	2016-04-13 10:09:48 -05:00
Yury Delendik	fa2f80d0fd	Merge pull request #7189 from yurydelendik/webpack-browserify-love Removing "entry-loader" dependency from webpack.	2016-04-13 08:42:11 -05:00
Yury Delendik	ae415f9e80	Removing "entry-loader" dependency from webpack.	2016-04-13 08:24:25 -05:00
Yury Delendik	b834b6899c	Merge pull request #7185 from iloire/issue-7177-support-almondjs Support almond.js #7177	2016-04-12 17:21:03 -05:00
Ivan Loire	1dfc49152a	Support almond.js #7177	2016-04-12 09:32:07 +10:00
Yury Delendik	398e6acbc5	Stops bleeding of pattern edges for mesh.	2016-04-11 18:21:44 -05:00
Jonas Jenwald	be6754a1a0	Merge pull request #7176 from yurydelendik/smask-resume Allow SMask be resumed after restore() and better transform after SMask	2016-04-11 15:57:40 +02:00
Yury Delendik	63f62a0e53	Finishing SMask at the end of operators list.	2016-04-11 08:02:06 -05:00
Yury Delendik	1485c1d1da	Suspending/resuming SMask operation during setGState/restore.	2016-04-11 08:02:06 -05:00
Jonas Jenwald	f59c3a0644	Remove the remaining usages of `new {Name,Cmd}` in favor of `{Name,Cmd}.get` Using `new {Name,Cmd}` should be avoided, since it creates a new object on every call, whereas `{Name,Cmd}.get` uses caches to only create one object regardless of how many times they are called. Most of these are found in the unit-tests, where increased memory usage probably doesn't matter very much. But it still seems good to get rid of those cases, since no part of the codebase ought to advertise that usage. Given the small size of the patch, I'm also tweaking a few comments and class names.	2016-04-08 12:14:05 +02:00
Yury Delendik	1e4886a15a	Remove global window and navigator usages from the core code.	2016-04-07 13:46:07 -05:00
Yury Delendik	1e3e14e6b2	Exposes all functional members via lib exports and use them in viewer.	2016-04-07 13:46:07 -05:00
Yury Delendik	1d12aed5ca	Move all PDFJS.xxx settings into display/global.	2016-04-07 13:46:07 -05:00
Yury Delendik	118b71925c	Forces UMD header to have relative path and extension for CommonJS.	2016-04-02 11:10:36 -05:00
Yury Delendik	34aa915441	Merge pull request #7146 from Snuffleupagus/extract-CFFParser Extract CFFParser and Type1Parser from fonts.js	2016-04-02 10:50:38 -05:00
Yury Delendik	055d642bf2	Merge pull request #7107 from Rob--W/worker-loading Detect premature worker load error	2016-04-02 10:40:26 -05:00
Rob Wu	c8996f654f	Detect and handle premature worker load error Fall back to a fake worker if the worker fails to load or initialize, e.g. due to a network error, a security error or simply a script error.	2016-04-02 11:06:15 +02:00
Jonas Jenwald	ef551e8266	Extract `Type1Parser` from fonts.js	2016-04-01 23:38:53 +02:00
Jonas Jenwald	b961e1d21b	Extract `CFFParser` from fonts.js (issue 6777)	2016-04-01 22:32:39 +02:00
Yury Delendik	a250c150ab	Merge pull request #7134 from yurydelendik/circ-stream-colorspace Refactors to remove stream.js dependency on colorspace.js	2016-04-01 08:23:24 -05:00
Yury Delendik	ff3ce973b8	Merge pull request #7106 from Snuffleupagus/issue-7101 Keep track of the character to glyph mapping in font_renderer.js, to prevent errors when different characters point to the same glyph (issue 7101)	2016-04-01 08:09:21 -05:00
Yury Delendik	35cbf74b12	Refactors to remove stream.js dependency on colorspace.js	2016-04-01 07:36:16 -05:00
Brendan Dahl	13d440df61	Merge pull request #7078 from Snuffleupagus/refactor-toFontChar-without-file Refactor the building of `toFontChar` for non-embedded fonts	2016-03-31 10:43:11 -07:00
Jonas Jenwald	05cf709f8e	Parse Type1 font files to determine the various `Length{n}` properties, instead of trusting the PDF file (issue 5686, issue 3928) Fixes 5686. Fixes 3928.	2016-03-31 11:08:12 +02:00
Jonas Jenwald	c40df8a393	Make `Type1Font` more class-like, by adding closure Note: Ignoring whitespace should simplify reviewing a great deal.	2016-03-31 11:00:27 +02:00
Jonas Jenwald	17aaa125df	Keep track of the character to glyph mapping in font_renderer.js, to prevent errors when different characters point to the same glyph (issue 7101) Fixes 7101.	2016-03-30 11:33:04 +02:00
Nico Schlömer	7cb055307d	Add element to text layer even if width === 0 Some browsers render certain special characters with width 0, others with strictly positive width. (For example, the Greek Delta, Δ, has width 0 in Google Chrome, and a positive width in Firefox.) The `if` clause in operation so far results in different text layer DOM trees for different browsers. This commit fixes that by adding the elements independently of their width.	2016-03-29 19:32:51 +02:00
Brendan Dahl	4e2f70440f	Merge pull request #6711 from yurydelendik/errors Better errors capturing at the core and stop rendering on error.	2016-03-29 09:19:28 -07:00
Jonas Jenwald	13d7a5070e	Prevent failures in the Annotation code if the `Rect` array contains indirect objects (issue 7115) Note that in the PDF files provided by the reporter, this issue was limited to `Rect` arrays in AcroForm entries (which we currently don't support). However, since a bad PDF generator could create this problem in any kind of annotation, the reduced test-case included here uses a simple LinkAnnotation instead. Fixes 7115.	2016-03-26 20:55:16 +01:00
Brendan Dahl	df7afcf004	Merge pull request #7053 from yurydelendik/rm-pdfjs-core Removes global PDFJS usage from the src/core/.	2016-03-25 13:19:43 -07:00
Yury Delendik	2fa4dd6f40	Proxy global PDFJS.verbosity to properly configure shared/util.	2016-03-23 19:24:37 -05:00
Yury Delendik	a8e5912cb1	Moves shared/global to display/global	2016-03-23 19:24:37 -05:00
Yury Delendik	e372f3608b	Makes WorkerMessageHandler non-global.	2016-03-23 19:24:37 -05:00
Yury Delendik	bda5e6235e	Removes global PDFJS usage from the src/core/.	2016-03-23 19:24:37 -05:00
Yury Delendik	54ee15d866	Merge pull request #7100 from yurydelendik/stream-wo-parser Removes core/stream circular dependency on core/parser.	2016-03-22 15:08:12 -05:00
Yury Delendik	6038c236b2	Removes core/stream circular dependency on core/parser.	2016-03-22 14:06:01 -05:00
Jonas Jenwald	d78fae0181	Ensure that TrueType font tables have `uint32` checksums According to "The table directory" under https://developer.apple.com/fonts/TrueType-Reference-Manual/RM06/Chap6.html#Directory, TrueType font tables should have `uint32` checksums. This is something that I noticed, and was initially confused about, while debugging a TrueType issue. As far as I can tell, the current (`int32`) checksums we use doesn't cause any issues in practice. However, I do think that this should be addressed to agree with the specification, and to reduce possible confusion when reading the font code.	2016-03-22 13:40:50 +01:00
Yury Delendik	21ed8ff71d	Merge pull request #7039 from prometheansacrifice/async-cmap-factory Refactors CMapFactory.create to make it async	2016-03-21 13:57:36 -05:00
Manas	f6d28ca323	Refactors CMapFactory.create to make it async	2016-03-21 23:08:19 +05:30
Jonas Jenwald	91756f6e86	Pass the `PDFJS.postMessageTransfer` parameter to the worker, so that the `MessageHandler` can be setup correctly in `createDocumentHandler` (issue 6957) This regressed in commit `acdd49f480`, i.e. PR 6571. Fixes 6957.	2016-03-16 18:34:26 +01:00
Yury Delendik	c6d2b7f9d9	Merge pull request #6906 from KamiHQ/fix-printing avoid apply transform twice for composite context	2016-03-11 08:26:59 -06:00
Yury Delendik	8ba413e761	Better errors capturing at the core and stop rendering on error.	2016-03-11 07:59:09 -06:00
Jonas Jenwald	cd2bd057ab	Refactor the building of `toFontChar` for non-embedded fonts Currently there's a lot of duplicate code for non-embedded `toFontChar`, which this patch simplifies by extracting the code into a helper function instead.	2016-03-10 21:25:39 +01:00
Jonas Jenwald	dfe9015a43	Convert `uniXXXX` glyph names to proper ones when building the `charCodeToGlyphId` map for TrueType fonts (bug 1132849, issue 6893, issue 6894) This patch adds a `getUnicodeForGlyph` helper function, which is used to recover Unicode values for non-standard glyph names. Some PDF generators, e.g. Scribus PDF, use improper `uniXXXX` glyph names which breaks the glyph mapping. We can avoid this by converting them to "standard" glyph names instead. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1132849. Fixes 6893. Fixes 6894.	2016-03-09 19:37:15 +01:00
Preetham Mysore	be1e12dbcb	Fix for descent calculation while reading font hhea headers	2016-03-03 08:51:41 -05:00
Yury Delendik	a022f6f069	Reverts back un-need change made at #6879 .	2016-03-02 09:57:33 -06:00
Jonas Jenwald	8402c79171	Merge pull request #7050 from brendandahl/issue4402 For CIDFontType2 use CID as glyph ID when missing CID to GID map.	2016-03-02 10:11:42 +01:00
Brendan Dahl	a6acf74b54	Merge pull request #7023 from brendandahl/issue6721 Only draw glyphs on canvas if they are in the font or the font file is missing.	2016-03-01 18:03:37 -08:00
Brendan Dahl	6e1d131384	For CIDFontType2 use CID as glyph ID when missing CID to GID map.	2016-03-01 17:05:33 -08:00
Brendan Dahl	ff87f3fb86	Only draw glyphs on canvas if they are in the font or the font file is missing.	2016-03-01 13:24:58 -08:00
Jonas Jenwald	505f15f221	Avoid accidentally getting the entire font file in `readNameTable` (issue 7020) In the PDF file in question, some of the 'name' table entries have `record.length === 0`. This becomes problematic in the non-unicode case, since `font.getBytes(0)` will fetch the entire stream. Given that OTS rejects 'name' entries larger than `2^16`, this thus explain the sanitizer errors. Fixes 7020.	2016-03-01 21:59:49 +01:00

... 9 10 11 12 13 ...

3495 Commits