pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	5181172498	Slightly improve the `isSourcePDF` parameter handling in `JpegImage` (PR 10031 follow-up) Currently there's only a single spot in the code-base where `JpegImage.getData` is called, however it nonetheless seem like a good idea to ensure during tests that the `isSourcePDF` parameter is correctly set. (Especially considering that the PDF use-cases will break without it.) Additionally, in `JpegImage._getLinearizedBlockData`, the code can be made a tiny bit more efficient by checking the value of `isSourcePDF` first to avoid useless checks (for the default PDF use-cases).	2018-09-12 11:30:59 +02:00
Tim van der Meij	bf13c8a50b	Use the `const` keyword for constants in `src/shared/util.js` Moreover, move general constants to the top of the file, i.e., those that are not closely tied to a function in the file.	2018-09-11 16:17:45 +02:00
Tim van der Meij	99de25d6cc	Implement unit tests for the `isSameOrigin` and `createValidAbsoluteUrl` utility functions Moreover, mark the `isValidProtocol` function as private since it's only used in the utilities file and is not (meant to be) exported.	2018-09-11 16:17:45 +02:00
Tim van der Meij	9a115b41de	Merge pull request #10034 from timvandermeij/canvas-workaround Remove `getSinglePixelWidth` workaround	2018-09-09 17:36:04 +02:00
Tim van der Meij	66422eb83e	Merge pull request #9340 from brendandahl/private-use Map all glyphs to the private use area and duplicate the first glyph.	2018-09-08 17:51:04 +02:00
Romain Petit	8671081001	Fix font-string variable name typo The font-string rebuild condition is always satisfied because the concerned variables are never set.	2018-09-07 09:55:45 +02:00
Brendan Dahl	b76cf665ec	Map all glyphs to the private use area and duplicate the first glyph. There have been lots of problems with trying to map glyphs to their unicode values. It's more reliable to just use the private use areas so the browser's font renderer doesn't mess with the glyphs. Using the private use area for all glyphs did highlight other issues that this patch also had to fix: * small private use area - Previously, only the BMP private use area was used which can't map many glyphs. Now, the (much bigger) PUP 16 area can also be used. * glyph zero not shown - Browsers will not use the glyph from a font if it is glyph id = 0. This issue was less prevalent when we mapped to unicode values since the fallback font would be used. However, when using the private use area, the glyph would not be drawn at all. This is illustrated in one of the current test cases (issue #8234) where there's an "ä" glyph at position zero. The PDF looked like it rendered correctly, but it was actually not using the glyph from the font. To properly show the first glyph it is always duplicated and appended to the glyphs and the maps are adjusted. * supplementary characters - The private use area PUP 16 is 4 bytes, so String.fromCodePoint must be used where we previously used String.fromCharCode. This is actually an issue that should have been fixed regardless of this patch. * charset - Freetype fails to load fonts when the charset size doesn't match number of glyphs in the font. We now write out a fake charset with the correct length. This also brought up the issue that glyphs with seac/endchar should only ever write a standard charset, but we now write a custom one. To get around this the seac analysis is permanently enabled so those glyphs are instead always drawn as two glyphs.	2018-09-05 14:04:54 -07:00
Jonas Jenwald	e5a6d892b4	Revert "Attempt to combine separate beginText/endText sequences in `getTextContent` (issue 9984)"	2018-09-05 18:01:33 +02:00
Tim van der Meij	959ed3705b	Implement a permissions API	2018-09-02 21:23:09 +02:00
Tim van der Meij	4874e9ace0	Convert the `WorkerTransport` class, in `src/display/api.js`, to ES6 syntax	2018-09-02 21:06:57 +02:00
Tim van der Meij	9c37599fd3	Convert the `PDFDocumentProxy` class, in `src/display/api.js`, to ES6 syntax Moreover, indicate that a member are private and improve the comments to be more consistent.	2018-09-02 21:06:57 +02:00
Tim van der Meij	1a3e842dc4	Remove `getSinglePixelWidth` workaround It's no longer necessary since https://bugzilla.mozilla.org/show_bug.cgi?id=1305963 is fixed quite some time ago. While we're here, mark the `cachedGetSinglePixelWidth` member as being private and use ES6 syntax in the `getSinglePixelWidth` method.	2018-09-02 20:36:06 +02:00
Jonas Jenwald	663922f93f	Add a new parameter to `JpegImage.getData` to indicate the source of the image data (issue 9513) The purpose of this patch is to provide a better default behaviour when `JpegImage` is used to parse standalone JPEG images with CMYK colour spaces. Since the issue that the patch concerns is somewhat of a special-case, the implementation utilizes the already existing decode support in an attempt to minimize the impact w.r.t. code size. Please note: It's always possible for the user of `JpegImage` to control image inversion, and thus override the new behaviour, by simply passing a custom `decodeTransform` array upon initialization.	2018-09-02 14:15:22 +02:00
Jonas Jenwald	47bf12cbac	Change `JpegImage._isColorConversionNeeded` into a getter, rather than a regular function Given how `_isColorConversionNeeded` is used, and that it always returns a boolean value, having it be a getter seems more appropriate.	2018-09-02 13:06:28 +02:00
Tim van der Meij	c94df0fef3	Merge pull request #9986 from Snuffleupagus/issue-9984 Attempt to combine separate beginText/endText sequences in `getTextContent` (issue 9984)	2018-09-01 21:21:29 +02:00
Tim van der Meij	66bd088948	Merge pull request #10010 from Snuffleupagus/issue-10004 Attempt to find truncated endstream commands, in the fallback code-path, in `Parser.makeStream` (issue 10004)	2018-09-01 18:44:08 +02:00
Tim van der Meij	283f2dfcc3	Merge pull request #10022 from janpe2/svg-Tr Implement text rendering modes in SVG backend	2018-08-29 23:51:07 +02:00
Tim van der Meij	27ebb41b8f	Merge pull request #10020 from Snuffleupagus/addon-prefs-no-eslint Ensure that the built `PdfJsDefaultPreferences.jsm` file won't be affected/touched during tree-wide ESLint rule changes in `mozilla-central` (PR 9571 follow-up)	2018-08-29 22:40:56 +02:00
Jonas Jenwald	d8aaa2f978	Update to the current year, i.e. 2018, in the bundle license headers	2018-08-28 23:46:56 +02:00
Jani Pehkonen	c426ea376c	Implement text rendering modes in SVG backend	2018-08-29 00:42:07 +03:00
cheryly279	29c0ea159d	Adding chunkname to async loaded code Better name	2018-08-27 17:17:32 -04:00
Jonas Jenwald	95e5bad4c4	Attempt to find truncated endstream commands, in the fallback code-path, in `Parser.makeStream` (issue 10004) Apparently there's some PDF generators, in this case the culprit is "Nooog Pdf Library / Nooog PStoPDF v1.5", that manage to mess up PDF creation enough that endstream[1] commands actually become truncated. Please note: The solution implemented here isn't perfect, since it won't be able to cope with PDF files that contains a mixture of correct and truncated endstream commands. However, considering that this particular mode of corruption fortunately doesn't seem very common[2], a slightly less complex solution ought to suffice for now. Fixes 10004. --- [1] Scanning through the PDF data to find endstream commands becomes necessary, in order to determine the stream length in cases where the `Length` entry of the (stream) dictionary is missing/incorrect. [2] I cannot recall having seen any (previous) issues/bugs with "Missing endstream" errors.	2018-08-26 11:51:11 +02:00
Jonas Jenwald	c81cbe113c	Extract the "scanning for endstream command" part of `Parser.makeStream` into a helper method With this code now living in a separate method, it can be simplified slightly (e.g. by using early returns).	2018-08-26 11:51:09 +02:00
Tim van der Meij	436d2efa8a	Merge pull request #10007 from Snuffleupagus/ColorSpace-class Convert the code in `src/core/colorspace.js to use ES6 classes	2018-08-25 18:45:40 +02:00
Tim van der Meij	4a0d15aa0e	Slightly simplify the catalog code	2018-08-25 16:40:59 +02:00
Tim van der Meij	aec236f6d8	Convert the `Catalog` class, in `src/core/obj.js`, to ES6 syntax	2018-08-25 16:38:22 +02:00
Jonas Jenwald	a182907592	Replace all occurences of `var` with `let`/`const` in `src/core/colorspace.js`	2018-08-25 03:20:21 +02:00
Jonas Jenwald	ce9a38c536	Convert the code in `src/core/colorspace.js to use ES6 classes Reduces the amount of boilerplate code when defining the the sub-classes. Please note that a couple of the closures were kept, since it's not (yet) possible to include helper functions inside of `class`es.	2018-08-25 03:20:19 +02:00
Jonas Jenwald	45b7b861b8	Remove the unused `defaultColor` property on `ColorSpace` instances This property is not only completely unused now, it never actually appears to have been used. Even though the memory savings, from not initializing these extra typed arrays, won't be significant in the grand scheme of things it still seems completely unnecessary to keep allocating this data. As far as I can tell, the main reason for the existence of `defaultColor` seem to be for documentation purposes. Hence the code is changed into comments instead, to keep the information around (but without the unnecessary allocations).	2018-08-23 11:16:52 +02:00
Jonas Jenwald	099ed08852	Add support for `async`/`await` using Babel For proof-of-concept, this patch converts a couple of `Promise` returning methods to use `async` instead. Please note that the `generic` build, based on this patch, has been successfully testing in IE11 (i.e. the viewer loads and nothing is obviously broken). Being able to use modern JavaScript features like `async`/`await` is a huge plus, but there's one (obvious) side-effect: The size of the built files will increase slightly (unless `SKIP_BABEL == true`). That's unavoidable, but seems like a small price to pay in the grand scheme of things. Finally, note that the `chromium` build target was changed to no longer skip Babel translation, since the Chrome extension still supports version `49` of the browser (where native `async` support isn't available).	2018-08-19 16:54:11 +02:00
Tim van der Meij	4ea663aa8a	Merge pull request #9987 from Snuffleupagus/rm-createBlob [api-minor] Remove the obsolete `createBlob` helper function	2018-08-19 16:43:36 +02:00
Jonas Jenwald	75923ea515	Remove the unused `PDFDocument.mainXRefEntriesOffset` method Not only is this method completely unused now, looking through the history of the code it never appears to have been used for anything either. Years ago `mainXRefEntriesOffset` was included when creating `XRef` instances, however it wasn't actually used for anything (the parameter was never checked, nor assigned to a property on `XRef`). If this method ever becomes useful (again) it's easy enough to restore it thanks to version control, but including dead code in the builds just seems wasteful.	2018-08-19 14:08:39 +02:00
Jonas Jenwald	50a47be190	[api-minor] Remove the obsolete `createBlob` helper function At this point in time, all supported browsers have native support for `Blob`; please see https://developer.mozilla.org/en-US/docs/Web/API/Blob/Blob#Browser_compatibility. Furthermore, note how the helper function was throwing an error if `Blob` isn't available anyway.	2018-08-19 13:37:19 +02:00
Jonas Jenwald	497b765ede	Attempt to combine separate beginText/endText sequences in `getTextContent` (issue 9984) Please note that while this improves issue 9984 slightly (and likely others too), it's not a complete solution. The remaining issues are related to the, more general, problems with the existing heuristics related to attempting to combine separate text items.	2018-08-18 13:45:32 +02:00
Jonas Jenwald	bc89edb8f0	Ensure that `Uint8ClampedArray` is used for image data transfered by `getTransfers` (PR 9802 follow-up) One of the `QueueOptimizer` cases wasn't updated to use `Uint8ClampedArray`s, which leads to inconsistent image data on the API side (but no actual rendering bugs, as far as I can tell). To prevent future errors, a non-production/test-only `assert` was added to ensure that the relevant image data only uses `Uint8ClampedArray`s.	2018-08-16 10:29:44 +02:00
Tim van der Meij	1268aea2b6	Merge pull request #9975 from Snuffleupagus/getDestination-refactor Re-factor `destinations`/`getDestination` to reduce unnecessary duplication, and reject non-string inputs	2018-08-12 15:51:58 +02:00
Tim van der Meij	af19ed6ee9	Merge pull request #9822 from timvandermeij/annotations [api-minor] Refactor the annotation code to be asynchronous	2018-08-11 20:39:50 +02:00
dmitryskey	3741becb9b	[api-minor] Refactor the annotation code to be asynchronous This commit is the first step towards implementing parsing for the appearance streams of annotations. Co-authored-by: Jonas Jenwald <jonas.jenwald@gmail.com> Co-authored-by: Tim van der Meij <timvandermeij@gmail.com>	2018-08-11 19:00:29 +02:00
Jonas Jenwald	1179584fd6	Reject `getDestination`, in the API, for non-string inputs Note how e.g. the `getPage` method does basic validation of the input.	2018-08-11 16:06:35 +02:00
Jonas Jenwald	b74c813353	Re-factor `destinations`/`getDestination`, in the `Catalog`, to reduce unnecessary duplication Currently, these two methods contain the same boilerplate code for getting the /Dests data.	2018-08-11 16:04:58 +02:00
Jonas Jenwald	06d1ff5af4	Tweak the MMType1 font detection in `getFontFileType` to improve font telemetry (PR 9961 follow-up) Please note that this patch does not affect rendering in any way, however it's relevant for font telemetry[1]. According to the specification, see https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G8.1904956, Type1C is a valid subtype for both Type1 and MMType1 fonts. --- [1] Refer to the font telemetry results in https://telemetry.mozilla.org/new-pipeline/dist.html#!cumulative=0&end_date=2018-06-25&keys=__none__!__none__!__none__&max_channel_version=nightly%252F62&measure=PDF_VIEWER_FONT_TYPES&min_channel_version=nightly%252F59&processType=*&product=Firefox&sanitize=1&sort_keys=submissions&start_date=2018-05-07&table=0&trim=1&use_submission_date=0 See also https://github.com/mozilla/pdf.js/wiki/Enumeration-Assignments-for-the-Telemetry-Histograms#pdf_viewer_font_types for help with interpreting the data.	2018-08-08 12:18:37 +02:00
Jonas Jenwald	f78efd883e	Attempt to throw `MissingPDFException` when applicable in `node_stream.js` (issue 9791)	2018-08-06 10:00:03 +02:00
Tim van der Meij	4111871ac5	Merge pull request #9958 from brendandahl/always-fallback Always fallback to system font on font failure.	2018-08-05 19:58:48 +02:00
Jonas Jenwald	3177f6aa55	Parse the font file to determine the correct type/subtype, rather than relying on the (often incorrect) data in the font dictionary The current font type/subtype detection code is quite inconsistent/unwieldy. In some cases it will simply assume that the font dictionary is correct, in others it will somewhat "arbitrarily" check the actual font file (more of these cases have been added over the years to fix specific bugs). As is evident from e.g. issue 9949, the font type/subtype detection code is continuing to cause issues. In an attempt to get rid of these hacks once and for all, this patch instead re-factors the type/subtype detection to always parse the font file. Please note that, as far as I can tell, we still appear to need to rely on the composite font detection based on the font dictionary. However, even if the composite/non-composite detection would get it wrong, that shouldn't really matter too much given that there's basically only two different code-paths (for "TrueType-like" vs "Type1-like" fonts).	2018-08-05 11:13:16 +02:00
Jonas Jenwald	9bbca04579	Add a (basic) `isCFFFile` helper function to detect CFF font files Compared to most other font formats, the CFF doesn't have a constant header which makes is slightly more difficult to detect such font files. Please refer to the Compact Font Format specification: https://www.adobe.com/content/dam/acom/en/devnet/font/pdfs/5176.CFF.pdf#G3.32094	2018-08-05 11:13:14 +02:00
Jonas Jenwald	f4db38aadf	Update the TrueType font file detection to also recognize the Mac specific header 'true' Please refer to the TrueType specification: https://developer.apple.com/fonts/TrueType-Reference-Manual/RM06/Chap6.html#ScalerTypeNote	2018-08-05 10:33:56 +02:00
Brendan Dahl	5f67a6a237	Always fallback to system font on font failure. The font in the PDF is marked as a CIDFontType0, but the font file is actually a true type font. To fully address this issue we should really peek into the font file and try to determine what it is. However, this is the first case of this issue, so I think this solution is acceptable for now.	2018-08-03 16:49:22 -07:00
Tim van der Meij	f19ee127a3	Merge pull request #9874 from boundlesshq/master [api-minor] Include export value for checkboxes	2018-08-03 23:43:23 +02:00
Jonas Jenwald	a504befc76	Stop warning for non-Name /Filter entries in the `PDFImage` constructor (PR 9897 follow-up) Fixes a stupid oversight on my part, since /Filter may (obviously) contain an Array, which resulted in unnecessary console warning spam in perfectly valid PDF files. Note that it still makes sense to check that /Filter is actually a Name, before attempting to access its `name` property, but the warning should definitely be removed.	2018-08-03 10:23:08 +02:00
Brian	2a665ebad4	Removed Extraneous Matrix Check in CalRGB Conversion	2018-08-02 10:16:42 -07:00

... 6 7 8 9 10 ...

3714 Commits