pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	4829f567c1	Move the `interface` definitions out of `src/core/worker.js` and into their own file These interfaces are already used in different files, in both the `src/core/` and `src/display/` folders, and having them reside in their own file seems a lot clearer and is also similar to the existing viewer interfaces. As part of moving the `interface` definitions, they're also converted to ES6 classes.	2018-11-08 13:21:37 +01:00
Jonas Jenwald	60da2d882b	[api-minor] Refactor/simplify the `PDFObject` class First of all, note how there's currently two methods for checking if a certain object exists, which seems completely unwarranted. Furthermore, the rarely used `getData` method was removed and its only callsite changed to use a combination of `PDFObjects.{has, get}` instead. Finally, the methods were rearranged slightly, to bring the most important ones (for an API user) to the top of the class.	2018-11-08 10:13:39 +01:00
Jonas Jenwald	d32321d84f	Convert `PDFObjects`, in `src/display/api.js`, to an ES6 class Also changes all occurrences of `var` to `const`, and marks internal properties/methods as "private".	2018-11-08 10:11:40 +01:00
Tim van der Meij	3e342554d1	Merge pull request #10228 from morille/patch-2 Don't detect nw.js as node.js	2018-11-07 23:51:01 +01:00
Romain Petit	13b0ca6b2a	Don't detect nw.js as node.js nw.js is chrome plus nodejs. It will succeed everywhere chrome succeeds, but fail in many cases where nodejs succeeds (see issue 9071). So it's safer to consider it as a browser context rather than a nodejs context. Make travis happy again CS Readability + Explanation The relevant portion of the NW.js documentation: http://docs.nwjs.io/en/latest/For%20Users/Advanced/JavaScript%20Contexts%20in%20NW.js/#access-nodejs-and-nwjs-api-in-browser-context Added full link to relevant doc.	2018-11-07 11:14:22 +01:00
Jonas Jenwald	a963d139dc	Convert `src/core/ps_parser.js` to use ES6 classes Besides being a fairly small and self-contained file, this code also shows a possible way of defining static constants on classes.	2018-11-03 17:43:06 +01:00
Tim van der Meij	ec76aa531e	Merge pull request #10202 from Snuffleupagus/issue-10200 Attempt to clean-up/restore pending rendering operations on `RenderTask.cancel` (issue 10200)	2018-11-02 23:11:47 +01:00
Jonas Jenwald	f23dba1c10	Change `canvasInRendering` to a `WeakSet` instead of a `WeakMap` Note how nowhere in the code `canvasInRendering.get()` is ever called, and that this structure is really only used to store references to `<canvas>` DOM elements. The reason for this being a `WeakMap` is probably because at the time we weren't using `core-js` polyfills yet, and since there already existed a manually implemented `WeakMap` polyfill it was probably simpler to use that.	2018-10-31 18:15:23 +01:00
Jonas Jenwald	f77b463339	Attempt to clean-up/restore pending rendering operations on `RenderTask.cancel` (issue 10200) Please note that, given the lack of a runnable example, I'm not totally sure if this first of all is enough to completely address the issue as filed and second of all if we actually want this new behaviour.	2018-10-31 16:22:17 +01:00
Tim van der Meij	ed4ac1bc67	Merge pull request #10162 from janpe2/svg-normalize-bbox Normalize BBox of form XObjects in SVG back-end	2018-10-28 13:18:48 +01:00
Jani Pehkonen	9cd5f94f03	Normalize the BBox of form XObjects on the /core side	2018-10-22 14:17:05 +03:00
Jonas Jenwald	5bb7f4b615	Convert `PDFDataRangeTransport` to an ES6 class	2018-10-20 17:15:27 +02:00
Tim van der Meij	d21892933d	Merge pull request #10161 from Snuffleupagus/DataLoaded-onProgress Ensure that `onProgress` is always called when the entire PDF file has been loaded, regardless of how it was fetched (issue 10160)	2018-10-20 15:22:05 +02:00
Jonas Jenwald	54f9883c51	Export `CMapCompressionType` and `PermissionFlag` on the `pdfjsLib` object (issue 10148, PR 10033 follow-up) `CMapCompressionType` makes a lot of sense to export, for anyone attempting to implement a custom `CMapReaderFactory`; fixes 10148. `PermissionFlag` likewise needs to be exported, since otherwise the result of the `getPermissions` API method becomes difficult to interpret; follow-up to 10033.	2018-10-20 11:38:00 +02:00
Jonas Jenwald	327f2eb588	Ensure that `onProgress` is always called when the entire PDF file has been loaded, regardless of how it was fetched (issue 10160) Please note: I'm totally fine with this patch being rejected, and the issue closed as WONTFIX; however these changes should address the issue if that's desired. From a conceptual point of view, reporting loading progress doesn't really make a lot of sense for PDF files opened by passing raw binary data directly to `getDocument` (since obviously all data was loaded). This is compared to PDF files loaded via e.g. `XMLHttpRequest` or the Fetch API, where the entire PDF file isn't available from the start and knowing the loading progress makes total sense. However I can certainly see why the current API could be considered inconsistent, which isn't great, since a registered `onProgress` callback will never be called for certain `getDocument` calls. The simplest solution to this inconsistency thus seem to be to ensure that `onProgress` is always called when handling the `DataLoaded` message, since that will always be dispatched[1] from the worker-thread. --- [1] Note that this isn't guaranteed to happen, since setting `disableAutoFetch = true` often prevents the entire file from ever loading. However, this isn't relevant for the issue at hand, and is a well-known consequence of using `disableAutoFetch = true`; note how the default viewer even has a specialized code-path for hiding the loadingBar.	2018-10-16 13:51:12 +02:00
Jonas Jenwald	4cde844ffe	Add a `DOMTokenList.toggle` polyfill for the second, optional, "force" parameter This is based on the polyfill available at https://developer.mozilla.org/en-US/docs/Web/API/Element/classList#Polyfill	2018-10-12 15:41:09 +02:00
Tim van der Meij	9e9426c354	Merge pull request #10143 from Snuffleupagus/getMainThreadWorkerMessageHandler-catch-errors Ensure that `getMainThreadWorkerMessageHandler` won't accidentally break `getDocument` (PR 10139 follow-up)	2018-10-11 00:05:01 +02:00
Jonas Jenwald	0e2c6047e4	Ensure that `getMainThreadWorkerMessageHandler` won't accidentally break `getDocument` (PR 10139 follow-up) This should have been part of PR 10139. In the event that a user has attempted to manually load the worker file on the main-thread, but somehow failed to do that correctly, there's a possibility that `getMainThreadWorkerMessageHandler` could throw. Considering how/where that helper function is being called, an error could still prevent `PDFDocumentLoadingTask` from completing (regardless if it's being resolved/rejected).	2018-10-09 15:44:31 +02:00
Jonas Jenwald	21c8dd4842	Combine the `pdfjsFilePath` and fallback `workerSrc` handling in `src/display/api.js` With the way that the `getWorkerSrc()` helper function is implemented now, there's no longer a particularly strong reason for keeping the global `pdfjsFilePath` variable around. With this patch the fallback `workerSrc` will thus, assuming is wasn't already set, be set to the "pdfjsFilePath" which simplifies the `getWorkerSrc()` function and reduces the amount of global state. Finally, the global `workerSrc` variable was renamed to prevent shadowing.	2018-10-09 13:47:48 +02:00
Tim van der Meij	f45e46d7ad	Merge pull request #10133 from kevinleedrum/fix-content-length Set returnValues.suggestedLength to Content-Length if integer	2018-10-09 00:05:57 +02:00
Kevin Lee Drum	4cf10ac79d	set returnValues.suggestedLength to Content-Length if integer	2018-10-07 13:26:29 -04:00
Jonas Jenwald	755c6edc5e	Ensure that the `PDFDocumentLoadingTask` is rejected when "setting up fake worker" failed (issue 10135) This should, hopefully, cover all the possible ways[1] in which "fake workers" are loaded. Given the different code-paths, adding unit-tests might not be that simple. Note that in order to make this work, the various `fakeWorkerFilesLoader` functions were converted to return `Promises`. --- [1] Unfortunately there's lots of them, for various build targets and configurations.	2018-10-06 13:18:51 +02:00
Simon Leblanc	b5806735d8	Add support of Ink annotation	2018-10-03 00:28:49 +02:00
Tim van der Meij	138324502c	Merge pull request #10119 from Snuffleupagus/rm-onFileAttachmentAnnotation Attempt to simplify the `fileattachmentannotation` event dispatching	2018-10-02 23:25:22 +02:00
Jonas Jenwald	d60ce998f1	Attempt to simplify the `fileattachmentannotation` event dispatching This attempts to reduced the level of indirection, and the amount of code, when dispatching `fileattachmentannotation` events, by removing the `PDFLinkService.onFileAttachmentAnnotation` method and just accessing `PDFLinkService.eventBus` directly in the `FileAttachmentAnnotationElement` constructor. Given that other properties, such as `externalLinkTarget`/`externalLinkRel`, are already being accessed directly this pattern seems fine here as well.	2018-10-01 15:09:08 +02:00
Jonas Jenwald	d6f4d2ff33	Add a `Symbol` polyfill, using core-js, to allow using `for...of` loops https://github.com/zloirock/core-js#ecmascript-symbol	2018-09-29 16:05:00 +02:00
Jonas Jenwald	435ec6a0d5	Use the Font Loading API in `MOZCENTRAL` builds, and `GENERIC` builds for Firefox version 63 and above (issue 9945)	2018-09-29 16:05:00 +02:00
Jonas Jenwald	05b021bcce	Refactor the `FontLoader` into proper, build-specific, ES6 classes Also changes `var` to `let`/`const` in code already touched in the patch, and makes use of template strings in a few spots.	2018-09-29 16:05:00 +02:00
Jonas Jenwald	45d6651976	Refactor unused `Date.now()` calls in `FontLoader.queueLoadingCallback` The `started` timestamp is completely usused, and the `end` timestamp is currently[1] being used essentially like a boolean value. Hence this code can be simplified to use an actual boolean value instead, which avoids potentially hundreds (or even thousands) of unnecessary `Date.now()` calls. --- [1] Looking briefly at the history of this code, I cannot tell if the timestamps themselves were ever used for anything (except for tracking "boolean" state).	2018-09-29 15:57:04 +02:00
Jonas Jenwald	ad3e937816	Replace the `Font.loading` property with, the already existing, `Font.missingFile` property The `Font.loading` property is only ever used once in the code, whereas `Font.missingFile` is more widely used. Furthermore the name `loading` feels, at least to me, slight less clear than `missingFile`. Finally, note that these two properties are the inverse of each other.	2018-09-29 15:57:04 +02:00
Jonas Jenwald	caf90ff6ee	Convert `FontFaceObject` to an ES6 class Also changes `var` to `let`/`const` in code already touched in the patch, and makes use of template strings in a few spots.	2018-09-29 15:57:04 +02:00
Jonas Jenwald	842e9206c0	Replace `String.prototype.substr()` occurrences with `String.prototype.substring()` As outlined in https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/substr, which refers to the ECMA-262 specification, using the `substr` function is advised against. Hence this PR, which replaces all remaining `substr` occurrences with `substring` instead. Please refer to https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/substr#Syntax respectively https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/substring#Syntax for the differences between the two functions. Note that in most cases in the code-base there's only one argument passed to `substr`, and those require no other changes except replacing "substr" with "substring". For the other cases, the `substr(start, length)` calls are changed to `substring(start, start + length)` instead.	2018-09-28 11:41:07 +02:00
Brendan Dahl	ae7dcae27e	Fix abbreviation.	2018-09-13 13:10:38 -07:00
Brendan Dahl	6adeabbb66	Add Glyph & Cog's XPDF copyright/license information.	2018-09-12 13:59:56 -07:00
Jonas Jenwald	5181172498	Slightly improve the `isSourcePDF` parameter handling in `JpegImage` (PR 10031 follow-up) Currently there's only a single spot in the code-base where `JpegImage.getData` is called, however it nonetheless seem like a good idea to ensure during tests that the `isSourcePDF` parameter is correctly set. (Especially considering that the PDF use-cases will break without it.) Additionally, in `JpegImage._getLinearizedBlockData`, the code can be made a tiny bit more efficient by checking the value of `isSourcePDF` first to avoid useless checks (for the default PDF use-cases).	2018-09-12 11:30:59 +02:00
Tim van der Meij	bf13c8a50b	Use the `const` keyword for constants in `src/shared/util.js` Moreover, move general constants to the top of the file, i.e., those that are not closely tied to a function in the file.	2018-09-11 16:17:45 +02:00
Tim van der Meij	99de25d6cc	Implement unit tests for the `isSameOrigin` and `createValidAbsoluteUrl` utility functions Moreover, mark the `isValidProtocol` function as private since it's only used in the utilities file and is not (meant to be) exported.	2018-09-11 16:17:45 +02:00
Tim van der Meij	9a115b41de	Merge pull request #10034 from timvandermeij/canvas-workaround Remove `getSinglePixelWidth` workaround	2018-09-09 17:36:04 +02:00
Tim van der Meij	66422eb83e	Merge pull request #9340 from brendandahl/private-use Map all glyphs to the private use area and duplicate the first glyph.	2018-09-08 17:51:04 +02:00
Romain Petit	8671081001	Fix font-string variable name typo The font-string rebuild condition is always satisfied because the concerned variables are never set.	2018-09-07 09:55:45 +02:00
Brendan Dahl	b76cf665ec	Map all glyphs to the private use area and duplicate the first glyph. There have been lots of problems with trying to map glyphs to their unicode values. It's more reliable to just use the private use areas so the browser's font renderer doesn't mess with the glyphs. Using the private use area for all glyphs did highlight other issues that this patch also had to fix: * small private use area - Previously, only the BMP private use area was used which can't map many glyphs. Now, the (much bigger) PUP 16 area can also be used. * glyph zero not shown - Browsers will not use the glyph from a font if it is glyph id = 0. This issue was less prevalent when we mapped to unicode values since the fallback font would be used. However, when using the private use area, the glyph would not be drawn at all. This is illustrated in one of the current test cases (issue #8234) where there's an "ä" glyph at position zero. The PDF looked like it rendered correctly, but it was actually not using the glyph from the font. To properly show the first glyph it is always duplicated and appended to the glyphs and the maps are adjusted. * supplementary characters - The private use area PUP 16 is 4 bytes, so String.fromCodePoint must be used where we previously used String.fromCharCode. This is actually an issue that should have been fixed regardless of this patch. * charset - Freetype fails to load fonts when the charset size doesn't match number of glyphs in the font. We now write out a fake charset with the correct length. This also brought up the issue that glyphs with seac/endchar should only ever write a standard charset, but we now write a custom one. To get around this the seac analysis is permanently enabled so those glyphs are instead always drawn as two glyphs.	2018-09-05 14:04:54 -07:00
Jonas Jenwald	e5a6d892b4	Revert "Attempt to combine separate beginText/endText sequences in `getTextContent` (issue 9984)"	2018-09-05 18:01:33 +02:00
Tim van der Meij	959ed3705b	Implement a permissions API	2018-09-02 21:23:09 +02:00
Tim van der Meij	4874e9ace0	Convert the `WorkerTransport` class, in `src/display/api.js`, to ES6 syntax	2018-09-02 21:06:57 +02:00
Tim van der Meij	9c37599fd3	Convert the `PDFDocumentProxy` class, in `src/display/api.js`, to ES6 syntax Moreover, indicate that a member are private and improve the comments to be more consistent.	2018-09-02 21:06:57 +02:00
Tim van der Meij	1a3e842dc4	Remove `getSinglePixelWidth` workaround It's no longer necessary since https://bugzilla.mozilla.org/show_bug.cgi?id=1305963 is fixed quite some time ago. While we're here, mark the `cachedGetSinglePixelWidth` member as being private and use ES6 syntax in the `getSinglePixelWidth` method.	2018-09-02 20:36:06 +02:00
Jonas Jenwald	663922f93f	Add a new parameter to `JpegImage.getData` to indicate the source of the image data (issue 9513) The purpose of this patch is to provide a better default behaviour when `JpegImage` is used to parse standalone JPEG images with CMYK colour spaces. Since the issue that the patch concerns is somewhat of a special-case, the implementation utilizes the already existing decode support in an attempt to minimize the impact w.r.t. code size. Please note: It's always possible for the user of `JpegImage` to control image inversion, and thus override the new behaviour, by simply passing a custom `decodeTransform` array upon initialization.	2018-09-02 14:15:22 +02:00
Jonas Jenwald	47bf12cbac	Change `JpegImage._isColorConversionNeeded` into a getter, rather than a regular function Given how `_isColorConversionNeeded` is used, and that it always returns a boolean value, having it be a getter seems more appropriate.	2018-09-02 13:06:28 +02:00
Tim van der Meij	c94df0fef3	Merge pull request #9986 from Snuffleupagus/issue-9984 Attempt to combine separate beginText/endText sequences in `getTextContent` (issue 9984)	2018-09-01 21:21:29 +02:00
Tim van der Meij	66bd088948	Merge pull request #10010 from Snuffleupagus/issue-10004 Attempt to find truncated endstream commands, in the fallback code-path, in `Parser.makeStream` (issue 10004)	2018-09-01 18:44:08 +02:00
Tim van der Meij	283f2dfcc3	Merge pull request #10022 from janpe2/svg-Tr Implement text rendering modes in SVG backend	2018-08-29 23:51:07 +02:00
Tim van der Meij	27ebb41b8f	Merge pull request #10020 from Snuffleupagus/addon-prefs-no-eslint Ensure that the built `PdfJsDefaultPreferences.jsm` file won't be affected/touched during tree-wide ESLint rule changes in `mozilla-central` (PR 9571 follow-up)	2018-08-29 22:40:56 +02:00
Jonas Jenwald	d8aaa2f978	Update to the current year, i.e. 2018, in the bundle license headers	2018-08-28 23:46:56 +02:00
Jani Pehkonen	c426ea376c	Implement text rendering modes in SVG backend	2018-08-29 00:42:07 +03:00
cheryly279	29c0ea159d	Adding chunkname to async loaded code Better name	2018-08-27 17:17:32 -04:00
Jonas Jenwald	95e5bad4c4	Attempt to find truncated endstream commands, in the fallback code-path, in `Parser.makeStream` (issue 10004) Apparently there's some PDF generators, in this case the culprit is "Nooog Pdf Library / Nooog PStoPDF v1.5", that manage to mess up PDF creation enough that endstream[1] commands actually become truncated. Please note: The solution implemented here isn't perfect, since it won't be able to cope with PDF files that contains a mixture of correct and truncated endstream commands. However, considering that this particular mode of corruption fortunately doesn't seem very common[2], a slightly less complex solution ought to suffice for now. Fixes 10004. --- [1] Scanning through the PDF data to find endstream commands becomes necessary, in order to determine the stream length in cases where the `Length` entry of the (stream) dictionary is missing/incorrect. [2] I cannot recall having seen any (previous) issues/bugs with "Missing endstream" errors.	2018-08-26 11:51:11 +02:00
Jonas Jenwald	c81cbe113c	Extract the "scanning for endstream command" part of `Parser.makeStream` into a helper method With this code now living in a separate method, it can be simplified slightly (e.g. by using early returns).	2018-08-26 11:51:09 +02:00
Tim van der Meij	436d2efa8a	Merge pull request #10007 from Snuffleupagus/ColorSpace-class Convert the code in `src/core/colorspace.js to use ES6 classes	2018-08-25 18:45:40 +02:00
Tim van der Meij	4a0d15aa0e	Slightly simplify the catalog code	2018-08-25 16:40:59 +02:00
Tim van der Meij	aec236f6d8	Convert the `Catalog` class, in `src/core/obj.js`, to ES6 syntax	2018-08-25 16:38:22 +02:00
Jonas Jenwald	a182907592	Replace all occurences of `var` with `let`/`const` in `src/core/colorspace.js`	2018-08-25 03:20:21 +02:00
Jonas Jenwald	ce9a38c536	Convert the code in `src/core/colorspace.js to use ES6 classes Reduces the amount of boilerplate code when defining the the sub-classes. Please note that a couple of the closures were kept, since it's not (yet) possible to include helper functions inside of `class`es.	2018-08-25 03:20:19 +02:00
Jonas Jenwald	45b7b861b8	Remove the unused `defaultColor` property on `ColorSpace` instances This property is not only completely unused now, it never actually appears to have been used. Even though the memory savings, from not initializing these extra typed arrays, won't be significant in the grand scheme of things it still seems completely unnecessary to keep allocating this data. As far as I can tell, the main reason for the existence of `defaultColor` seem to be for documentation purposes. Hence the code is changed into comments instead, to keep the information around (but without the unnecessary allocations).	2018-08-23 11:16:52 +02:00
Jonas Jenwald	099ed08852	Add support for `async`/`await` using Babel For proof-of-concept, this patch converts a couple of `Promise` returning methods to use `async` instead. Please note that the `generic` build, based on this patch, has been successfully testing in IE11 (i.e. the viewer loads and nothing is obviously broken). Being able to use modern JavaScript features like `async`/`await` is a huge plus, but there's one (obvious) side-effect: The size of the built files will increase slightly (unless `SKIP_BABEL == true`). That's unavoidable, but seems like a small price to pay in the grand scheme of things. Finally, note that the `chromium` build target was changed to no longer skip Babel translation, since the Chrome extension still supports version `49` of the browser (where native `async` support isn't available).	2018-08-19 16:54:11 +02:00
Tim van der Meij	4ea663aa8a	Merge pull request #9987 from Snuffleupagus/rm-createBlob [api-minor] Remove the obsolete `createBlob` helper function	2018-08-19 16:43:36 +02:00
Jonas Jenwald	75923ea515	Remove the unused `PDFDocument.mainXRefEntriesOffset` method Not only is this method completely unused now, looking through the history of the code it never appears to have been used for anything either. Years ago `mainXRefEntriesOffset` was included when creating `XRef` instances, however it wasn't actually used for anything (the parameter was never checked, nor assigned to a property on `XRef`). If this method ever becomes useful (again) it's easy enough to restore it thanks to version control, but including dead code in the builds just seems wasteful.	2018-08-19 14:08:39 +02:00
Jonas Jenwald	50a47be190	[api-minor] Remove the obsolete `createBlob` helper function At this point in time, all supported browsers have native support for `Blob`; please see https://developer.mozilla.org/en-US/docs/Web/API/Blob/Blob#Browser_compatibility. Furthermore, note how the helper function was throwing an error if `Blob` isn't available anyway.	2018-08-19 13:37:19 +02:00
Jonas Jenwald	497b765ede	Attempt to combine separate beginText/endText sequences in `getTextContent` (issue 9984) Please note that while this improves issue 9984 slightly (and likely others too), it's not a complete solution. The remaining issues are related to the, more general, problems with the existing heuristics related to attempting to combine separate text items.	2018-08-18 13:45:32 +02:00
Jonas Jenwald	bc89edb8f0	Ensure that `Uint8ClampedArray` is used for image data transfered by `getTransfers` (PR 9802 follow-up) One of the `QueueOptimizer` cases wasn't updated to use `Uint8ClampedArray`s, which leads to inconsistent image data on the API side (but no actual rendering bugs, as far as I can tell). To prevent future errors, a non-production/test-only `assert` was added to ensure that the relevant image data only uses `Uint8ClampedArray`s.	2018-08-16 10:29:44 +02:00
Tim van der Meij	1268aea2b6	Merge pull request #9975 from Snuffleupagus/getDestination-refactor Re-factor `destinations`/`getDestination` to reduce unnecessary duplication, and reject non-string inputs	2018-08-12 15:51:58 +02:00
Tim van der Meij	af19ed6ee9	Merge pull request #9822 from timvandermeij/annotations [api-minor] Refactor the annotation code to be asynchronous	2018-08-11 20:39:50 +02:00
dmitryskey	3741becb9b	[api-minor] Refactor the annotation code to be asynchronous This commit is the first step towards implementing parsing for the appearance streams of annotations. Co-authored-by: Jonas Jenwald <jonas.jenwald@gmail.com> Co-authored-by: Tim van der Meij <timvandermeij@gmail.com>	2018-08-11 19:00:29 +02:00
Jonas Jenwald	1179584fd6	Reject `getDestination`, in the API, for non-string inputs Note how e.g. the `getPage` method does basic validation of the input.	2018-08-11 16:06:35 +02:00
Jonas Jenwald	b74c813353	Re-factor `destinations`/`getDestination`, in the `Catalog`, to reduce unnecessary duplication Currently, these two methods contain the same boilerplate code for getting the /Dests data.	2018-08-11 16:04:58 +02:00
Jonas Jenwald	06d1ff5af4	Tweak the MMType1 font detection in `getFontFileType` to improve font telemetry (PR 9961 follow-up) Please note that this patch does not affect rendering in any way, however it's relevant for font telemetry[1]. According to the specification, see https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G8.1904956, Type1C is a valid subtype for both Type1 and MMType1 fonts. --- [1] Refer to the font telemetry results in https://telemetry.mozilla.org/new-pipeline/dist.html#!cumulative=0&end_date=2018-06-25&keys=__none__!__none__!__none__&max_channel_version=nightly%252F62&measure=PDF_VIEWER_FONT_TYPES&min_channel_version=nightly%252F59&processType=*&product=Firefox&sanitize=1&sort_keys=submissions&start_date=2018-05-07&table=0&trim=1&use_submission_date=0 See also https://github.com/mozilla/pdf.js/wiki/Enumeration-Assignments-for-the-Telemetry-Histograms#pdf_viewer_font_types for help with interpreting the data.	2018-08-08 12:18:37 +02:00
Jonas Jenwald	f78efd883e	Attempt to throw `MissingPDFException` when applicable in `node_stream.js` (issue 9791)	2018-08-06 10:00:03 +02:00
Tim van der Meij	4111871ac5	Merge pull request #9958 from brendandahl/always-fallback Always fallback to system font on font failure.	2018-08-05 19:58:48 +02:00
Jonas Jenwald	3177f6aa55	Parse the font file to determine the correct type/subtype, rather than relying on the (often incorrect) data in the font dictionary The current font type/subtype detection code is quite inconsistent/unwieldy. In some cases it will simply assume that the font dictionary is correct, in others it will somewhat "arbitrarily" check the actual font file (more of these cases have been added over the years to fix specific bugs). As is evident from e.g. issue 9949, the font type/subtype detection code is continuing to cause issues. In an attempt to get rid of these hacks once and for all, this patch instead re-factors the type/subtype detection to always parse the font file. Please note that, as far as I can tell, we still appear to need to rely on the composite font detection based on the font dictionary. However, even if the composite/non-composite detection would get it wrong, that shouldn't really matter too much given that there's basically only two different code-paths (for "TrueType-like" vs "Type1-like" fonts).	2018-08-05 11:13:16 +02:00
Jonas Jenwald	9bbca04579	Add a (basic) `isCFFFile` helper function to detect CFF font files Compared to most other font formats, the CFF doesn't have a constant header which makes is slightly more difficult to detect such font files. Please refer to the Compact Font Format specification: https://www.adobe.com/content/dam/acom/en/devnet/font/pdfs/5176.CFF.pdf#G3.32094	2018-08-05 11:13:14 +02:00
Jonas Jenwald	f4db38aadf	Update the TrueType font file detection to also recognize the Mac specific header 'true' Please refer to the TrueType specification: https://developer.apple.com/fonts/TrueType-Reference-Manual/RM06/Chap6.html#ScalerTypeNote	2018-08-05 10:33:56 +02:00
Brendan Dahl	5f67a6a237	Always fallback to system font on font failure. The font in the PDF is marked as a CIDFontType0, but the font file is actually a true type font. To fully address this issue we should really peek into the font file and try to determine what it is. However, this is the first case of this issue, so I think this solution is acceptable for now.	2018-08-03 16:49:22 -07:00
Tim van der Meij	f19ee127a3	Merge pull request #9874 from boundlesshq/master [api-minor] Include export value for checkboxes	2018-08-03 23:43:23 +02:00
Jonas Jenwald	a504befc76	Stop warning for non-Name /Filter entries in the `PDFImage` constructor (PR 9897 follow-up) Fixes a stupid oversight on my part, since /Filter may (obviously) contain an Array, which resulted in unnecessary console warning spam in perfectly valid PDF files. Note that it still makes sense to check that /Filter is actually a Name, before attempting to access its `name` property, but the warning should definitely be removed.	2018-08-03 10:23:08 +02:00
Brian	2a665ebad4	Removed Extraneous Matrix Check in CalRGB Conversion	2018-08-02 10:16:42 -07:00
Tim van der Meij	716acf63d4	Merge pull request #9938 from Snuffleupagus/issue-9915 Ensure that Type0, i.e. composite, OpenType fonts with `CFF ` tables are not treated as CFF fonts if their glyph mapping is non-default (issue 9915)	2018-08-02 00:11:18 +02:00
Jonas Jenwald	3ce420131f	Prefer the Width/Height of the image data, rather than the image dictionary, for JPEG 2000 images (issue 9650) According to the PDF specification, see https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#page=45 > When using the JPXDecode filter with image XObjects, the following changes to and constraints on some entries in the image dictionary shall apply (see 8.9.5, "Image Dictionaries" for details on these entries): > > - Width and Height shall match the corresponding width and height values in the JPEG2000 data. > > - . . . Hence it seems reasonable to use the Width/Height of the image data itself, rather than the image dictionary when there's a mismatch. Given that JPEG 2000 images are already being parsed, in order to obtain basic parameters, the actual Width/Height is readily available in the `PDFImage` constructor.	2018-08-01 16:42:26 +02:00
Jonas Jenwald	17f65908ae	Add more validation of the /Filter entry, in image dictionaries, to the `PDFImage` constructor Given that the code is currently assuming that the /Filter entry is a `Name`, it cannot hurt to actually ensure that's the case. Also fixes an error message, for JPEG 2000 images with unsupported ColorSpaces, since `this.numComps` hasn't been initialized when it's accessed during the `throw new Error()` invocation.	2018-08-01 16:41:15 +02:00
Jonas Jenwald	17eac2d48a	Ensure that Type0, i.e. composite, OpenType fonts with `CFF` tables are not treated as CFF fonts if their glyph mapping is non-default (issue 9915) This particular code-path has been the source of numerous regressions to date, so hopefully this patch won't cause any more of those. Fixes 9915.	2018-07-29 23:06:15 +02:00
Jonas Jenwald	cfdb597e4a	Ensure that the `CIDSystemInfo` strings, in Type0 fonts, are correctly decoded This isn't directly related to the subsequent patch, but just something that I happened to notice while poking around in the font code.	2018-07-29 23:06:15 +02:00
Tim van der Meij	3521424576	Merge pull request #9920 from Snuffleupagus/getMetadata-linearization [api-minor] Add an `IsLinearized` property to the `PDFDocument.documentInfo` getter, to allow accessing the linearization status through the API (via `PDFDocumentProxy.getMetadata`)	2018-07-29 20:23:22 +02:00
Tim van der Meij	f45450bd78	Merge pull request #9931 from Snuffleupagus/refactor-getPage Refactor `getPage` (in the worker), and attempt to use the `Linearization` dictionary to lookup the first Page	2018-07-29 19:33:46 +02:00
Tim van der Meij	a2c317f12b	Merge pull request #9925 from Snuffleupagus/StreamsSequenceStream-maybeLength Attempt to estimate the minimum required `buffer` length when initializing `StreamsSequenceStream` instances	2018-07-29 16:52:34 +02:00
Jonas Jenwald	ec3728b540	Use the `Linearization` dictionary, if it exists, when fetching the first Page Since PDF.js already supports range requests and streaming, not to mention chunked rendering, attempting to use the `Linearization` dictionary in `PDFDocument.getPage` probably isn't going to improve performance in any noticeable way. Nonetheless, when `Linearization` data is available, it will allow looking up the first Page directly without having to descend into the `Pages` tree to find the correct object.	2018-07-28 22:23:36 +02:00
Jonas Jenwald	fbb25ff4e2	Move `getPage`, on the worker side, from `Catalog` and into `PDFDocument` instead Addresses an existing TODO, and avoids having to pass in a `pageFactory` when creating `Catalog` instances.	2018-07-28 22:23:36 +02:00
Jonas Jenwald	81b471c781	[Regression] Convert `Catalog.builtInCMapCache` into a `Map`, instead of an Object, to ensure that it's correctly reset (PR 8064 follow-up) With the `builtInCMapCache` being a simple Object, it unfortunately means that the `Catalog.cleanup` method isn't resetting it as intended. By just replacing the `builtInCMapCache` with an empty Object, existing references to it will not actually be updated. The result is that e.g. `Page` instances still keeps references to, what should have been removed, CMap data. To fix these problems, the `builtInCMapCache` is converted into a `Map` instead (since it can be easily reset).	2018-07-28 22:20:43 +02:00
bion	c31ddf7edc	[api-minor] Include export value for checkboxes	2018-07-28 00:30:41 -07:00
Jonas Jenwald	928b89382e	[api-minor] Add an `IsLinearized` property to the `PDFDocument.documentInfo` getter, to allow accessing the linearization status through the API (via `PDFDocumentProxy.getMetadata`) There was a (somewhat) recent question on IRC about accessing the linearization status of a PDF document, and this patch contains a simple way to expose that through already existing API methods. Please note that during setup/parsing in `PDFDocument` the linearization data is already being fetched and parsed, provided of course that it exists. Hence this patch will not cause any additional data to be loaded.	2018-07-26 15:54:19 +02:00
Jonas Jenwald	8a4466139b	Simplify the `DocumentInfoValidators` definition With this file now being a proper (ES6) module, it's no longer (technically) necessary for this structure to be lazily initialized. Considering its size, and simplicity, I therefore cannot see the harm in letting `DocumentInfoValidators` just be simple Object instead. While I'm not aware of any bugs caused by the current code, it cannot hurt to add an `isDict` check in `PDFDocument.documentInfo` (since the current code assumes that `infoDict` being defined implies it also being a Dictionary). Finally, the patch also converts a couple of `var` to `let`/`const`.	2018-07-26 15:54:01 +02:00
Jonas Jenwald	2d51bce941	Remove unnecessary `stream.length` check from `PDFDocument.linearization` Note first of all that `PDFDocument` will be initialized with either a `Stream` or a `ChunkedStream`, and that both of these have `length` getters. Secondly, the `PDFDocument` constructor will assert that the `stream` has a non-zero (and positive) length. Hence there's no point in checking `stream.length` in the `linearization` getter.	2018-07-26 15:54:01 +02:00
Jonas Jenwald	32bfa55d98	Attempt to estimate the minimum required `buffer` length when initializing `StreamsSequenceStream` instances For most other `DecodeStream` based streams, we'll attempt to estimate the minimum `buffer` length based on the raw stream data. The purpose of this is to avoid having to unnecessarily re-size the `buffer`, thus reducing the number of intermediate allocations necessary when decoding the stream data. However, currently no such optimization is attempted for `StreamsSequenceStream`, and given that they can often be quite large that seems unfortunate. To improve this, at least somewhat, this patch utilizes the raw sizes of the `StreamsSequenceStream` sub-streams to estimate the minimum required `buffer` length. Most likely this patch won't have a huge effect on memory consumption, however for pathological cases it should help reduce peak memory usage slightly. One example is the PDF file in issue 2813, where currently the `StreamsSequenceStream` instances would grow their `buffer`s as `2 MiB -> 4 MiB -> 8 MiB -> 16 MiB -> 32 MiB`. With this patch, the same stream `buffers`s grow as `8 MiB -> 16 MiB -> 32 MiB`, thus avoiding a total of `12 MiB` of intermediate allocations (since there's two `StreamsSequenceStream` used, for rendering/text-extraction).	2018-07-26 13:42:59 +02:00
Jonas Jenwald	36b683ca55	Provide custom messages for the `no-restricted-globals` ESLint rule, and refactor the `.eslintrc` files (PR 9868 follow-up) Without providing useful (custom) error messages for the `no-restricted-globals` rule, see https://eslint.org/docs/rules/no-restricted-globals, it's quite likely that the rule will be incorrectly disabled rather than the required globals being imported as intended. To reduced duplication of the `no-restricted-globals` rule in multiple `.eslintrc` files, it's instead moved to the top-level `.eslintrc` file and disabled as needed on a folder/file basis outside of `/src` and `/web`.	2018-07-23 14:10:13 +02:00
Jonas Jenwald	8ec99b200c	Prevent Metadata/XML parsing from breaking `PDFDocumentProxy.getMetadata` when no XML root document is found (issue 8884) With the new XML parser, see PR 9573, the referenced PDF file now causes `getMetadata` to fail when incomplete XML tags are encountered. This provides a simple, and hopefully generally useful, work-around that may also help prevent future bugs. (Without being able to reproduce nor even understand the other (non XML) errors mentioned in issue 8884, I'd say that this patch is enough to close that one as fixed.)	2018-07-18 11:37:40 +02:00
Tim van der Meij	61db85ab64	Merge pull request #9886 from Snuffleupagus/bug-1473809 Prevent errors in `sanitizeTTProgram`, during parsing of CALL functions, when encountering invalid functions stack deltas (bug 1473809)	2018-07-15 17:23:52 +02:00
Jonas Jenwald	8e76d26e5b	Move the `toRoman` helper function out of the `Util` scope Compared to all the other (static) methods in `Util`, the `toRoman` one looks slightly out of place. Even more so considering that `Util` is being exposed through `pdfjsLib`, where access to a Roman numerals conversion method doesn't make much sense.	2018-07-10 10:45:25 +02:00
Jonas Jenwald	c1c49badff	Remove the, now unused, `Util.inherit` helper function	2018-07-10 10:29:47 +02:00
Jonas Jenwald	2b25deb84c	Prevent errors in `sanitizeTTProgram`, during parsing of CALL functions, when encountering invalid functions stack deltas (bug 1473809) I was feeling bored; so this is a very quick, and somewhat naive, attempt at fixing the bug. The breaking error, i.e. `Error during font loading: invalid array length`, was thrown when attempting to re-size the `stack` to a negative length when parsing the CALL functions. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1473809.	2018-07-10 09:45:55 +02:00
Jonas Jenwald	bf6d45f85a	Convert `CMap` and `IdentityCMap` to ES6 classes Also changes `var` to `let`/`const` in code already touched in the patch.	2018-07-09 21:12:01 +02:00
Jonas Jenwald	b773b356af	Convert `NameOrNumberTree`, `NameTree`, and `NumberTree` to ES6 classes Also changes `var` to `let`/`const` in code already touched in the patch.	2018-07-09 21:12:01 +02:00
Jonas Jenwald	ba1af46709	Convert `CompiledFont`, `TrueTypeCompiled`, and `Type2Compiled` to ES6 classes Also changes `var` to `let`/`const` in code already touched in the patch.	2018-07-09 21:12:01 +02:00
Jonas Jenwald	775763a091	Ensure that `CompiledFont.compileGlyph` always returns an Array (PR 6141 follow-up) PR 6141 changed `CompiledFont.compileGlyph` to, in the general case, return an Array. However, that PR apparenly forgot to update the no-glyph, empty-glyph, and endchar-glyph code-path and a String was still being (incorrectly) returned. Given the way that `FontFaceObject.getPathGenerator` (on the API side) is implemented, this shouldn't have caused any bugs despite the Worker possible returning unexpected data.	2018-07-09 21:12:01 +02:00
Tim van der Meij	646d81cd09	Merge pull request #9837 from timvandermeij/unreachable Replace `NotImplementedException` with `unreachable`	2018-07-09 21:10:36 +02:00
Tim van der Meij	907c7f190b	Convert `src/code/pdf_manager.js` to ES6 classes/syntax	2018-07-08 16:43:46 +02:00
Jonas Jenwald	a9ce4e8417	Stop exposing the `URL` polyfill in the global scope This moves/exposes the `URL` polyfill similarily to the existing `ReadableStream` polyfill, rather than exposing it globally, to avoid interfering with any "outside" code. Both the `URL` and `ReadableStream` polyfills are now exposed on the `pdfjsLib` object, such that they are accessible to the viewer components. Furthermore, the `no-restricted-globals` ESLint rule is also enabled to prevent accidental usage of the native `URL`/`ReadableStream` implementations directly in the `src/` and `web/` folders; see also https://eslint.org/docs/rules/no-restricted-globals Addresses the remaining TODO in https://github.com/mozilla/pdf.js/projects/6	2018-07-04 09:16:28 +02:00
Tim van der Meij	99f8f2c275	Merge pull request #9853 from Snuffleupagus/re-render-after-cancel Fix re-rendering, using the same canvas, when rendering was previously cancelled (PR 8519 follow-up)	2018-06-29 23:25:43 +02:00
Tim van der Meij	6fa2c779b5	Merge pull request #9838 from Snuffleupagus/invalid-path-OPS Error, rather than warn, once a number of invalid path operators are encountered in `EvaluatorPreprocessor.read` (bug 1443140)	2018-06-28 23:15:25 +02:00
Jonas Jenwald	bf0aca86d7	Fix re-rendering, using the same canvas, when rendering was previously cancelled (PR 8519 follow-up) Currently if `RenderTask.cancel` is called immediately after rendering was started, then by the time that `InternalRenderTask.initializeGraphics` is called rendering will already have been cancelled. However, we're still inserting the canvas into the `canvasInRendering` map, thus breaking any future attempts at re-rendering using the same canvas. Considering that `InternalRenderTask.cancel` always removes the canvas from the map, I cannot imagine that we'd ever want to re-add it after rendering was cancelled (it was likely just a simple oversight in PR 8519). Fixes 9456.	2018-06-28 22:56:37 +02:00
Tim van der Meij	14b69a4c1c	Merge pull request #9729 from Snuffleupagus/gulp-image_decoders Add a `gulp image_decoders` command to package the image decoders (i.e. jpg.js, jpx.js, jbig2.js) separately, and publish them in pdfjs-dist	2018-06-26 23:27:32 +02:00
Jonas Jenwald	74e9999044	Add unit-tests for `PDFPageProxy.stats` (PR 9245 follow-up) This wasn't included in PR 9245, since all the API options were still global at that time. Writing the unit-tests also uncovered an issue with `getOperatorList` not starting the "Page Request" timer.	2018-06-25 14:20:49 +02:00
Jonas Jenwald	7f21e38787	Error, rather than warn, once a number of invalid path operators are encountered in `EvaluatorPreprocessor.read` (bug 1443140) Incomplete path operators, in particular, can result in fairly chaotic rendering artifacts, as can be observed on page four of the referenced PDF file. The initial (naive) solution that was attempted, was to simply throw a `FormatError` as soon as any invalid (i.e. too short) operator was found and rely on the existing `ignoreErrors` code-paths. However, doing so would have caused regressions in some files; see the existing `issue2391-1` test-case, which was promoted to an `eq` test to help prevent future bugs. Hence this patch, which adds special handling for invalid path operators since those may cause quite bad rendering artifacts. You could, in all fairness, argue that the patch is a handwavy solution and I wouldn't object. However, given that this only concerns corrupt PDF files, the way that PDF viewers (PDF.js included) try to gracefully deal with those could probably be described as a best-effort solution anyway. This patch also adjusts the existing `warn`/`info` messages to print the command name according to the PDF specification, rather than an internal PDF.js enumeration value. The former should be much more useful for debugging purposes. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1443140.	2018-06-24 16:05:08 +02:00
Tim van der Meij	2907827d31	Replace `NotImplementedException` with `unreachable`	2018-06-23 21:20:53 +02:00
Jonas Jenwald	275834ae66	Clean-up, and add JSDocs to, the `PDFDocumentProxy.loadingParams` method (PR 9830 follow-up)	2018-06-23 13:33:22 +02:00
Tim van der Meij	34594a5b02	Merge pull request #9830 from EugeneSqr/9824 Removed safari compatibility check (issue #9824)	2018-06-23 02:21:06 +02:00
Tim van der Meij	98ea39f9d0	Merge pull request #9827 from Snuffleupagus/misc-corrupt-pdf-fixes Fix various corrupt PDF files (issue 9252, issue 9418)	2018-06-21 22:35:00 +02:00
eugenesqr	331ac8ae74	removed safari compatibility check	2018-06-21 12:57:56 +03:00
Brendan Dahl	a278c5a8dc	Merge pull request #9795 from timvandermeij/object-assign Replace `Util.extendObj` by `Object.assign`	2018-06-20 10:50:40 -07:00
Jonas Jenwald	56e3648b65	Add basic validation of the 'trailer' dictionary candidates in `XRef.indexObjects` (issue 9418) This patch avoids choosing a (possible) 'trailer' dictionary that `XRef.parse` and/or the `Catalog` constructor/methods will reject anyway. Since `XRef.indexObjects` is already parsing the entire PDF file, the extra dictionary look-ups added here shouldn't matter much. Besides, this is a fallback code-path that only applies to corrupt PDF files anyway.	2018-06-20 13:41:22 +02:00
Jonas Jenwald	346810e02a	Add basic validation of the 'Root' dictionary in `XRef.parse` and try to recover when possible Note that the `Catalog` constructor, and some of its methods, are already enforcing that the 'Root' dictionary is valid/well-formed. However, by doing additional validation already in `XRef.parse` there's a slightly larger chance that corrupt PDF files could be successfully parsed/rendered.	2018-06-20 13:41:22 +02:00
Jonas Jenwald	e84813e7cc	Prevent hard errors if fetching the `Encrypt` dictionary fails in `XRef.parse`	2018-06-20 13:41:22 +02:00
Jonas Jenwald	30ad62a86a	Use the correct `startPos` when repeating the search for 'endobj' operators in `XRef.indexObjects` (PR 9288 follow-up)	2018-06-20 13:41:22 +02:00
Jonas Jenwald	6bbcafcd26	Let `Lexer.getNumber` treat a single decimal point as zero (issue 9252) This is consistent with the behaviour in Adobe Reader.	2018-06-20 13:41:21 +02:00
Jonas Jenwald	df4799a12a	Ensure that line-breaks are only skipped after operators in `Lexer.getNumber` (PR 8359 follow-up) With the current code line-breaks are accepted not just after an operator, but after a decimal point as well. When looking at this again, the latter case seems prone to cause false positives and might also interfere with subsequent patches. Hence this is code is adjusted to actually do what the original commit message says, and nothing more.	2018-06-20 13:41:15 +02:00
Jonas Jenwald	303537bcb1	Add a `gulp image_decoders` command to allow packaging/distributing the image decoders (i.e. jpg.js, jpx.js, jbig2.js) separately from the main PDF.js library Please note that the standalone `pdf.image_decoders.js` file will be including the complete `src/shared/util.js` file, despite only using parts of it.[1] This was done purposely, to not negatively impact the readability/maintainability of the core PDF.js code. Furthermore, to ensure that the compatibility is the same in the regular PDF.js library and in the the standalone image decoders, `src/shared/compatibility.js` was included as well. To (hopefully) prevent future complaints about the size of the built `pdf.image_decoders.js` file, a few existing async-related polyfills are being skipped (since all of the image decoders are completely synchronous). Obviously this required adding a couple of pre-processor statements, but given that these are all limited to "compatibility" code, I think this might be OK!? --- [1] However, please note that previous commits moved `PageViewport` and `MessageHandler` out of `src/shared/util.js` which reduced its size.	2018-06-16 17:56:54 +02:00
Jonas Jenwald	bfc88ead66	Expose a `Jbig2Image.parse` method, by re-instating the `parseJbig2` function The purpose of this patch is to hopefully provide slightly better user ergonomics, if/when the PDF.js image decoders are used standalone. This implementation is (basically) reverting the changes in PR 9386, in conjunction with code from the `parse` method found at https://github.com/notmasteryet/jpgjs/blob/master/src/pdfjs.js	2018-06-16 17:56:54 +02:00
Jonas Jenwald	682672db8e	Change the signature of the `JpegImage` constructor, to allow passing in various options directly	2018-06-16 17:56:54 +02:00
Tim van der Meij	620da6f4df	Merge pull request #9802 from Snuffleupagus/ColorSpace-PDFImage-Uint8ClampedArray Update `ColorSpace` and `PDFImage` to use `Uint8ClampedArray`s and remove manual clamping/rounding	2018-06-16 17:55:10 +02:00
Jonas Jenwald	0958006713	Send `UnsupportedFeature` notification when errors are ignored in `FontFaceObject.getPathGenerator`	2018-06-13 11:02:10 +02:00
Jonas Jenwald	bf0db0fb72	Pass the `ignoreErrors` API option to the `FontFaceObject` constructor, and utilize it in `getPathGenerator` to ignore missing glyphs Obviously it's still not possible to render non-embedded fonts as paths, but in this way the rest of the page will at least be allowed to continue rendering. Please note: Including the 14 standard fonts in PDF.js probably wouldn't be that difficult to implement. (I'm not a lawyer, but the fonts from PDFium could probably be used given their BSD license.) However, the main blocker ought to be the total size of the necessary font data, since I cannot imagine people being OK with shipping ~5 MB of (additional) font data with Firefox. (Based on the reactions when the CMap files were added, and those are only ~1 MB in size.)	2018-06-13 11:02:06 +02:00
Jonas Jenwald	fe288bb872	Refactor the `FontFaceObject.getPathGenerator` method - Reduce the overall indentation level, by making use of early returns. - Replace `var` with `let`.	2018-06-13 11:02:02 +02:00
Jonas Jenwald	778981ec89	Catch, and propagate, errors in the `requestAnimationFrame` branch of `InternalRenderTask._scheduleNext` To support these changes, `InternalRenderTask._next` now returns a Promise.	2018-06-13 11:01:58 +02:00
Jonas Jenwald	d4ff541b78	Enforce the use, in non-production/test-only mode, of `Uint8ClampedArray` in all relevant methods in `ColorSpace` and `PDFImage` Since `ColorSpace` now depends on the native clamping of `Uint8ClampedArray`, this patch adds non-production/test-only `assert`s to enforce that the expected TypedArray is used for the output. These `assert`s are purposely not included in PRODUCTION builds since that would break rendering completely, as opposed to "only" displaying some weird colours, when a `Uint8Array` was used. Furthermore, these are mostly added to help catch explicit developer errors when working with the `ColorSpace` and `PDFImage` code.	2018-06-12 11:01:32 +02:00
Jonas Jenwald	4b69bb7fe9	Add a TESTING build option, to enable using non-production/test-only code-paths Since the tests (currently) run with the `pdf.worker.js` file built, i.e. with `PRODUCTION = true` set, there's no simple way to add e.g. `assert` calls for both non-production and test-only builds without also affecting PRODUCTION builds.	2018-06-12 11:01:32 +02:00
Jonas Jenwald	f01e54eae1	Improve the warning messages printed by `PartialEvaluator.{getOperatorList, getTextContent} when errors are being ignored Currently the actual errors aren't printed, which can make debugging harder than necessary.	2018-06-12 11:01:32 +02:00
Jonas Jenwald	731f2e6dfc	Remove manual clamping/rounding from `ColorSpace` and `PDFImage`, by having their methods use `Uint8ClampedArray`s The built-in image decoders are already using `Uint8ClampedArray` when returning data, and this patch simply extends that to the rest of the image/colorspace code. As far as I can tell, the only reason for using manual clamping/rounding in the first place was because TypedArrays used to be polyfilled (using regular arrays). And trying to polyfill the native clamping/rounding would probably have been had too much overhead, but given that TypedArray support is required in PDF.js version `2.0` that's no longer a concern. Please note: Because of different rounding behaviour, basically `Math.round` in `Uint8ClampedArray` respectively `Math.floor` in the old code, there will be very slight movement in quite a few existing test-cases. However, the changes should be imperceivable to the naked eye, given that the absolute difference is at most `1` for each RGB component when comparing `master` and this patch (see also the updated expectation values in the unit-tests).	2018-06-12 11:01:32 +02:00
Jonas Jenwald	55199aa281	Remove the unused `bpc` parameter from, and update the signature of, the `resizeRgbImage` function in `src/core/colorspace.js`	2018-06-12 11:01:32 +02:00
Jonas Jenwald	d1637056b3	Use shorthand method signatures in `src/core/colorspace.js`	2018-06-12 11:01:32 +02:00
Jonas Jenwald	32367c5968	Make the `getBytes`/`peekBytes` methods of `Stream`/`DecodeStream`/`ChunkedStream` able to return `Uint8ClampedArray`s The built-in image decoders are already returning data as `Uint8ClampedArray`, and subsequently the JPEG/JBIG2/JPX streams are as well. However, for general streams we obviously don't want to force the use of `Uint8ClampedArray` unless an "Image" is actually being decoded. Hence this patch, which adds a parameter that allows the caller of the `getBytes`/`peekBytes` methods to force a `Uint8ClampedArray` (rather than a `Uint8Array`) to be returned.	2018-06-12 11:01:32 +02:00
Brendan Dahl	3ac638fad3	Merge pull request #9689 from RafaPolit/master Fixed critical unhandled promise that prevented error catching using API	2018-06-11 15:40:30 -06:00
Tim van der Meij	af8e88d00b	Replace `Util.extendObj` by `Object.assign`	2018-06-10 20:11:03 +02:00
Tim van der Meij	903bad1906	Remove `Util.appendToArray` and `Util.prependToArray` The former may be replaced by regular JavaScript array concatenation and the latter is unused. This avoids unnecessary function calls/imports.	2018-06-10 15:24:09 +02:00
Jonas Jenwald	07d610615c	Move, and modernize, `Util.loadScript` from `src/shared/util.js` to `src/display/dom_utils.js` Not only is the `Util.loadScript` helper function unused on the Worker side, even trying to use it there would throw an Error (since `document` isn't defined/available in Workers). Hence this helper function is moved, and its code modernized slightly by having it return a Promise rather than needing a callback function. Finally, to reduced code duplication, the "new" loadScript function is exported and used in the viewer.	2018-06-07 13:52:40 +02:00

1 2 3 4 5 ...

3498 Commits