pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	78c32c2697	Improve the handling of errors, in `PartialEvaluator.loadFont`, occuring in `PartialEvaluator.preEvaluateFont` (issue 12823) Currently any errors thrown in `preEvaluateFont`, which is a synchronous method, will not be handled at all in the `loadFont` method and we were thus failing to return an `ErrorFont`-instance as intended here. Also, add an explicit check in `PartialEvaluator.preEvaluateFont` to ensure that Type0-fonts always have a valid dictionary.	2021-01-07 11:38:38 +01:00
Tim van der Meij	ca18af6af3	Merge pull request #12774 from calixteman/doc_action_test JS -- Add tests for print/save actions	2021-01-03 18:46:37 +01:00
Tim van der Meij	50303fc8f4	Merge pull request #12766 from Snuffleupagus/issue-11004 Ignore, rather than throwing on, unsupported Coding style default (COD) options in JPEG 2000 images (issue 11004)	2020-12-28 20:26:10 +01:00
Calixte Denizet	ffd4bc790c	JS -- Add tests for print/save actions * change PDFDocument::hasJSActions to return true when there are JS actions in catalog.	2020-12-24 18:51:00 +01:00
Calixte Denizet	7c3facb174	JS -- Add support for buttons * radio buttons * checkboxes	2020-12-22 16:41:51 +01:00
Jonas Jenwald	cffb7af3b0	Ignore, rather than throwing on, unsupported Coding style default (COD) options in JPEG 2000 images (issue 11004) Similar to other markers that we currently skip, by ignoring unsupported Coding style default (COD) options we'll at least render something here (although some JPEG 2000 images may look slightly wrong). Note that if the unsupported COD options lead to additional errors, during parsing, we'll still abort parsing of the JPEG 2000 image.	2020-12-21 20:35:52 +01:00
Brendan Dahl	3ea1c43b15	Merge pull request #12751 from calixteman/da_not_a_string Add a default DA for textfield to avoid issues when printing or saving	2020-12-21 09:44:08 -08:00
Calixte Denizet	a7c682c600	Add a default DA for textfield to avoid issues when printing or saving * it aims to fix issue #12750	2020-12-19 23:38:45 +01:00
calixteman	e6e2809825	Merge pull request #12702 from calixteman/doc_actions JS - Collect and execute actions at doc level	2020-12-18 21:33:32 +01:00
Calixte Denizet	1e2173f038	JS - Collect and execute actions at doc and pages level * the goal is to execute actions like Open or OpenAction * can be tested with issue6106.pdf (auto-print) * once #12701 is merged, we can add page actions	2020-12-18 20:03:59 +01:00
Jonas Jenwald	48a76aea2b	Ignore, rather than throwing on, Coding style component (COC) markers in JPEG 2000 images (issue 12752) Similar to other markers that we currently skip, by ignoring the Coding style component (COC) marker we'll at least prevent outright errors (although some JPEG 2000 images may look slightly wrong).	2020-12-18 18:18:32 +01:00
Calixte Denizet	03814bd6a2	Don't use 'in' operator to check if key is in a Map	2020-12-16 16:00:12 +01:00
Tim van der Meij	d1848f5022	Merge pull request #12725 from brendandahl/remeasure-std Use widths defined by font for standard fonts.	2020-12-11 20:36:19 +01:00
Jonas Jenwald	67e5db75d8	Ignore color-operators in Type3 glyphs beginning with a `d1` operator (issue 12705) Please refer to the PDF specification at https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G8.1977497 and https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G7.3998470 This patch removes the color-operators in the evaluator, since that should be more efficient than doing it repeatedly in the main-thread when rendering the Type3 glyphs.	2020-12-11 15:49:13 +01:00
Brendan Dahl	45d9ab6e45	Use widths defined by font for standard fonts. There doesn't seem to be anything definitive about this in the spec, but from experimenting, it seems acrobat lets PDFs override the widths of the standard fonts.	2020-12-10 15:30:39 -08:00
Tim van der Meij	00b4f86db3	Merge pull request #12717 from Snuffleupagus/issue-12714 Ensure that the /Annots-entry, on /Page-instances, is actually an Array (issue 12714)	2020-12-10 23:06:59 +01:00
Calixte Denizet	25bf504ff5	Be sure that CalculationOrder is either null or a non-empty array	2020-12-10 16:02:11 +01:00
Jonas Jenwald	796a0d3155	Ensure that the /Annots-entry, on /Page-instances, is actually an Array (issue 12714) In the referenced PDF document, the second and third page has corrupt /Annots-entries which contain /Dict-data rather than the intended Arrays.	2020-12-10 11:42:00 +01:00
Tim van der Meij	012e15f7a3	Fix non-standard quadpoints orders for annotations This change requires us to use valid quadpoints arrays in the existing unit tests too due to the normalization.	2020-12-06 16:02:41 +01:00
Jonas Jenwald	c42029489e	Run `gulp lint --fix`, to account for changes in Prettier version `2.2.1` Please refer to https://github.com/prettier/prettier/blob/master/CHANGELOG.md#221 for additional details.	2020-11-29 10:01:46 +01:00
Tim van der Meij	256068556d	Merge pull request #12662 from Snuffleupagus/issue-12402 Check the top-level /Pages dictionary when finding the trailer in `XRef.indexObjects` (issue 12402)	2020-11-25 21:54:41 +01:00
Jonas Jenwald	8a132f584d	Check the top-level /Pages dictionary when finding the trailer in `XRef.indexObjects` (issue 12402) In addition to the existing /Root and /Pages validation, also check that the /Pages-entry actually is a dictionary and that it has a valid /Count-entry. This way we can avoid picking a trailer candidate which e.g. the `Catalog.numPages` getter will just end up rejecting, thus breaking PDF document loading completely.	2020-11-25 15:14:53 +01:00
Calixte Denizet	18b525de2e	Parenthesis in names are not escaped when saving	2020-11-25 12:28:12 +01:00
Calixte Denizet	b11592a756	JS -- hidden annotations must be built in case a script show them * in some pdf, there are actions with "event.source.hidden = ..." * in order to handle visibility when printing, annotationStorage is extended to store multiple properties (value, hidden, editable, ...)	2020-11-10 12:48:34 +01:00
Calixte Denizet	a5279897a7	JS -- Add listener for sandbox events only if there are some actions * When no actions then set it to null instead of empty object * Even if a field has no actions, it needs to listen to events from the sandbox in order to be updated if an action changes something in it.	2020-11-09 18:37:59 +01:00
Jonas Jenwald	a03b383edb	Fail early, in modern `GENERIC` builds, if `globalThis` isn't available (PR 11799 follow-up, issue 12596) It probably doesn't hurt to explicitly check for `globalThis` as well, in addition to the existing checks.	2020-11-07 19:00:33 +01:00
Tim van der Meij	99ac2d1036	Merge pull request #12583 from Snuffleupagus/nonBlendModesSet Add global caching, for /Resources without blend modes, and use it to reduce repeated fetching/parsing in `PartialEvaluator.hasBlendModes`	2020-11-05 23:53:39 +01:00
Tim van der Meij	646f895d35	Merge pull request #12568 from calixteman/defaultvalue [api-minor] JS -- Add default value in annotation data	2020-11-05 22:53:21 +01:00
Jonas Jenwald	082cd8fc6c	Add global caching, for /Resources without blend modes, and use it to reduce repeated fetching/parsing in `PartialEvaluator.hasBlendModes` The `PartialEvaluator.hasBlendModes` method is necessary to determine if there's any blend modes on a page, which unfortunately requires synchronous parsing of the /Resources of each page before its rendering can start (see the "StartRenderPage"-message). In practice it's not uncommon for certain /Resources-entries to be found on more than one page (referenced via the XRef-table), which thus leads to unnecessary re-fetching/re-parsing of data in `PartialEvaluator.hasBlendModes`. To improve performance, especially in pathological cases, we can cache /Resources-entries when it's absolutely clear that they do not contain any blend modes at all[1]. This way, subsequent `PartialEvaluator.hasBlendModes` calls can be made significantly more efficient. This patch was tested using the PDF file from issue 6961, i.e. https://github.com/mozilla/pdf.js/files/121712/test.pdf: ``` [ { "id": "issue6961", "file": "../web/pdfs/issue6961.pdf", "md5": "a80e4357a8fda758d96c2c76f2980b03", "rounds": 100, "type": "eq" } ] ``` which gave the following results when comparing this patch against the `master` branch: ``` -- Grouped By browser, page, stat -- browser \| page \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| ---- \| ------------ \| ----- \| ------------ \| ----------- \| ---- \| ------ \| ------------- firefox \| 0 \| Overall \| 100 \| 1034 \| 555 \| -480 \| -46.39 \| faster firefox \| 0 \| Page Request \| 100 \| 489 \| 7 \| -482 \| -98.67 \| faster firefox \| 0 \| Rendering \| 100 \| 545 \| 548 \| 2 \| 0.45 \| firefox \| 1 \| Overall \| 100 \| 912 \| 428 \| -484 \| -53.06 \| faster firefox \| 1 \| Page Request \| 100 \| 487 \| 1 \| -486 \| -99.77 \| faster firefox \| 1 \| Rendering \| 100 \| 425 \| 427 \| 2 \| 0.51 \| ``` --- [1] In the case where blend modes are found, it becomes a lot more difficult to know if it's generally safe to skip /Resources-entries. Hence we don't cache anything in that case, however note that most document/pages do not utilize blend modes anyway.	2020-11-05 16:59:08 +01:00
Calixte Denizet	39f5954729	JS -- Add default value in annotation data * these values are used when a form is resetted	2020-11-05 13:44:23 +01:00
Brendan Dahl	1de2bc4816	Merge pull request #12505 from calixteman/12504 Split highlight annotation div into multiple divs	2020-11-04 10:41:28 -08:00
Tim van der Meij	3e52098e29	Merge pull request #12555 from calixteman/color Replace css color rgb(...) by #...	2020-11-02 23:55:39 +01:00
Calixte Denizet	9d11b51a3e	Replace css color rgb(...) by #... * it's faster to generate the color code in using a table for components * it's very likely a way faster to parse (when setting the color in the canvas)	2020-11-02 10:25:04 +01:00
Tim van der Meij	46e60a266c	Merge pull request #12552 from Snuffleupagus/annotation-fixes Miscellaneous (small) improvements in `src/core/annotation.js`	2020-10-31 00:41:39 +01:00
Tim van der Meij	e341e6e542	Merge pull request #12525 from brendandahl/mark-info [api-minor] Implement API to get MarkInfo from the catalog.	2020-10-31 00:05:19 +01:00
Brendan Dahl	f5c821e9c3	[api-minor] Implement API to get MarkInfo from the catalog.	2020-10-30 10:59:45 -07:00
Jonas Jenwald	fdb6520012	Change the `Catalog.openAction` getter back to using an Object internally (PR 12543 follow-up) Given that the `Map`-pattern apparently has undesirable performance characteristics, change this getter back to using an Object instead and check its size before returning it.	2020-10-30 13:27:05 +01:00
Jonas Jenwald	a1e5581a0b	Let `Annotation._collectActions` return `null` when no actions are present Rather than returning an empty Object[1] we should be returning `null` instead, since that's consistent with existing API-functionality. To avoid having to manually track if the Object is empty, this patch also introduces a small helper function to check its size.	2020-10-30 13:23:05 +01:00
Jonas Jenwald	8540b4cc76	Stop calling `Font.charsToGlyphs`, in `src/core/annotation.js`, with unused arguments As can be seen in `src/core/fonts.js`, this method only accepts one parameter, hence it's somewhat difficult to understand what the Annotation-code is actually attempting to do here. The only possible explanation that I can imagine, is that the intention was initially to call `Font.charToGlyph` directly instead. However, note that that'd would not actually have been correct, since that'd ignore one level of font-caching (see `this.charsCache`). Hence the unused arguments are removed, in `src/core/annotation.js`, and the `Font.charToGlyph` method is now marked as "private" as intended.	2020-10-30 13:17:52 +01:00
Jonas Jenwald	46e94cad17	Fix some errors reported by the ESLint `no-useless-escape` rule This patch removes unnecessary escape-sequence in (mostly) strings, as a first step, since the ones in regular expressions probably requires more careful testing (just in case). The only exception is a regular expression in `src/core/annotation.js`, since we should have both unit- and reference-tests for this code and given [this information on MDN](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Guide/Regular_Expressions/Character_Classes#Types): > Inside a character set, the dot loses its special meaning and matches a literal dot. Please find additional details about the ESLint rule at https://eslint.org/docs/rules/no-useless-escape	2020-10-29 15:40:40 +01:00
Jonas Jenwald	9fc7cdcc9d	Use a `Map`, rather than an `Object`, internally in the `Catalog.openAction` getter (PR 11644 follow-up) This provides a work-around to avoid having to conditionally try to initialize the `openAction`-object in multiple places. Given that `Object.fromEntries` doesn't seem to guarantee that a `null` prototype is used, we thus hack around that by using `Object.assign` with `Object.create(null)`.	2020-10-28 14:43:28 +01:00
Tim van der Meij	ea4d88a330	Merge pull request #12395 from calixteman/checks Render not displayed annotations in using normal appearance when printing	2020-10-28 00:11:10 +01:00
Calixte Denizet	6be2f84b4e	Render not displayed annotations in using normal appearance when printing	2020-10-27 19:00:31 +01:00
Tim van der Meij	71a14be8e7	Merge pull request #12534 from Snuffleupagus/murmurhash-slice Ensure that `MurmurHash3_64.update` handles `ArrayBuffer` input correctly, to avoid hash-collisions (issue 12533)	2020-10-26 23:34:03 +01:00
Jonas Jenwald	f2fa053c51	Ensure that `MurmurHash3_64.update` handles `ArrayBuffer` input correctly, to avoid hash-collisions (issue 12533) Different fonts incorrectly end up with identical hashes, despite having different /ToUnicode data. The issue, and it's very interesting that we've apparently not seen it before, appears to be caused by the fact that different /ToUnicode entries share the same underlying `ArrayBuffer`, which thus becomes problematic at the `const dataUint32 = new Uint32Array(data.buffer, 0, blockCounts);` line. The simplest solution thus seem to be to just copy the input, when it's an `ArrayBuffer`, rather than using it as-is. (Note that if we'd stringified the input, when calling `MurmurHash3_64.update`, the issue would also have been fixed. In this case, we're already creating an unique TypedArray.)	2020-10-26 16:27:33 +01:00
Jonas Jenwald	56fa6d414c	Add a `getArrayLookupTableFactory` helper function and use it to re-format `src/core/{glyphlist, unicode}.js` Please note: Once https://bugzilla.mozilla.org/show_bug.cgi?id=1247687 is implemented, and we've removed SystemJS completely, this entire patch can (and even should) be reverted. This is similar to the existing `getLookupTableFactory` helper function, but is implemented as outlined in issue 6774. The re-formatting of the tables were done automatically, by using find-and-replace with regular expressions. For reasons that I don't even pretend to understand, using this particular structure for these very long lookup tables allow SystemJS to process the files correctly/quickly and the development viewer thus works as intended.	2020-10-26 11:08:00 +01:00
Jonas Jenwald	441d9c8cc0	Change `src/core/{glyphlist, unicode}.js` to use standard `import`/`export` statements While the built `pdf.worker.js` file still works correctly with these changes, despite these two files being excluded by Babel[1], the development viewer does not because of issues with SystemJS[2] and/or its Babel-plugin (both of which are old). Furthermore, note also that excluding these two files from Babel-processing isn't generally necessary since e.g. the `gulp mozcentral` command works anyway. The explanation is rather that it's actually the source-map generation which fails for these huge sequences when building the `pdf.worker.js` file. However, not using standard `import`/`export` statements in all files means we also need to use SystemJS when e.e. running the unit-tests. This is very unfortunate, since SystemJS (or its old Babel-version) doesn't support modern ECMAScript features such as e.g. optional chaining and nullish coalescing. Unfortunately it also seems that https://bugzilla.mozilla.org/show_bug.cgi?id=1247687, which tracks the implementation of worker-modules in Firefox, has stalled since there hasn't been any updates for six months now. To hopefully address all of the above, this patch is the first in a series that attempts to further reduce our reliance on SystemJS. --- [1] The only difference being how the dependencies are handled, in the Webpack-bundled file. [2] Parsing takes way too long and consumes too much memory, thus rendering the development viewer essentially unusable.	2020-10-26 11:08:00 +01:00
Tim van der Meij	b4ca3d55b8	Merge pull request #12508 from calixteman/button_fallback_font Fallback font for buttons must be ZapfDingbats.	2020-10-24 18:56:12 +02:00
Tim van der Meij	180f35ee91	Merge pull request #12526 from Snuffleupagus/TilingPattern-args Improve argument/name handling when parsing TilingPatterns (PR 12458 follow-up)	2020-10-24 15:47:57 +02:00
Tim van der Meij	c493dc96fa	Merge pull request #12516 from Snuffleupagus/fieldObjects-annotation-undefined Prevent issues, in `PDFDocument.fieldObjects`, for invalid Annotations	2020-10-24 15:42:33 +02:00

1 2 3 4 5 ...

1859 Commits