pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	d742daf4b7	Merge pull request #17647 from Snuffleupagus/babel-rm-empty-nodes Remove empty, top-level, nodes in the Babel plugin	2024-02-12 09:25:59 +01:00
Jonas Jenwald	14ef0b4211	Remove empty, top-level, nodes in the Babel plugin Looking at the built files you'll notice some lines containing nothing more than a semicolon. This is the result of (mostly top-level) `if`-statements, which include `PDFJSDev`-checks, that evalute to `false` during Babel parsing. This has always annoyed me a bit, and looking at Babel plugin it seems that we can fix this simply by removing the relevant nodes.	2024-02-09 13:58:24 +01:00
Calixte Denizet	275b6748b6	Update quickjs to 3f81070e573e3592728dbbbd04c84c498b20d6dc According to: `3f81070e57` this is a new release of quickjs.	2024-02-09 11:51:24 +01:00
Jonas Jenwald	6a78cf0d93	Remove support for `require` statements from the build system This part of the (modern) preprocessor is now dead code, since we no longer use `require` statements anywhere in the main code-base. Note that as part of the changes leading up to PDF.js version `4` we removed all[1] the remaining `require` statements, and we also have an ESLint rule to ensure that no new ones are accidentally added. --- [1] With two small exceptions, in benchmarking-code and in the Webpack-example.	2024-02-07 13:34:46 +01:00
Nicolò Ribaudo	a352f28785	Fix transform of unary expression in Babel plugin All of our static evaluation & dead-code elimination transforms need to happen in post-order, transforming inner nodes first. This is so that in complex nested cases all transforms see the simplified version of their inner nodes. For example: async getNimbusExperimentData() { if (!PDFJSDev.test("GECKOVIEW")) { return null; } // other code } -> [evaluation of PDFJSDev.*] async getNimbusExperimentData() { if (!false) { return null; } // other code } -> [!false -> true] async getNimbusExperimentData() { if (true) { return null; } // other code } -> [if (true) -> replace with the if branch] async getNimbusExperimentData() { return null; // other code } -> [early return -> remove dead code] async getNimbusExperimentData() { return null; // other code } This was done correctly in all cases except for our `UnaryExpression` transform, which was happening in pre-order.	2024-01-29 11:53:17 +01:00
Nicolò Ribaudo	f724ae98a1	Replace the webpack+acorn transform with a Babel plugin This commit converts the pdfjsdev-loader transform into a Babel plugin, to skip a AST->string->AST round-trip. Before this commit, the webpack build process was: 1. Babel parses the code 2. Babel transforms the AST 3. Babel generates the code 4. Acorn parses the code 5. pdfjsdev-loader transforms the AST 6. @javascript-obfuscator/escodegen generates the code 7. Webpack parses the file 8. Webpack concatenates the files After this commit, it is reduced to: 1. Babel parses the code 2. Babel transforms the AST 3. babel-plugin-pdfjs-preprocessor transforms the AST 4. Babel generates the code 5. Webpack parses the file 6. Webpack concatenates the files This change improves the build time by ~25% (tested on MacBook Air M2): - `gulp lib`: 3.4s to 2.6s - `gulp dist`: 36s to 29s - `gulp generic`: 5.5s to 4.0s - `gulp mozcentral`: 4.7s to 3.2s The new Babel plugin doesn't support the `saveComments` option of pdfjsdev-loader, and it just always discards comments. Even though pdfjsdev-loader supported multiple values for that option, it was effectively ignored due to `acorn` dropping comments by default.	2024-01-23 16:00:59 +01:00
Nicolò Ribaudo	f5bb9bc21b	Rename preprocessor2.mjs to babel-plugin-pdfjs-preprocessor.mjs This is in preparation for the next commit, which will convert preprocessor2.mjs to a Babel plugin. The purpose of this commit is to help git track the rename regardless of the large amount of changes.	2024-01-23 13:13:43 +01:00
Takashi Tamura	61ed77cfb4	Rename .d.ts to .d.mts. Close #17241 Add a type test for legacy. - https://www.typescriptlang.org/docs/handbook/modules/reference.html#file-extension-substitution	2023-11-12 07:30:36 +09:00
Jonas Jenwald	13ca668be0	Update `external/dist/webpack.js` to account for outputting of JavaScript modules (PR 17055 follow-up) Hopefully this makes sense, since I don't know enough about Webpack to tell exactly how this file is being used in practice.	2023-11-04 13:04:26 +01:00
Calixte Denizet	2967eca605	Update the path to get all locales and update locales	2023-10-25 12:47:06 +02:00
Jonas Jenwald	c60401a765	Merge pull request #17133 from Snuffleupagus/rm-builder-merge Use object destructuring, rather than the `merge` helper, in the gulpfile	2023-10-19 15:41:40 +02:00
Calixte Denizet	66982a2a11	[api-minor] Move to Fluent for the localization (bug 1858715) - For the generic viewer we use @fluent/dom and @fluent/bundle - For the builtin pdf viewer in Firefox, we set a localization url and then we rely on document.l10n which is a DOMLocalization object.	2023-10-19 11:20:41 +02:00
Jonas Jenwald	ae664ea8e0	Use object destructuring, rather than the `merge` helper, in the gulpfile This helper function was originally added in PR 1953, eleven years ago, at which point object destructuring didn't exist.	2023-10-18 13:49:26 +02:00
Jonas Jenwald	d53093045a	Enable the `import/no-commonjs` ESLint plugin rule Given the amount of work put into removing `require`-calls from the code-base, let's ensure that new ones aren't accidentally added in the future. Note that we still have a couple of files where `require` is being used, in particular: - The Node.js examples, however those will be updated to use `import` in PR 17081. - The Webpack examples, and related support files, however I unfortunately don't know enough about Webpack to be able to update those. (Hopefully users of that code will help out here, once version `4` is released.) - The `statcmp`-tool, since some of those `require`-calls cannot be converted to `import` without other code changes (and that file is only used during benchmarking). Please find additional details at https://github.com/import-js/eslint-plugin-import/blob/main/docs/rules/no-commonjs.md	2023-10-14 12:49:17 +02:00
Jonas Jenwald	927e50f5d4	[api-major] Output JavaScript modules in the builds (issue 10317) At this point in time all browsers, and also Node.js, support standard `import`/`export` statements and we can now finally consider outputting modern JavaScript modules in the builds.[1] In order for this to work we can only use proper `import`/`export` statements throughout the main code-base, and (as expected) our Node.js support made this much more complicated since both the official builds and the GitHub Actions-based tests must keep working.[2] One remaining issue is that the `pdf.scripting.js` file cannot be built as a JavaScript module, since doing so breaks PDF scripting. Note that my initial goal was to try and split these changes into a couple of commits, however that unfortunately didn't really work since it turned out to be difficult for smaller patches to work correctly and pass (all) tests that way.[3] This is a classic case of every change requiring a couple of other changes, with each of those changes requiring further changes in turn and the size/scope quickly increasing as a result. One possible "issue" with these changes is that we'll now only output JavaScript modules in the builds, which could perhaps be a problem with older tools. However it unfortunately seems far too complicated/time-consuming for us to attempt to support both the old and modern module formats, hence the alternative would be to do "nothing" here and just keep our "old" builds.[4] --- [1] The final blocker was module support in workers in Firefox, which was implemented in Firefox 114; please see https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Statements/import#browser_compatibility [2] It's probably possible to further improve/simplify especially the Node.js-specific code, but it does appear to work as-is. [3] Having partially "broken" patches, that fail tests, as part of the commit history is really not a good idea in general. [4] Outputting JavaScript modules was first requested almost five years ago, see issue 10317, and nowadays there should be much better support for JavaScript modules in various tools.	2023-10-07 09:31:08 +02:00
Jonas Jenwald	5711d0f95d	[GeckoView] Exclude `annotation_editor_layer_builder.css` in the build (issue 16994) Given the limitations of the old pre-processor that's used for CSS/HTML files, this unfortunately isn't as "easy" to implement as it is for JavaScript code. Since this is the first case where we've wanted to do conditional CSS imports, rather than trying to completely re-write the pre-processor, this patch settles for handling it explicitly in the `expandCssImports` function.	2023-09-21 15:51:33 +02:00
Jonas Jenwald	c018070e80	Enable the `no-lonely-if` ESLint rule These changes were mostly done automatically, using `gulp lint --fix`, and only a few spots with comments needed manual tweaking; please see https://eslint.org/docs/latest/rules/no-lonely-if	2023-07-21 20:10:44 +02:00
Jonas Jenwald	5174232326	[ESM] Convert the `external/builder/`-folder to use standard modules	2023-07-11 11:38:59 +02:00
Tim van der Meij	1972b7311b	Merge pull request #16667 from Snuffleupagus/esm-cmaps [ESM] Convert the "cmaps"-task to use `import()` syntax	2023-07-09 15:33:57 +02:00
Jonas Jenwald	f012fc5e70	[ESM] Convert the "cmaps"-task to use `import()` syntax	2023-07-08 18:52:58 +02:00
Jonas Jenwald	a650fcd634	[ESM] Convert the `external/importL10n`-folder to use standard modules	2023-07-08 09:36:32 +02:00
Jonas Jenwald	5f5db4b160	Run the PDF.js-viewer API unit-test in Node.js environments (PR 16592 follow-up) It occurred to me that we can actually run this unit-test in Node.js environments by making use of the preprocessor to stub out the browser globals there.	2023-06-26 09:37:34 +02:00
calixteman	2d2f7b315e	Merge pull request #16363 from calixteman/use_local_font [api-minor] Use a local font or fallback on an embedded one (if it exists) for non-embedded fonts (bug 1766039)	2023-05-10 14:19:05 +02:00
Calixte Denizet	53134c0c0b	[api-minor] Use a local font or fallback on an embedded one (if it exists) for non-embedded fonts (bug 1766039) - Replace FoxitSans with LiberationSans: LiberationSans is already there (for XFA) and we can use it as a good replacement of FoxitSans. - For now we just try to substitue standard fonts, the strategy is the following: * we try to find a font locally from a hardcoded list; * if it fails then we use Liberation as fallback (only for Helvetica for the moment); * else we just fallback on the system serif/sansserif/monospace font.	2023-05-10 14:10:23 +02:00
Jonas Jenwald	dcd55a7164	Enable `unicorn/prefer-at` unconditionally (PR 15014 follow-up) Now that Node.js version 18 is required, we should be able to use `Array.prototype.at()` everywhere in the code-base.	2023-05-07 13:43:19 +02:00
Jonas Jenwald	95bf9fc17f	Remove SystemJS usage, in development mode, from the worker Now that https://bugzilla.mozilla.org/show_bug.cgi?id=1247687 has landed in Firefox, we're able to use worker-modules during development :-) This removes the final piece of SystemJS usage from the PDF.js library, thus allowing a fair bit of clean-up, and we now use only native `import`/`export` statements everywhere in development mode.	2023-04-29 13:43:24 +02:00
Jonas Jenwald	62a9435190	[api-minor] Stop including the "lib"-build in the `pdfjs-dist` repository The `pdfjs-dist/lib/` directory contains a README file that explicitly advises against using those files, however based on a fairly large number of issues filed over the years users seem to be (mostly) overlooking that warning. In particular it unfortunately seems to be somewhat common for users to attempt to "combine" proper builds from `pdfjs-dist/build/` together with individual components from the `pdfjs-dist/lib/web/` directory, which more often than not leads to subtle bugs and general problems. When we receive bug reports about this it's often not immediately obvious what the problem is, given that many issues lack enough details (such as runnable test-cases), but after some back-and-forth it usually turns out that usage of `pdfjs-dist/lib/` is the culprit. Considering that keeping the general PDF.js library working is challenging and time-consuming enough nowadays, this patch thus proposes that we stop including the "lib"-build in the `pdfjs-dist` repository to both reduce user confusion and the support burden.	2023-04-26 12:11:13 +02:00
Jonas Jenwald	1fc09f0235	Enable the `unicorn/prefer-string-replace-all` ESLint plugin rule Note that the `replaceAll` method still requires that a global regular expression is used, however by using this method it's immediately obvious when looking at the code that all occurrences will be replaced; please see https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/replaceAll#parameters Please find additional details at https://github.com/sindresorhus/eslint-plugin-unicorn/blob/main/docs/rules/prefer-string-replace-all.md	2023-03-23 12:57:10 +01:00
Jonas Jenwald	5f64621d46	Use `String.prototype.replaceAll()` where appropriate This fairly new method allows replacing multiple occurrences within a string without having to use regular expressions. Please refer to: - https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/replaceAll - https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/replaceAll#browser_compatibility	2023-03-22 15:31:10 +01:00
Jonas Jenwald	c0a023eaf9	Update various README files to be less specific about the supported JavaScript features By being less specific about which exact JavaScript features are required for the default vs `legacy` build, we don't need to worry about keeping multiple README files up-to-date. These README files will now refer back to the FAQ for current browser/environment support information.	2023-01-25 15:46:53 +01:00
Sam Magura	1c2d200918	[api-minor] Use new Worker() syntax in webpack entrypoint This requires Webpack 5 and will break for anyone using Webpack 4. worker-loader no longer needs to be installed.	2022-09-13 11:12:00 -04:00
Jonas Jenwald	6e31799948	[api-minor] Add the Babel `targets`-option to avoid transpiling code for unsupported browsers Currently we simply use the Babel `preset-env` in the `legacy`-builds of the PDF.js library. This has the side-effect of transpiling the code for very old browsers/environments, including ones that we (since many years) no longer support which unnecessarily bloats the size of the `legacy`-builds. For the CSS files we're only targeting the supported browsers, and it's thus possible to extend that to also apply to Babel. One of the most significant changes, with this patch, is that we'll no longer polyfill `async`/`await` in the `legacy`-builds. However, this shouldn't be an issue given the browsers that we currently support in PDF.js; please refer to: - https://github.com/mozilla/pdf.js/wiki/Frequently-Asked-Questions#faq-support - https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Statements/async_function#browser_compatibility	2022-08-19 22:19:43 +02:00
Jonas Jenwald	37ebc28756	Use more `for...of` loops in the code-base Note that these cases, which are all in older code, were found using the [`unicorn/no-for-loop`](https://github.com/sindresorhus/eslint-plugin-unicorn/blob/main/docs/rules/no-for-loop.md) ESLint plugin rule. However, note that I've opted not to enable this rule by default since there's still some cases where I do think that it makes sense to allow "regular" for-loops.	2022-07-17 16:18:54 +02:00
Jonas Jenwald	c21f4faaf8	Reduce unnecessary usage of `Array.prototype.concat()` There are obviously cases where using `concat` makes perfect sense, since that method doesn't change any of the existing Arrays; see https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Array/concat However, in a few cases throughout the code-base that's not an issue and using `concat` only leads to unnecessary intermediate allocations. With modern JavaScript we can thus replace those with a combination of `push` and spread-syntax, which wasn't originally possible when the code was written.	2022-06-19 13:40:52 +02:00
Jonas Jenwald	4902ad8923	Use modern DOM methods a bit more (PR 15031 follow-up) Apparently the ESLint rule added in PR 15031 wasn't able to catch all cases that can be converted, which is probably not all that surprising given how some of these call-sites look. - Use `Element.prepend()` to insert nodes before all other ones in the element, rather than using `firstChild` with `insertBefore`-calls; see https://developer.mozilla.org/en-US/docs/Web/API/Element/prepend - Fix one incorrect `insertBefore` call, in the AnnotationLayer-code. Initially the patch simply changed that to an `Element.before()`-call, however that broke one of the integration-tests. It turns out that the `index` may try to access a non-existent select-child, which triggers undefined behaviour; note the warning in https://developer.mozilla.org/en-US/docs/Web/API/Node/insertBefore#parameters	2022-06-13 10:47:37 +02:00
Jonas Jenwald	9ac4536693	Enable the `unicorn/prefer-at` ESLint plugin rule (PR 15008 follow-up) Please find additional information here: - https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Array/at - https://github.com/sindresorhus/eslint-plugin-unicorn/blob/main/docs/rules/prefer-at.md	2022-06-09 21:21:19 +02:00
Jonas Jenwald	397f2e63d0	Handle CSS-comments better in the preprocess-function (PR 14963 follow-up) This fixes another oversight, please see the updated tests.	2022-06-02 16:06:47 +02:00
Jonas Jenwald	65fe0130f4	Handle CSS-comments correctly in the `preprocess`-function (PR 14886 follow-up) I overlooked this in PR 14886, sorry about that!	2022-05-28 08:41:25 +02:00
Jonas Jenwald	ec6575db00	Avoid the `preprocess`-function adding consecutive blank lines When pre-processor blocks are being removed, since they don't apply to the current build target, we may currently end up with consecutive blank lines. While this is obviously not a big issue, it's nonetheless undesirable and we can adjust the `writeLine` function to prevent that.	2022-05-11 14:21:16 +02:00
Jonas Jenwald	527251d62b	Update the `preprocess`-function to avoid adding trailing new-lines (issue 14902) This is a follow-up to PR 14886, which "broke" this. In addition to fixing the issue, using an Array and `join`-ing it at the end may also be a tiny bit more efficient than using a growing string.	2022-05-11 12:36:00 +02:00
Jonas Jenwald	d1f13a6af3	Use the regular `preprocess`-function for the CSS files as well An old shortcoming of the `preprocessCSS`-function is its complete lack of support for our "normal" defines, which makes it very difficult to have build-specific CSS rules. Recently we've started using specially crafted comments to remove CSS rules from the MOZCENTRAL build, but (ab)using the `preprocessCSS`-function in this way really doesn't feel great. However, it turns out to be surprisingly simple to instead use the "regular" `preprocess`-function for the CSS files as well. The only special-handling that's still necessary is the helper-function for dealing with CSS-imports, but apart from that everything seems to just work. One reason, as far as I can tell, for having a separate `preprocessCSS`-function was likely that we originally used lots of vendor-prefixed CSS rules in our CSS files. With improvements over the years, especially thanks to Autoprefixer and PostCSS, we've been able to remove almost all non-standard CSS rules and the need for special-casing the CSS parsing has mostly vanished. Please note: As part of testing this patch I've diffed the output of `gulp generic`, `gulp mozcentral`, and `gulp chromium` against the `master`-branch to check that there was no obvious breakage.	2022-05-07 22:45:52 +02:00
Calixte Denizet	2dba91e0ed	Update quickjs to revision 2788d71e823b522b178db3b3660ce93689534e6d - The date parser in quickjs is not optimal so use the navigator one.	2022-05-01 15:53:50 +02:00
Jonas Jenwald	5180bafb8f	[gulpfile.js] Use the regular `defines` in the `preprocessCSS` function Rather than manually specifying a "mode", we can simply use the regular `defines` directly instead. To improve consistency, in the `external/builder/builder.js` file, a couple of parameters are also re-named.	2022-03-16 22:39:48 +01:00
Jonas Jenwald	b89595fd20	[api-minor] Remove the, in `legacy` builds, bundled `ReadableStream` polyfill According to the MDN compatibility data, see https://developer.mozilla.org/en-US/docs/Web/API/ReadableStream#browser_compatibility, all browsers that we support have native `ReadableStream` implementations (since quite some time too). Hence only Node.js is now lagging behind w.r.t. `ReadableStream` support, and its experimental implementation doesn't really help us given the life-span of the LTS releases (see https://en.wikipedia.org/wiki/Node.js#Releases). It seems quite unfortunate to bundle a `ReadableStream` polyfill in the `legacy` builds when it's unnecessary in browsers, given its overall size, but fortunately we can avoid that by simply listing `web-streams-polyfill` as a dependency for the `pdfjs-dist` library.	2022-02-13 10:15:58 +01:00
Calixte Denizet	d9921a2bd5	Update quickjs sandbox - compiled with the latest emscripten: - Digest:sha256:a28bd5ddf32c2b145b51503ddddb3c02804ab624d661abd54173b1f2dc6cbf06	2022-01-29 21:30:39 +01:00
Jonas Jenwald	00bd549e82	Update the year in the `license_header` files This also includes a couple of files that are included as-is in the `pdfjs-dist` library.	2022-01-27 19:24:31 +01:00
Jonas Jenwald	00f8fab8a5	Add support for modern ECMAScript `class` features With ESLint 8 we should now finally be able to start using modern `class` features, see https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Classes/Public_class_fields and https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Classes/Private_class_fields However, while both ESLint and Acorn now support this, it unfortunately turns out that Escodegen (which we use during building) still lack the necessary support. Looking at https://github.com/estools/escodegen there's not been any updates since last year, and there's also open PRs adding support for these new `class` features. To avoid blocking usage of these `class` features in the PDF.js code-base, in particular private fields/methods, this patch thus proposes that we (hopefully temporarily) switch to an `escodegen` fork that has the necessary support; please see https://www.npmjs.com/package/@javascript-obfuscator/escodegen While I have no reason to doubt the security of the `escodegen` fork, this patch nonetheless pins the version number. Furthermore, I've also diffed the output of the two `.js`-files in this forked package against the original files without finding anything that looks immediately "dangerous".	2021-10-22 22:01:17 +02:00
Jonas Jenwald	aae8a21286	Revert "For mozcentral use Firefox color theme instead of system theme." since -moz-toolbar-prefers-color-scheme was removed Reverts mozilla/pdf.js#13314, see https://groups.google.com/g/firefox-dev/c/vajhbYKDpPM Given that `-moz-toolbar-prefers-color-scheme` was removed in https://bugzilla.mozilla.org/show_bug.cgi?id=1736038, unless we fix this before the next PDF.js update in mozilla-central we'll thus break dark mode in the Firefox built-in PDF Viewer.	2021-10-17 12:29:25 +02:00
Michael Wu	c08b4ea30d	Fix Viewer API definitions and include in CI The Viewer API definitions do not compile because of missing imports and anonymous objects are typed as `Object`. These issues were not caught during CI because the test project was not compiling anything from the Viewer API. As an example of the first problem: ``` /** * @implements MyInterface / export class MyClass { ... } ``` will generate a broken definition that doesn’t import MyInterface: ``` /* * @implements MyInterface / export class MyClass implements MyInterface { ... } ``` This can be fixed by adding a typedef jsdoc to specify the import: ``` /* @typedef {import("./otherFile").MyInterface} MyInterface / ``` See https://github.com/jsdoc/jsdoc/issues/1537 and https://github.com/microsoft/TypeScript/issues/22160 for more details. As an example of the second problem: ``` /* * Gets the size of the specified page, converted from PDF units to inches. * @param {Object} An Object containing the properties: {Array} `view`, * {number} `userUnit`, and {number} `rotate`. / function getPageSizeInches({ view, userUnit, rotate }) { ... } ``` generates the broken definition: ``` function getPageSizeInches({ view, userUnit, rotate }: Object) { ... } ``` The jsdoc should specify the type of each nested property: ``` /* * Gets the size of the specified page, converted from PDF units to inches. * @param {Object} options An object containing the properties: {Array} `view`, * {number} `userUnit`, and {number} `rotate`. * @param {number[]} options.view * @param {number} options.userUnit * @param {number} options.rotate */ ```	2021-08-25 18:45:46 -04:00
Michael Wu	acfb54a836	Fix pdf_viewer definitions Current pdf_viewer definitions result in errors like the following when trying to use them in a ts project: [error] TypeScript error node_modules/.pnpm/pdfjs-dist@2.10.377/node_modules/pdfjs-dist/web/pdf_viewer.d.ts:1:15 - error TS2691: An import path cannot end with a '.d.ts' extension. Consider importing 'pdfjs-dist/types/web/pdf_viewer.component.js' instead. 1 export * from "pdfjs-dist/types/web/pdf_viewer.component.d.ts"; Import/export statements in typescript should not include file extensions.	2021-08-20 12:23:43 -04:00

1 2 3 4 5 ...

261 Commits