pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	90b2664622	Add better validation for the "PREFERENCE" kind `AppOptions` Given that the "PREFERENCE" kind is used e.g. to generate the preference-list for the Firefox PDF Viewer, those options need to be carefully validated. With this patch we'll now check this unconditionally in development mode, during testing, and when creating the preferences in the gulpfile.	2024-02-20 18:38:15 +01:00
Calixte Denizet	2133da166e	When updating, write the xref table in the same format as the previous one (bug 1878916) The specs are unclear about what kind of xref table format must be used. In checking the validity of some pdfs in the preflight tool from Acrobat we can guess that having the same format is the correct way to do. The pdf in the mentioned bug, after having been changed, wasn't correctly displayed in neither Chrome nor Acrobat: it's now fixed.	2024-02-13 14:14:37 +01:00
Jonas Jenwald	37e98e39f6	Skip any whitespace after the first object in linearized PDFs (issue 17665) This way the code is now consistent with the non-linearized branch in the `PDFDocument.startXRef` getter.	2024-02-12 22:05:36 +01:00
Jonas Jenwald	19ef3e367b	Tweak the issue 11878 unit-test parsing time check (PR 17428 follow-up) This unit-test has been failing occasionally in Chrome and Node.js, hence we tweak the parsing time check to reduce the likelihood of that happening.	2024-02-12 12:31:55 +01:00
Jonas Jenwald	5732faee1e	Prevent duplicate names in unit/integration tests Having identical names for different test-cases may result in less helpful output, which we can avoid with the use of the ESLint Jasmine plugin. This patch enables the rules at the `branch` level, to limit the amount/scope of the changes slightly. (We could thus make this rule more strict in the future, if that's deemed useful.) Please refer to: - https://github.com/tlvince/eslint-plugin-jasmine/blob/master/docs/rules/no-spec-dupes.md - https://github.com/tlvince/eslint-plugin-jasmine/blob/master/docs/rules/no-suite-dupes.md	2024-02-11 11:45:09 +01:00
Jonas Jenwald	6da9448f6c	Remove the `web-com` import map (PR 17588 follow-up) With the changes in PR 17588 we're already importing the relevant code via the `web/app.js` file.	2024-02-07 16:33:27 +01:00
Jonas Jenwald	97c2ce9da0	Ensure that `GenericL10n` works if the locale files cannot be loaded - Ensure that localization works in the GENERIC viewer, even if the necessary locale files cannot be loaded. This was the behaviour prior to the introduction of Fluent, and it seems worthwhile to keep that (especially since we already bundle the en-US strings anyway). - Let the `GenericL10n`-implementation use the bundled en-US strings directly when no language is provided. - Remove the `NullL10n`-implementation, and simply fallback to `GenericL10n`, to reduce the maintenance burden of viewer-components localization. - Indirectly, given the previous point, stop exporting `NullL10n` in the viewer-components since it's now removed. Note that it was never really intended to be used directly and only existed as a fallback. Please note: This doesn't affect the Firefox PDF Viewer, thanks to the use of import maps.	2024-01-31 14:07:11 +01:00
Jonas Jenwald	384291234d	Re-enable the `should compress and save text` unit-test (issue 17399) This unit-test is now failing in up to date versions of Node.js respectively Chromium-browsers, since `CompressionStream` no longer produces consistent data across all environments/browsers. However logging the compressed TypedArray produced by `writeStream`, with Firefox respectively Chrome, and then feeding both of those TypedArray as input to `DecompressionStream` produced the same (correct) result in both browsers. Hence the exact output of `CompressionStream` shouldn't matter, as long as we're able to successfully decompress it when the resulting PDF document is opened with the PDF.js library, and the unit-test is thus extended to check this.	2024-01-28 14:31:07 +01:00
Tim van der Meij	94309edc9a	Disable the "should compress and save text" unit test in Chrome too Starting with Chrome 120.0.6099.109 (shipped with Puppeteer 21.8.0+) the unit test fails in Chrome as well. The issue is tracked in #17399, but for now we'll only run the unit test in Firefox so we can continue to update Puppeteer while also still having a browser in which it runs, until we figure out why the behavior of `CompressionStream` changed.	2024-01-27 20:34:30 +01:00
Jonas Jenwald	5dd25b6e80	Re-factor `DefaultExternalServices` into a regular class, without static methods The `DefaultExternalServices` code, which is used to provide build-specific functionality, is very old. This results in a pattern where we first initialize `PDFViewerApplication.externalServices` and then override it for the different builds. By converting `DefaultExternalServices` into a "regular" class, and leveraging import maps, we can directly initialize the correct instance depending on the build.	2024-01-27 12:07:15 +01:00
Jonas Jenwald	d1080e785a	Remove the `createPreferences` method from `DefaultExternalServices` Given the simplicity of the `createPreferences` method, we can leverage import maps to directly initialize the correct `Preferences`-instance depending on the build.	2024-01-27 11:38:42 +01:00
Jonas Jenwald	1698991ae2	Remove the `createDownloadManager` method from `DefaultExternalServices` Given the simplicity of the `createDownloadManager` method, we can leverage import maps to directly initialize the correct `DownloadManager`-instance depending on the build.	2024-01-27 11:38:36 +01:00
Calixte Denizet	7f2428a77e	Reduce memory use and improve perfs when computing the bounding box of a bezier curve (bug 1875547) It isn't really a fix for the mentioned bug but it slightly improve things. In reducing the memory use, the time spent in the GC is reduced either. The algorithm to compute the bounding box is the same as before but it has just been rewritten to be more efficient.	2024-01-24 23:41:14 +01:00
Jonas Jenwald	f9a384d711	Enable the `arrow-body-style` ESLint rule This manually ignores some cases where the resulting auto-formatting would not, as far as I'm concerned, constitute a readability improvement or where we'd just end up with more overall indentation. Please see https://eslint.org/docs/latest/rules/arrow-body-style	2024-01-21 16:20:55 +01:00
Jonas Jenwald	9dfe9c552c	Use shorter arrow functions where possible For arrow functions that are both simple and short, we can avoid using explicit `return` to shorten them even further without hurting readability. For the `gulp mozcentral` build-target this reduces the overall size of the output by just under 1 kilo-byte (which isn't a lot but still can't hurt).	2024-01-21 10:13:12 +01:00
Calixte Denizet	5732c0c54a	Use the original value of a field when propagating event (fixes #17540 ) And avoid to not format a field when the value is 0.	2024-01-19 22:13:51 +01:00
Jonas Jenwald	f8e3c79cb5	Merge pull request #17537 from Snuffleupagus/rm-isArrayBuffer Remove the `isArrayBuffer` helper function	2024-01-19 15:37:02 +01:00
Jonas Jenwald	b37536c38c	Remove the `isArrayBuffer` helper function This old helper function can now be replaced with `ArrayBuffer.isView()` and/or `instanceof ArrayBuffer` checks, as needed depending on the situation.	2024-01-19 14:10:52 +01:00
Calixte Denizet	d64f334f98	[Editor] Add support for printing/saving free highlight annotations	2024-01-19 12:58:46 +01:00
calixteman	bf6cc83e7a	Merge pull request #17525 from calixteman/prettier_json Format json files in using prettier	2024-01-16 20:16:56 +01:00
Calixte Denizet	29de9bdce6	Format json files in using prettier	2024-01-16 19:40:25 +01:00
Calixte Denizet	10389c5017	Add the font Linux Libertine as a possible substitution for Times New Roman and try to load the font family (guessed from the font name) before trying the local substitution. The local(...) command expects to have a real font name and not a predefined substitution it's why we try the font family.	2024-01-16 12:31:23 +01:00
Calixte Denizet	405f573d70	Take into account empty lines when extracting text content from the appearance Fixes #17492.	2024-01-14 20:23:29 +01:00
Jonas Jenwald	9f02cc36d4	Attempt to further reduce re-parsing for globally cached images (PR 11912, 16108 follow-up) In PR 11912 we started caching images that occur on multiple pages globally, which improved performance a lot in many PDF documents. However, one slightly annoying limitation of the implementation is the need to re-parse the image once the global-caching threshold has been reached. Previously this was difficult to avoid, since large image-resources will cause cleanup to run on the main-thread after rendering has finished. In PR 16108 we started delaying this cleanup a little bit, to improve performance if a user e.g. zooms and/or rotates the document immediately after rendering completes. Taking those two PRs together, we now have a situation where it's much more likely that the main-thread has "globally used" images cached at the page-level. Hence we can instead attempt to copy a locally cached image into the global object-cache on the main-thread and thus reduce unnecessary re-parsing of large/complex global images, which significantly reduces the rendering time in many cases. For the PDF document in issue 11878, the rendering time of the second page changes as follows (on my computer): - With the `master`-branch it takes >600 ms to render. - With this patch that goes down to ~50 ms, which is one order of magnitude faster. (Note that all other pages are, as expected, completely unaffected by these changes.) This new main-thread copying is limited to "large" global images, since: - Re-parsing of small images, on the worker-thread, is usually fast enough to not be an issue. - With the delayed cleanup after rendering, it's still not guaranteed that an image is available in a page-level cache on the main-thread. - This forces the worker-thread to wait for the main-thread, which is a pattern that you always want to avoid unless absolutely necessary.	2023-12-21 21:26:21 +01:00
Jonas Jenwald	988d3a188f	Merge pull request #17395 from Snuffleupagus/pypdf-2332 Support Annotations with corrupt /BS-entries	2023-12-09 14:18:29 +01:00
Jonas Jenwald	a1d859c082	Disable the "should compress and save text" unit-test in Node.js (PR 17202 follow-up) It seems this unit-test now fails consistently in "all" up-to-date Node.js versions. We should probably try and understand why, but for now just disable it to get passing CI tests.	2023-12-09 14:13:11 +01:00
Tim van der Meij	c908f2d55c	Merge pull request #17372 from Snuffleupagus/fuzzing-VerbosityLevel-ERRORS Limit the amount of console "spam" during fuzz tests (PR 17337 follow-up)	2023-12-09 13:57:23 +01:00
calixteman	8702e1bbb2	Merge pull request #17359 from calixteman/editor_highlight_color_picker [Editor] Add a color picker with predefined colors for highlighting text (bug 1866434)	2023-12-06 11:06:55 +01:00
Calixte Denizet	098cc16c46	Set text field value as a string when it's for a date or a time (bug 1868503)	2023-12-06 09:44:30 +01:00
Calixte Denizet	ff23d37fa2	[Editor] Add a color picker with predefined colors for highlighting text (bug 1866434) The doorhanger for highlighting has a basic color picker composed of 5 predefined colors to set the default color to use. These colors can be changed thanks to a preference for now but it's something which could be changed in the Firefox settings in the future. Each highlight has in its own toolbar a color picker to just change its color. The different color pickers are so similar (modulo few differences in their styles) that this patch introduces a new class ColorPicker which provides a color picker component which could be reused in future editors. All in all, a large part of this patch is dedicated to color picker itself and its style and the rest is almost a matter of wiring the component.	2023-12-05 23:27:22 +01:00
Jonas Jenwald	d7bec1b527	Limit the amount of console "spam" during fuzz tests (PR 17337 follow-up) Having just tested PR 17337 locally I noticed that especially the `JpxImage`-test causes a "ridiculous" amount of warning messages to be printed, which doesn't seem helpful. Given that only actual `Error`s should be relevant here, we can easily disable this logging during the tests.	2023-12-04 16:39:45 +01:00
Jonas Jenwald	fe3bc575de	Disable the "should compress and save text" unit-test in additional Node.js versions (PR 17202 follow-up) It seems this unit-test started failing in Node.js version 20.10 as well. We should probably try and understand why, but for now just disable it to get passing CI tests.	2023-11-30 20:47:15 +01:00
Calixte Denizet	eb5f610d18	Remove language codes from text strings. And take care to have an even number of bytes with utf16 strings.	2023-11-25 15:09:31 +01:00
Calixte Denizet	f8f4432961	[Editor] Add support for saving/printing a newly added Highlight annotation (bug 1865708)	2023-11-22 10:41:55 +01:00
Calixte Denizet	31d9b9f574	[Editor] Add a way to extract the outlines of a union of rectangles The goal is to be able to get these outlines to fill the shape corresponding to a text selection in order to highlight some text contents. The outlines will be used either to show selected/hovered highlights.	2023-11-20 18:45:19 +01:00
Jonas Jenwald	709d89420e	Re-factor how the `GenericL10n` class fetches localization-data - Re-factor the existing `fetchData` helper function such that it can fetch more types of data, and it now supports "arraybuffer", "json", and "text". This only needed minor adjustments in the `DOMCMapReaderFactory` and `DOMStandardFontDataFactory` classes.[1] - Expose the `fetchData` helper function in the API, such that the viewer is able to access it. - Use the `fetchData` helper function in the `GenericL10n` class, since this should allow fetching of localization-data even if the default viewer is run in an environment without support for the Fetch API. --- [1] While testing this I also noticed a minor inconsistency when handling standard font-data on the worker-thread.	2023-11-14 13:45:14 +01:00
Tim van der Meij	71a6c749d0	Merge pull request #17202 from Snuffleupagus/node-ci-latest Also test the latest Node.js version in GitHub Actions	2023-11-04 12:45:03 +01:00
Jonas Jenwald	99522c3201	Also test the latest Node.js version in GitHub Actions Hopefully this will allow us to catch bugs in new Node.js versions earlier, rather than having to wait for bug reports. Given that `CompressionStream` is (currently) only potentially used when saving a modified PDF document, which is unlikely to be a common use-case in Node.js environments, let's just disable the affected unit-test for now.	2023-11-02 16:58:03 +01:00
Jonas Jenwald	155a302e74	Use even more optional chaining in the code-base	2023-11-02 16:47:33 +01:00
Calixte Denizet	133ed96f8f	Don't take into account the INVISIBLE flag for well-known annotations	2023-10-25 10:16:14 +02:00
Jonas Jenwald	e2af77fd6c	Add a unit-test to ensure that `NullL10n` won't diverge from the `L10n`-class To prevent the standalone viewer-components from breaking, we need to ensure that the `NullL10n`-interface won't accidentally diverge from the actual `L10n`-implementations.	2023-10-24 13:13:14 +02:00
Jonas Jenwald	f07675a6a8	[api-minor] Re-factor `NullL10n` and remove the hard-coded l10n strings (PR 17115 follow-up) Please note: These changes only affect the GENERIC build, since `NullL10n` is only a stub elsewhere (see PR 17135). After the changes in PR 17115, which modernized and improved l10n-handling, the `NullL10n`-implementation is no longer a good fallback for the "proper" `L10n`-classes. To improve this situation, especially for the standalone viewer-components, this patch makes the following changes: - Let the `NullL10n`-implementation extend an actual `L10n`-class, which is constant and lazily initialized, to ensure that it works exactly like the "proper" ones. - Automatically bundle the "en-US" l10n-strings in the build, via the pre-processor, such that we don't need to remember to manually update them. - Ensure that the standalone viewer-components register their DOM-elements for translation, similar to the default viewer, since this will allow future code improvements by using "data-l10n-id"/"data-l10n-args" in most (if not all) parts of the viewer. - Remove the `NullL10n` from the `AnnotationLayer`, to avoid affecting bundle size too much. For third-party users that access the `AnnotationLayer`, as exposed in the main PDF.js library, they'll now need to manually register it for translation. (However, the standalone viewer-components still works given the point above.)	2023-10-20 21:49:33 +02:00
Jonas Jenwald	69ad0d9861	Only bundle `NullL10n` in GENERIC builds (bug 1859818)	2023-10-19 13:51:00 +02:00
calixteman	5d8be99782	Merge pull request #17115 from calixteman/mv_to_fluent [api-minor] Move to Fluent for the localization (bug 1858715)	2023-10-19 13:40:50 +02:00
Calixte Denizet	66982a2a11	[api-minor] Move to Fluent for the localization (bug 1858715) - For the generic viewer we use @fluent/dom and @fluent/bundle - For the builtin pdf viewer in Firefox, we set a localization url and then we rely on document.l10n which is a DOMLocalization object.	2023-10-19 11:20:41 +02:00
Jonas Jenwald	674052d3fc	Re-factor the blob-URL caching in `DownloadManager.openOrDownloadData` Cache blob-URLs on the actual data, rather than DOM elements, to reduce potential duplicates (note the updated unit-test).	2023-10-17 10:18:34 +02:00
Jonas Jenwald	d5acbbccd3	Update the ESLint `globals` list (PR 17055 follow-up) Given that we only use standard `import`/`export` statements now, after recent PRs, the "exports" global is unused. Instead we add "__non_webpack_import__" to the `globals` to avoid having to sprinkle disable statements throughout the code. Finally, the way that `globals` are defined has changed in ESLint and we should thus explicitly specify them as "readonly"; please find additional details at https://eslint.org/docs/latest/use/configure/language-options#specifying-globals	2023-10-15 11:38:10 +02:00
Jonas Jenwald	5e986cb514	Use native `import maps` in development mode This patch seem to work fine locally now, and `mozregression` points to it being fixed in bug https://bugzilla.mozilla.org/show_bug.cgi?id=1803984 which landed in Firefox 116. By using the native `import maps` functionality, we can remove a development dependency. Please find the specification at https://wicg.github.io/import-maps/	2023-10-13 20:35:34 +02:00
Jonas Jenwald	38245500fd	Output JavaScript modules for the `LIB` build-target (PR 17055 follow-up) This finally allows us to mark the entire PDF.js library as a "module", which should thus conclude the (multi-year) effort to re-factor and improve how we import files/resources in the code-base. This also means that the `gulp ci-test` target, which is what's run in GitHub Actions, now uses JavaScript modules since that's supported in modern Node.js versions.	2023-10-13 18:54:33 +02:00
Jonas Jenwald	927e50f5d4	[api-major] Output JavaScript modules in the builds (issue 10317) At this point in time all browsers, and also Node.js, support standard `import`/`export` statements and we can now finally consider outputting modern JavaScript modules in the builds.[1] In order for this to work we can only use proper `import`/`export` statements throughout the main code-base, and (as expected) our Node.js support made this much more complicated since both the official builds and the GitHub Actions-based tests must keep working.[2] One remaining issue is that the `pdf.scripting.js` file cannot be built as a JavaScript module, since doing so breaks PDF scripting. Note that my initial goal was to try and split these changes into a couple of commits, however that unfortunately didn't really work since it turned out to be difficult for smaller patches to work correctly and pass (all) tests that way.[3] This is a classic case of every change requiring a couple of other changes, with each of those changes requiring further changes in turn and the size/scope quickly increasing as a result. One possible "issue" with these changes is that we'll now only output JavaScript modules in the builds, which could perhaps be a problem with older tools. However it unfortunately seems far too complicated/time-consuming for us to attempt to support both the old and modern module formats, hence the alternative would be to do "nothing" here and just keep our "old" builds.[4] --- [1] The final blocker was module support in workers in Firefox, which was implemented in Firefox 114; please see https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Statements/import#browser_compatibility [2] It's probably possible to further improve/simplify especially the Node.js-specific code, but it does appear to work as-is. [3] Having partially "broken" patches, that fail tests, as part of the commit history is really not a good idea in general. [4] Outputting JavaScript modules was first requested almost five years ago, see issue 10317, and nowadays there should be much better support for JavaScript modules in various tools.	2023-10-07 09:31:08 +02:00

1 2 3 4 5 ...

1186 Commits