Go to file

Jonas Jenwald 9f02cc36d4 Attempt to further reduce re-parsing for globally cached images (PR 11912, 16108 follow-up)

In PR 11912 we started caching images that occur on multiple pages globally, which improved performance a lot in many PDF documents.
However, one slightly annoying limitation of the implementation is the need to re-parse the image once the global-caching threshold has been reached. Previously this was difficult to avoid, since large image-resources will cause cleanup to run on the main-thread after rendering has finished. In PR 16108 we started delaying this cleanup a little bit, to improve performance if a user e.g. zooms and/or rotates the document immediately after rendering completes.

Taking those two PRs together, we now have a situation where it's much more likely that the main-thread has "globally used" images cached at the page-level. Hence we can instead attempt to *copy* a locally cached image into the global object-cache on the main-thread and thus reduce unnecessary re-parsing of large/complex global images, which significantly reduces the rendering time in many cases.

For the PDF document in issue 11878, the rendering time of *the second page* changes as follows (on my computer):
- With the `master`-branch it takes >600 ms to render.
- With this patch that goes down to ~50 ms, which is one order of magnitude faster.

(Note that all other pages are, as expected, completely unaffected by these changes.)

This new main-thread copying is limited to "large" global images, since:
- Re-parsing of small images, on the worker-thread, is usually fast enough to not be an issue.
- With the delayed cleanup after rendering, it's still not guaranteed that an image is available in a page-level cache on the main-thread.
- This forces the worker-thread to wait for the main-thread, which is a pattern that you always want to avoid unless absolutely necessary.

2023-12-21 21:26:21 +01:00

.github

Revert "Bump actions/upload-artifact from 3 to 4"

2023-12-18 15:01:19 +01:00

docs

Update the "Interactive examples" links (PR 17055 follow-up)

2023-10-10 09:41:01 +02:00

examples

Fix examples/webpack/README.md. The .mjs extension is necessary. Close #17319

2023-11-23 09:25:20 +09:00

extensions

[Editor] Add a color picker with predefined colors for highlighting text (bug 1866434)

2023-12-05 23:27:22 +01:00

external

Rename *.d.ts to *.d.mts. Close #17241

2023-11-12 07:30:36 +09:00

l10n

[Editor] Add some missing strings to localize for highlighting

2023-12-12 19:57:38 +01:00

src

Attempt to further reduce re-parsing for globally cached images (PR 11912, 16108 follow-up)

2023-12-21 21:26:21 +01:00

test

Attempt to further reduce re-parsing for globally cached images (PR 11912, 16108 follow-up)

2023-12-21 21:26:21 +01:00

web

Toggle the visibility of the outlineOptionsContainer, in the sidebar, using only CSS

2023-12-19 10:01:16 +01:00

.editorconfig

Add the .mjs file-extension to the EditorConfig

2023-08-23 11:22:25 +02:00

.eslintignore

Remove obsolete entries in the lint-ignore files

2023-10-25 13:38:51 +02:00

.eslintrc

[api-minor] Move to Fluent for the localization (bug 1858715)

2023-10-19 11:20:41 +02:00

.gitattributes

[api-minor] Move to Fluent for the localization (bug 1858715)

2023-10-19 11:20:41 +02:00

.gitignore

Include package-lock.json for reproducible builds

2018-06-02 20:29:47 +02:00

.gitpod.Dockerfile

Simplifies code contributions by automating the dev setup with gitpod.io

2019-11-06 04:12:19 +00:00

.gitpod.yml

Simplifies code contributions by automating the dev setup with gitpod.io

2019-11-06 04:12:19 +00:00

.mailmap

Add mgol's name to AUTHORS, add .mailmap

2017-11-22 10:46:11 +01:00

.prettierrc

Update Prettier to version 2.0

2020-04-14 12:28:14 +02:00

.stylelintignore

Remove obsolete entries in the lint-ignore files

2023-10-25 13:38:51 +02:00

.stylelintrc

Enable some Stylelint color-related rules to slightly reduce file sizes

2023-10-05 17:51:21 +02:00

AUTHORS

Add SehyunPark to AUTHORS

2017-11-29 22:24:08 +09:00

CODE_OF_CONDUCT.md

Add Mozilla Code of Conduct file

2019-03-27 21:00:01 -07:00

EXPORT

Adds ECCN response statement

2017-10-23 13:31:36 -05:00

gulpfile.mjs

[Editor] Add a color picker with predefined colors for highlighting text (bug 1866434)

2023-12-05 23:27:22 +01:00

LICENSE

cleaned whitespace

2015-02-17 11:07:37 -05:00

package-lock.json

Update Puppeteer to version 21.6.0 and force "CDP" protocol

2023-12-08 12:27:44 +01:00

package.json

Update Puppeteer to version 21.6.0 and force "CDP" protocol

2023-12-08 12:27:44 +01:00

pdfjs.config

Bump the stable version in pdfjs.config

2023-11-26 13:46:23 +01:00

README.md

Tweak the README slightly

2023-07-04 11:32:25 +02:00

tsconfig.json

[api-minor] Re-factor NullL10n and remove the hard-coded l10n strings (PR 17115 follow-up)

2023-10-20 21:49:33 +02:00

README.md

PDF.js

PDF.js is a Portable Document Format (PDF) viewer that is built with HTML5.

PDF.js is community-driven and supported by Mozilla. Our goal is to create a general-purpose, web standards-based platform for parsing and rendering PDFs.

Contributing

PDF.js is an open source project and always looking for more contributors. To get involved, visit:

Feel free to stop by our Matrix room for questions or guidance.

Getting Started

Online demo

Please note that the "Modern browsers" version assumes native support for the latest JavaScript features; please also see this wiki page.

Modern browsers: https://mozilla.github.io/pdf.js/web/viewer.html
Older browsers: https://mozilla.github.io/pdf.js/legacy/web/viewer.html

Browser Extensions

Firefox

PDF.js is built into version 19+ of Firefox.

Chrome

The official extension for Chrome can be installed from the Chrome Web Store. This extension is maintained by @Rob--W.
Build Your Own - Get the code as explained below and issue gulp chromium. Then open Chrome, go to Tools > Extension and load the (unpackaged) extension from the directory build/chromium.

Getting the Code

To get a local copy of the current code, clone it using git:

$ git clone https://github.com/mozilla/pdf.js.git
$ cd pdf.js

Next, install Node.js via the official package or via nvm. You need to install the gulp package globally (see also gulp's getting started):

$ npm install -g gulp-cli

If everything worked out, install all dependencies for PDF.js:

$ npm install

Finally, you need to start a local web server as some browsers do not allow opening PDF files using a file:// URL. Run:

$ gulp server

and then you can open:

http://localhost:8888/web/viewer.html

Please keep in mind that this assumes the latest version of Mozilla Firefox; refer to Building PDF.js for non-development usage of the PDF.js library.

It is also possible to view all test PDF files on the right side by opening:

http://localhost:8888/test/pdfs/?frame

Building PDF.js

In order to bundle all src/ files into two production scripts and build the generic viewer, run:

$ gulp generic

If you need to support older browsers, run:

$ gulp generic-legacy

This will generate pdf.js and pdf.worker.js in the build/generic/build/ directory (respectively build/generic-legacy/build/). Both scripts are needed but only pdf.js needs to be included since pdf.worker.js will be loaded by pdf.js. The PDF.js files are large and should be minified for production.

Using PDF.js in a web application

To use PDF.js in a web application you can choose to use a pre-built version of the library or to build it from source. We supply pre-built versions for usage with NPM and Bower under the pdfjs-dist name. For more information and examples please refer to the wiki page on this subject.

Including via a CDN

PDF.js is hosted on several free CDNs:

Learning

You can play with the PDF.js API directly from your browser using the live demos below:

Interactive examples

More examples can be found in the examples folder. Some of them are using the pdfjs-dist package, which can be built and installed in this repo directory via gulp dist-install command.

For an introduction to the PDF.js code, check out the presentation by our contributor Julian Viereck:

https://www.youtube.com/watch?v=Iv15UY-4Fg8

More learning resources can be found at:

https://github.com/mozilla/pdf.js/wiki/Additional-Learning-Resources

The API documentation can be found at:

https://mozilla.github.io/pdf.js/api/

Questions

Check out our FAQs and get answers to common questions:

https://github.com/mozilla/pdf.js/wiki/Frequently-Asked-Questions

Talk to us on Matrix:

https://chat.mozilla.org/#/room/#pdfjs:mozilla.org

File an issue:

https://github.com/mozilla/pdf.js/issues/new