Go to file

Yury Delendik 66e0dd1b06 Use streams for OperatorList chunking (issue 10023)

*Please note:* The majority of this patch was written by Yury, and it's simply been rebased and slightly extended to prevent issues when dealing with `RenderingCancelledException`.

By leveraging streams this (finally) provides a simple way in which parsing can be aborted on the worker-thread, which will ultimately help save resources.
With this patch worker-thread parsing will *only* be aborted when the document is destroyed, and not when rendering is cancelled. There's a couple of reasons for this:

- The API currently expects the *entire* OperatorList to be extracted, or an Error to occur, once it's been started. Hence additional re-factoring/re-writing of the API code will be necessary to properly support cancelling and re-starting of OperatorList parsing in cases where the `lastChunk` hasn't yet been seen.
- Even with the above addressed, immediately cancelling when encountering a `RenderingCancelledException` will lead to worse performance in e.g. the default viewer. When zooming and/or rotation of the document occurs it's very likely that `cancel` will be (almost) immediately followed by a new `render` call. In that case you'd obviously *not* want to abort parsing on the worker-thread, since then you'd risk throwing away a partially parsed Page and thus be forced to re-parse it again which will regress perceived performance.
- This patch is already *somewhat* risky, given that it touches fundamentally important/critical code, and trying to keep it somewhat small should hopefully reduce the risk of regressions (and simplify reviewing as well).

Time permitting, once this has landed and been in Nightly for awhile, I'll try to work on the remaining points outlined above.

Co-Authored-By: Yury Delendik <ydelendik@mozilla.com>
Co-Authored-By: Jonas Jenwald <jonas.jenwald@gmail.com>

2019-08-24 15:56:40 +02:00

.github

Attempt to clarify the l10n section of CONTRIBUTING.md

2019-04-10 11:33:25 +02:00

docs

Switch to HTTPS for the license link on the website

2019-01-05 15:35:17 +01:00

examples

added in information about pdfjs/webpack

2019-08-20 10:20:32 -04:00

extensions

[CRX] Preserve referrer in Chrome 72+

2019-05-29 11:28:38 +02:00

external

Enable the consistent-return ESLint rule

2019-05-11 14:27:21 +02:00

l10n

Update translations

2019-06-29 12:33:23 +02:00

src

Use streams for OperatorList chunking (issue 10023)

2019-08-24 15:56:40 +02:00

test

Use streams for OperatorList chunking (issue 10023)

2019-08-24 15:56:40 +02:00

web

Use shorthand method signature for afterPrint in web/app.js

2019-08-24 18:26:25 +08:00

.editorconfig

Uses editorconfig to maintain consistent coding styles

2015-11-14 07:32:18 +05:30

.eslintignore

Turn on ESLint in examples directory, apply examples-specific exceptions

2018-12-11 15:23:26 +01:00

.eslintrc

Enable the eslint-plugin-no-unsanitized ESLint plugin to disallow unsafe usage of e.g. innerHTML

2019-06-23 13:50:30 +02:00

.gitattributes

Fixing C++,PHP and Pascal presence in the repo

2015-10-29 13:03:51 -05:00

.gitignore

Include package-lock.json for reproducible builds

2018-06-02 20:29:47 +02:00

.gitmodules

Update fonttools location and version (issue 6223)

2015-07-17 12:51:09 +02:00

.mailmap

Add mgol's name to AUTHORS, add .mailmap

2017-11-22 10:46:11 +01:00

.travis.yml

Upgrade to Gulp 4

2018-12-17 16:20:13 +01:00

AUTHORS

Add SehyunPark to AUTHORS

2017-11-29 22:24:08 +09:00

CODE_OF_CONDUCT.md

Add Mozilla Code of Conduct file

2019-03-27 21:00:01 -07:00

EXPORT

Adds ECCN response statement

2017-10-23 13:31:36 -05:00

gulpfile.js

Restore the header size limit of 80 KB

2019-06-29 13:23:43 +02:00

LICENSE

cleaned whitespace

2015-02-17 11:07:37 -05:00

package-lock.json

Bump js-yaml from 3.12.0 to 3.13.1

2019-07-19 00:04:03 +00:00

package.json

Update packages

2019-06-29 12:35:45 +02:00

pdfjs.config

Bump versions in pdfjs.config

2019-07-10 22:25:24 +02:00

README.md

Add links to PDF.js homepage and API reference in README.md

2019-04-17 23:37:37 +02:00

systemjs.config.js

Provide custom messages for the no-restricted-globals ESLint rule, and refactor the .eslintrc files (PR 9868 follow-up)

2018-07-23 14:10:13 +02:00

README.md

PDF.js

PDF.js is a Portable Document Format (PDF) viewer that is built with HTML5.

PDF.js is community-driven and supported by Mozilla Labs. Our goal is to create a general-purpose, web standards-based platform for parsing and rendering PDFs.

Contributing

PDF.js is an open source project and always looking for more contributors. To get involved, visit:

Feel free to stop by #pdfjs on irc.mozilla.org for questions or guidance.

Getting Started

Online demo

https://mozilla.github.io/pdf.js/web/viewer.html

Browser Extensions

Firefox

PDF.js is built into version 19+ of Firefox.

Chrome

The official extension for Chrome can be installed from the Chrome Web Store. This extension is maintained by @Rob--W.
Build Your Own - Get the code as explained below and issue gulp chromium. Then open Chrome, go to Tools > Extension and load the (unpackaged) extension from the directory build/chromium.

Getting the Code

To get a local copy of the current code, clone it using git:

$ git clone https://github.com/mozilla/pdf.js.git
$ cd pdf.js

Next, install Node.js via the official package or via nvm. You need to install the gulp package globally (see also gulp's getting started):

$ npm install -g gulp-cli

If everything worked out, install all dependencies for PDF.js:

$ npm install

Finally, you need to start a local web server as some browsers do not allow opening PDF files using a file:// URL. Run:

$ gulp server

and then you can open:

http://localhost:8888/web/viewer.html

Please keep in mind that this requires an ES6 compatible browser; refer to Building PDF.js for usage with older browsers.

It is also possible to view all test PDF files on the right side by opening:

http://localhost:8888/test/pdfs/?frame

Building PDF.js

In order to bundle all src/ files into two production scripts and build the generic viewer, run:

$ gulp generic

This will generate pdf.js and pdf.worker.js in the build/generic/build/ directory. Both scripts are needed but only pdf.js needs to be included since pdf.worker.js will be loaded by pdf.js. The PDF.js files are large and should be minified for production.

Using PDF.js in a web application

To use PDF.js in a web application you can choose to use a pre-built version of the library or to build it from source. We supply pre-built versions for usage with NPM and Bower under the pdfjs-dist name. For more information and examples please refer to the wiki page on this subject.

Including via a CDN

PDF.js is hosted on several free CDNs:

Learning

You can play with the PDF.js API directly from your browser using the live demos below:

Interactive examples

More examples can be found in the examples folder. Some of them are using the pdfjs-dist package, which can be built and installed in this repo directory via gulp dist-install command.

For an introduction to the PDF.js code, check out the presentation by our contributor Julian Viereck:

https://www.youtube.com/watch?v=Iv15UY-4Fg8

More learning resources can be found at:

https://github.com/mozilla/pdf.js/wiki/Additional-Learning-Resources

The API documentation can be found at:

https://mozilla.github.io/pdf.js/api/

Questions

Check out our FAQs and get answers to common questions:

https://github.com/mozilla/pdf.js/wiki/Frequently-Asked-Questions

Talk to us on IRC (Internet Relay Chat):

#pdfjs on irc.mozilla.org

File an issue:

https://github.com/mozilla/pdf.js/issues/new

https://twitter.com/pdfjs