pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	1cd9a28c81	Replace the `XRef.cache` Array with a Map instead Given that the different types of `Stream`s will never be cached, this thus implies that the `XRef.cache` Array will always be more-or-less sparse. Generally speaking, the longer the document the more sparse the `XRef.cache` will thus become. For example, looking at the `pdf.pdf` file from the test-suite: The length of the `XRef.cache` Array will be a few hundred thousand elements, with approximately 95% of them being empty. Hence it seems pretty clear that an Array isn't really the best data-structure for this kind of cache, and this patch thus changes it to a Map instead. This patch-series was tested using the PDF file from issue 2618, i.e. http://bugzilla-attachments.gnome.org/attachment.cgi?id=226471, with the following manifest file: ``` [ { "id": "issue2618", "file": "../web/pdfs/issue2618.pdf", "md5": "", "rounds": 200, "type": "eq" } ] ``` which gave the following results when comparing this patch-series against the `master` branch: ``` -- Grouped By browser, stat -- browser \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ----- \| ------------- Firefox \| Overall \| 200 \| 2736 \| 2736 \| 1 \| 0.02 \| Firefox \| Page Request \| 200 \| 2 \| 2 \| 0 \| -8.26 \| faster Firefox \| Rendering \| 200 \| 2733 \| 2734 \| 1 \| 0.03 \| ```	2019-08-18 12:07:18 +02:00
Jonas Jenwald	34a53b9f5d	Inline the `isRef` checks in the various `XRef.fetch` related methods The relevant methods are usually not hot enough for these changes to have an easily measurable effect, however there's been a lot of other cases where similiar inlining has helped performance. (And these changes may help offset the changes made in the next patch.)	2019-08-18 11:57:48 +02:00
Tim van der Meij	1565d1849d	Merge pull request #11073 from brendandahl/code-point Move polyfill for codePointAt to String prototype.	2019-08-17 13:26:35 +02:00
Brendan Dahl	c8129b8787	Move polyfill for codePointAt to String prototype. This method belongs on the prototype not the String object.	2019-08-16 14:32:43 -07:00
Jonas Jenwald	40d3916f31	Reduce the number of temporary variables in the `Parser.getObj` method This avoids allocating approximately 1.7 million short-lived variables when loading the PDF file from issue 2618, i.e. http://bugzilla-attachments.gnome.org/attachment.cgi?id=226471, in the default viewer.	2019-08-16 13:51:41 +02:00
Jonas Jenwald	7728a6630c	Inline the `isString` check in the `Parser.getObj` method For very large and complex PDF files this will help performance slightly, since `Parser.getObj` is called a lot during parsing in the worker. This patch was tested using the PDF file from issue 2618, i.e. http://bugzilla-attachments.gnome.org/attachment.cgi?id=226471, with the following manifest file: ``` [ { "id": "issue2618", "file": "../web/pdfs/issue2618.pdf", "md5": "", "rounds": 200, "type": "eq" } ] ``` which gave the following results when comparing this patch against the `master` branch: ``` -- Grouped By browser, stat -- browser \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ----- \| ------------- Firefox \| Overall \| 200 \| 2847 \| 2830 \| -17 \| -0.60 \| faster Firefox \| Page Request \| 200 \| 2 \| 2 \| 0 \| -7.14 \| Firefox \| Rendering \| 200 \| 2844 \| 2827 \| -17 \| -0.60 \| faster ```	2019-08-16 10:34:24 +02:00
Jonas Jenwald	7f456b3e2e	Replace of all usages of `var` with `let`/`const` in the `src/shared/util.js` file Also removes a couple of unnecessary (temporary) variable assigments in `arraysToBytes` and uses template strings in a few spots.	2019-08-11 14:35:35 +02:00
Jonas Jenwald	f6c4a1f080	Convert `Util` to a class with static methods Also replaces `var` with `const` in all the relevant code.	2019-08-11 14:35:35 +02:00
Jonas Jenwald	7ee370a394	Remove the `skipEmpty` parameter from `Util.intersect` (PR 11059 follow-up) Looking at this again, it struck me that added functionality in `Util.intersect` is probably more confusing than helpful in general; sorry about the churn in this code! Based on the parameter name you'd probably expect it to only match when the intersection is `[0, 0, 0, 0]` and not when only one component is zero, hence the `skipEmpty` parameter thus feels too tightly coupled to the `Page.view` getter.	2019-08-11 14:33:52 +02:00
Tim van der Meij	fbe8c6127c	Merge pull request #11059 from Snuffleupagus/boundingBox-more-validation Fallback gracefully when encountering corrupt PDF files with empty /MediaBox and /CropBox entries	2019-08-09 22:39:01 +02:00
Jonas Jenwald	d637b25e36	Fallback gracefully when encountering corrupt PDF files with empty /MediaBox and /CropBox entries This is based on a real-world PDF file I encountered very recently[1], although I'm currently unable to recall where I saw it. Note that different PDF viewers handle these sort of errors differently, with Adobe Reader outright failing to render the attached PDF file whereas PDFium mostly handles it "correctly". The patch makes the following notable changes: - Refactor the `cropBox` and `mediaBox` getters, on the `Page`, to reduce unnecessary duplication. (This will also help in the future, if support for extracting additional page bounding boxes are added to the API.) - Ensure that the page bounding boxes, i.e. `cropBox` and `mediaBox`, are never empty to prevent issues/weirdness in the viewer. - Ensure that the `view` getter on the `Page` will never return an empty intersection of the `cropBox` and `mediaBox`. - Add an optional parameter to `Util.intersect`, to allow checking that the computed intersection isn't actually empty. - Change `Util.intersect` to have consistent return types, since Arrays are of type `Object` and falling back to returning a `Boolean` thus seem strange. --- [1] In that case I believe that only the `cropBox` was empty, but it seemed like a good idea to attempt to fix a bunch of related cases all at once.	2019-08-09 10:18:13 +02:00
Jonas Jenwald	0f78fdb229	Handle some corrupt/truncated JPEG images that are missing the EOI (End of Image) marker (issue 11052) Note that even Adobe Reader cannot render the PDF file completely, which is always a good indication that it's corrupt.	2019-08-08 10:37:41 +02:00
Jonas Jenwald	e9b7996f2f	Actually compare the `cropBox` and `mediaBox` correctly in the `Page.view` getter The current code will only consider the `cropBox` and `mediaBox` as equal when they both point to the same underlying Array. In the case where a PDF file actually specifies both boxes independently, with the exact same values in each, the comparison will currently fail and lead to an unneeded intersection computation.	2019-08-07 17:15:57 +02:00
Jonas Jenwald	5ac9c7c384	Support corrupt PDF files with invalid/non-existent Group /CS entries (issue 11045) The PDF file in question tries to reference a non-existent ColorSpace, which should be quite rare in practice.	2019-08-06 14:33:05 +02:00
Tim van der Meij	be70ee236d	Merge pull request #11013 from timvandermeij/annotations-quadpoints [api-minor] Implement quadpoints for annotations in the core layer	2019-08-04 16:06:10 +02:00
Jonas Jenwald	0276385e6e	[api-minor] Fix completely broken `getStats` method by returning stats in Objects, rather than in Arrays (PR 11029 follow-up) With the changes to the `StreamType`/`FontType` "enums" in PR 11029, one unfortunate result is that `getStats` now always returns empty Arrays. Something that everyone, myself included, apparently missed is that you obviously cannot index an Array with Strings :-) I wrongly assumed that the unit-tests would catch any bugs, but they apparently suffered from the same issue as the code in `src/core/`. Another possible option could perhaps be to use `Set`s, rather than objects, but that will require larger changes since `LoopbackPort` (in `src/display/api.js`) doesn't support them.	2019-08-02 14:09:24 +02:00
Tim van der Meij	9c8fe3142a	Merge pull request #11034 from Snuffleupagus/cancel-with-AbortException Ensure that `ReadableStream`s are cancelled with actual Errors	2019-08-02 00:18:44 +02:00
Tim van der Meij	e0b38bed3c	Merge pull request #11029 from brendandahl/pdfjs-telemetry-update [api-minor] Update telemetry to use 'categorical' histograms.	2019-08-02 00:11:02 +02:00
Brendan Dahl	31d71808e7	[api-minor] Update telemetry to use 'categorical' histograms. Firefox telemetry supports using string labels now. Convert our integers that we used for categories to just use strings. The upstream work will happen in: https://bugzilla.mozilla.org/show_bug.cgi?id=1566882	2019-08-01 09:51:02 -07:00
Jonas Jenwald	a3150166ec	Ensure that `ReadableStream`s are cancelled with actual Errors There's a number of spots in the current code, and tests, where `cancel` methods are not called with appropriate arguments (leading to Promises not being rejected with Errors as intended). In some cases the cancel `reason` is implicitly set to `undefined`, and in others the cancel `reason` is just a plain String. To address this inconsistency, the patch changes things such that cancelling is done with `AbortException`s everywhere instead.	2019-08-01 16:40:46 +02:00
Tim van der Meij	d909b86b28	Merge pull request #11020 from Snuffleupagus/issue-11016 Add a work-around, in `glyphlist.js`, for bad PDF generators which use a non-standard `/f_f` string in the `Encoding` dictionary when referring to the ff ligature (issue 11016)	2019-07-31 23:33:34 +02:00
wangsongyan	c61205d980	decode filename when match an urlencode filename from contentDispositionFilename	2019-07-31 09:33:56 +08:00
Jonas Jenwald	9ad50521b1	Add a work-around, in `glyphlist.js`, for bad PDF generators which use a non-standard `/f_f` string in the `Encoding` dictionary when referring to the ff ligature (issue 11016) This patch will not incur any (measurable) overhead, since the glyphlist is already quite long and one more entry won't really matter, which is important given that this sort of PDF corruption ought to be very rare. Furthermore, this patch purposely does not add a bunch of similarly modified ligature names on pure speculation. Any similar additions, for other ligatures, should only be made if there's real-world examples of PDF files where that's actually necessary.	2019-07-30 17:06:58 +02:00
Jonas Jenwald	38ccb43436	Reduce the number of function calls in `EvaluatorPreprocessor.read` For very large and complex PDF files this will help performance slightly, since `EvaluatorPreprocessor.read` is called a lot during parsing in the worker. This patch was tested using the PDF file from issue 2618, i.e. http://bugzilla-attachments.gnome.org/attachment.cgi?id=226471, using the following manifest file: ``` [ { "id": "issue2618", "file": "../web/pdfs/issue2618.pdf", "md5": "", "rounds": 200, "type": "eq" } ] ``` This gave the following results when comparing this patch against the `master` branch: ``` -- Grouped By browser, stat -- browser \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ----- \| ------------- Firefox \| Overall \| 200 \| 3402 \| 3358 \| -43 \| -1.28 \| faster Firefox \| Page Request \| 200 \| 1 \| 1 \| 0 \| 26.71 \| Firefox \| Rendering \| 200 \| 3401 \| 3357 \| -44 \| -1.28 \| faster ```	2019-07-29 08:43:36 +02:00
Tim van der Meij	9114004d5b	[api-minor] Implement quadpoints for annotations in the core layer	2019-07-28 20:36:21 +02:00
Jonas Jenwald	ff90aa4323	Inline the `isCmd` check in the `Parser.shift` method For very large and complex PDF files this will help performance slightly, since `Parser.shift` is called a lot during parsing. This patch was tested using the PDF file from issue 2618, i.e. http://bugzilla-attachments.gnome.org/attachment.cgi?id=226471 (with well over four million `Parser.shift` calls for just the one page), using the following manifest file: ``` [ { "id": "issue2618", "file": "../web/pdfs/issue2618.pdf", "md5": "", "rounds": 100, "type": "eq" } ] ``` This gave the following results when comparing this patch against the `master` branch: ``` -- Grouped By browser, stat -- browser \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ----- \| ------------- Firefox \| Overall \| 100 \| 3386 \| 3322 \| -65 \| -1.92 \| faster Firefox \| Page Request \| 100 \| 1 \| 1 \| 0 \| -8.08 \| Firefox \| Rendering \| 100 \| 3385 \| 3321 \| -65 \| -1.92 \| faster ```	2019-07-22 12:07:36 +02:00
Jonas Jenwald	b5254f2745	Attempt to significantly reduce the number of `ChunkedStream.{ensureByte, ensureRange}` calls by inlining the `this.progressiveDataLength` checks at the call-sites The number of in particular `ChunkedStream.ensureByte` calls is often absolutely huge (on the order of million calls) when loading and rendering even moderately complicated PDF files, which isn't entirely surprising considering that the `getByte`/`getBytes`/`peekByte`/`peekBytes` methods are used for essentially all data reading/parsing. The idea implemented in this patch is to inline an inverted `progressiveDataLength` check at all of the `ensureByte`/`ensureRange` call-sites, which in practice will often result in several orders of magnitude fewer function calls. Obviously this patch will only help if the browser supports streaming, which all reasonably modern browsers now do (including the Firefox built-in PDF viewer), and assuming that the user didn't set the `disableStream` option (e.g. for using `disableAutoFetch`). However, I think we should be able to improve performance for the default out-of-the-box use case, without worrying about e.g. older browsers (where this patch will thus incur one additional check before calling `ensureByte`/`ensureRange`). This patch was inspired by the first commit in PR 5005, which was subsequently backed out in PR 5145 for causing regressions. Since the general idea of avoiding unnecessary function calls was really nice, I figured that re-attempting this in one way or another wouldn't be a bad idea. Given that streaming is now supported, which it wasn't back then, using `progressiveDataLength` seemed like an easier approach in general since it also allowed supporting both `ensureByte` and `ensureRange`. This sort of patch obviously needs data to back it up, hence I've benchmarked the changes using the following manifest file (with the default `tracemonkey` file): ``` [ { "id": "tracemonkey-eq", "file": "pdfs/tracemonkey.pdf", "md5": "9a192d8b1a7dc652a19835f6f08098bd", "rounds": 250, "type": "eq" } ] ``` I get the following complete results when comparing this patch against the `master` branch: ``` -- Grouped By browser, stat -- browser \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ----- \| ------------- Firefox \| Overall \| 3500 \| 140 \| 134 \| -6 \| -4.46 \| faster Firefox \| Page Request \| 3500 \| 2 \| 2 \| 0 \| -0.10 \| Firefox \| Rendering \| 3500 \| 138 \| 131 \| -6 \| -4.54 \| faster ``` Here it's pretty clear that the patch does have a positive net effect, even for a PDF file of fairly moderate size and complexity. However, in this case it's probably interesting to also look at the results per page: ``` -- Grouped By page, stat -- page \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ---- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ------ \| ------------- 0 \| Overall \| 250 \| 74 \| 75 \| 1 \| 0.69 \| 0 \| Page Request \| 250 \| 1 \| 1 \| 0 \| 33.20 \| 0 \| Rendering \| 250 \| 73 \| 74 \| 0 \| 0.25 \| 1 \| Overall \| 250 \| 123 \| 121 \| -2 \| -1.87 \| faster 1 \| Page Request \| 250 \| 3 \| 2 \| 0 \| -11.73 \| 1 \| Rendering \| 250 \| 121 \| 119 \| -2 \| -1.67 \| 2 \| Overall \| 250 \| 64 \| 63 \| -1 \| -1.91 \| 2 \| Page Request \| 250 \| 1 \| 1 \| 0 \| 8.81 \| 2 \| Rendering \| 250 \| 63 \| 62 \| -1 \| -2.13 \| faster 3 \| Overall \| 250 \| 97 \| 97 \| 0 \| -0.06 \| 3 \| Page Request \| 250 \| 1 \| 1 \| 0 \| 25.37 \| 3 \| Rendering \| 250 \| 96 \| 95 \| 0 \| -0.34 \| 4 \| Overall \| 250 \| 97 \| 97 \| 0 \| -0.38 \| 4 \| Page Request \| 250 \| 1 \| 1 \| 0 \| -5.97 \| 4 \| Rendering \| 250 \| 96 \| 96 \| 0 \| -0.27 \| 5 \| Overall \| 250 \| 99 \| 97 \| -3 \| -2.92 \| 5 \| Page Request \| 250 \| 2 \| 1 \| 0 \| -17.20 \| 5 \| Rendering \| 250 \| 98 \| 95 \| -3 \| -2.68 \| 6 \| Overall \| 250 \| 99 \| 99 \| 0 \| -0.14 \| 6 \| Page Request \| 250 \| 2 \| 2 \| 0 \| -16.49 \| 6 \| Rendering \| 250 \| 97 \| 98 \| 0 \| 0.16 \| 7 \| Overall \| 250 \| 96 \| 95 \| -1 \| -0.55 \| 7 \| Page Request \| 250 \| 1 \| 2 \| 1 \| 66.67 \| slower 7 \| Rendering \| 250 \| 95 \| 94 \| -1 \| -1.19 \| 8 \| Overall \| 250 \| 92 \| 92 \| -1 \| -0.69 \| 8 \| Page Request \| 250 \| 1 \| 1 \| 0 \| -17.60 \| 8 \| Rendering \| 250 \| 91 \| 91 \| 0 \| -0.52 \| 9 \| Overall \| 250 \| 112 \| 112 \| 0 \| 0.29 \| 9 \| Page Request \| 250 \| 2 \| 1 \| 0 \| -7.92 \| 9 \| Rendering \| 250 \| 110 \| 111 \| 0 \| 0.37 \| 10 \| Overall \| 250 \| 589 \| 522 \| -67 \| -11.38 \| faster 10 \| Page Request \| 250 \| 14 \| 13 \| 0 \| -1.26 \| 10 \| Rendering \| 250 \| 575 \| 508 \| -67 \| -11.62 \| faster 11 \| Overall \| 250 \| 66 \| 66 \| -1 \| -0.86 \| 11 \| Page Request \| 250 \| 1 \| 1 \| 0 \| -16.48 \| 11 \| Rendering \| 250 \| 65 \| 65 \| 0 \| -0.62 \| 12 \| Overall \| 250 \| 303 \| 291 \| -12 \| -4.07 \| faster 12 \| Page Request \| 250 \| 2 \| 2 \| 0 \| 12.93 \| 12 \| Rendering \| 250 \| 301 \| 289 \| -13 \| -4.19 \| faster 13 \| Overall \| 250 \| 48 \| 47 \| 0 \| -0.45 \| 13 \| Page Request \| 250 \| 1 \| 1 \| 0 \| 1.59 \| 13 \| Rendering \| 250 \| 47 \| 46 \| 0 \| -0.52 \| ``` Here it's clear that this patch significantly improves the rendering performance of the slowest pages, while not causing any big regressions elsewhere. As expected, this patch thus helps larger and/or more complex pages the most (which is also where even small improvements will be most beneficial). There's obviously the question if this is slightly regressing simpler pages, but given just how short the times are in most cases it's not inconceivable that the page results above are simply caused be e.g. limited `Date.now()` and/or limited numerical precision.	2019-07-18 17:30:22 +02:00
Tim van der Meij	6e96a158f4	Merge pull request #10820 from vlastimilmaca/annot-irt-rt-states Annotations - Added parsing of IRT, RT, State and StateModel	2019-07-17 23:34:31 +02:00
vlastimilmaca	fe49f0f766	Annotations - Implement parsing of IRT, RT, State and StateModel	2019-07-16 23:33:07 +02:00
Jonas Jenwald	bea15b6ce5	Simplify the `PDFDocument.fingerprint` method slightly The way that this method handles documents without an `ID` entry in the Trailer dictionary feels overly complicated to me. Hence this patch adds `getByteRange` methods to the various Stream implementations[1], and utilize that rather than manually calling `ensureRange` when computing a fallback `fingerprint`. --- [1] Note that `PDFDocument` is only ever initialized with either a `Stream` or a `ChunkedStream`, hence why the `DecodeStream.getByteRange` method isn't implemented.	2019-07-15 13:26:08 +02:00
Tim van der Meij	13ebfec903	Merge pull request #10969 from Snuffleupagus/api-test-stopAtErrors Add an API unit-test for the `stopAtErrors` option (PRs 8240 and 8922 follow-up)	2019-07-14 14:47:57 +02:00
Jonas Jenwald	b548bafef7	Simplify, and inline, the `finalize` function in the `MessageHandler` class The `finalize` helper function has only a single call-site, and furthermore it's just a one-liner too. Furthermore it's only ever called with a `Promise` as its argument, meaning that it's unnecessarily convoluted as well (i.e. the `Promise.resolve()` part shouldn't be necessary). Hence this code can be both simplified and inlined at its only call-site instead.	2019-07-13 17:54:32 +02:00
Jonas Jenwald	c7fb7116d6	Add an API unit-test for the `stopAtErrors` option (PRs 8240 and 8922 follow-up) Also fixes an inconsistency in the 'PageError' handler, for `getOperatorList`, in the API.	2019-07-13 16:06:05 +02:00
Jonas Jenwald	17116917f7	Remove useless `wrapReason` calls in the `MessageHandler` class Currently `wrapReason` is manually called at every `resolveOrReject` call-site, despite it being completely unnecessary unless there's an actual error being handled. This is obviously inefficient, and it's easy enough to avoid by having `resolveOrReject` handle this only when actually needed.	2019-07-13 13:08:29 +02:00
Tim van der Meij	ed3954fc7a	Merge pull request #10851 from brendandahl/shading-bbox Apply bounding box before using shading patterns.	2019-07-12 22:52:07 +02:00
Tim van der Meij	87f36e3520	Merge pull request #10850 from brendandahl/scale-line-width Scale stroking line width when using a tiling pattern.	2019-07-12 22:50:32 +02:00
Tim van der Meij	28326165ff	Merge pull request #10958 from Snuffleupagus/api-rm-receivingOperatorList Remove the `intentState.receivingOperatorList` boolean since it's redundant	2019-07-11 23:55:00 +02:00
Jonas Jenwald	9a4d14bf36	Prevent "Uncaught promise" messages in the console when cancelling `TextLayer` tasks (PR 10601 follow-up) Since `finally` won't stop error propagation, this causes unnecessary messages to be printed in the console whenever a `TextLayer` task is cancelled.	2019-07-11 11:48:33 +02:00
Jonas Jenwald	ef48a9a713	Update the `PageError` handler, in the API, to always mark the `operatorList` as done and finalize any pending renderTasks Note that, in the old code, there was a code-path which could prevent this from happening thus affecting future cleanup. Furthermore, ensure that we'll always attempt to cleanup when handling the 'PageError' message, similar to the code in e.g. the `PDFPageProxy._renderPageChunk` method.	2019-07-10 14:23:59 +02:00
Jonas Jenwald	c6fcdf474b	Remove the `intentState.receivingOperatorList` boolean since it's redundant The `receivingOperatorList` property is currently tracked twice in the rendering code, both directly and inversely through the `intentState.operatorList.lastChunk` boolean. This type of double bookkeeping is never a good idea, since it's just too easy for the properties to accidentally fall out of sync. In this case there's even a `cleanup`-related bug caused by this, which means that `PDFPageProxy._tryCleanup` will never be able to discard any data if there's an error on the worker-thread (as handled through the 'PageError' message). Hence the simplest solution seems, at least to me, to update `PDFPageProxy._tryCleanup` to replace the `intentState.receivingOperatorList` check with a `!intentState.operatorList.lastChunk` check and completely remove the former property.	2019-07-10 14:23:10 +02:00
Brendan Dahl	6fab0a0dac	Apply bounding box before using shading patterns. Fixes #8092	2019-07-08 14:05:48 -07:00
Brendan Dahl	446efab707	Scale stroking line width when using a tiling pattern.	2019-07-08 13:47:54 -07:00
Jonas Jenwald	bdc31f8b50	Make the `find` helper function, in `src/core/document.js`, more efficient by using `peekBytes` rather reading the stream one byte at a time Please note: A a similar change was attempted in PR 5005, but it was subsequently backed out in PR 5069. Unfortunately I don't think anyone ever tried to debug exactly why it didn't work, since it ought to have worked, and having re-tested this now I'm not able to reproduce the problem any more. However, given just how inefficient the current code is, with thousands of strictly unnecessary function calls for each `find` invocation, I'd really like to try fixing this again.	2019-07-06 11:44:17 +02:00
Jonas Jenwald	41745a5996	Reduce the number of `isCmd` calls slightly in the `XRef` class This reduces the total number of function calls, when reading the XRef table respectively when fetching uncompressed XRef entries. Note in particular the `XRef.readXRefTable` method, where there're two back-to-back `isCmd` checks rather than just one.	2019-06-29 16:28:45 +02:00
Tim van der Meij	7fc329d6e3	Merge pull request #10902 from ahuglajbclajep/tiling-pattern-support Implement tiling patterns for the SVG back-end	2019-06-28 12:45:02 +02:00
Tim van der Meij	f1867de492	Merge pull request #10925 from Snuffleupagus/eslint_no-unsanitized Enable the `eslint-plugin-no-unsanitized` ESLint plugin to disallow unsafe usage of e.g. `innerHTML`	2019-06-27 20:32:24 +02:00
ahuglajbclajep	77940dbd86	Implement tiling patterns for the SVG back-end	2019-06-25 16:25:25 +09:00
Jonas Jenwald	f710eb56e4	Change the signature of the `Parser` constructor to take a parameter object A lot of the `new Parser()` call-sites look quite unwieldy/ugly as-is, with a bunch of somewhat randomly ordered arguments, which we can avoid by changing the constructor to accept an object instead. As an added bonus, this provides better documentation without having to add inline argument comments in the code.	2019-06-23 16:01:45 +02:00
Jonas Jenwald	5bb5e7741d	Enable the `eslint-plugin-no-unsanitized` ESLint plugin to disallow unsafe usage of e.g. `innerHTML` See https://github.com/mozilla/eslint-plugin-no-unsanitized Since we've generally never allowed e.g. `innerHTML`, which is enforced during review, there's only one linting failure with this patch. (Which is white-listed, according to the existing comment and the fact that it's test-only code.)	2019-06-23 13:50:30 +02:00
Jonas Jenwald	021e5ffb88	Move `PDFWorkerStream` and related code to its own file Since all other `IPDFStream` implementations live in their own files, it seems reasonable for these to do so as well. Furthermore, converts all of the relevant code to ES6 classes and updates the interface definitions to mark a couple of methods `async`.	2019-06-15 13:05:25 +02:00

1 2 3 4 5 ...

3622 Commits