pdf.js

Author	SHA1	Message	Date
Tim van der Meij	71477dc5b1	Merge pull request #11111 from Snuffleupagus/MessageHandler-resolveCall Inline the `resolveCall` helper function at its call-sites in `MessageHandler`	2019-09-02 22:19:31 +02:00
Jonas Jenwald	cd82b81bc7	Inline the `resolveCall` helper function at its call-sites in `MessageHandler` There's only three call-sites and one of them doesn't even need the complete functionality of `resolveCall`, hence it seems reasonable to just inline this code. An additional benefit of this is that the `Function.prototype.apply()` instance can also be converted into "normal" function calls, which should be a tiny bit more efficient. The patch also replaces a number of unnecessary arrow functions, in relevant parts of the `MessageHandler` code, with "normal" functions instead. Finally, all `Promise.resolve().then(...)` calls are replaced with `new Promise(...)` instead since the latter is a tiny bit more efficient. This also explains the test failures on the Linux bot, with a prior version of the patch, since the `Promise.resolve().then(...)` format essentially creates two Promises thus causing additional delay.	2019-09-01 13:40:19 +02:00
Tim van der Meij	10165c070e	Merge pull request #11110 from Snuffleupagus/MessageHandler-scope Remove support for the `scope` parameter in the `MessageHandler.on` method	2019-09-01 12:27:08 +02:00
Jonas Jenwald	055f03938b	Remove support for the `scope` parameter in the `MessageHandler.on` method At this point in time it's easy to convert the `MessageHandler.on` call-sites to use arrow functions, and thus let the JavaScript engine handle scopes for us, rather than having to manually keep references to the relevant scopes in `MessageHandler`.[1] An additional benefit of this is that a couple of `Function.prototype.call()` instances can now be converted into "normal" function calls, which should be a tiny bit more efficient. All in all, I don't see any compelling reason why it'd be necessary to keep supporting custom `scope`s in the `MessageHandler` implementation. --- [1] In the event that a custom scope is ever needed, simply using `bind` on the handler function when calling `MessageHandler.on` ought to work as well.	2019-09-01 09:24:15 +02:00
Tim van der Meij	d1e6d427cd	Merge pull request #11107 from Snuffleupagus/MessageHandler-postMessage Various `MessageHandler` improvements when using Streams	2019-08-31 00:06:17 +02:00
Jonas Jenwald	f71ea2de0e	Remove the `makeReasonSerializable` helper function, and use `wrapReason` instead, in `src/shared/message_handler.js` Since `wrapReason` and `makeReasonSerializable` are essentially functionally equivalent it doesn't seem necessary to keep both of them around, especially when `makeReasonSerializable` only has a single call-site.	2019-08-30 19:36:10 +02:00
Jonas Jenwald	4e6a9b54c7	Change the internal `stream` property, as sent when Streams are used, from a String to a Number Given that the `stream` property is an internal implementation detail, changing its type shouldn't be a problem. By using Numbers instead, we can avoid unnecessary String allocations when creating/processing Streams.	2019-08-30 13:27:18 +02:00
Jonas Jenwald	252a3e35fb	Reduce the amount of unnecessary function calls and object allocations, in `MessageHandler`, when using Streams With PR 11069 we're now using Streams for OperatorList parsing (in addition to just TextContent parsing), which brings the nice benefit of being able to easily abort parsing on the worker-thread thus saving resources. However, since we're now creating many more `ReadableStream` there appears to be a tiny bit more overhead because of it (giving ~1% slower runtime of `browsertest` on the bots). In this case we're just going to have to accept such a small regression, since the benefits of using Streams clearly outweighs it. What we can do here, is to try and make the Streams part of the `MessageHandler` implementation slightly more efficient by e.g. removing unnecessary function calls (which has been helpful in other parts of the code-base). To that end, this patch makes the following changes: - Actually support `transfers` in `MessageHandler.sendWithStream`, since the parameter was being ignored. - Inline the `sendStreamRequest`/`sendStreamResponse` helper functions at their respective call-sites. Obviously this causes some amount of code duplication, however I still think this change seems reasonable since for each call-site: - It avoids making one unnecessary function call. - It avoids allocating one temporary object. - It avoids sending, and thus structure clone, various undefined object properties. - Inline objects in the `MessageHandler.{send, sendWithPromise}` methods. - Finally, directly call `comObj.postMessage` in various methods when `transfers` are not present, rather than calling `MessageHandler.postMessage`, to further reduce the amount of function calls.	2019-08-30 12:32:20 +02:00
Jonas Jenwald	ae0d9e8c2a	Replace some instances of implicit `function.bind(this)` usage, in `src/display/api.js`, with arrow functions instead	2019-08-30 11:35:05 +02:00
Tim van der Meij	3dfce2d4ef	Merge pull request #11104 from Snuffleupagus/textLayer-style [TextLayer] Avoid unnecessary font updates in `_layoutText` and remove `setAttribute` usage in `appendText`	2019-08-28 23:25:58 +02:00
Tim van der Meij	9f592ebf25	Merge pull request #11102 from mozilla/dependabot/npm_and_yarn/mixin-deep-1.3.2 Bump mixin-deep from 1.3.1 to 1.3.2	2019-08-28 23:11:31 +02:00
Jonas Jenwald	667e548e5f	[TextLayer] Remove `setAttribute` usage in `appendText` (issue 8066) One of the motivations for using `setAttribute` in the first place was to support more efficient DOM updates in the `expandTextDivs` method, since performance of the `enhanceTextSelection` mode can be somewhat bad when there's a lot of `textDivs` on the page. With recent `TextLayer` changes/optimizations it's no longer necessary to store a complete `style`-string for every `textDiv`, and we can thus re-visit the `setAttribute` usage. Note that with the current code, in `appendText`, there's only one string per `textDiv` which avoids a bunch of temporary strings. While the changes in this patch means that there's now three strings per `textDiv` instead, the total length of these strings are now quite a bit shorter (42 characters to be exact).	2019-08-28 16:52:09 +02:00
Jonas Jenwald	106b239c5d	[TextLayer] Avoid unnecessary font updates in `_layoutText` (PR 11097 follow-up) This should obviously have been done in PR 11097, but for some reason I completely overlooked it; sorry about that. There's no good reason to update the font unless you're actually going to measure the width of the textContent. This can reduce unnecessary font switching a fair bit, even for documents which are somewhat simple/short (in e.g. the `tracemonkey.pdf` file this cuts the amount of font switches almost in half).	2019-08-28 16:08:06 +02:00
dependabot[bot]	594c49c571	Bump mixin-deep from 1.3.1 to 1.3.2 Bumps [mixin-deep](https://github.com/jonschlinkert/mixin-deep) from 1.3.1 to 1.3.2. - [Release notes](https://github.com/jonschlinkert/mixin-deep/releases) - [Commits](https://github.com/jonschlinkert/mixin-deep/compare/1.3.1...1.3.2) Signed-off-by: dependabot[bot] <support@github.com>	2019-08-28 00:33:12 +00:00
Tim van der Meij	184d416639	Merge pull request #11097 from Snuffleupagus/textLayer-measure-width [TextLayer] Only measure the width of the text, in `_layoutText`, for multi-char text divs	2019-08-25 16:08:51 +02:00
Tim van der Meij	d64b49831d	Merge pull request #11095 from timvandermeij/api-attachments-unit-test Include a reduced, non-linked PDF file for the attachments API unit test	2019-08-25 15:22:51 +02:00
Tim van der Meij	09df1ee0ce	Include a reduced, non-linked PDF file for the attachments API unit test	2019-08-25 15:14:57 +02:00
Tim van der Meij	97d3294d3d	Merge pull request #11096 from timvandermeij/updates Update translations/packages and upgrade to `eslint` version 6	2019-08-25 15:05:52 +02:00
Jonas Jenwald	a1398048e5	[TextLayer] Simplify building of the expanded transform in `expandTextDivs` Rather than essentially re-computing the `originalTransform` every time, we can simply use it directly instead.	2019-08-25 13:09:04 +02:00
Jonas Jenwald	b68f7bb404	[TextLayer] Only measure the width of the text, in `_layoutText`, for multi-char text divs For performance reasons single-char text divs aren't being scaled, as outlined in a comment in `appendText`. Hence it doesn't seem necessary, or even a good idea, to unconditionally measuring the width of the text in `_layoutText`.	2019-08-25 12:32:49 +02:00
Tim van der Meij	215c546fd5	Upgrade to `eslint` version 6 This major version bump required two changes: - The global line in the mobile viewer example should be removed because the `.eslintrc` file already defines these globals and with the new `eslint` version we otherwise get an error saying "'pdfjsLib' is already defined as a built-in global variable". - The ECMA version for the examples must be set to 6 since we're using modules, otherwise we get an error saying "sourceType 'module' is not supported when ecmaVersion < 2015". It turns out that the previous version of `eslint` already used ECMA version 6 silently even though we set 5, see https://github.com/eslint/eslint/issues/9687#issuecomment-432413384, so in terms of our code nothing really changes.	2019-08-24 20:21:10 +02:00
Tim van der Meij	d9cd890228	Update packages	2019-08-24 20:08:09 +02:00
Tim van der Meij	ce1acff5f0	Update translations	2019-08-24 20:05:47 +02:00
Tim van der Meij	56ae7a6690	Merge pull request #11069 from Snuffleupagus/getoplist-stream Use streams for OperatorList chunking (issue 10023)	2019-08-24 19:31:00 +02:00
Jonas Jenwald	711040ecc5	Stop re-throwing errors in the 'GetOperatorList' and 'GetTextContent' handlers, in `src/core/worker.js` These functions aren't returning anything, now that they're using `ReadableStream`s, and it thus doesn't seem necessary to re-throw errors (also given the console message that's caused by it).	2019-08-24 15:56:41 +02:00
Yury Delendik	66e0dd1b06	Use streams for OperatorList chunking (issue 10023) Please note: The majority of this patch was written by Yury, and it's simply been rebased and slightly extended to prevent issues when dealing with `RenderingCancelledException`. By leveraging streams this (finally) provides a simple way in which parsing can be aborted on the worker-thread, which will ultimately help save resources. With this patch worker-thread parsing will only be aborted when the document is destroyed, and not when rendering is cancelled. There's a couple of reasons for this: - The API currently expects the entire OperatorList to be extracted, or an Error to occur, once it's been started. Hence additional re-factoring/re-writing of the API code will be necessary to properly support cancelling and re-starting of OperatorList parsing in cases where the `lastChunk` hasn't yet been seen. - Even with the above addressed, immediately cancelling when encountering a `RenderingCancelledException` will lead to worse performance in e.g. the default viewer. When zooming and/or rotation of the document occurs it's very likely that `cancel` will be (almost) immediately followed by a new `render` call. In that case you'd obviously not want to abort parsing on the worker-thread, since then you'd risk throwing away a partially parsed Page and thus be forced to re-parse it again which will regress perceived performance. - This patch is already somewhat risky, given that it touches fundamentally important/critical code, and trying to keep it somewhat small should hopefully reduce the risk of regressions (and simplify reviewing as well). Time permitting, once this has landed and been in Nightly for awhile, I'll try to work on the remaining points outlined above. Co-Authored-By: Yury Delendik <ydelendik@mozilla.com> Co-Authored-By: Jonas Jenwald <jonas.jenwald@gmail.com>	2019-08-24 15:56:40 +02:00
Tim van der Meij	ee75fc1298	Merge pull request #11092 from Snuffleupagus/textLayer-expandTextDivs-padding [TextLayer] Use an Array to build the total `padding`, rather than concatenating Strings, in `expandTextDivs`	2019-08-24 14:45:21 +02:00
Tim van der Meij	42213f6a2c	Merge pull request #11093 from Priestch/shorthand_after_print Shorthand afterPrint signature in app.js	2019-08-24 14:36:53 +02:00
Priestch	000780d27e	Use shorthand method signature for `afterPrint` in `web/app.js`	2019-08-24 18:26:25 +08:00
Jonas Jenwald	29a2516e4c	[TextLayer] Use an Array to build the total `padding`, rather than concatenating Strings, in `expandTextDivs` Furthermore, it's possible to re-use the same Array for all `textDiv`s on the page and the resulting padding string also becomes a lot more compact. Please note that the `paddingLeft` branch was moved, since the padding values need to be ordered as `top, right, bottom, left`. Finally, with this re-factoring it's no longer necessary to cache the original `style` string for every `textDiv` when `enhanceTextSelection` is enabled.	2019-08-24 01:13:59 +02:00
Tim van der Meij	edbebb8bf7	Merge pull request #11090 from Snuffleupagus/textLayer-expandTextDivs-transform [TextLayer] Use an Array to build the total `transform`, rather than concatenating Strings, in `expandTextDivs`	2019-08-23 23:12:42 +02:00
Tim van der Meij	d1ef08e147	Merge pull request #11091 from Snuffleupagus/textLayer-expandTextDivs-valid-padding [TextLayer] Only handle positive padding values in `expandTextDivs`	2019-08-23 23:08:39 +02:00
Jonas Jenwald	932fcacff8	[TextLayer] Only handle positive padding values in `expandTextDivs` Given that browsers will reject padding values smaller than zero (which may be caused by limited numerical precision during calculations in the `expand` code), it makes no sense to include those when expanding the `textDiv`s.	2019-08-23 13:16:20 +02:00
Jonas Jenwald	37e8a8189b	[TextLayer] Use an Array to build the total `transform`, rather than concatenating Strings, in `expandTextDivs` Furthermore, it's possible to re-use the same Array for all `textDiv`s on the page.	2019-08-23 12:17:12 +02:00
Tim van der Meij	490deb1b65	Merge pull request #11086 from Snuffleupagus/textLayer-originalTransform [TextLayer] Only cache the `originalTransform` when `enhanceTextSelection` is enabled	2019-08-22 23:09:07 +02:00
Brendan Dahl	31f319301d	Merge pull request #11087 from brendandahl/disable-links Add a way to disable external links.	2019-08-22 11:13:11 -07:00
Jonas Jenwald	a519ceffee	[TextLayer] Use template strings when updating the font property in the `_layoutText` method	2019-08-22 14:47:44 +02:00
Jonas Jenwald	6afe3221b7	[TextLayer] Only cache the `originalTransform` when `enhanceTextSelection` is enabled Given that this is completely unused in "regular" text-selection mode, there's no reason to unconditionally store one string for every `textDiv`.	2019-08-22 14:47:18 +02:00
Brendan Dahl	98e989116c	Add a way to disable external links.	2019-08-21 11:20:41 -07:00
Tim van der Meij	52c6b3c138	Merge pull request #11079 from Snuffleupagus/textLayer-memory [TextLayer] Only cache the current `textDiv` style when `enhanceTextSelection` is enabled and use template strings in `expandTextDivs``	2019-08-20 22:48:10 +02:00
Tim van der Meij	78f9ab53fc	Merge pull request #11081 from dhuang612/document-pdfjs-dist/webpack added in information about pdfjs/webpack	2019-08-20 22:38:58 +02:00
dhuang612	d52d1e2d09	added in information about pdfjs/webpack updated readme with corrections	2019-08-20 10:20:32 -04:00
Jonas Jenwald	431a264126	[TextLayer] Reduce the amount of intermediary strings in `expandTextDivs` By using template strings, we can avoid some unnecessary string allocations (which is also helped by shortening a variable name).	2019-08-19 12:09:18 +02:00
Jonas Jenwald	45dfad8640	[TextLayer] Only cache the current `textDiv` style when `enhanceTextSelection` is enabled This will help save a little bit of memory, by not storing one unused string for each `textDiv` in regular text-selection mode.	2019-08-19 11:02:56 +02:00
Tim van der Meij	852fc955bd	Merge pull request #11076 from Snuffleupagus/XRef-fetch-isRef/cache Replace the `XRef.cache` Array with a Map instead	2019-08-18 14:44:31 +02:00
Jonas Jenwald	1cd9a28c81	Replace the `XRef.cache` Array with a Map instead Given that the different types of `Stream`s will never be cached, this thus implies that the `XRef.cache` Array will always be more-or-less sparse. Generally speaking, the longer the document the more sparse the `XRef.cache` will thus become. For example, looking at the `pdf.pdf` file from the test-suite: The length of the `XRef.cache` Array will be a few hundred thousand elements, with approximately 95% of them being empty. Hence it seems pretty clear that an Array isn't really the best data-structure for this kind of cache, and this patch thus changes it to a Map instead. This patch-series was tested using the PDF file from issue 2618, i.e. http://bugzilla-attachments.gnome.org/attachment.cgi?id=226471, with the following manifest file: ``` [ { "id": "issue2618", "file": "../web/pdfs/issue2618.pdf", "md5": "", "rounds": 200, "type": "eq" } ] ``` which gave the following results when comparing this patch-series against the `master` branch: ``` -- Grouped By browser, stat -- browser \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ----- \| ------------- Firefox \| Overall \| 200 \| 2736 \| 2736 \| 1 \| 0.02 \| Firefox \| Page Request \| 200 \| 2 \| 2 \| 0 \| -8.26 \| faster Firefox \| Rendering \| 200 \| 2733 \| 2734 \| 1 \| 0.03 \| ```	2019-08-18 12:07:18 +02:00
Jonas Jenwald	34a53b9f5d	Inline the `isRef` checks in the various `XRef.fetch` related methods The relevant methods are usually not hot enough for these changes to have an easily measurable effect, however there's been a lot of other cases where similiar inlining has helped performance. (And these changes may help offset the changes made in the next patch.)	2019-08-18 11:57:48 +02:00
Tim van der Meij	1565d1849d	Merge pull request #11073 from brendandahl/code-point Move polyfill for codePointAt to String prototype.	2019-08-17 13:26:35 +02:00
Brendan Dahl	c8129b8787	Move polyfill for codePointAt to String prototype. This method belongs on the prototype not the String object.	2019-08-16 14:32:43 -07:00
Tim van der Meij	20181b65d4	Merge pull request #11070 from Snuffleupagus/Parser-getObj-rm-isString Inline the `isString` check in the `Parser.getObj` method	2019-08-16 22:54:55 +02:00

1 2 3 4 5 ...

11799 Commits