pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	f15eb63ed5	Remove the `PDFSinglePageViewer`-specific code from `web/secondary_toolbar.js` (PR 9877 follow-up) This was added on the assumption that the viewer would (eventually) start using the `PDFSinglePageViewer` for e.g. PAGE-scrolling mode and PresentationMode. However, having both a `PDFViewer` and a `PDFSinglePageViewer` side-by-side in the viewer would've been tricky to implement well, which is why PR 14112 implemented PAGE-scrolling for the general `BaseViewer` instead. Given that the default viewer is no longer (potentially) going to use `PDFSinglePageViewer`, there's code in the `SecondaryToolbar` (and related CSS rules) which is now unnecessary.	2021-11-29 13:13:17 +01:00
Jonas Jenwald	8fa5fcfe72	[Regression] Prevent errors, during loading, in the viewer for XFA-documents (PR 14295 follow-up) In the second commit in PR 14295, I forgot that the pages in XFA-documents don't have references (like in regular PDF documents); sorry about that!	2021-11-26 20:21:12 +01:00
Jonas Jenwald	f7b1da418f	Center pages vertically in PresentationMode (issue 10906) This patch can be tested e.g. with the `sizes.pdf` document in the test-suite. While this patch isn't necessarily the best solution, e.g. it might be possible to solve this with only CSS, it's what I was able to come up with to address an old issue. The solution here re-uses the `spread`-class in PresentationMode, since that one already takes care of centering pages vertically, together with a dummy-page that takes up the entire height of the window. Finally, some PresentationMode-related CSS-rules are also simplified slightly, since the changes in PR 14112 (using Page-scrolling) allows some clean-up here.	2021-11-24 14:09:34 +01:00
Jonas Jenwald	58a2728647	Ensure that `BaseViewer.#ensurePdfPageLoaded` updates the `PDFLinkService`-pagesRefCache if necessary The issue that this patch fixes has existed ever since the viewer was first re-factored into components, however it only really affects the `disableAutoFetch = true` mode. By default we're fetching all pages in `BaseViewer.setDocument`, and as part of the parsing/initialization we're also populating the `PDFLinkService`-pagesRefCache. The purpose of that cache is to make navigating to any internal destinations faster, by not having to (asynchronously) lookup the pageNumber via the API when handling the destination. In comparison, when the `disableAutoFetch = true` mode is being used we're instead lazily initializing the pages in the `BaseViewer.#ensurePdfPageLoaded`-method. For some reason, that I can only assume is a simple oversight, we're not attempting to update the `PDFLinkService`-pagesRefCache in that case.	2021-11-21 11:53:19 +01:00
Jonas Jenwald	0ebac67a9f	Remove the `{BaseViewer, PDFThumbnailViewer}._pagesRequests` caches In the `BaseViewer` this cache is mostly relevant in the `disableAutoFetch = true` mode, since the pages are being initialized lazily in that case. In the `PDFThumbnailViewer` this cache is mostly used for thumbnails that are actually being rendered, as opposed to those created directly from the "regular" pages. Please note that I'm not suggesting that we remove these caches because they're only used in some situations, but rather because they're for all intents and purposes actually redundant. In the API itself, we're already caching both the page-promises and the actual pages themselves on the `WorkerTransport`-instance. Hence these viewer-caches aren't really necessary in practice, and adds what to me mostly seems like an unnecessary level of indirection.[1] Given that the viewer now relies on caching in the API itself, this patch also adds a new unit-test to ensure that page-caching works (and keep working) as expected. --- [1] In the `WorkerTransport.getPage`-method the parameter is being validated on every call, but that's hardly enough code to warrant keeping the "duplicate" caches in the viewer in my opinion.	2021-11-21 11:40:45 +01:00
Jonas Jenwald	6da0944fc7	[api-minor] Replace `PDFDocumentProxy.getStats` with a synchronous `PDFDocumentProxy.stats` getter Please note: These changes will primarily benefit longer documents, somewhat at the expense of e.g. one-page documents. The existing `PDFDocumentProxy.getStats` function, which in the default viewer is called for each rendered page, requires a round-trip to the worker-thread in order to obtain the current document stats. In the default viewer, we currently make one such API-call for every rendered page. This patch proposes replacing that method with a synchronous `PDFDocumentProxy.stats` getter instead, combined with re-factoring the worker-thread code by adding a `DocStats`-class to track Stream/Font-types and only send them to the main-thread the first time that a type is encountered. Note that in practice most PDF documents only use a fairly limited number of Stream/Font-types, which means that in longer documents most of the `PDFDocumentProxy.getStats`-calls will return the same data.[1] This re-factoring will obviously benefit longer document the most[2], and could actually be seen as a regression for one-page documents, since in practice there'll usually be a couple of "DocStats" messages sent during the parsing of the first page. However, if the user zooms/rotates the document (which causes re-rendering), note that even a one-page document would start to benefit from these changes. Another benefit of having the data available/cached in the API is that unless the document stats change during parsing, repeated `PDFDocumentProxy.stats`-calls will return the same identical object. This is something that we can easily take advantage of in the default viewer, by now only reporting "documentStats" telemetry[3] when the data actually have changed rather than once per rendered page (again beneficial in longer documents). --- [1] Furthermore, the maximium number of `StreamType`/`FontType` are `10` respectively `12`, which means that regardless of the complexity and page count in a PDF document there'll never be more than twenty-two "DocStats" messages sent; see `41ac3f0c07/src/shared/util.js (L206-L232)` [2] One example is the `pdf.pdf` document in the test-suite, where rendering all of its 1310 pages only result in a total of seven "DocStats" messages being sent from the worker-thread. [3] Reporting telemetry, in Firefox, includes using `JSON.stringify` on the data and then sending an event to the `PdfStreamConverter.jsm`-code. In that code the event is handled and `JSON.parse` is used to retrieve the data, and in the "documentStats"-case we'll then iterate through the data to avoid double-reporting telemetry; see https://searchfox.org/mozilla-central/rev/8f4c180b87e52f3345ef8a3432d6e54bd1eb18dc/toolkit/components/pdfjs/content/PdfStreamConverter.jsm#515-549	2021-11-20 12:20:55 +01:00
Brendan Dahl	c6cb39ef30	Merge pull request #14262 from Snuffleupagus/issue-14261 Include the /Lang-property, when it exists, in the StructTree-data (issue 14261)	2021-11-19 07:51:21 -08:00
Brendan Dahl	9f4a2cf5ce	Merge pull request #14276 from Snuffleupagus/issue-14242-2 Only show the `loadingIcon`-spinner on visible pages (issue 14242)	2021-11-18 13:43:58 -08:00
Tim van der Meij	3dccaccbb4	Merge pull request #14278 from Snuffleupagus/rm-removeChild Replace the remaining `Node.removeChild()` instances with `Element.remove()`	2021-11-17 20:17:55 +01:00
Tim van der Meij	f90eebd282	Merge pull request #14280 from Snuffleupagus/scrollMode-PAGE-spread-loop Slightly optimize `spreadMode` toggling with `ScrollMode.PAGE` set (PR 14112 follow-up)	2021-11-17 19:46:30 +01:00
Jonas Jenwald	4ef1a129fa	Replace the remaining `Node.removeChild()` instances with `Element.remove()` Using `Element.remove()` is a slightly more compact way of removing an element, since you no longer need to explicitly find/use its parent element. Furthermore, the patch also replaces a couple of loops that're used to delete all elements under a node with simply overwriting the contents directly (a pattern already used throughout the viewer). See also: - https://developer.mozilla.org/en-US/docs/Web/API/Node/removeChild - https://developer.mozilla.org/en-US/docs/Web/API/Element/remove	2021-11-16 17:52:50 +01:00
Brendan Dahl	3209c013c4	Merge pull request #14247 from calixteman/button [api-minor] Render pushbuttons on their own canvas (bug 1737260)	2021-11-16 08:10:40 -08:00
Jonas Jenwald	1214c056e9	Slightly optimize `spreadMode` toggling with `ScrollMode.PAGE` set (PR 14112 follow-up) It shouldn't be necessary to iterate through all pages when using a non-default `spreadMode`, since we already know which page(s) should become visible. This code is a left-over from the initial (local) implementation that resulted in PR 14112, however I forgot to clean-up some things such as e.g. this loop. Also fixes an outdated comment, see PR 14204 which removed the mentioned data-structure.	2021-11-16 15:37:58 +01:00
Jonas Jenwald	7d4c37e988	Use the new iterator in the `PDFPageViewBuffer` unit-tests The previous patch introduced an iterator in the `PDFPageViewBuffer`-class, hence the test-only `_buffer`-getter is no longer necessary.	2021-11-15 14:06:17 +01:00
Jonas Jenwald	e909fcdba8	Only show the `loadingIcon`-spinner on visible pages (issue 14242) This patch preserves the old behaviour of appending a `loadingIcon`-div to all pages that are not yet loaded/rendered. However, the actual `loadingIcon`-spinner (i.e. the `loading-icon.gif` image) will only be displayed on visible pages to improve performance. To avoid having to iterate through all pages in the document, which doesn't seem like a good idea for a PDF document with thousands of pages, we use a combination of the currently visible and cached pages to toggle the `loadingIcon`-spinner.	2021-11-15 14:06:14 +01:00
Jonas Jenwald	971ac8e993	Include the /Lang-property, when it exists, in the StructTree-data (issue 14261) Please note: This is a tentative patch, since I don't have the necessary a11y-software to actually test it.	2021-11-14 12:37:41 +01:00
Jonas Jenwald	08d56c67ae	Convert `GrabToPan` to a standard `class` This code is the last piece[1] of the viewer that's not using standard `class`es, and by converting this code we get rid of some now unneeded boilerplate code (slightly reducing the size of the built `web/viewer.js` file). Note that while this code was originally imported from a separate repository, it was last sync-ed with upstream five years ago which is why this re-factoring should be OK as far as I'm concerned (and we've done some other clean-up since then as well). --- [1] Technically the `web/debugger.js` file is left as well, however that code is first of all not bundled in the built `web/viewer.js` file and secondly it's not even loaded by default either.	2021-11-13 23:07:36 +01:00
Jonas Jenwald	ed6af0f844	[web/grab_to_pan.js] Inline the `isLeftMouseReleased` helper function Given the support information listed in the function itself, the [MDN compatibility data](https://developer.mozilla.org/en-US/docs/Web/API/MouseEvent/buttons#browser_compatibility), and the [currently supported browsers](`4bb9de4b00/gulpfile.js (L79-L87)`) in the PDF.js project we should be able to simplify the code by inlining the function instead.	2021-11-13 23:00:15 +01:00
Jonas Jenwald	7a428345db	Merge pull request #14271 from calixteman/params Parse query string in using URLSearchParams	2021-11-13 22:59:34 +01:00
Calixte Denizet	fe95e100e4	Parse query string in using URLSearchParams - I just noticed in reading the code that we parse that stuff when something exists in the web api; - see https://developer.mozilla.org/en-US/docs/Web/API/URLSearchParams/URLSearchParams.	2021-11-13 21:10:54 +01:00
Tim van der Meij	de7cfed9e3	Merge pull request #14260 from Snuffleupagus/telemetry-pageInfo-once Report "pageInfo" telemetry once, rather than for each rendered page	2021-11-13 20:22:58 +01:00
Calixte Denizet	7041c62ccf	Remove non-displayable chars from outline title (#14267 ) - it aims to fix #14267; - there is nothing about chars in range [0-1F] in the specs but acrobat doesn't display them in any way.	2021-11-13 16:56:08 +01:00
Calixte Denizet	33ea817b20	[api-minor] Render pushbuttons on their own canvas (bug 1737260) - First step to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1737260; - several interactive pdfs use the possibility to hide/show buttons to show different icons; - render pushbuttons on their own canvas and then insert it the annotation_layer; - update test/driver.js in order to convert canvases for pushbuttons into images.	2021-11-12 15:37:33 +01:00
Jonas Jenwald	8eed0b9145	Report "pageInfo" telemetry once, rather than for each rendered page Reporting telemetry, in Firefox, includes using `JSON.stringify` on the data and then sending an event to the `PdfStreamConverter.jsm`-code. In that code the event is handled and `JSON.parse` is used to retrieve the data, and in the "pageInfo"-case we'll then proceed to ignore everything except the first such event; see https://searchfox.org/mozilla-central/rev/24fac1ad31fb9c6e9c4c767c6a7ff45d226078f3/toolkit/components/pdfjs/content/PdfStreamConverter.jsm#509-514 All-in-all, sending the "pageInfo" telemetry for each rendered page is thus unnecessary and this patch makes the viewer send it only once instead.	2021-11-11 12:36:06 +01:00
Brendan Dahl	4ee906adf4	Merge pull request #14209 from Snuffleupagus/issue-14205 [Google Chrome] Ensure that `markedContent` spans are placed in the top-left corner (issue 14205)	2021-11-09 07:59:14 -08:00
Tim van der Meij	891f21fba6	Merge pull request #14245 from Snuffleupagus/PDFPageViewBuffer-class Convert `PDFPageViewBuffer` to a standard class, and use a `Set` internally	2021-11-07 14:37:33 +01:00
Jonas Jenwald	13ef763222	[Google Chrome] Ensure that `markedContent` spans are placed in the top-left corner (issue 14205) This is a tentative patch, since we unfortunately cannot easily test it (as far as I can tell). In Firefox this (obviously) works as-is, but in Google Chrome the `markedContent` spans are inserted within the regular text-content (in the DOM) and with non-zero heights.	2021-11-07 11:01:35 +01:00
Jonas Jenwald	12d41bcba4	Prevent mobile devices from interfering with the textLayer-elements (issue 14243) This is a tentative patch, since I don't have the necessary hardware to test it. See https://developer.mozilla.org/en-US/docs/Web/CSS/text-size-adjust, which is currently ignored in Firefox. It seems overall safer, and more future-proof, to simply add this to the entire `textLayer` rather than its individual elements.	2021-11-06 11:39:43 +01:00
Jonas Jenwald	a774707e31	Remove the `moveToEndOfArray` helper function, since it's unused With the previous patch, this helper function is no longer used and keeping it around will simply increase the size of the builds. This removal is purposely done separately, to make it easy to revert the patch in the future if this helper function would become useful again.	2021-11-06 10:19:17 +01:00
Jonas Jenwald	f55bf42398	Convert `PDFPageViewBuffer` to use a `Set` internally This relies on the fact that `Set`s preserve the insertion order[1], which means that we can utilize an iterator to access the first stored view. Note that in the `resize`-method, we can now move the visible pages to the back of the buffer using a single loop (hence we don't need to use the `moveToEndOfArray` helper function any more). --- [1] This applies to `Map`s as well, although that's not entirely relevant here.	2021-11-06 10:19:17 +01:00
Jonas Jenwald	0eba15b43a	Convert `PDFPageViewBuffer` to a standard class This patch makes use of private `class` fields, to ensure that the previously "private" properties remain as such.	2021-11-06 10:19:17 +01:00
Jonas Jenwald	fe205efd8d	Add a couple of basic unit-tests for `PDFPageViewBuffer` The `PDFPageViewBuffer`-code is very important for the correct function of the viewer, but it's currently not tested at all. While the `PDFPageViewBuffer` is obviously intended to be used with `PDFPageView`-instances, it only accesses a couple of `PDFPageView` properties/methods and consequently it's fairly easy to unit-test this code with dummy-data. These unit-tests should help improve our confidence in this code, and will also come in handy with other changes that I'm working on (regarding modernizing and re-factoring the `PDFPageViewBuffer`-code).	2021-11-05 19:43:20 +01:00
Jonas Jenwald	e78e4e72bf	Further modernize `PDFThumbnailViewer.scrollThumbnailIntoView` The way that we're currently handling the last-`id` is very old, and there's no longer any good reason to special-case things when only one thumbnail is visible. Furthermore, we can also modernize the loop slightly by using `for...of` instead of `Array.prototype.some()` when checking for fully visible thumbnails.	2021-11-03 21:13:47 +01:00
Jonas Jenwald	6323f8532a	Let `getVisibleElements` return a Set containing the visible element `id`s Note how in `PDFPageViewBuffer.resize` we're manually iterating through the visible pages in order to build a Set of the visible page `id`s. By instead moving the building of this Set into the `getVisibleElements` helper function, as part of the existing parsing, this code becomes ever so slightly more efficient. Furthermore, more direct access to the visible page `id`s also come in handy in other parts of the viewer as well. In the `BaseViewer.isPageVisible` method we no longer need to loop through the visible pages, but can instead directly check if the pageNumber is visible. In the `PDFRenderingQueue.getHighestPriority` method, when checking for "holes" in the page layout, we can also avoid some unnecessary look-ups this way.	2021-11-03 21:13:44 +01:00
Tim van der Meij	2ac6c939a5	Merge pull request #14225 from Snuffleupagus/render-better-holes-check Avoid doing unnecessary checks, when pre-rendering page layouts with "holes" (PR 14131 follow-up)	2021-11-03 19:48:37 +01:00
Tim van der Meij	c68dc03be6	Merge pull request #14221 from Snuffleupagus/pr-12870-followup Use `BaseViewer.previousPage` more in the default viewer (PR 12870 follow-up)	2021-11-03 19:44:12 +01:00
Jonas Jenwald	ab5f4a3e5e	Avoid doing unnecessary checks, when pre-rendering page layouts with "holes" (PR 14131 follow-up) Sometimes I'll hopefully learn to optimize my code directly when writing it, rather than having to do multiple clean-up passes; sorry about the churn here! For most page layouts there won't be any "holes" in the visible pages (or thumbnails), and in those cases it'd obviously be preferable not having to repeat any checks of already rendered pages. Rather than only checking the "distance" between the first/last pages, we can instead compare the theoretical number of pages (between first/last) with the actually visible number of pages instead. This way, we're able to better detect the "holes"-case and can skip unnecessary parsing in the common case.	2021-11-03 09:40:39 +01:00
Jonas Jenwald	292a715c1c	Use optional chaining to simplify `PDFViewerApplication.store` accesses in various event handlers This way we no longer need the intermediate variables.	2021-11-02 12:00:57 +01:00
Jonas Jenwald	d6e8b8fbc1	Use `BaseViewer.previousPage` more in the default viewer (PR 12870 follow-up) I missed this one spot in PR 12870, when converting the other cases in the "keydown" event handler. However, given that it only matters in PresentationMode and/or when "page-fit" zooming is enabled, this oversight shouldn't have had any user-observable impact (but we should fix it nonetheless).	2021-11-02 11:48:18 +01:00
Jonas Jenwald	f4e88b0a57	[Firefox] Handle errors if loading failed before the "supportsRangedLoading" message was sent (bug 1732141) This is a follow-up to PR 10675, since there I completely overlooked that we also need to handle the case where a PDF document has failed to load when the "supportsRangedLoading" message is sent to the viewer.	2021-11-01 17:50:49 +01:00
Jonas Jenwald	8c70258065	Merge pull request #14182 from calixteman/richtext Support rich content in markup annotation	2021-10-31 14:41:56 +01:00
Calixte Denizet	cf8dc750d6	Support rich content in markup annotation - use the xfa parser but in the xhtml namespace.	2021-10-31 13:44:51 +01:00
Tim van der Meij	317e4dd146	Merge pull request #14204 from Snuffleupagus/rm-shadowViewer Remove the `shadowViewer` used with Page scrolling	2021-10-30 12:37:30 +02:00
calixteman	2c0bbaf208	Merge pull request #14153 from catherinemds/xfa-link Fix XFA links (bug 1735738)	2021-10-29 11:06:00 -07:00
Catherine	db0b3cda8b	XFA - Fix xfaLink class to make links work (bug 1735738) There were some links not working in some XFA files,I realized that the anchor tag that contains the link has an inline display and couldn't receive any height, solved this by adding a "position: absolute". Tested with two different files in Firefox Nightly and Chrome and now all links are working perfectly fine. Added reftest to avoid future regressions	2021-10-29 11:39:33 -04:00
Jonas Jenwald	c18df2c61f	Remove the `shadowViewer` used with Page scrolling The only reason for using a `DocumentFragment` in the first place, originally added in PR 8724, was to prevent errors in the `PDFPageView`-constructor. However, we should be able to simply make its `container`-option optional instead, since it's not being used for anything else in the class. Note that pre-rendering still works correctly in my testing, and given that the `BaseViewer` keeps references to all `PDFPageView`-instances (via its `_pages` Array) it also shouldn't be possible to "lose" any pages/canvases this way.	2021-10-28 13:48:15 +02:00
Tim van der Meij	6f3700b393	Merge pull request #14191 from Snuffleupagus/viewer-empty-pageLabels Ignore pageLabels, in the viewer, when they're all empty	2021-10-27 20:01:45 +02:00
Jonas Jenwald	66c26d70d4	Ignore pageLabels, in the viewer, when they're all empty Unfortunately there exist PDF documents where all pageLabels are empty strings, see e.g. http://www.cs.cornell.edu/~ragarwal/pubs/blk-switch.pdf (taken from an old issue), which result in the pageNumber-input being completely blank. That doesn't seem very helpful, and this patch simply extends the approach used to ignore pageLabels that are identical to standard page numbering.	2021-10-25 16:11:04 +02:00
Jonas Jenwald	2c779a8fbe	[Regression] Prevent breaking errors when opening a new document in the GENERIC viewer (PR 14158 follow-up) In the GENERIC viewer, e.g. when dragging-and-dropping a new PDF document which automatically opens the outline, there can now be breaking errors in the `{BaseViewer, PDFThumbnailViewer}.#getScrollAhead` methods since there's no visible pages/thumbs during loading; sorry about the breakage!	2021-10-25 14:27:24 +02:00
Jonas Jenwald	c85cd80b1b	Remove unnecessary pageLabel length-check in the viewer Given how the pageLabel array is defined, see `1ab9a6e36e/src/core/catalog.js (L627)`, it shouldn't be necessary to check the length in the viewer.	2021-10-25 13:25:34 +02:00

1 2 3 4 5 ...

3211 Commits