pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	19c2dfbb96	Move rotation normalization from `PDFViewerApplication` and into `BaseViewer` The rotation handling that's currently living in `PDFViewerApplication` is very old, and pre-dates the introduction of the viewer components by years. As can be seen in the `BaseViewer.pagesRotation` setter, we're not actually normalizing the rotation as intended and instead rely on the caller to handle that correctly. This is first of all inconsistent, given how other setters are implemented, and secondly it could also lead to the rotation being set to a value outside of the `[0, 360)`-range. Finally, for improved consistency the rotation handling in `PageViewport` is updated similarly. Please note that this case, it's not changing the pre-existing logic.	2021-03-28 14:19:58 +02:00
calixteman	63471bcbbe	XFA - Convert some template properties into CSS ones (#13082 ) - implement few positioning properties: position, width, height, anchor; - implement font element; - implement fill element (used by font) and its children (linear, radial, ...); - font property is inherited from ancestor container (see https://www.pdfa.org/wp-content/uploads/2020/07/XFA-3_3.pdf#page=43) so let CSS handles that stuff; - in order to reduce the number of properties to set, only set non default properties and put the default in CSS; - set a background to some containers to be able to see them (will be removed in a future commit).	2021-03-25 13:02:39 +01:00
Tim van der Meij	8269ddbd16	Merge pull request #13105 from Snuffleupagus/BasePdfManager-parseDocBaseUrl Improve memory usage around the `BasePdfManager.docBaseUrl` parameter (PR 7689 follow-up)	2021-03-19 23:03:20 +01:00
Jonas Jenwald	57e7557235	Actually reset the `PDFPageProxy._xfaPromise` property as intended (PR 13069 follow-up) (#13119 ) Similar to the existing `annotationsPromise` and `_jsActionsPromise` properties, the new `_xfaPromise` should obviously also be reset, since otherwise you might end up holding onto a lot of data for pages that are no longer active. (That caching wasn't present in the original version of PR 13069, which is why I didn't spot it until now.)	2021-03-19 11:31:54 +01:00
calixteman	24e598a895	XFA - Add a layer to display XFA forms (#13069 ) - add an option to enable XFA rendering if any; - for now, let the canvas layer: it could be useful to implement XFAF forms (embedded pdf in xml stream for the background and xfa form for the foreground); - ui elements in template DOM are pretty close to their html counterpart so we generate a fake html DOM from template one: - it makes easier to translate template properties to html ones; - it makes faster the creation of the html element in the main thread.	2021-03-19 10:11:40 +01:00
Jonas Jenwald	c4c7216171	Improve memory usage around the `BasePdfManager.docBaseUrl` parameter (PR 7689 follow-up) While there is nothing outright wrong with the existing implementation, it can however lead to increased memory usage in one particular case (that I completely overlooked when implementing this): For "data:"-URLs, which by definition contains the entire PDF document and can thus be arbitrarily large, we obviously want to avoid sending, storing, and/or logging the "raw" docBaseUrl in that case. To address this, this patch makes the following changes: - Ignore any non-string in the `docBaseUrl` option passed to `getDocument`, since those are unsupported anyway, already on the main-thread. - Ignore "data:"-URLs in the `docBaseUrl` option passed to `getDocument`, to avoid having to send what could potentially be a very long string to the worker-thread. - Parse the `docBaseUrl` option directly in the `BasePdfManager`-constructors, on the worker-thread, to avoid having to store the "raw" docBaseUrl in the first place.	2021-03-17 15:48:24 +01:00
Jonas Jenwald	bd9dee1544	Move the `getPdfFilenameFromUrl` helper function from `web/ui_utils.js` and into `src/display/display_utils.js` It seems reasonable to place this alongside the similar `getFilenameFromUrl` helper function. This way, with the changes in the next patch, we also avoid having to expose the `isDataScheme` function in the API itself and we instead expose `getPdfFilenameFromUrl` in the API (which feels overall more appropriate).	2021-03-17 15:48:24 +01:00
Jonas Jenwald	50681d71c8	Ensure that `getDocument` handles Node.js `Buffer`s more gracefully (issue 13075) While the JSDocs have never advertised `getDocument` as supporting Node.js `Buffer`s, that apparently doesn't stop users from passing such data structures to `getDocument`. In theory the existing `instanceof Uint8Array` check ought to have caught Node.js `Buffer`s, however for reasons that I don't even pretend to understand that check actually passes. Hence this patch which, only in Node.js environments, will special-case `Buffer`s to hopefully provide a slightly better out-of-the-box behaviour in Node.js environments[1]. --- [1] Although I'm not sure that we necessarily want to advertise this in the JSDocs, given the specialized use-case.	2021-03-13 10:52:38 +01:00
Jonas Jenwald	b326432895	Simplify the data lookup in the `AnnotationStorage.getValue` method Rather than first checking if data exists before fetching it from storage, we can simply do the lookup directly and then check its value. Note that this follows the same pattern as utilized in the `AnnotationStorage.setValue` method.	2021-03-11 16:37:38 +01:00
Jonas Jenwald	a0e584eeb2	Replace the `objectFromEntries` helper function with an `objectFromMap` one instead Given that it's only used with `Map`s, and that it's currently implemented in such a way that we (indirectly) must iterate through the data twice, some simplification cannot hurt here. Note that the only reason that we're not using `Object.fromEntries(...)` directly, at each call-site, is that that one won't guarantee that a `null` prototype is being used.	2021-03-11 16:37:34 +01:00
Calixte Denizet	c01ef24541	JS - reset correctly radio buttons	2021-03-07 11:04:40 +01:00
Jonas Jenwald	6fd899dc44	[api-minor] Support the Content-Disposition filename in the Firefox PDF Viewer (bug 1694556, PR 9379 follow-up) As can be seen [in the mozilla-central code](https://searchfox.org/mozilla-central/rev/a6db3bd67367aa9ddd9505690cab09b47e65a762/toolkit/components/pdfjs/content/PdfStreamConverter.jsm#1222-1225), we're already getting the Content-Disposition filename. However, that data isn't passed through to the viewer nor to the `PDFDataTransportStream`-implementation, which explains why it's currently being ignored. Please note: This will also require a small mozilla-central patch, see https://bugzilla.mozilla.org/show_bug.cgi?id=1694556, to forward the necessary data to the viewer.	2021-02-26 10:50:29 +01:00
Jonas Jenwald	df931ef685	Move the opening of PDF file attachments into the `DownloadManager`-implementations Note how the `PDFAttachmentViewer` handles PDF file attachments specially, by opening them in a new window/tab, rather than forcing them to be downloaded. This is done to improve the overall UX, since browsers in general are able to handle PDF files internally. However, for file annotations we're currently not attempting to do the same thing and are instead just downloading them directly. In order to unify the behaviour, without having to duplicate a lot of code, the opening of PDF file attachments is thus moved into a new `DownloadManager.openOrDownloadData` method.	2021-02-23 13:44:23 +01:00
Tim van der Meij	f3aa4408a5	Merge pull request #13005 from calixteman/colors JS - Fix setting a color on an annotation	2021-02-21 14:50:03 +01:00
Calixte Denizet	4a5f1d1b7a	JS - Fix setting a color on an annotation - strokeColor corresponds to borderColor; - support fillColor and textColor; - support colors on the different annotations; - fix typo in aforms (+test).	2021-02-20 15:24:37 +01:00
Jonas Jenwald	d69cf702f3	Add a `this`-bound method for `InternalRenderTask.cancel` This is similar to the other methods, and the only reason for this not having been done originally is that the `cancel` functionality is a later addition.	2021-02-20 14:47:57 +01:00
Jonas Jenwald	e9038cc3d1	Send the `AnnotationStorage`-data to the worker-thread as a `Map` Rather than converting the `AnnotationStorage`-data to an Object, before sending it to the worker-thread, we should be able to simply send the internal `Map` directly. The "structured clone algorithm" doesn't have a problem with `Map`s, however the `LoopbackPort` used when workers are disabled (e.g. in Node.js environments) didn't use to support them. With PR 12997 having lifted that restriction, we should now be able to simply send the `AnnotationStorage`-data as-is rather than having to iterate through it to first create an Object. Please note: The changes in `src/core/annotation.js` could have been a lot more compact if we were able to use optional chaining in the `src/core` folder. Unfortunately that's still not possible, since SystemJS is being used in the development viewer (i.g. `gulp server`) and fixing that is still blocked by [bug 1247687](https://bugzilla.mozilla.org/show_bug.cgi?id=1247687).	2021-02-18 17:13:43 +01:00
Tim van der Meij	4619b1b568	Merge pull request #12997 from Snuffleupagus/metadata-worker Move the Metadata parsing to the worker-thread	2021-02-17 20:57:46 +01:00
Tim van der Meij	77862bdb8e	Merge pull request #12999 from Snuffleupagus/LoopbackPort-rm-sync [api-minor] Remove support for synchronous event dispatching in `LoopbackPort`	2021-02-17 20:39:54 +01:00
Jonas Jenwald	3398070e26	[api-minor] Remove support for synchronous event dispatching in `LoopbackPort` Please note: The `defer` parameter has been enabled by default ever since PR 9777 (in 2018), which first shipped in PDF.js release `2.0.943`. With workers disabled, e.g. in Node.js environments, this has been used ever since without any problems reported[1]. The impetus for this change was that I happened to notice that if the `LoopbackPort` was used with synchronous event dispatching, we'd simply send that data as-is to the listeners. This created an inconsistency in the data returned from the `pdf.worker.js` file, since `postMessage` used with actual workers (or the `LoopbackPort` with `defer = true`) will ignore/throw when encountering unclonable data. Originally my intention was simply to just call `cloneValue` regardless of the event dispatching used in `LoopbackPort`, however looking at the use-cases (or lack thereof) of the `LoopbackPort` it seemed reasonable to simply remove the `defer` parameter instead. This patch is tagged "[api-minor]" since the `LoopbackPort` is still exposed in the API, although I really hope that no third-party is using this (since disabling workers leads to bad performance). Finally, this patch changes a `forEach` loop to `for...of` and makes uses of optional changing in existing code. --- [1] As evident by the `npm test` command run by Github Actions, and previously by Travis.	2021-02-17 16:12:29 +01:00
Jonas Jenwald	cc3a6563ee	Move the Metadata parsing to the worker-thread The only reason, as far as I can tell, for parsing the Metadata on the main-thread is how it was originally implemented. When Metadata support was first implemented, it utilized the [`DOMParser`](https://developer.mozilla.org/en-US/docs/Web/API/DOMParser) which isn't available in workers. Today, with the custom XML-parser being used, that's no longer an issue and it seems reasonable to move the Metadata parsing to the worker-thread[1], since that's where all parsing should happen (for performance reasons). Based on these changes, we'll be able to reduce the now unnecessary duplication of the XML-parser (and related code) in both of the built `pdf.js`/`pdf.worker.js` files. Finally, this patch changes the `_repair` method to use "Array + join" rather than string concatenation. --- [1] This needed the previous patch, to enable sending of `Map`s between threads with workers disabled.	2021-02-17 13:12:01 +01:00
Jonas Jenwald	73bf45e64b	Support `Map` and `Set`, with `postMessage`, when workers are disabled The `LoopbackPort` currently doesn't support `Map` and `Set`, which it should since the "structured clone algorithm" used in browsers does support both of them; please see https://developer.mozilla.org/en-US/docs/Web/API/Web_Workers_API/Structured_clone_algorithm#supported_types	2021-02-17 13:11:59 +01:00
Jonas Jenwald	0a28e51e40	Simplify the default value handling of `renderInteractiveForms` in the viewer components I happened to look at this code, and I can't for the life of me figure out why I didn't just implement it like this patch in the first place (since the current format feels overly verbose).	2021-02-17 10:47:55 +01:00
Jonas Jenwald	b26c7974fe	[api-minor] Change the `dc:subject` Metadata field to an Array This patch simply extends the existing handling of the `dc:creator` field, which should hopefully suffice here; please refer to https://wwwimages2.adobe.com/content/dam/acom/en/devnet/xmp/pdfs/XMP%20SDK%20Release%20cc-2016-08/XMPSpecificationPart1.pdf#page=34	2021-02-14 17:16:40 +01:00
Tim van der Meij	c79fd71457	Merge pull request #12896 from calixteman/text_layer Modifiy the way to compute baseline to have a better match between canvas and text layer	2021-02-13 15:12:58 +01:00
Calixte Denizet	ea06bb0e36	[api-minor] Annotation -- Don't compute appearance when nothing has changed * don't set a value in annotationStorage by default: - having an undefined when the annotation is rendered for saving/printing means nothing has changed so use normal appearance - aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1681687 * change the way to compute font size when this one is null in DA: - make fontSize proportional to line height - in multiline case, take into account the number of lines for text entered to adapt the font size	2021-02-12 19:27:21 +01:00
Calixte Denizet	b4421b076a	Modifiy the way to compute baseline to have a better match between canvas and text layer - use ascent of the fallback font instead of the one from pdf to position spans - use TextMetrics.fontBoundingBoxAscent if available or - use a basic heuristic to guess ascent in drawing char on a canvas - compute ascent as a ratio of font height	2021-02-12 11:28:02 +01:00
dhufnagel	fc925827b2	fix initial state of checkboxes in display layer (#12904 ) consider the export value when multiple checkboxes have the same name	2021-02-12 11:22:54 +01:00
Jonas Jenwald	31098c404d	Use `Math.hypot`, instead of `Math.sqrt` with manual squaring (#12973 ) When the PDF.js project started `Math.hypot` didn't exist yet, and until recently we still supported browsers (IE 11) without a native `Math.hypot` implementation; please see this compatibility information: https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Math/hypot#browser_compatibility Furthermore, somewhat recently there were performance improvements of `Math.hypot` in Firefox; see https://bugzilla.mozilla.org/show_bug.cgi?id=1648820 Finally, this patch also replaces a couple of multiplications with the exponentiation operator.	2021-02-10 12:28:49 +01:00
Tim Nguyen	2ca886baee	Use DOM hidden property instead of attribute methods	2021-02-08 00:21:49 +01:00
Jonas Jenwald	fec8c4c43f	Access `this._onUnsupportedFeature` directly in `FontFaceObject.getPathGenerator` Given that `FontFaceObject` is not exposed in the public API, but only accessed internally, there's no need to assume that a `FontFaceObject`-instance is ever initialized without `onUnsupportedFeature` being provided. This is also consistent with the `BaseFontLoader` implementation.	2021-01-29 16:48:55 +01:00
Tim van der Meij	286271152f	Merge pull request #12910 from calixteman/bidi Add back dir property in spans in text layer	2021-01-27 22:09:00 +01:00
Calixte Denizet	539256c351	Add back dir property in spans in text layer - aims to fix #12909	2021-01-26 12:00:05 +01:00
calixteman	a3f6882b06	JS -- add support for choice widget (#12826 )	2021-01-25 23:40:57 +01:00
Calixte Denizet	34d2e72df2	JS - Fix mouse event names - fix issue #12895	2021-01-23 20:26:22 +01:00
Dominik Hufnagel	c5083cda02	set font size and color on annotation layer use the default appearance to set the font size and color of a text annotation widget	2021-01-22 23:12:14 +01:00
Brendan Dahl	2cba290361	Merge pull request #12836 from calixteman/update_buttons JS -- update radio/checkbox values even if there are no actions	2021-01-21 14:00:26 -08:00
Brendan Dahl	4142001fc2	Merge pull request #12869 from calixteman/lw Fix zoom issue with too thin lines	2021-01-21 08:31:59 -08:00
Jonas Jenwald	298ee5cfbb	Replace some ternary operators with optional chaining, and nullish coalescing, in the `src/display/`-folder This way, we can further reduce unnecessary code-repetition in some cases.	2021-01-19 17:20:02 +01:00
Calixte Denizet	9754216c60	Fix zoom issue with too thin lines - aims to fix issue #12868: apply zoom factor to linewidth after setting it to 1. - only apply 1px-width when required - the sign of getSinglePixelWidth is used to know if 1px-width is required	2021-01-16 15:52:27 +01:00
Calixte Denizet	0d1b19632d	Enforce linewidth to 1px when at least one of scale factor is lower than 1	2021-01-15 13:18:24 +01:00
Jonas Jenwald	cf7eb87934	Remove a duplicated reference test (PR 12812 follow-up) - Remove a duplicated reference test, see "issue12810", from the manifest. - Improve the spelling in a couple of comments in `src/core/canvas.js`, most notable of the word "parallelogram". - Update a comment, also in `src/core/canvas.js`, to actually agree with the value used to reduce confusion when reading the code.	2021-01-15 10:57:15 +01:00
Brendan Dahl	6619f1f3f2	Merge pull request #12812 from calixteman/too_thin Enforce line width to be at least 1px after applied transform	2021-01-14 15:21:44 -08:00
Jonas Jenwald	13742eb82d	Inlude the JS `actions` for the page when dispatching the "pageopen"-event in the `BaseViewer` Note first of all how the `PDFDocumentProxy.getJSActions` method in the API caches the result, which makes repeated lookups cheap enough to not really be an issue. Secondly, with the previous patch, we're now only dispatching "pageopen"/"pageclose"-events when there's actually a sandbox that listens for them. All-in-all, with these changes we can thus simplify the default-viewer "pageopen"-event handler a fair bit.	2021-01-12 20:28:50 +01:00
calixteman	1de1ae0be6	Merge pull request #12838 from calixteman/authors [api-minor] Change the "dc:creator" Metadata field to an Array	2021-01-12 02:44:58 -08:00
Calixte Denizet	43d5512f5c	[api-minor] Change the "dc:creator" Metadata field to an Array - add scripting support for doc.info.authors - doc.info.metadata is the raw string with xml code	2021-01-11 21:34:07 +01:00
Calixte Denizet	b3dccd66ab	Enforce line width to be at least 1px after applied transform * add a comment to explain how minimal linewidth is computed. * when context.linewidth < 1 after transform, firefox and chrome don't render in the same way (issue #12810). * set lineWidth to 1 after transform and before stroking - aims fix issue #12295 - a pixel can be transformed into a rectangle with both heights < 1. A single rescale leads to a rectangle with dim equals to 1 and the other to something greater than 1. * change the way to render rectangle with null dimensions: - right now we rely on the lineWidth set before "re" but it can be set after "re" and before "S" and in this case the rendering will be wrong. - render such rectangles as a single line.	2021-01-10 18:02:12 +01:00
Jonas Jenwald	81525fd446	Use ESLint to ensure that `export`s are sorted alphabetically There's built-in ESLint rule, see `sort-imports`, to ensure that all `import`-statements are sorted alphabetically, since that often helps with readability. Unfortunately there's no corresponding rule to sort `export`-statements alphabetically, however there's an ESLint plugin which does this; please see https://www.npmjs.com/package/eslint-plugin-sort-exports The only downside here is that it's not automatically fixable, but the re-ordering is a one-time "cost" and the plugin will help maintain a consistent ordering of `export`-statements in the future. Note: To reduce the possibility of introducing any errors here, the re-ordering was done by simply selecting the relevant lines and then using the built-in sort-functionality of my editor.	2021-01-09 20:37:51 +01:00
Jonas Jenwald	941b65f683	Remove unncessary `CanvasFactory`/`CMapReaderFactory`/`FileReaderFactory` duplication in unit-tests Given that the API will now, after PR 12039, automatically pick the correct factories to use depending on the environment (browser vs. Node.js), we can utilize that in the unit-tests as well. This way we don't have to manually repeat the same initialization code in multiple unit-tests. Note: The official PDF.js API is defined in `src/pdf.js`, hence the new exports in `src/display/api.js` will not affect that. Also, updates the unit-test `FileReaderFactory` helpers similarily. Drive-by change: Fix the `CMapReaderFactory` usage in the annotation unit-tests, since the cache should only contain raw data and not a Promise. While this obviously works as-is, having unit-tests that "abuse" the intended data format can easily lead to unnecessary failures if changes are made to the relevant `src/core/` code.	2021-01-08 17:33:59 +01:00
Calixte Denizet	7172f0a928	JS -- update radio/checkbox values even if there are no actions	2021-01-08 16:43:16 +01:00

1 2 3 4 5 ...

1151 Commits