pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	0024165f1f	Move `binarySearchFirstItem` back to the `web/`-folder (PR 15237 follow-up) This was moved into the `src/display/`-folder in PR 15110, for the initial editor-a11y patch. However, with the changes in PR 15237 we're again only using `binarySearchFirstItem` in the `web/`-folder and it thus seem reasonable to move it back there. The primary reason for moving it back is that `binarySearchFirstItem` is currently exposed in the public API, and we always want to avoid that unless it's either PDF-related functionality or code that simply must be shared between the `src/`- and `web/`-folders. In this case, `binarySearchFirstItem` is a general helper function that doesn't really satisfy either of those alternatives.	2022-08-14 11:38:17 +02:00
Jonas Jenwald	dd95e4f851	Add official support for passing `ArrayBuffer`-data to `getDocument` (issue 15269) While this has always worked, as a consequence of the implementation, it's never been officially supported. In addition to adding basic unit-tests, this patch also introduces a couple of new JSDoc `@typedef`s in the API to avoid overly long lines.	2022-08-10 14:13:01 +02:00
Jonas Jenwald	f6db7975c5	Enable the ESLint `prefer-spread` rule Note that in a couple of spots the argument could be `undefined` and there we simply disable the rule instead. Please refer to https://eslint.org/docs/latest/rules/prefer-spread	2022-08-06 10:17:00 +02:00
calixteman	b985eaa98c	Merge pull request #15267 from calixteman/freetext_a11y [Annotation] Add a div containing the text of a FreeText annotation (bug 1780375)	2022-08-04 11:49:29 +02:00
Calixte Denizet	31155740c3	[Annotation] Add a div containing the text of a FreeText annotation (bug 1780375) An annotation doesn't have to be in the text flow, hence it's likely a bad idea to insert its text in the text layer. But the text must be visible from a screen reader point of view so it must somewhere in the DOM. So with this patch, the text from a FreeText annotation is extracted and added in a div in its HTML counterpart, and with the patch #15237 the text should be visible and positioned relatively to the text flow.	2022-08-04 11:14:05 +02:00
Calixte Denizet	6916fabd51	Skip unknown fields when calculating a value in using AFSimple_Calculate	2022-08-03 23:40:09 +02:00
Jonas Jenwald	0c31320c12	[api-minor] Improve `thumbnail` handling in documents that contain interactive forms To improve performance of the sidebar we use the page-canvases to generate the thumbnails whenever possible, since that avoids unnecessary re-rendering when the sidebar is open. This works generally well, however there's an old problem in PDF documents that contain interactive forms (when those are enabled): Note how the thumbnails become partially (or fully) blank, since those Annotations are not included in the OperatorList.[1] We obviously want to keep using the `PDFThumbnailView.setImage`-method for most documents, however we need a way to skip it only for those pages that contain interactive forms. As it turns out it's unfortunately not all that simple to tell, after the fact, from looking only at the OperatorList that some Annotations were skipped. While it might have been possible to try and infer that in the viewer, it'd not have been pretty considering that at the time when rendering finishes the annotationLayer has not yet been built. The overall simplest solution that I could come up with, was instead to include a summary of the interactive form-state when doing the final "flushing" of the OperatorList and expose that information in the API. --- [1] Some examples from our test-suite: `annotation-tx2.pdf` where the thumbnail is completely blank, and `bug1737260.pdf` where the thumbnail is missing the "buttons" found on the page.	2022-07-30 16:53:32 +02:00
Jonas Jenwald	2fb083f3e2	Ensure that the `isUsingOwnCanvas`-parameter is consistently included in operatorLists (PR 14247 follow-up) Currently some `OPS.beginAnnotation` arguments will contain a `Number` value for the `isUsingOwnCanvas`-parameter, or in some cases an `undefined` value, which is inconsistent from an API perspective.	2022-07-28 13:37:37 +02:00
Calixte Denizet	7831a100b3	[Editor] Add the possibility to change line opacity in Ink editor	2022-07-27 18:46:25 +02:00
Calixte Denizet	af41a5cb49	[Editor] Simplify the command manager The previous version was maybe functional but definitely painful to maintain (maybe more efficient... I don't know) so this patch aims to simplify it and it adds some basic unit tests.	2022-07-21 18:44:41 +02:00
Jonas Jenwald	f46895d750	Merge pull request #15110 from calixteman/editing_a11y [Editor] Improve a11y for newly added element (#15109)	2022-07-19 20:02:53 +02:00
Calixte Denizet	624b26e1de	[Editor] Improve a11y for newly added element (#15109 ) - In the annotationEditorLayer, reorder the editors in the DOM according the position of the elements on the screen; - add an aria-owns attribute on the "nearest" element in the text layer which points to the added editor.	2022-07-19 18:52:17 +02:00
Jonas Jenwald	37ebc28756	Use more `for...of` loops in the code-base Note that these cases, which are all in older code, were found using the [`unicorn/no-for-loop`](https://github.com/sindresorhus/eslint-plugin-unicorn/blob/main/docs/rules/no-for-loop.md) ESLint plugin rule. However, note that I've opted not to enable this rule by default since there's still some cases where I do think that it makes sense to allow "regular" for-loops.	2022-07-17 16:18:54 +02:00
Jonas Jenwald	c2f7942aea	Ensure that the /Resources-entry is actually a dictionary (issue 15150) Prevent issues in corrupt PDF documents, if the /Resources-entry is not of the correct and expected type.	2022-07-08 12:43:43 +02:00
Jonas Jenwald	345bb18575	[editor] Use the `fit-curve` package (issue 15004) Rather than including all of this external code in the PDF.js repository, we should be using the npm package instead. Unfortunately this is slightly more complicated than you'd hope, since the `fit-curve` package (which is older) isn't directly compatible with modern JavaScript modules. In particular, the following cases needed to be considered: - For the development viewer (i.e. `gulp server`) and the unit-tests, we thus need to build a fitCurve-bundle that can be directly `import`ed. - For the actual PDF.js build-targets, we can slightly reduce the sizes by depending on the "raw" `fit-curve` source-code. - For the Node.js unit-tests, the `fit-curve` package can be used as-is.	2022-07-07 10:43:43 +02:00
Calixte Denizet	1a3ef2a0aa	[editor] Add some UI elements in order to set font size & color, and ink thickness & color	2022-06-28 12:05:04 +02:00
Calixte Denizet	3789dab307	Always flush the current item with MarkedContent stuff when getting text (#15094 )	2022-06-25 17:19:57 +02:00
calixteman	23fcdabb37	Merge pull request #15088 from calixteman/editor_rotation Support rotating editor layer	2022-06-25 16:18:07 +02:00
Calixte Denizet	0c420f5135	Support rotating editor layer - As in the annotation layer, use percent instead of pixels as unit; - handle the rotation of the editor layer in allowing editing when rotation angle is not zero; - the different editors are rotated counterclockwise in order to be usable when the main page is itself rotated; - add support for saving/printing rotated editors.	2022-06-24 20:02:32 +02:00
Calixte Denizet	6e46226cd7	Fix unit test (#15093 follow-up)	2022-06-24 18:55:35 +02:00
Jonas Jenwald	1cc7cecc7b	[api-minor] Introduce a `PrintAnnotationStorage` with frozen serializable data Given that printing is triggered synchronously in browsers, it's thus possible for scripting (in PDF documents) to modify the Annotation-data while printing is currently ongoing. To work-around that we add a new printing-specific `AnnotationStorage`, where the serializable data is frozen upon initialization, which the viewer can thus create/utilize during printing.	2022-06-23 17:06:46 +02:00
Calixte Denizet	30c63eb0ec	[Editor] Add support for printing newly added FreeText annotations	2022-06-22 13:26:09 +02:00
Calixte Denizet	f27c8c4471	[Editor] Add support for printing newly added Ink annotations	2022-06-21 18:21:49 +02:00
Calixte Denizet	cdc58b7a52	Rotate annotations based on the MK::R value (bug 1675139) - it aims to fix: https://bugzilla.mozilla.org/show_bug.cgi?id=1675139; - An annotation can be rotated (counterclockwise); - the rotation can be set in using JS.	2022-06-21 17:57:26 +02:00
Jonas Jenwald	8129815538	Enable the `unicorn/prefer-dom-node-append` ESLint plugin rule This rule will help enforce slightly shorter code, especially since you can insert multiple elements at once, and according to MDN `Element.append()` is available in all browsers that we currently support. Please find additional information here: - https://developer.mozilla.org/en-US/docs/Web/API/Element/append - https://github.com/sindresorhus/eslint-plugin-unicorn/blob/main/docs/rules/prefer-dom-node-append.md	2022-06-12 13:07:03 +02:00
Jonas Jenwald	bbf857d635	[api-minor] Stop using the `beginAnnotations`/`endAnnotations` operators (PR 14998 follow-up) After the changes in PR 14998, these operators are now no-ops in the `src/display/canvas.js` code and should no longer be necessary. Given that `beginAnnotations`/`endAnnotations` are not in the PDF specification, but are rather custom PDF.js operators, it seems reasonable to stop using them now that they've become no-ops.	2022-06-11 14:21:26 +02:00
Tim van der Meij	a57a4bc6c2	Merge pull request #15018 from Snuffleupagus/issue-15016 Expose `TextLayerRenderTask` in the TypeScript definitions (issue 15016, PR 14013 follow-up)	2022-06-10 22:18:35 +02:00
Jonas Jenwald	e046b811b7	Expose `TextLayerRenderTask` in the TypeScript definitions (issue 15016, PR 14013 follow-up) While `TextLayerRenderTask` apparently makes sense in TypeScript environments, given that it's being returned by the `renderTextLayer`-function in the API, we really don't want to extend the public API by simply exporting the class directly in `src/pdf.js` since it should never be called/initialized manually. Hence we follow the same pattern as in PR 14013, and add some very basic unit-tests to ensure that `renderTextLayer` always returns a `TextLayerRenderTask`-instance as expected.	2022-06-10 22:12:32 +02:00
Jonas Jenwald	9ac4536693	Enable the `unicorn/prefer-at` ESLint plugin rule (PR 15008 follow-up) Please find additional information here: - https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Array/at - https://github.com/sindresorhus/eslint-plugin-unicorn/blob/main/docs/rules/prefer-at.md	2022-06-09 21:21:19 +02:00
Calixte Denizet	36aae436bf	[editor] Add support for saving newly added Ink	2022-06-08 22:16:01 +02:00
Calixte Denizet	7773b3f5be	[edition] Add support for saving a newly added FreeText	2022-06-08 14:34:09 +02:00
Calixte Denizet	c7afce4210	Support Hangul syllables when searching some text (bug 1771477) - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1771477; - hangul contains some syllables which are decomposed when using NFD, hence the text must be correctly shifted in case it contains some of them.	2022-05-28 16:50:03 +02:00
Calixte Denizet	60498c67e4	Display background when printing or saving a text widget (issue #14928 )	2022-05-19 16:41:54 +02:00
Jonas Jenwald	6bcc5b615d	[api-minor] Include line endings in Line/Polyline Annotation-data (issue 14896) Please refer to: - https://web.archive.org/web/20220309040754if_/https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#G11.2109792 - https://web.archive.org/web/20220309040754if_/https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#G11.2096489 - https://web.archive.org/web/20220309040754if_/https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#G11.2096447 Note that we still won't attempt to use the /LE-data when creating fallback appearance streams, as mentioned in PR 13448, since custom line endings aren't common enough to warrant the added complexity. Finally, note that according to the PDF specification we should potentially also take the line endings into account for FreeText Annotations. However, in that case their use is conditional on other parameters that we currently don't support.	2022-05-12 11:08:30 +02:00
Jonas Jenwald	8267fd8a52	Replace the `AnnotationStorage.lastModified`-getter with a proper hash-method The current `lastModified`-getter, which only contains a time-stamp, is a fairly crude way of detecting if the stored data has actually been changed. In particular, when the `getRawValue`-method is used, the `lastModified`-getter doesn't cope with data being modified from the "outside". To fix these issues[1], and to prevent any future bugs in this code, this patch introduces a new `AnnotationStorage.hash`-getter which computes a hash of the currently stored data. To simplify things this re-uses the existing `MurmurHash3_64`-implementation, which required moving that file into the `src/shared/`-folder, since its performance should be good enough here. --- [1] Given how the `AnnotationStorage.lastModified`-getter was used, this would have been limited to printing of forms.	2022-05-04 15:21:30 +02:00
Jonas Jenwald	8135d7ccf6	Merge pull request #14869 from calixteman/14862 [JS] Fix few bugs present in the pdf for issue #14862	2022-05-03 18:31:31 +02:00
Calixte Denizet	094ff38da0	[JS] Fix few bugs present in the pdf for issue #14862 - since resetForm function reset a field value a calculateNow is consequently triggered. But the calculate callback can itself call resetForm, hence an infinite recursive loop. So basically, prevent calculeNow to be triggered by itself. - in Firefox, the letters entered in some fields were duplicated: "AaBb" instead of "AB". It was mainly because beforeInput was triggering a Keystroke which was itself triggering an input value update and then the input event was triggered. So in order to avoid that, beforeInput calls preventDefault and then it's up to the JS to handle the event. - fields have a property valueAsString which returns the value as a string. In the implementation it was wrongly used to store the formatted value of a field (2€ when the user entered 2). So this patch implements correctly valueAsString. - non-rendered fields can be updated in using JS but when they're, they must take some properties in the annotationStorage. It was implemented for field values, but it wasn't for display, colors, ... - it fixes #14862 and #14705.	2022-05-03 15:48:44 +02:00
Jonas Jenwald	df5a4fd0a7	Support encoded dest-strings in /GoTo destination dictionaries (issue 14864) Interestingly enough this appears to be the very first case of encoded dest-strings, in /GoTo destination dictionaries, that we've actually come across. What's really fascinating is that it's less than a week after issue 14847, given that these issues are somewhat similar.	2022-05-02 10:14:32 +02:00
Jonas Jenwald	fbf6dee8ee	[api-minor] Remove the `forceClamped`-functionality in the Streams (issue 14849) As it turns out, most of the code-paths in the `PDFImage`-class won't actually pass the TypedArray (containing the image-data) to the `ColorSpace`-code. Hence we generally don't need to force the image-data to be a `Uint8ClampedArray`, and can just as well directly use a `Uint8Array` instead. In the following cases we're returning the data without any `ColorSpace`-parsing, and the exact TypedArray used shouldn't matter: - `b72a448327/src/core/image.js (L714)` - `b72a448327/src/core/image.js (L751)` In the following cases the image-data is only used internally, and again the exact TypedArray used shouldn't matter: - `b72a448327/src/core/image.js (L762)` with the actual image-data being defined (as `Uint8ClampedArray`) further below - `b72a448327/src/core/image.js (L837)` Please note: This is tagged `api-minor` because it's API-observable, given that some image/mask-data will now be returned as `Uint8Array` rather than using `Uint8ClampedArray` unconditionally. However, that seems like a small price to pay to (slightly) reduce memory usage during image-conversion.	2022-04-29 14:46:30 +02:00
Jonas Jenwald	71370d012b	Support destinations in NameTrees with encoded keys (issue 14847) Initially I considered updating the `NameOrNumberTree`-implementation to handle encoded keys, however that quickly became somewhat messy (especially in the `NameOrNumberTree.get`-method) since only NameTrees using string-keys. Hence the easiest solution, as far as I'm concerned, was thus to just update the `Catalog.destinations`-getter instead. Please note that in the referenced PDF document the `Catalog.destination`-method will thus fallback to fetch all destinations, which should be fine since this is the very first case of encoded keys that we've seen. Also changes the `NameOrNumberTree.getAll`-method to prevent a possible run-time error, although we've so far not seen such a case, for any non-Array Kids-entries found in a NameTree/NumberTree. Finally, to improve overall consistency and to hopefully prevent future bugs, the patch also updates a couple of other `NameTree` call-sites to correctly handle encoded keys. (Note that the `Catalog.attachments`-getter was already doing this.)	2022-04-27 11:19:55 +02:00
Calixte Denizet	040fcae5ab	Improve performance with image masks (bug 857031) - it aims to partially fix performance issue reported: https://bugzilla.mozilla.org/show_bug.cgi?id=857031; - the idea is too avoid to use byte arrays but use ImageBitmap which are a way faster to draw: * an ImageBitmap is Transferable which means that it can be built in the worker instead of in the main thread: - this is achieved in using an OffscreenCanvas when it's available, there is a bug to enable them for pdf.js: https://bugzilla.mozilla.org/show_bug.cgi?id=1763330; - or in using createImageBitmap: in Firefox a task is sent to the main thread to build the bitmap so it's slightly slower than using an OffscreenCanvas. * it's transfered from the worker to the main thread by "reference"; * the byte buffers used to create the image data have a very short lifetime and ergo the memory used is globally less than before. - Use the localImageCache for the mask; - Fix the pdf issue4436r.pdf: it was expected to have a binary stream for the image; - Move the singlePixel trick from operator_list to image: this way we can use this trick even if it isn't in a set as defined in operator_list.	2022-04-09 18:26:26 +02:00
Jonas Jenwald	addb4cb12b	Use `String.prototype.repeat()` in a couple of spots Rather than using a temporary Array to manually create repeated strings, we can use `String.prototype.repeat()` instead. The reason that we didn't use this from the start is most likely because some browsers, notably IE, didn't support this; note https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/repeat#browser_compatibility	2022-03-30 15:42:40 +02:00
Calixte Denizet	ad3fb71a02	[Annotations] Add support for printing/saving choice list with multiple selections - it aims to fix issue #12189.	2022-03-29 18:59:44 +02:00
Calixte Denizet	18e79e3c0b	[text selection] Add the whitespaces present in the pdf in the text chunk - it aims to fix issue #14627; - the basic idea of the recent text refactoring was to only consider the rendered visible whitespaces. But sometimes, the heuristics aren't correct and although some whitespaces are in the text stream they weren't in the text chunks because they were too small. Hence we added some exceptions, for example, we always add a whitespace when it is between two non-whitespace chars but only when in the same Tj. So basically, this patch removes the constraint to have the chars in the same Tj (in using a circular buffer to save the two last chars) but don't add a space when the visible space is really too small (hence `NOT_A_SPACE_FACTOR`).	2022-03-27 14:34:56 +02:00
Jonas Jenwald	849de5a508	Slightly improve validation of (some) parameters in `getDocument` There's a couple of `getDocument` parameters that should be numbers, but which are currently not fully validated to prevent issues elsewhere in the code-base. Also, improves validation of the `ownerDocument` parameter since we currently accept more-or-less anything here.	2022-03-21 13:32:17 +01:00
Calixte Denizet	f0b549c2a2	[JS] - Parse a date in using the given format first and then try the default date parser - it aims to fix #14672.	2022-03-19 16:07:43 +01:00
Jonas Jenwald	c0736647f9	Add general iteration support in the `RefSet` and `RefSetCache` classes This patch removes the existing `forEach` methods, in favor of making the classes properly iterable instead. Given that the classes are using a `Set` respectively a `Map` internally, implementing this is very easy/efficient and allows us to simplify some existing code.	2022-03-18 14:27:34 +01:00
Jonas Jenwald	fb345ee184	Enable the "gets fieldObjects" unit-test in Node.js (PR 14409 follow-up) Apparently this unit-test works in Node.js now, hence it's possible that the reason it didn't work previously is that there were bugs in our old `structuredClone` polyfill.	2022-03-13 10:40:57 +01:00
Tim van der Meij	bcf453cf14	Merge pull request #14656 from Snuffleupagus/mv-isSameOrigin Move the `isSameOrigin` helper function	2022-03-11 21:08:49 +01:00
Jonas Jenwald	537ed37835	Move the `isSameOrigin` helper function This function is currently placed in the `src/shared/util.js` file, which means that the code is duplicated in both of the built `pdf.js` and `pdf.worker.js` files. Furthermore, it only has a single call-site which is also specific to the `GENERIC`-build of the PDF.js library. Hence this helper function is instead moved into the `src/display/api.js` file, in such a way that it's conditionally defined but still can be unit-tested.	2022-03-10 13:51:09 +01:00

1 2 3 4 5 ...

1044 Commits