Sakurai/pdf.js - pdf.js - Gitea on kemo

Sakurai/pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	317abd6d07	Change the `createPromiseCapability` helper function into a `PromiseCapability` class This is not only slightly more compact, but it also simplifies the handling of the `settled` getter.	2023-04-29 13:43:24 +02:00
Jonas Jenwald	f9c2a8d437	Introduce some optional chaining in the `src/shared/` folder	2023-04-29 13:43:24 +02:00
Tim van der Meij	c9359957e6	Merge pull request #16305 from Snuffleupagus/PDFJSDev-skip-PRODUCTION Remove the `PRODUCTION` build-target	2023-04-22 14:53:30 +02:00
Calixte Denizet	117bbf7cd9	[api-minor] Don't normalize the text used in the text layer. Some arabic chars like \ufe94 could be searched in a pdf, hence it must be normalized when creating the search query. So to avoid to duplicate the normalization code, everything is moved in the find controller. The previous code to normalize text was using NFKC but with a hardcoded map, hence it has been replaced by the use of normalize("NFKC") (it helps to reduce the bundle size by 30kb). In playing with this \ufe94 char, I noticed that the bidi algorithm wasn't taking into account some RTL unicode ranges, the generated font wasn't embedding the mapping this char and the unicode ranges in the OS/2 table weren't up-to-date. When normalized some chars can be replaced by several ones and it induced to have some extra chars in the text layer. To avoid any regression, when copying some text from the text layer, a copied string is normalized (NFKC) before being put in the clipboard (it works like this in either Acrobat or Chrome).	2023-04-17 14:31:23 +02:00
Jonas Jenwald	804aa896a7	Stop using the `PRODUCTION` build-target in the JavaScript code This special build-target is very old, and was introduced with the first pre-processor that only uses comments to enable/disable code. When the new pre-processor was added `PRODUCTION` effectively became redundant, at least in JavaScript code, since `typeof PDFJSDev === "undefined"` checks now do the same thing. This patch proposes that we remove `PRODUCTION` from the JavaScript code, since that simplifies the conditions and thus improves readability in many cases. Please note: There's not, nor has there ever been, any gulp-task that set `PRODUCTION = false` during building.	2023-04-17 12:04:34 +02:00
Jonas Jenwald	c0671ac133	Slightly increase the maximum image sizes that we'll cache The current value originated in PR 2317, and in the decade that have passed the amount of RAM available in (most) devices should have increased a fair bit. Nowadays we also do a much better job of detecting repeated images at both the page- and document-level, which helps reduce overall memory-usage in many documents. Finally the constant is also moved into the `src/shared/util.js` file, since it was implicitly used on both the main- and worker-thread previously.	2023-03-08 17:06:10 +01:00
Jonas Jenwald	2f3dcc2327	[api-minor] Remove the deprecated `onUnsupportedFeature` functionality (PR 15758 follow-up) This was deprecated in PR 15758, which has now been included in three official PDF.js releases. While PR 15880 did limit the bundle-size impact of this functionality on e.g. the Firefox PDF Viewer, it still leads to some unnecessary "bloat" that these changes remove. Furthermore, with this being deprecated there'd also be no effort put into e.g. extending the `UNSUPPORTED_FEATURES` list when handling future error cases.	2023-03-07 10:18:43 +01:00
Jonas Jenwald	6d4d402a78	Move the `arrayBuffersToBytes` helper function into the worker-thread Given that this helper function is only used on the worker-thread, there's no reason to duplicate it in both of the built `pdf.js` and `pdf.worker.js` files.	2023-02-11 21:34:37 +01:00
Jonas Jenwald	c56f25409d	Re-factor the `arraysToBytes` helper function (PR 16032 follow-up) Currently this helper function only has two call-sites, and both of them only pass in `ArrayBuffer` data. Given how it's implemented there's a couple of code-paths that are completely unused (e.g. the "string" one), and in particular the intended fast-paths don't actually work. This patch re-factors and simplifies the helper function, and it'll no longer accept anything except `ArrayBuffer` data (hence why it's also re-named). Note that at the time when `arraysToBytes` was added we still supported browsers without TypedArray functionality, and we'd then simulate them using regular Arrays.	2023-02-10 10:26:35 +01:00
Jonas Jenwald	96d338e437	Reduce usage of the `arrayByteLength` helper function We're using this helper function when reading data from the [`PDFWorkerStreamReader.read`](`a49d1d1615/src/core/worker_stream.js (L90-L98)`) and [`PDFWorkerStreamRangeReader.read`](`a49d1d1615/src/core/worker_stream.js (L122-L128)`) methods, and as can be seen they always return `ArrayBuffer` data. Hence we can simply get the `byteLength` directly, and don't need to use the helper function. Note that at the time when `arrayByteLength` was added we still supported browsers without TypedArray functionality, and we'd then simulate them using regular Arrays.	2023-02-09 15:50:38 +01:00
Jonas Jenwald	1a69d537c1	[api-minor] Limit the `PDFDocumentLoadingTask.onUnsupportedFeature` functionality to GENERIC builds (PR 15758 follow-up) This was deprecated in PR 15758 but it's unfortunately quite difficult to tell if third-party users are depending on this, e.g. to implement custom error reporting, and if so to what extent. However, thanks to the pre-processor we can limit most of this code to GENERIC builds which still seem like a worthwhile change. These changes reduce the bundle size of the Firefox PDF Viewer by 3.8 kB in total.	2023-01-01 17:53:12 +01:00
Jonas Jenwald	0c1fb4e740	[api-minor] Remove the `PDFDocumentProxy.stats` getter (PR 15758 follow-up) This was deprecated in PR 15758 and given that it's quite unlikely that any third-party users are relying on this functionality, since it was only ever added to support telemetry reporting in the Firefox PDF Viewer, it should hopefully be fine to remove this fairly quickly. These changes reduce the bundle size of the Firefox PDF Viewer by 4.5 kB in total.	2023-01-01 17:06:47 +01:00
Jonas Jenwald	82d127883d	Stop duplicating the `platform` getter in multiple files Currently both of the `AnnotationElement` and `KeyboardManager` classes contain identical `platform` getters, which seems like unnecessary duplication. With the pre-processor we can also limit the feature-testing to only GENERIC builds, since `navigator` should always be available in browsers.	2022-11-29 12:14:40 +01:00
Jonas Jenwald	9adc7859c8	Move the `escapeString` helper function into the worker-thread Given that this helper function is only used on the worker-thread, there's no reason to duplicate it in both of the `pdf.js` and `pdf.worker.js` files.	2022-11-16 12:35:48 +01:00
Jonas Jenwald	e5859e145d	Move the `isAscii` helper function into the worker-thread Given that this helper function is only used on the worker-thread, there's no reason to duplicate it in both of the `pdf.js` and `pdf.worker.js` files.	2022-11-16 12:35:48 +01:00
Jonas Jenwald	2eaa708e3a	Combine the `stringToUTF16String` and `stringToUTF16BEString` helper functions Given that these functions are virtually identical, with the latter only adding a BOM, we can combine the two. Furthermore, since both functions were only used on the worker-thread, there's no reason to duplicate this functionality in both of the `pdf.js` and `pdf.worker.js` files.	2022-11-16 12:35:44 +01:00
Calixte Denizet	3ca03603c2	[Annotation] Fix printing/saving for annotations containing some non-ascii chars and with no fonts to handle them (bug 1666824) - For text fields * when printing, we generate a fake font which contains some widths computed thanks to an OffscreenCanvas and its method measureText. In order to avoid to have to layout the glyphs ourselves, we just render all of them in one call in the showText method in using the system sans-serif/monospace fonts. * when saving, we continue to create the appearance streams if the fonts contain the char but when a char is missing, we just set, in the AcroForm dict, the flag /NeedAppearances to true and remove the appearance stream. This way, we let the different readers handle the rendering of the strings. - For FreeText annotations * when printing, we use the same trick as for text fields. * there is no need to save an appearance since Acrobat is able to infer one from the Content entry.	2022-11-10 19:05:39 +01:00
Jonas Jenwald	c33b8d7692	Cache the normalized unicode-value on the `Glyph`-instance Currently, during text-extraction, we're repeatedly normalizing and (when necessary) reversing the unicode-values every time. This seems a little unnecessary, since the result won't change, hence this patch moves that into the `Glyph`-instance and makes it lazily initialized. Taking the `tracemonkey.pdf` document as an example: When extracting the text-content there's a total of 69236 characters but only 595 unique `Glyph`-instances, which mean a 99.1 percent cache hit-rate. Generally speaking, the longer a PDF document is the more beneficial this should be. Please note: The old code is fast enough that it unfortunately seems difficult to measure a (clear) performance improvement with this patch, so I completely understand if it's deemed an unnecessary change.	2022-11-03 22:36:53 +01:00
Tim van der Meij	229d21b50d	Merge pull request #15553 from Snuffleupagus/rm-CMapCompressionType-STREAM Remove the unused `CMapCompressionType.STREAM` value	2022-10-09 13:33:54 +02:00
Jonas Jenwald	484e81ef6e	[api-major] Remove some deprecated constants All of the these constants have been deprecated for a while, and with the upcoming major version this seems like a good time to remove them. For the string-constants we can simply remove them, but the number-constants are left commented out since we don't want to re-number the list to prevent third-party breakage.	2022-10-08 18:13:53 +02:00
Jonas Jenwald	4cc98de6d7	Remove the unused `CMapCompressionType.STREAM` value This was added in PR 8064, over five years ago, for a possible future CMap file-format that was never implemented.	2022-10-08 17:10:05 +02:00
Jonas Jenwald	4b39b1c76b	Remove the unused `Util.apply3dTransform` method This method was originally added in PR 1157 (back in 2012), however its only call-site was then removed in PR 2423 (also in 2012). Hence this method has been completely unused for nearly a decade, and it should thus be safe to remove it.	2022-10-07 13:55:36 +02:00
Jonas Jenwald	3e625994bd	Change how `src/shared/compatibility.js` is imported Currently the compatibility-file is loaded using a standard `import`-statement and while its code is enclosed in a pre-processor block, and thus is excluded in e.g. the MOZCENTRAL build-target, it still results in the built `pdf.js`/`pdf.worker.js` files having an effectively empty closure as a result. By moving the checks from `src/shared/compatibility.js` and into `src/shared/util.js` instead, we can load the file using a build-time `require`-statement and thus avoid that closure. Note that with these changes the compatibility-file will no longer be loaded in development mode, i.e. when `gulp server` is used. However, this shouldn't be a big issue given that none of its included polyfills could be loaded then anyway (since `require`-statements are being used) and that it's really only intended for the `legacy`-builds of the library.	2022-10-01 13:29:54 +02:00
Calixte Denizet	7831a100b3	[Editor] Add the possibility to change line opacity in Ink editor	2022-07-27 18:46:25 +02:00
Jonas Jenwald	4a4c6b9851	[editor] Introduce a proper `annotationEditorMode` option/preference (PR 15075 follow-up) This replaces the boolean `annotationEditorEnabled` option/preference with a "proper" `annotationEditorMode` one. This way it's not only possible for the user to control if Editing is enabled/disabled, but also which specific Editing-mode should become enabled upon PDF document load. Given that Editing is not enabled/released yet, I cannot imagine that changing the name and type of the option/preference should be an issue.	2022-06-29 11:35:58 +02:00
Calixte Denizet	1a3ef2a0aa	[editor] Add some UI elements in order to set font size & color, and ink thickness & color	2022-06-28 12:05:04 +02:00
Jonas Jenwald	bbf857d635	[api-minor] Stop using the `beginAnnotations`/`endAnnotations` operators (PR 14998 follow-up) After the changes in PR 14998, these operators are now no-ops in the `src/display/canvas.js` code and should no longer be necessary. Given that `beginAnnotations`/`endAnnotations` are not in the PDF specification, but are rather custom PDF.js operators, it seems reasonable to stop using them now that they've become no-ops.	2022-06-11 14:21:26 +02:00
Calixte Denizet	c161a86ba1	[editor] Add an Ink editor - Approximate the drawn curve by a set of Bezier curves in using js code from https://github.com/soswow/fit-curves. The code has been slightly modified in order to make the linter happy.	2022-06-09 19:35:59 +02:00
Calixte Denizet	7773b3f5be	[edition] Add support for saving a newly added FreeText	2022-06-08 14:34:09 +02:00
Jonas Jenwald	51bf928061	[editor] A couple of small FreeText-related fixes (PR 14976 follow-up) - Ensure that the modified-warning won't be displayed, when navigating away from the viewer, if the user has added custom Annotations and then removed all of them. - Ensure that the initial editor-buttons state, i.e. the `toggled`-class, is correctly displayed in the toolbar when then viewer loads. - Tweak the CSS-classes for the editor-buttons, such that they use the correct focus/hover-rules (similar to the sidebar-buttons). - Remove a no longer accurate comment from the `BaseViewer.annotationEditorMode`-setter. - Address a couple of smaller outstanding review comments, including some re-formatting changes, from PR 14976.	2022-06-04 21:48:11 +02:00
Calixte Denizet	be1aa11986	[edition] Add a FreeText editor (#14970 ) - add a basic UI to edit some text in a pdf; - an editor can be moved, suppressed, cut, copied, pasted, selected; - add an undo/redo manager.	2022-06-04 18:20:11 +02:00
Calixte Denizet	9d82106d20	Set the text fields font size based on their height - right now we're using the font size from the pdf itself but we use an other font in the annotation layer. So this size doesn't really make sense and leads to bad rendering (see pdf in #14928); - use a sans-serif font for the fields containing text (fix issue #14736); - remove useless padding in text-based fields (fix issue #14301); - text fields allow/disallow scrolling bars (see bit 24 in Ff entry), so use this value to hide/show scrollbars in annotation layer.	2022-05-28 18:00:39 +02:00
Calixte Denizet	4b7691baf6	Simplify min/max computations in constructPath (bug 1135277) - most of the time the current transform is a scaling one (modulo translation), hence it's possible to avoid to apply the transform on each bbox and then apply it a posteriori; - compute the bbox when it's possible in the worker.	2022-04-17 17:25:54 +02:00
Calixte Denizet	7501fe6f30	Improve performance of shared/utils.js::intersect - avoid to call normalizeRect which clones the rectangles: it's useless and time consuming; - in profiling the pdf in bug 1135277, the time spent in intersect drops from ~1s to ~30ms.	2022-04-15 22:24:26 +02:00
Calixte Denizet	040fcae5ab	Improve performance with image masks (bug 857031) - it aims to partially fix performance issue reported: https://bugzilla.mozilla.org/show_bug.cgi?id=857031; - the idea is too avoid to use byte arrays but use ImageBitmap which are a way faster to draw: * an ImageBitmap is Transferable which means that it can be built in the worker instead of in the main thread: - this is achieved in using an OffscreenCanvas when it's available, there is a bug to enable them for pdf.js: https://bugzilla.mozilla.org/show_bug.cgi?id=1763330; - or in using createImageBitmap: in Firefox a task is sent to the main thread to build the bitmap so it's slightly slower than using an OffscreenCanvas. * it's transfered from the worker to the main thread by "reference"; * the byte buffers used to create the image data have a very short lifetime and ergo the memory used is globally less than before. - Use the localImageCache for the mask; - Fix the pdf issue4436r.pdf: it was expected to have a binary stream for the image; - Move the singlePixel trick from operator_list to image: this way we can use this trick even if it isn't in a set as defined in operator_list.	2022-04-09 18:26:26 +02:00
Jonas Jenwald	1dc4713a0b	Re-factor the `isLittleEndian`/`isEvalSupported` caching This functionality is very old, hence we should be able to improve the caching a little bit with modern JavaScript features.	2022-04-05 16:01:01 +02:00
Jonas Jenwald	537ed37835	Move the `isSameOrigin` helper function This function is currently placed in the `src/shared/util.js` file, which means that the code is duplicated in both of the built `pdf.js` and `pdf.worker.js` files. Furthermore, it only has a single call-site which is also specific to the `GENERIC`-build of the PDF.js library. Hence this helper function is instead moved into the `src/display/api.js` file, in such a way that it's conditionally defined but still can be unit-tested.	2022-03-10 13:51:09 +01:00
Jonas Jenwald	99cd24ce3e	Remove the `isString` helper function The call-sites are replaced by direct `typeof`-checks instead, which removes unnecessary function calls. Note that in the `src/`-folder we already had more `typeof`-cases than `isString`-calls.	2022-02-26 16:33:41 +01:00
Jonas Jenwald	3704283f5b	Remove the `isBool` helper function The call-sites are replaced by direct `typeof`-checks instead, which removes unnecessary function calls.	2022-02-23 13:31:03 +01:00
Jonas Jenwald	05edd91bdb	Remove the `isNum` helper function The call-sites are replaced by direct `typeof`-checks instead, which removes unnecessary function calls. Note that in the `src/`-folder we already had more `typeof`-cases than `isNum`-calls. These changes were mostly done using regular expression search-and-replace, with two exceptions: - In `Font._charToGlyph` we no longer unconditionally update the `width`, since that seems completely unnecessary. - In `PDFDocument.documentInfo`, when parsing custom entries, we now do the `typeof`-check once.	2022-02-22 11:55:34 +01:00
Jonas Jenwald	b87a243222	[api-minor] Stop exposing the `createObjectURL` helper function in the API With recent changes, specifically PR 14515 and the previous patch, the `createObjectURL` helper function is now only used with the SVG back-end. All other call-sites, throughout the code-base, are now using `URL.createObjectURL(...)` directly and it no longer seems necessary to keep exposing the helper function in the API. Finally, the `createObjectURL` helper function is moved into the `src/display/svg.js` file to avoid unnecessarily duplicating this code on both the main- and worker-threads.	2022-02-10 12:01:35 +01:00
Jonas Jenwald	0e1b93bf20	Replace some `assert` usage with `unreachable` in the `src/shared/util.js` file Inlining the checks should be a tiny bit more efficient, since it avoids have to make unconditional function calls in these fairly commonly used helper functions.	2022-01-15 13:01:25 +01:00
Jonas Jenwald	12d8f0b64d	Re-factor the `stringToPDFString` helper function for UTF-16 strings This patch changes the function to instead utilize the `TextDecoder` for both kinds of UTF-16 BOM strings.	2022-01-14 20:38:40 +01:00
Jonas Jenwald	76444888fb	Add (basic) UTF-8 support in the `stringToPDFString` helper function (issue 14449) This patch implements this by looking for the UTF-8 BOM, i.e. `\xEF\xBB\xBF`, in order to determine the encoding.[1] The actual conversion is done using the `TextDecoder` interface, which should be available in all environments/browsers that we support; please see https://developer.mozilla.org/en-US/docs/Web/API/TextDecoder#browser_compatibility --- [1] Assuming that everything lacking a UTF-16 BOM would have to be UTF-8 encoded really doesn't seem correct.	2022-01-14 18:57:07 +01:00
Jonas Jenwald	7b8794b37e	[api-minor] Move `removeNullCharacters` into the viewer This helper function has never been used in e.g. the worker-thread, hence its placement in `src/shared/util.js` led to a small amount of unnecessary duplication. After the previous patches this helper function is now only used in the viewer, hence it no longer seems necessary to expose it through the official API. Please note: It seems somewhat unlikely that third-party users were relying directly on this helper function, which is why it's not being exported as part of the viewer components. (If necessary, we can always change this later on.)	2022-01-06 12:25:33 +01:00
Jonas Jenwald	d9fac34596	Ensure that the `shadow` helper function is passed a valid property (PR 14152 follow-up) Trying to shadow a non-existent property is always an implementation mistake, since it leads to the `shadow`-call not having any effect. In PR 14152 I overlooked the fact that it's fairly easy to enforce this during development/testing, since that can help catch e.g. simple spelling bugs.	2021-12-04 10:07:21 +01:00
Calixte Denizet	7041c62ccf	Remove non-displayable chars from outline title (#14267 ) - it aims to fix #14267; - there is nothing about chars in range [0-1F] in the specs but acrobat doesn't display them in any way.	2021-11-13 16:56:08 +01:00
Jonas Jenwald	52372b9378	Merge pull request #14175 from brendandahl/smask-v2 Use a new method for handling soft masks.	2021-10-23 09:27:18 +02:00
Brendan Dahl	82681ea20c	Track the clipping box and bounding box of the path. This allows us to compose much smaller regions of soft mask making them much faster. This should also allow for further optimizations in the pattern code. For example locally I see issue #6573 go from 55s to 5s with this change. Fixes #6573	2021-10-22 13:41:29 -07:00
Jonas Jenwald	ff9d2b2ab1	Prevent run-time errors in Node.js versions with `URL.createObjectURL` support (issue 14170) Apparently Node.js has added global `URL.createObjectURL` support, but not done the same thing for `Blob`. Hence we also need to check for the availability of `Blob` in the `createObjectURL` helper function, and it's probably a good idea to also update `examples/node/pdf2svg.js` to work-around this until these changes reach an official PDF.js release.	2021-10-21 10:32:44 +02:00

1 2 3 4 5 ...