pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	2f3dcc2327	[api-minor] Remove the deprecated `onUnsupportedFeature` functionality (PR 15758 follow-up) This was deprecated in PR 15758, which has now been included in three official PDF.js releases. While PR 15880 did limit the bundle-size impact of this functionality on e.g. the Firefox PDF Viewer, it still leads to some unnecessary "bloat" that these changes remove. Furthermore, with this being deprecated there'd also be no effort put into e.g. extending the `UNSUPPORTED_FEATURES` list when handling future error cases.	2023-03-07 10:18:43 +01:00
Calixte Denizet	3849063d36	[Annotation] Don't rotate an annotation when it has the NoRotate flag	2023-03-06 17:27:11 +01:00
Calixte Denizet	05b0c9d7e6	Render large images even if they're larger than the canvas limits (bug 1720282) The idea is to encode large image in BMP format (which is very simple and doesn't require to compute any checksums) and then use createImageBitmap with a BMP blob (which doesn't suffer of the Canvas/ImageData limits). From a performance point of view, it isn't crazy (generating a large blob + decoding it on the main thread is really not ideal) but at least we've something to display which is a way better than a blank page (and one can notice that most of the time is spent in decoding the image from the pdf stream).	2023-03-05 14:07:07 +01:00
Ben Wagner	158c836e26	Correct PostScript trigonometric operators PDF 32000-1:2008 7.10.5.1 "Type 4 (PostScript Calculator) Functions" defers to the PostScript Language Reference for the description of these functions. The PostScript Language Reference, third edition chapter 8 "Operators" defines the `angle` type as a "number of degrees". Section 8.1 defines "angle `sin` real", "angle `cos` real", and "num den `atan` angle". The documentation for `atan` further states that it will return an angle in degrees between 0 and 360. Handle these operators correctly in `PostScriptEvaluator.execute`. Convert the inputs to `sin` and `cos` from degrees to radians for use with `Math.sin` and `Math.cos`. Correctly pop two values from the stack for `atan`, use `Math.atan2`, and convert from radians to (positive) degrees.	2023-03-03 17:25:11 -05:00
Jonas Jenwald	ceec93c832	[api-minor] Remove calling `getDocument` directly with a `PDFDataRangeTransport`-instance (PR 15943 follow-up) This was deprecated in PR 15943, which has now been included in two official PDF.js releases. Given that `PDFDataRangeTransport` is somewhat unlikely to be used outside of the built-in Firefox PDF Viewer, it doesn't seem necessary to wait longer before removing this. Also, removes the specific error-message for GENERIC builds to not unnecessarily "advertise" using non-objects when calling the `getDocument`-function. Please note: This patch is written using the GitHub UI, since I'm currently without a dev machine, so hopefully it works correctly.	2023-03-02 15:12:01 +01:00
Calixte Denizet	fd03cd5493	[api-minor] Generate images in the worker instead of the main thread. We introduced the use of OffscreenCanvas in #14754 and this patch aims to use them for all kind of images. It'll slightly improve performances (and maybe slightly decrease memory use). Since an image can be rendered in using some transfer maps but because of OffscreenCanvas we don't have the underlying pixels array the transfer maps stuff is re-implemented in using the SVG filter feComponentTransfer.	2023-03-01 17:40:12 +01:00
Jonas Jenwald	9640add1f7	Merge pull request #16100 from Snuffleupagus/getDocument-canvasFactory [api-minor] Move the `canvasFactory` option into `getDocument`	2023-03-01 10:34:11 +01:00
Jonas Jenwald	f42a2e8451	[api-minor] Move the `canvasFactory` option into `getDocument` Rather than repeatedly initializing a `canvasFactory`-instance for every page, move it to the document-level instead. Please note: This patch is written using the GitHub UI, since I'm currently without a dev machine, so hopefully it works correctly.	2023-03-01 09:07:16 +01:00
Jonas Jenwald	45c332110e	Check `OffscreenCanvas` support once on the worker-thread Currently we repeat the `FeatureTest.isOffscreenCanvasSupported` checks all over the worker-thread code, and with upcoming changes this will become even "worse". Hence this patch, which changes the worker-thread default value for the `isOffscreenCanvasSupported`-parameter to `false` and moves the feature-testing into the `BasePdfManager`-constructor. Please note: This patch is written using the GitHub UI, since I'm currently without a dev machine, so hopefully it works correctly.	2023-02-27 12:27:28 +01:00
Jonas Jenwald	5075d0495b	Use `OffscreenCanvas` as intended for all code-paths in `src/display/text_layer.js` (PR 15722 follow-up) Currently some `getCtx` calls will have `isOffscreenCanvasSupported === undefined` set, meaning that `OffscreenCanvas` isn't being used as intended, since no `TextLayerRenderTask._isOffscreenCanvasSupported` property exists. Please note: This patch is written using the GitHub UI, since I'm currently without a dev machine, so hopefully it works correctly.	2023-02-24 11:29:58 +01:00
Calixte Denizet	3a21423386	[Acroform] Use the full path to find the node in the XFA datasets where to store the value I noticed several 'Path not found' errors because of a field called #subform[2]. From the XFA specs, the hash is used for a class of elements in the template tree. When we're looking for a node in the datasets tree, it doesn't make sense to search for a class. Hence the path element starting with a hash are just skipped.	2023-02-23 12:09:39 +01:00
Jonas Jenwald	1b076b7a35	Move the `ImageBitmap` clean-up into the `PDFObjects` class With upcoming changes we'll potentially start to cache `ImageBitmap` data at the document-level, in addition to just at the page-level. Hence we need to ensure that such data is actually released on clean-up, and rather than duplicating the existing manual handling this code is instead moved into the `PDFObjects.clear` method. (In my opinion, this is an overall improvement even without globally cached `ImageBitmap` data.) Please note: This patch is written using the GitHub UI, since I'm currently without a dev machine, so hopefully it's correct and makes sense.	2023-02-21 12:00:45 +01:00
Calixte Denizet	dca54c8f8a	[JS] Send a Validate action on change on Choice widget	2023-02-19 16:33:05 +01:00
Jonas Jenwald	b6ba8cc84a	[api-minor] Deprecate providing binary data as `Buffer` in Node.js environments The `Buffer`-object is Node.js specific functionality[1], thus (obviously) not found in browsers. Please note that the PDF.js library has never officially supported/documented that binary data can be passed as a `Buffer`, and that internally in the `src/core`-code we only work with standard `Uint8Array`s. This means that if, in Node.js environments, a `Buffer` is passed to the API we need to wrap it into a `Uint8Array`, which essentially means creating a copy of the data and thus increasing memory usage. --- [1] Refer to https://nodejs.org/api/buffer.html#buffer	2023-02-14 11:30:40 +01:00
Jonas Jenwald	df3b359280	Remove "else after return" from the `getUrlProp`/`getDataProp` helper functions This helps readability of this code a little bit, in my opinion, and it's actually ever so slightly less code in the built `pdf.js` file.	2023-02-14 10:50:22 +01:00
Jonas Jenwald	8026ed6b0a	Reduce duplication for reference tests with an `annotationStorage` entry Currently we duplicate the same code more than once in the `test/driver.js` file, which we can avoid by adding a new `AnnotationStorage` helper method instead.	2023-02-13 11:09:16 +01:00
Tim van der Meij	22618213c7	Merge pull request #16040 from Snuffleupagus/arrayBuffersToBytes Re-factor the `arraysToBytes` helper function (PR 16032 follow-up)	2023-02-12 11:47:57 +01:00
Jonas Jenwald	6d4d402a78	Move the `arrayBuffersToBytes` helper function into the worker-thread Given that this helper function is only used on the worker-thread, there's no reason to duplicate it in both of the built `pdf.js` and `pdf.worker.js` files.	2023-02-11 21:34:37 +01:00
Jonas Jenwald	18042163ce	Improve the consistency between the `LocalPdfManager`/`NetworkPdfManager` constructor Currently these classes take a bunch of parameters (somewhat randomly ordered), probably because this is very old code that's been extended over the years. Hence this patch changes the constructors to use parameter-objects instead, which improves consistency and (slightly) reduces the amount of code as well. Please note: Also removes the `msgHandler`-property on these classes, since I cannot find a single call-site that accesses it.	2023-02-11 13:39:52 +01:00
Jonas Jenwald	14b0e8c0b6	Ensure that "GetAnnotations" errors are propagated to the main-thread (PR 15267 follow-up) With the changes in PR 15267 we're now accidentally swallowing "GetAnnotations" errors, rather than propagating them to the main-thread as intended.	2023-02-10 12:18:35 +01:00
Jonas Jenwald	c56f25409d	Re-factor the `arraysToBytes` helper function (PR 16032 follow-up) Currently this helper function only has two call-sites, and both of them only pass in `ArrayBuffer` data. Given how it's implemented there's a couple of code-paths that are completely unused (e.g. the "string" one), and in particular the intended fast-paths don't actually work. This patch re-factors and simplifies the helper function, and it'll no longer accept anything except `ArrayBuffer` data (hence why it's also re-named). Note that at the time when `arraysToBytes` was added we still supported browsers without TypedArray functionality, and we'd then simulate them using regular Arrays.	2023-02-10 10:26:35 +01:00
Jonas Jenwald	5ba596786c	Change `WorkerTasks`, in `WorkerMessageHandler.createDocumentHandler`, to a use a Set This is a tiny bit more compact, thanks to the `Set.prototype.delete` method.	2023-02-09 22:01:16 +01:00
calixteman	0fca6e187c	Merge pull request #16035 from calixteman/fix_combo_value [Annotation] A combo can have a value other than one in the options	2023-02-09 19:56:16 +01:00
Jonas Jenwald	1fc8350795	Merge pull request #16032 from Snuffleupagus/less-arrayByteLength Reduce usage of the `arrayByteLength` helper function	2023-02-09 18:56:20 +01:00
Calixte Denizet	cb1638530d	[Annotation] A combo can have a value other than one in the options When printing the pdf in #12233 in Acrobat, we can see that the combo for country is empty: it's because the V entry doesn't have to be one of the options.	2023-02-09 18:50:57 +01:00
calixteman	972744a68f	Merge pull request #16033 from calixteman/bug1640217 Ignore position of combining diacritics when getting text (bug 1640217)	2023-02-09 18:23:59 +01:00
calixteman	533a461db0	Merge pull request #16031 from calixteman/bug1770750 [Annotation] For choice widget, use the I entry instead of the V one (bug 1770750)	2023-02-09 18:01:28 +01:00
Calixte Denizet	58e4d92884	[Annotation] For choice widget, use the I entry instead of the V one (bug 1770750) It isn't really conform to the specifications but Acrobat is working like that...	2023-02-09 17:26:13 +01:00
Calixte Denizet	4e9f26afa3	Ignore position of combining diacritics when getting text (bug 1640217)	2023-02-09 17:13:57 +01:00
Jonas Jenwald	96d338e437	Reduce usage of the `arrayByteLength` helper function We're using this helper function when reading data from the [`PDFWorkerStreamReader.read`](`a49d1d1615/src/core/worker_stream.js (L90-L98)`) and [`PDFWorkerStreamRangeReader.read`](`a49d1d1615/src/core/worker_stream.js (L122-L128)`) methods, and as can be seen they always return `ArrayBuffer` data. Hence we can simply get the `byteLength` directly, and don't need to use the helper function. Note that at the time when `arrayByteLength` was added we still supported browsers without TypedArray functionality, and we'd then simulate them using regular Arrays.	2023-02-09 15:50:38 +01:00
Jonas Jenwald	323d3d246a	Re-factor the `readChunk` function in `ChunkedStreamManager.sendRequest` Move the `done` branch to the top of the function, similar to how we usually format things when `ReadableStream`s are used.	2023-02-09 15:33:06 +01:00
Jonas Jenwald	9d29abdfa0	Change the `LoopbackPort` class to use a Set internally This is a tiny bit more compact, thanks to the `Set.prototype.delete` method.	2023-02-09 12:34:41 +01:00
Calixte Denizet	c92ba393c2	Fix pinch-to-zoom on mac for the Firefox builtin viewer In the mac case we don't want to care about the scaleFactor threshold because else if too big another move could start and then subsequent events aren't considered as wheel events. It isn't really ideal and at some point we'll need to find a way at least for the Firefox case to get the real events instead of the fake wheel ones.	2023-02-08 15:04:41 +01:00
Calixte Denizet	a25895bf72	[Annotation] Take into account the stroke alpha for a FreeText without appearance	2023-02-07 22:15:27 +01:00
Calixte Denizet	ea7b4b4d6c	[Annotation] Avoid to encrypt the appearance stream two times (bug 1815476)	2023-02-07 19:26:46 +01:00
Jonas Jenwald	0a0f3fc733	Move the main-thread CMap/StandardFontData factory initialization to `getDocument` By default we're using worker-thread fetching (in browsers) of this data nowadays, however in Node.js environments or if the user provides custom factories we still fallback to main-thread fetching. Hence it makes sense, as far as I'm concerned, to move this initialization into the `getDocument` function to ensure that the factories can actually be initialized before attempting to load the document. Also, this further reduces the amount of `getDocument` parameters that we need to pass into into the `WorkerTransport` class.	2023-02-05 11:52:35 +01:00
Jonas Jenwald	ce8ac6d96a	Only pass the necessary parameters to `_fetchDocument` and `WorkerTransport` Currently we're passing all available parameters to this function respectively class, despite that not actually being necessary. By splitting the parameters we not only improve the structure, and basically "document" the code a little bit, but we can also simplify the `_fetchDocument` function considerably.	2023-02-05 11:52:33 +01:00
Jonas Jenwald	512aa50fdd	Re-factor the parameter parsing/validation in `getDocument` This is very old code, where we loop through the user-provided options and build an internal parameter object. To prevent errors we also need to ensure that the parameters are correct/valid, which is especially important for the ones that are sent to the worker-thread such that structured cloning won't fail.[1] Over the years this has led to more and more code being added in `getDocument` to validate the user-provided options, and at this point most of them have at least basic validation. However the way that this is implemented feels slightly backwards, since we first build the internal parameter object and only afterwards validate those parameters.[2] Hence this patch changes the `getDocument` function to instead check/validate the supported options upfront, and then explicitly build the internal parameter object with only the needed properties. --- [1] Note the supported types at https://developer.mozilla.org/en-US/docs/Web/API/Web_Workers_API/Structured_clone_algorithm#supported_types [2] The internal parameter object may also, because of the loop, end up with lots of unnecessary properties since anything that the user provides is being copied.	2023-02-05 11:52:25 +01:00
Tim van der Meij	e698664927	Merge pull request #16004 from Snuffleupagus/WorkerTransport-cacheSimpleMethod Improve how we cache Promises in `WorkerTransport`	2023-02-04 15:13:12 +01:00
Tim van der Meij	b75dafba87	Merge pull request #15987 from Snuffleupagus/onOpenWithTransport-params Remove unused parameters from the `onOpenWithTransport` method in `PDFViewerApplication.initPassiveLoading`	2023-02-04 15:07:42 +01:00
Tim van der Meij	e848a0e61c	Merge pull request #15981 from Snuffleupagus/cMapPacked-true [api-minor] Let the `cMapPacked` parameter, in `getDocument`, default to `true`	2023-02-04 15:00:26 +01:00
Jonas Jenwald	3a7fce49a3	A tiny improvement of the `MetadataParser._repair` method We can just insert the initial greater-than sign at the start of the buffer, rather than doing that manually at the end.	2023-02-04 12:43:55 +01:00
Jonas Jenwald	2de03a7d91	Improve how we cache Promises in `WorkerTransport` A number of methods have their Promises cached, to avoid repeated worker round-trips, since they're expected to be called more than once from the default viewer. The way that the caching is currently implemented means that we need to remember to manually clear these Promises on document cleanup/destruction, and it'd be nice to avoid that. With this patch the relevant Promises are now instead placed in just one `Map`, which is easy to clear, and a new helper method is also introduced to reduce duplication for simple `WorkerTransport` methods.	2023-02-04 11:57:37 +01:00
Calixte Denizet	185281957d	[Editor] Make the annotation editor layer invisible when disabled and empty It'll help to avoid to consider them when the browser is restyling.	2023-02-01 17:53:44 +01:00
Jonas Jenwald	cf8ee47589	Remove unused parameters from the `onOpenWithTransport` method in `PDFViewerApplication.initPassiveLoading` The only parameter that we actually need here is the `PDFDataRangeTransport`-instance, since the others are not necessary. - The `url` parameter, as passed to the `getDocument` function in the API, is simply being ignored; see `2d87a2eb1c/src/display/api.js (L447-L458)` - The `length` parameter, as passed to the `getDocument` function in the API, is always being overwritten; see `2d87a2eb1c/src/display/api.js (L519-L525)`	2023-02-01 09:33:22 +01:00
Jonas Jenwald	5e88228767	Allow, optionally, using worker-modules during local development Until PR 12563 is deemed safe to land, I'd still like to be able to use worker-modules in the viewer during local development. Hence this patch which temporarily adds a new `workerModules` hash-parameter, only available in non-PRODUCTION mode, that allows using worker-modules in the development viewer. To enable this functionality, simply use http://localhost:8888/web/viewer.html#workerModules=true	2023-01-31 12:09:44 +01:00
Jonas Jenwald	c5d6391898	[api-minor] Let the `cMapPacked` parameter, in `getDocument`, default to `true` The initial CMap support was added in PR 4259 using the "raw" Adobe files, however they were quickly deemed to be unnecessarily large. As a result PR 4470 introduced the more compact "binary" CMap format, with both of those PRs being included in the very same release (version `0.8.1334`) . Please note that we've thus never shipped anything except the "binary" CMap files with the PDF library, and furthermore note that we've not even once updated the CMap files since they were originally added almost nine years ago. Requiring users to remember that `cMapPacked = true` is necessary, in addition to setting the `cMapUrl` parameter, in order for CMap loading to work feels like a less than ideal API. Hence this patch, which suggests that we simply let `cMapPacked` default to `true` now.	2023-01-30 15:35:02 +01:00
Jonas Jenwald	808ca828f1	Extend `getGlyphMapForStandardFonts` with additional entries (issue 15977)	2023-01-30 12:13:21 +01:00
Tim van der Meij	ee3be2f979	Merge pull request #15951 from Snuffleupagus/polyfill-Path2D Polyfill `Path2D` in Node.js environments	2023-01-28 19:06:54 +01:00
Tim van der Meij	e539d2da1e	Merge pull request #15964 from Snuffleupagus/getDocument-non-object Only accept non-objects passed to `getDocument` in GENERIC builds	2023-01-28 18:42:09 +01:00

... 9 10 11 12 13 ...

6234 Commits