pdf.js

Author	SHA1	Message	Date
Calixte Denizet	385f275ad9	Warn when pdf.js can't load an OS font	2023-05-16 14:58:38 +02:00
Calixte Denizet	4e8dd54e8e	For non-embedded fonts, don't generate the fallback several times	2023-05-15 20:02:45 +02:00
Calixte Denizet	b264e0301a	Simplify the code to generate font substitution information	2023-05-15 19:17:52 +02:00
Calixte Denizet	d4b70ec306	For missing font, use a local font if it exists even if there's no standard substitution If the font foo is missing we just try lo load local(foo) and maybe we'll be lucky.	2023-05-13 21:54:27 +02:00
Calixte Denizet	cfb908c999	Add a cache to avoid to load several times a local font On my computer, it takes few tenths of a second to load a local font. Since a font can be used several times in a document, the cache will improve performances.	2023-05-10 20:01:21 +02:00
calixteman	2d2f7b315e	Merge pull request #16363 from calixteman/use_local_font [api-minor] Use a local font or fallback on an embedded one (if it exists) for non-embedded fonts (bug 1766039)	2023-05-10 14:19:05 +02:00
Calixte Denizet	53134c0c0b	[api-minor] Use a local font or fallback on an embedded one (if it exists) for non-embedded fonts (bug 1766039) - Replace FoxitSans with LiberationSans: LiberationSans is already there (for XFA) and we can use it as a good replacement of FoxitSans. - For now we just try to substitue standard fonts, the strategy is the following: * we try to find a font locally from a hardcoded list; * if it fails then we use Liberation as fallback (only for Helvetica for the moment); * else we just fallback on the system serif/sansserif/monospace font.	2023-05-10 14:10:23 +02:00
Calixte Denizet	2486536843	Compress the data when saving annotions CompressionStream API has been added in Firefox 113 (see https://bugzilla.mozilla.org/show_bug.cgi?id=1823619) hence we can use it to compress the streams with added/modified annotations.	2023-05-09 14:46:50 +02:00
calixteman	8f2d8f62f3	Merge pull request #16397 from calixteman/issue14565 Make something similar to Acrobat when Underline annotation has no appearance	2023-05-08 21:16:49 +02:00
Tim van der Meij	bfb664b9a1	Merge pull request #16398 from Snuffleupagus/xfa-optional-chaining Introduce some optional chaining in the `src/core/xfa/` folder	2023-05-07 14:54:05 +02:00
Jonas Jenwald	1753e321cd	Remove the compatibility checks in `WorkerMessageHandler.createDocumentHandler` For some time these checks have only targeted Node.js environments, since the features in question exist in all supported browsers (even when a `legacy`-build is used). Now that we've updated the minimum supported Node.js version to 18, a number of polyfills are thus (finally) no longer necessary in that environment. Hence for certain basic functionality, such as e.g. text-extraction, it's now possible to use either a modern- or a `legacy`-build of the PDF.js library in Node.js environments. Please note: For e.g. canvas-rendering in Node.js environments it's still necessary to use a `legacy`-build, since that functionality requires various polyfills.	2023-05-07 13:43:19 +02:00
Jonas Jenwald	ed8be6f882	[api-minor] Update the minimum supported Node.js version to 18 This patch updates the minimum supported environments as follows: - Node.js 18, which was released on 2022-04-19; see https://en.wikipedia.org/wiki/Node.js#Releases Note also that Node.js 16 will soon reach EOL, and thus no longer receive any security updates.	2023-05-07 13:43:19 +02:00
Jonas Jenwald	89f768322d	Introduce some optional chaining in the `src/core/xfa/` folder After PR 12563 we're now free to use optional chaining in the worker-thread as well.	2023-05-07 12:49:07 +02:00
Calixte Denizet	6c0fdc6ec2	Make something similar to Acrobat when Underline annotation has no appearance	2023-05-06 21:19:25 +02:00
Jonas Jenwald	722e5910e1	Improve handling of JPEG images with non-standard /Decode-entries (issue 16395) The /Decode-implementation in the our JPEG decoder, i.e. `src/core/jpg.js`, seems to only handle inverting of images properly. To support arbitrary /Decode-entries correctly we'll always use the `PDFImage.decodeBuffer` method, even for "simple" JPEG images, which should be fine since non-default /Decode-entries aren't a very common occurrence. Please note: This patch will lead to a little bit of movement in some existing test-cases, however it should be virtually imperceivable to the naked eye.	2023-05-06 13:55:39 +02:00
calixteman	f151a39d14	Merge pull request #16387 from calixteman/issue16384 [Annotations] Draw readonly annotations on their own canvas and show the HTML elements when there is a JS interaction (issue #16384)	2023-05-04 21:49:08 +02:00
Calixte Denizet	72da14f005	[Annotations] Draw readonly annotations on their own canvas and show the HTML elements when there is a JS interaction (issue #16384 )	2023-05-04 20:08:32 +02:00
calixteman	a24e11a91c	Merge pull request #16106 from bungeman/improve_color_stop_detection Better approximate gradient color stops	2023-05-04 19:48:57 +02:00
Jonas Jenwald	667085ee33	Merge pull request #16368 from Snuffleupagus/rm-GlobalImageCache-addPageIndex Inline the `addPageIndex` method in `GlobalImageCache.shouldCache`	2023-05-04 12:09:04 +02:00
Jonas Jenwald	001acfb5ac	Merge pull request #16381 from Snuffleupagus/rm-isStandardFont-prop Remove the unused `isStandardFont` font-property (PR 15880 follow-up)	2023-05-04 00:30:05 +02:00
Jonas Jenwald	24a75bda5d	Remove the unused `isStandardFont` font-property (PR 15880 follow-up) This property was added in PR 12726 specifically for use in the `getFontType` function, indirectly used by the `PDFDocumentProxy.stats` getter in the API. In PR 15880 that functionality was removed, but I forgot to remove this now unused font-property.	2023-05-03 11:52:54 +02:00
Jonas Jenwald	88616f77ae	Remove the closure from `BitModel` in the `src/core/jpx.js` file	2023-04-29 13:49:39 +02:00
Jonas Jenwald	b0a1af306d	Simplify initialization of `static` class properties in the worker-thread Now that we no longer depend on the old Babel version in SystemJS we can remove the `static get ...` work-arounds used to define constants, which leads to slightly more compact code.	2023-04-29 13:49:38 +02:00
Jonas Jenwald	d950b91c4e	Introduce some logical assignment in the `src/core/` folder	2023-04-29 13:49:37 +02:00
Jonas Jenwald	317abd6d07	Change the `createPromiseCapability` helper function into a `PromiseCapability` class This is not only slightly more compact, but it also simplifies the handling of the `settled` getter.	2023-04-29 13:43:24 +02:00
Jonas Jenwald	94c2d08975	Revert "Add a `getArrayLookupTableFactory` helper function and use it to re-format `src/core/{glyphlist, unicode}.js`" This reverts commit 56fa6d414cb1115e03f9c1aa9f1d5bc52efcb7ac now that SystemJS is gone.	2023-04-29 13:43:24 +02:00
Jonas Jenwald	95bf9fc17f	Remove SystemJS usage, in development mode, from the worker Now that https://bugzilla.mozilla.org/show_bug.cgi?id=1247687 has landed in Firefox, we're able to use worker-modules during development :-) This removes the final piece of SystemJS usage from the PDF.js library, thus allowing a fair bit of clean-up, and we now use only native `import`/`export` statements everywhere in development mode.	2023-04-29 13:43:24 +02:00
Jonas Jenwald	bb1228cb64	Inline the `addPageIndex` method in `GlobalImageCache.shouldCache` When the `GlobalImageCache` implementation originally landed, back in PR 11912, the image handling was slightly more complex (with e.g. browser-decoding of some JPEG images). At this point it no longer seems necessary to manually handle pageIndexes in this way, and we should be able to simply inline that in the `GlobalImageCache.shouldCache` method.	2023-04-28 09:40:32 +02:00
Jonas Jenwald	e12535457f	Avoid some repeated `stringToBytes`-calls in the `src/core/crypto.js` file Currently we repeatedly lookup, and convert to bytes, the "O" and "U" encryption-dictionary entries.	2023-04-26 17:52:46 +02:00
Jonas Jenwald	74585c7c59	Remove the unused `PDF20.hash` method This method was added in PR 4938, almost nine years ago, however it doesn't appear to ever have been used. Given the similarities between the `PDF17` and `PDF20` classes, and how they're used, if the `PDF20.hash` method was actually necessary you'd also expect a similiar method in the `PDF17` class.	2023-04-23 10:13:46 +02:00
Jonas Jenwald	5e0722e4c2	Remove the `PDF20` closure, in the `src/core/crypto.js` file To allow doing this the existing helper function was changed into a "private" method instead.	2023-04-23 10:08:17 +02:00
Jonas Jenwald	9cb3236ac0	Remove the remaining unnecessary closures in the `src/core/primitives.js` file	2023-04-22 15:33:04 +02:00
Tim van der Meij	e304423ba1	Merge pull request #16331 from Snuffleupagus/cmap-rm-closure Remove unnecessary closures in the CMap code	2023-04-22 14:58:13 +02:00
Tim van der Meij	c9359957e6	Merge pull request #16305 from Snuffleupagus/PDFJSDev-skip-PRODUCTION Remove the `PRODUCTION` build-target	2023-04-22 14:53:30 +02:00
Jonas Jenwald	bc7aa8a585	Re-factor some `String.fromCharCode` usage in the `src/core/binary_cmap.js` file We can replace one case of `apply` with rest parameters, and avoid doing repeated `String.fromCharCode` calls within a loop.	2023-04-21 12:21:31 +02:00
Jonas Jenwald	cabc98f310	Remove the remaining closure in the `src/core/cmap.js` file With modern JavaScript we (usually) no longer need to keep old closures, which slightly reduces the size of the code.	2023-04-21 12:21:31 +02:00
Jonas Jenwald	244002502b	Move the `BinaryCMapReader` into its own file The "binary" CMap-format is specific to the PDF.js library, and is used to reduce the size of the built-in CMap data-files. By moving this code to its own file we can remove the nowadays unnecessary closures, which helps to slightly reduce the size of this code.	2023-04-21 12:21:20 +02:00
Calixte Denizet	19ca41896e	Correctly clip the text in the text layer (fixes #16316 )	2023-04-18 17:00:42 +02:00
Calixte Denizet	117bbf7cd9	[api-minor] Don't normalize the text used in the text layer. Some arabic chars like \ufe94 could be searched in a pdf, hence it must be normalized when creating the search query. So to avoid to duplicate the normalization code, everything is moved in the find controller. The previous code to normalize text was using NFKC but with a hardcoded map, hence it has been replaced by the use of normalize("NFKC") (it helps to reduce the bundle size by 30kb). In playing with this \ufe94 char, I noticed that the bidi algorithm wasn't taking into account some RTL unicode ranges, the generated font wasn't embedding the mapping this char and the unicode ranges in the OS/2 table weren't up-to-date. When normalized some chars can be replaced by several ones and it induced to have some extra chars in the text layer. To avoid any regression, when copying some text from the text layer, a copied string is normalized (NFKC) before being put in the clipboard (it works like this in either Acrobat or Chrome).	2023-04-17 14:31:23 +02:00
Jonas Jenwald	804aa896a7	Stop using the `PRODUCTION` build-target in the JavaScript code This special build-target is very old, and was introduced with the first pre-processor that only uses comments to enable/disable code. When the new pre-processor was added `PRODUCTION` effectively became redundant, at least in JavaScript code, since `typeof PDFJSDev === "undefined"` checks now do the same thing. This patch proposes that we remove `PRODUCTION` from the JavaScript code, since that simplifies the conditions and thus improves readability in many cases. Please note: There's not, nor has there ever been, any gulp-task that set `PRODUCTION = false` during building.	2023-04-17 12:04:34 +02:00
Jonas Jenwald	c79bdd6ae6	Simplify the `CFFCompiler.compileTypedArray` method Rather than manually creating the Array, we can use the now existing `Array.from` method instead.	2023-04-15 11:13:34 +02:00
Jonas Jenwald	0ce568e789	Remove `CFFCompiler.compileGlobalSubrIndex` since it's completely unused This method was originally added in PR 1320, eleven years ago, however it doesn't appear to ever have been used (not even from the start). Furthermore, this method also tries to access a property that doesn't exist (`this.out`) and then call a method that also doesn't exist (`writeByteArray`).	2023-04-15 11:13:21 +02:00
Jonas Jenwald	ab2773416b	Merge pull request #16291 from Snuffleupagus/issue-16289 Limit the `Path2D`-checks in the worker-thread to Node.js (PR 16238 follow-up, issue 16289)	2023-04-14 21:26:12 +02:00
Calixte Denizet	5eab8ec610	Avoid when it's possible to use Array.concat when compiling a CFF font In looking at https://bugs.ghostscript.com/show_bug.cgi?id=706451 I noticed that bug2.pdf was pretty slow to load for such a basic file. In profiling I noticed that a lot of time is spent in Array.concat, hence this patch use Array.push when it's possible (it's now ~3 times faster).	2023-04-14 19:01:01 +02:00
Jonas Jenwald	edd13895dd	Limit the `Path2D`-checks in the worker-thread to Node.js (PR 16238 follow-up, issue 16289) The changes in PR 16238 were intended specifically for Node.js environments, however they accidentally applied to older browsers as well. Please note: In up-to-date browsers `Path2D` is available in Workers, which should be connected to the introduction of `OffscreenCanvas`.	2023-04-14 11:51:11 +02:00
Jonas Jenwald	3a36a9d337	Merge pull request #16268 from Snuffleupagus/RegionalImageCache Attempt to also cache images at the "page"-level (issue 16263)	2023-04-11 12:06:29 +02:00
calixteman	c1c372c320	Merge pull request #16225 from calixteman/16224 Thin whitespaces must have their own span	2023-04-11 11:13:16 +02:00
Jonas Jenwald	9881dbf927	Attempt to also cache images at the "page"-level (issue 16263) Currently we have two separate image-caches on the worker-thread: - A local one, which is unique to each `PartialEvaluator.getOperatorList` invocation. This one caches both names and references, since image-resources may be accessed in either way. - A global one, which applies to the entire PDF documents and all its pages. This one only caches references, since nothing else would work. This patch introduces a third image-cache, which essentially sits "between" the two existing ones. The new `RegionalImageCache`[1] will be usable throughout a `PartialEvaluator` instance, and consequently it only caches references, which thus allows us to keep track of repeated image-resources found in e.g. different /Form and /SMask objects. --- [1] For lack of a better word, since naming things is hard...	2023-04-10 11:34:41 +02:00
Tim van der Meij	13f2426aab	Merge pull request #16238 from Snuffleupagus/update-Node-compat-check Update the Node.js compatibility-check in the worker-thread	2023-04-01 14:20:33 +02:00
Jonas Jenwald	57a307d0cd	Update the Node.js compatibility-check in the worker-thread Please note: In Node.js environments a `legacy`-build must be used since only those versions include any polyfills. Previously we'd only check if `ReadableStream` is natively supported, however since Node.js version 18 that's now been implemented; please see https://developer.mozilla.org/en-US/docs/Web/API/ReadableStream#browser_compatibility Hence we'll also check for the availability of `Path2D`, since that's browser-specific functionality not expected to be available in Node.js environments; please see https://developer.mozilla.org/en-US/docs/Web/API/Path2D#browser_compatibility	2023-03-30 18:36:15 +02:00

1 2 3 4 5 ...

2801 Commits