pdf.js

Author	SHA1	Message	Date
Calixte Denizet	35a58ed987	Extract all the text of text annotations	2023-05-25 23:11:42 +02:00
calixteman	8d5da54cd5	Merge pull request #16467 from calixteman/non_null_ultimate Avoid to have a null fallback if none has been provided	2023-05-24 17:00:14 +02:00
Jonas Jenwald	5a7beb9f30	Attempt to improve non-embedded Wingdings font support (bug 1652224) Now that font-substitution has been implemented, we should be able to do much a better job at supporting non-embedded Wingdings fonts. Given that this is a Windows-specific font, see https://en.wikipedia.org/wiki/Wingdings, this is however not guaranteed to work (well) on other platforms.	2023-05-24 14:59:13 +02:00
Calixte Denizet	7dce0a27f6	Avoid to have a null fallback if none has been provided	2023-05-24 14:44:36 +02:00
Jonas Jenwald	aeed6f2b67	Ignore named encoding for non-embedded symbol fonts (issue 16464) The affected font is non-embedded ZapfDingbats, however the PDF document for some inexplicable reason specifies the encoding as "WinAnsiEncoding" (which is obviously wrong). To work-around this bug in the PDF generator, we'll simply ignore any explicitly specified named encoding for non-embedded symbol fonts.	2023-05-24 10:48:47 +02:00
Jonas Jenwald	a6f9505a39	Merge pull request #16461 from Snuffleupagus/issue-16454 Improve "EI" detection in inline images (PR 12028 follow-up, issue 16454)	2023-05-23 22:23:22 +02:00
Calixte Denizet	a76a69e1ed	Take into account the final space if any in the TJ command The final space was just ignored and that led to wrongly position the next chunk of text.	2023-05-23 17:09:32 +02:00
Jonas Jenwald	dfbbb8c0ac	Improve "EI" detection in inline images (PR 12028 follow-up, issue 16454) Given that inline images may contain "EI"-sequences in the image-data itself, actually finding the end-of-image operator isn't always straightforward. Here we extend the implementation from PR 12028 to potentially check all of the following bytes, rather than stopping immediately. While we have fairly decent test-coverage for this code, whenever you're changing it there's unfortunately a slightly higher than normal risk of regressions. (You'd really wish that PDF generators just stop using inline images.)	2023-05-23 17:04:51 +02:00
Calixte Denizet	ca12bca276	Sanitize the glyph bounding box - if the contours count is lower than -1, the glyph is really likely wrong so just remove it from the font; - if a contour has the repeat flag then repeats count mustn't be 0.	2023-05-21 16:24:41 +02:00
Jonas Jenwald	f657de7de2	Extend `getNonStdFontMap` for non-embedded Impact fonts (bug 1365930) According to https://en.wikipedia.org/wiki/Impact_(typeface) this font should be available on all current versions of Windows, and with the recently added font-substitution we should actually be able to render it correctly (at least on Windows).	2023-05-19 18:40:03 +02:00
Jonas Jenwald	8c4821ceda	[api-minor] Slightly shorten the marked-content ids used in the textLayer Generally we try to keep the ids that we create short, hence we can slightly shorten the "static" parts of them.	2023-05-18 22:32:10 +02:00
Jonas Jenwald	04de155aaa	Slightly shorten the `loadedName`-ids used with font-substitutions Generally we try to keep the ids that we create short, hence we can slightly shorten the "static" part of them.	2023-05-18 22:27:11 +02:00
Jonas Jenwald	3be66f59d6	Merge pull request #16440 from Snuffleupagus/more-modern-JS Introduce even more modern JavaScript features in the code-base	2023-05-18 20:56:00 +02:00
Calixte Denizet	3091e70aad	Flush the current chunk when the font changed because of a restore op (issue #14755 )	2023-05-18 19:37:16 +02:00
Jonas Jenwald	e8030752f3	Introduce even more modern JavaScript features in the code-base After PR 12563 we're now free to use e.g. logical OR assignment, nullish coalescing, and optional chaining in the entire code-base.	2023-05-18 18:55:41 +02:00
Jonas Jenwald	4355e76c60	Simplify the `fontID` handling in `PartialEvaluator.loadFont` The `fontID` handling is quite old and predates the use of the `idFactory` to generate a unique id for each font, hence we can simplify this code a little bit.	2023-05-18 13:09:08 +02:00
Tim van der Meij	ac8032628b	Merge pull request #16424 from Snuffleupagus/core-optional-chaining Introduce more optional chaining in the `src/core/` folder	2023-05-18 12:40:08 +02:00
calixteman	839be801a0	Merge pull request #16433 from calixteman/bug1825002 For text widgets, get the text from the AP stream instead of from the format callback (bug 1825002)	2023-05-17 16:48:59 +02:00
Calixte Denizet	177036e6ae	For text widgets, get the text from the AP stream instead of from the format callback (bug 1825002) When fixing bug 1766987, I thought the field formatted value came from the result of the format callback: I was wrong. The format callback is ran but the value is unused (maybe it's useful to set some global vars... or it's just a bug in Acrobat). Anyway the value to display is the one rendered in the AP stream. The field value setter has been simplified and that fixes issue #16409.	2023-05-17 14:07:28 +02:00
Jonas Jenwald	bfb374dbf6	Attempt to fallback to a default font, for non-available ones, in more cases (issue 16432) This essentially extends PR 11218 to also apply when looking up the final font-reference, via the XRef-table, fails because the font isn't available. This patch also changes `PartialEvaluator.fallbackFontDict` to simply use "Helvetica" as the default font-name, since that seems generally reasonable given the now existing font-substitution code.	2023-05-17 11:41:08 +02:00
Calixte Denizet	385f275ad9	Warn when pdf.js can't load an OS font	2023-05-16 14:58:38 +02:00
Calixte Denizet	4e8dd54e8e	For non-embedded fonts, don't generate the fallback several times	2023-05-15 20:02:45 +02:00
Calixte Denizet	b264e0301a	Simplify the code to generate font substitution information	2023-05-15 19:17:52 +02:00
Jonas Jenwald	1b4a7c5965	Introduce more optional chaining in the `src/core/` folder After PR 12563 we're now free to use optional chaining in the worker-thread as well. (This patch also fixes one previously "missed" case in the `web/` folder.) For the MOZCENTRAL build-target this patch reduces the total bundle-size by `1.6` kilobytes.	2023-05-15 12:38:28 +02:00
Calixte Denizet	d4b70ec306	For missing font, use a local font if it exists even if there's no standard substitution If the font foo is missing we just try lo load local(foo) and maybe we'll be lucky.	2023-05-13 21:54:27 +02:00
Calixte Denizet	cfb908c999	Add a cache to avoid to load several times a local font On my computer, it takes few tenths of a second to load a local font. Since a font can be used several times in a document, the cache will improve performances.	2023-05-10 20:01:21 +02:00
calixteman	2d2f7b315e	Merge pull request #16363 from calixteman/use_local_font [api-minor] Use a local font or fallback on an embedded one (if it exists) for non-embedded fonts (bug 1766039)	2023-05-10 14:19:05 +02:00
Calixte Denizet	53134c0c0b	[api-minor] Use a local font or fallback on an embedded one (if it exists) for non-embedded fonts (bug 1766039) - Replace FoxitSans with LiberationSans: LiberationSans is already there (for XFA) and we can use it as a good replacement of FoxitSans. - For now we just try to substitue standard fonts, the strategy is the following: * we try to find a font locally from a hardcoded list; * if it fails then we use Liberation as fallback (only for Helvetica for the moment); * else we just fallback on the system serif/sansserif/monospace font.	2023-05-10 14:10:23 +02:00
Calixte Denizet	2486536843	Compress the data when saving annotions CompressionStream API has been added in Firefox 113 (see https://bugzilla.mozilla.org/show_bug.cgi?id=1823619) hence we can use it to compress the streams with added/modified annotations.	2023-05-09 14:46:50 +02:00
calixteman	8f2d8f62f3	Merge pull request #16397 from calixteman/issue14565 Make something similar to Acrobat when Underline annotation has no appearance	2023-05-08 21:16:49 +02:00
Tim van der Meij	bfb664b9a1	Merge pull request #16398 from Snuffleupagus/xfa-optional-chaining Introduce some optional chaining in the `src/core/xfa/` folder	2023-05-07 14:54:05 +02:00
Jonas Jenwald	1753e321cd	Remove the compatibility checks in `WorkerMessageHandler.createDocumentHandler` For some time these checks have only targeted Node.js environments, since the features in question exist in all supported browsers (even when a `legacy`-build is used). Now that we've updated the minimum supported Node.js version to 18, a number of polyfills are thus (finally) no longer necessary in that environment. Hence for certain basic functionality, such as e.g. text-extraction, it's now possible to use either a modern- or a `legacy`-build of the PDF.js library in Node.js environments. Please note: For e.g. canvas-rendering in Node.js environments it's still necessary to use a `legacy`-build, since that functionality requires various polyfills.	2023-05-07 13:43:19 +02:00
Jonas Jenwald	ed8be6f882	[api-minor] Update the minimum supported Node.js version to 18 This patch updates the minimum supported environments as follows: - Node.js 18, which was released on 2022-04-19; see https://en.wikipedia.org/wiki/Node.js#Releases Note also that Node.js 16 will soon reach EOL, and thus no longer receive any security updates.	2023-05-07 13:43:19 +02:00
Jonas Jenwald	89f768322d	Introduce some optional chaining in the `src/core/xfa/` folder After PR 12563 we're now free to use optional chaining in the worker-thread as well.	2023-05-07 12:49:07 +02:00
Calixte Denizet	6c0fdc6ec2	Make something similar to Acrobat when Underline annotation has no appearance	2023-05-06 21:19:25 +02:00
Jonas Jenwald	722e5910e1	Improve handling of JPEG images with non-standard /Decode-entries (issue 16395) The /Decode-implementation in the our JPEG decoder, i.e. `src/core/jpg.js`, seems to only handle inverting of images properly. To support arbitrary /Decode-entries correctly we'll always use the `PDFImage.decodeBuffer` method, even for "simple" JPEG images, which should be fine since non-default /Decode-entries aren't a very common occurrence. Please note: This patch will lead to a little bit of movement in some existing test-cases, however it should be virtually imperceivable to the naked eye.	2023-05-06 13:55:39 +02:00
calixteman	f151a39d14	Merge pull request #16387 from calixteman/issue16384 [Annotations] Draw readonly annotations on their own canvas and show the HTML elements when there is a JS interaction (issue #16384)	2023-05-04 21:49:08 +02:00
Calixte Denizet	72da14f005	[Annotations] Draw readonly annotations on their own canvas and show the HTML elements when there is a JS interaction (issue #16384 )	2023-05-04 20:08:32 +02:00
calixteman	a24e11a91c	Merge pull request #16106 from bungeman/improve_color_stop_detection Better approximate gradient color stops	2023-05-04 19:48:57 +02:00
Jonas Jenwald	667085ee33	Merge pull request #16368 from Snuffleupagus/rm-GlobalImageCache-addPageIndex Inline the `addPageIndex` method in `GlobalImageCache.shouldCache`	2023-05-04 12:09:04 +02:00
Jonas Jenwald	001acfb5ac	Merge pull request #16381 from Snuffleupagus/rm-isStandardFont-prop Remove the unused `isStandardFont` font-property (PR 15880 follow-up)	2023-05-04 00:30:05 +02:00
Jonas Jenwald	24a75bda5d	Remove the unused `isStandardFont` font-property (PR 15880 follow-up) This property was added in PR 12726 specifically for use in the `getFontType` function, indirectly used by the `PDFDocumentProxy.stats` getter in the API. In PR 15880 that functionality was removed, but I forgot to remove this now unused font-property.	2023-05-03 11:52:54 +02:00
Jonas Jenwald	88616f77ae	Remove the closure from `BitModel` in the `src/core/jpx.js` file	2023-04-29 13:49:39 +02:00
Jonas Jenwald	b0a1af306d	Simplify initialization of `static` class properties in the worker-thread Now that we no longer depend on the old Babel version in SystemJS we can remove the `static get ...` work-arounds used to define constants, which leads to slightly more compact code.	2023-04-29 13:49:38 +02:00
Jonas Jenwald	d950b91c4e	Introduce some logical assignment in the `src/core/` folder	2023-04-29 13:49:37 +02:00
Jonas Jenwald	317abd6d07	Change the `createPromiseCapability` helper function into a `PromiseCapability` class This is not only slightly more compact, but it also simplifies the handling of the `settled` getter.	2023-04-29 13:43:24 +02:00
Jonas Jenwald	94c2d08975	Revert "Add a `getArrayLookupTableFactory` helper function and use it to re-format `src/core/{glyphlist, unicode}.js`" This reverts commit 56fa6d414cb1115e03f9c1aa9f1d5bc52efcb7ac now that SystemJS is gone.	2023-04-29 13:43:24 +02:00
Jonas Jenwald	95bf9fc17f	Remove SystemJS usage, in development mode, from the worker Now that https://bugzilla.mozilla.org/show_bug.cgi?id=1247687 has landed in Firefox, we're able to use worker-modules during development :-) This removes the final piece of SystemJS usage from the PDF.js library, thus allowing a fair bit of clean-up, and we now use only native `import`/`export` statements everywhere in development mode.	2023-04-29 13:43:24 +02:00
Jonas Jenwald	bb1228cb64	Inline the `addPageIndex` method in `GlobalImageCache.shouldCache` When the `GlobalImageCache` implementation originally landed, back in PR 11912, the image handling was slightly more complex (with e.g. browser-decoding of some JPEG images). At this point it no longer seems necessary to manually handle pageIndexes in this way, and we should be able to simply inline that in the `GlobalImageCache.shouldCache` method.	2023-04-28 09:40:32 +02:00
Jonas Jenwald	e12535457f	Avoid some repeated `stringToBytes`-calls in the `src/core/crypto.js` file Currently we repeatedly lookup, and convert to bytes, the "O" and "U" encryption-dictionary entries.	2023-04-26 17:52:46 +02:00

1 2 3 4 5 ...

2872 Commits