pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	04de155aaa	Slightly shorten the `loadedName`-ids used with font-substitutions Generally we try to keep the ids that we create short, hence we can slightly shorten the "static" part of them.	2023-05-18 22:27:11 +02:00
Calixte Denizet	3091e70aad	Flush the current chunk when the font changed because of a restore op (issue #14755 )	2023-05-18 19:37:16 +02:00
calixteman	839be801a0	Merge pull request #16433 from calixteman/bug1825002 For text widgets, get the text from the AP stream instead of from the format callback (bug 1825002)	2023-05-17 16:48:59 +02:00
Calixte Denizet	177036e6ae	For text widgets, get the text from the AP stream instead of from the format callback (bug 1825002) When fixing bug 1766987, I thought the field formatted value came from the result of the format callback: I was wrong. The format callback is ran but the value is unused (maybe it's useful to set some global vars... or it's just a bug in Acrobat). Anyway the value to display is the one rendered in the AP stream. The field value setter has been simplified and that fixes issue #16409.	2023-05-17 14:07:28 +02:00
Jonas Jenwald	bfb374dbf6	Attempt to fallback to a default font, for non-available ones, in more cases (issue 16432) This essentially extends PR 11218 to also apply when looking up the final font-reference, via the XRef-table, fails because the font isn't available. This patch also changes `PartialEvaluator.fallbackFontDict` to simply use "Helvetica" as the default font-name, since that seems generally reasonable given the now existing font-substitution code.	2023-05-17 11:41:08 +02:00
Calixte Denizet	385f275ad9	Warn when pdf.js can't load an OS font	2023-05-16 14:58:38 +02:00
Jonas Jenwald	cb1a10e358	Check the `css` property in the `getFontSubstitution` unit-tests Given that the `css` property isn't constant, since it contains document/font ids, we cannot just check it directly. However, we can make use of regular expressions to ensure that the format is generally correct.	2023-05-14 19:11:35 +02:00
calixteman	4101128c09	Merge pull request #16421 from calixteman/font_subst_test Add tests for the font substitution	2023-05-14 18:23:12 +02:00
Calixte Denizet	89140fcd98	Add tests for the font substitution	2023-05-14 18:07:03 +02:00
Jonas Jenwald	8fbd6755eb	Enable the `unicorn/no-useless-promise-resolve-reject` ESLint plugin rule Please see https://github.com/sindresorhus/eslint-plugin-unicorn/blob/main/docs/rules/no-useless-promise-resolve-reject.md Note that this patch also re-sorts the existing `unicorn`-rules in proper alphabetical order.	2023-05-13 11:30:25 +02:00
Jonas Jenwald	8f3940fbf3	Move the sidebar-resizing handling into the `PDFSidebar` class Originally the `PDFSidebarResizer` class was slightly larger, since the code used to contain e.g. feature testing for older (and no longer supported) browsers. Given that there's some amount of overlap, when it comes to what DOM-elements and state that these classes need, it now seems reasonable to simply move the sidebar-resizing into the `PDFSidebar` class. For the MOZCENTRAL build-target this patch reduces the size of the built `web/viewer.js` file by just over `1.1` kilobytes.	2023-05-12 10:00:12 +02:00
Calixte Denizet	cfb908c999	Add a cache to avoid to load several times a local font On my computer, it takes few tenths of a second to load a local font. Since a font can be used several times in a document, the cache will improve performances.	2023-05-10 20:01:21 +02:00
Calixte Denizet	2486536843	Compress the data when saving annotions CompressionStream API has been added in Firefox 113 (see https://bugzilla.mozilla.org/show_bug.cgi?id=1823619) hence we can use it to compress the streams with added/modified annotations.	2023-05-09 14:46:50 +02:00
calixteman	8f2d8f62f3	Merge pull request #16397 from calixteman/issue14565 Make something similar to Acrobat when Underline annotation has no appearance	2023-05-08 21:16:49 +02:00
Jonas Jenwald	dcd55a7164	Enable `unicorn/prefer-at` unconditionally (PR 15014 follow-up) Now that Node.js version 18 is required, we should be able to use `Array.prototype.at()` everywhere in the code-base.	2023-05-07 13:43:19 +02:00
Calixte Denizet	6c0fdc6ec2	Make something similar to Acrobat when Underline annotation has no appearance	2023-05-06 21:19:25 +02:00
Jonas Jenwald	722e5910e1	Improve handling of JPEG images with non-standard /Decode-entries (issue 16395) The /Decode-implementation in the our JPEG decoder, i.e. `src/core/jpg.js`, seems to only handle inverting of images properly. To support arbitrary /Decode-entries correctly we'll always use the `PDFImage.decodeBuffer` method, even for "simple" JPEG images, which should be fine since non-default /Decode-entries aren't a very common occurrence. Please note: This patch will lead to a little bit of movement in some existing test-cases, however it should be virtually imperceivable to the naked eye.	2023-05-06 13:55:39 +02:00
calixteman	f151a39d14	Merge pull request #16387 from calixteman/issue16384 [Annotations] Draw readonly annotations on their own canvas and show the HTML elements when there is a JS interaction (issue #16384)	2023-05-04 21:49:08 +02:00
Calixte Denizet	72da14f005	[Annotations] Draw readonly annotations on their own canvas and show the HTML elements when there is a JS interaction (issue #16384 )	2023-05-04 20:08:32 +02:00
calixteman	a24e11a91c	Merge pull request #16106 from bungeman/improve_color_stop_detection Better approximate gradient color stops	2023-05-04 19:48:57 +02:00
Jonas Jenwald	f31b320113	Merge pull request #12563 from Snuffleupagus/rm-SystemJS-worker [api-minor] Remove SystemJS usage, in development mode, from the worker	2023-05-03 23:57:17 +02:00
Calixte Denizet	c07149a44f	Apply HCM filters on annotations which have their own canvas (bug 1830850)	2023-05-03 10:19:59 +02:00
Jonas Jenwald	317abd6d07	Change the `createPromiseCapability` helper function into a `PromiseCapability` class This is not only slightly more compact, but it also simplifies the handling of the `settled` getter.	2023-04-29 13:43:24 +02:00
Calixte Denizet	b4264e9648	Fix two intermittents issues in integration tests	2023-04-25 12:31:36 +02:00
Jonas Jenwald	58b5eb89b8	Merge pull request #16315 from Snuffleupagus/annotationLayer-CSS-is Introduce some `:is` usage in the annotationLayer CSS	2023-04-19 15:32:10 +02:00
Jonas Jenwald	5119e7fd6a	Merge pull request #16313 from Snuffleupagus/textLayer-CSS-is Introduce some `:is` usage in the textLayer CSS	2023-04-19 15:17:22 +02:00
Calixte Denizet	19ca41896e	Correctly clip the text in the text layer (fixes #16316 )	2023-04-18 17:00:42 +02:00
Jonas Jenwald	fcc535706a	Introduce some `:is` usage in the annotationLayer CSS While this slightly reduces duplication in the CSS rules, some of the auto-formatting done by Prettier is perhaps not great. (Given the overall advantage of using Prettier, we'll probably have to simply accept this.)	2023-04-18 12:42:13 +02:00
Jonas Jenwald	5cb99321d7	Introduce some `:is` usage in the textLayer CSS	2023-04-18 11:39:09 +02:00
Calixte Denizet	117bbf7cd9	[api-minor] Don't normalize the text used in the text layer. Some arabic chars like \ufe94 could be searched in a pdf, hence it must be normalized when creating the search query. So to avoid to duplicate the normalization code, everything is moved in the find controller. The previous code to normalize text was using NFKC but with a hardcoded map, hence it has been replaced by the use of normalize("NFKC") (it helps to reduce the bundle size by 30kb). In playing with this \ufe94 char, I noticed that the bidi algorithm wasn't taking into account some RTL unicode ranges, the generated font wasn't embedding the mapping this char and the unicode ranges in the OS/2 table weren't up-to-date. When normalized some chars can be replaced by several ones and it induced to have some extra chars in the text layer. To avoid any regression, when copying some text from the text layer, a copied string is normalized (NFKC) before being put in the clipboard (it works like this in either Acrobat or Chrome).	2023-04-17 14:31:23 +02:00
calixteman	3e08eee511	Merge pull request #16301 from calixteman/issue16278 [Editor] Take into account the initial rotation (issue #16278)	2023-04-17 09:42:07 +02:00
Calixte Denizet	8e5f4c0622	[Editor] Take into account the initial rotation (issue #16278 )	2023-04-16 21:36:26 +02:00
Tim van der Meij	f46ed43b81	Merge pull request #16247 from Snuffleupagus/issue-7442 [api-minor] Add support, in `PDFFindController`, for mixing phrase/word searches (issue 7442)	2023-04-16 14:23:41 +02:00
Calixte Denizet	ca54ea12b3	Add the possibility to copy all the pdf text whatever the rendered pages are (bug 1788035)	2023-04-15 18:59:40 +02:00
Jonas Jenwald	0e19c3a120	[api-minor] Add support, in `PDFFindController`, for mixing phrase/word searches (issue 7442) Please note: This patch only extends the `PDFFindController` implementation itself to support this functionality, however it's purposely not exposed in the default viewer. This replaces the previous `phraseSearch`-parameter, and a `query`-string will now always be interpreted as a phrase-search. To enable searching for individual words, the `query`-parameter must instead consist of an Array of strings. This way it's now also possible to combine phrase/word searches, with a `query`-parameter looking something like `["Lorem ipsum", "foo", "bar"]` which will search for the phrase "Lorem ipsum" and the words "foo" respectively "bar".	2023-04-15 13:32:37 +02:00
calixteman	7571842d84	Merge pull request #16275 from calixteman/ifx_search_with_fractions Fix search of numbers inside fractions	2023-04-11 21:52:56 +02:00
Calixte Denizet	d8795f9f8f	Fix search of numbers inside fractions	2023-04-11 20:57:26 +02:00
Jonas Jenwald	3a36a9d337	Merge pull request #16268 from Snuffleupagus/RegionalImageCache Attempt to also cache images at the "page"-level (issue 16263)	2023-04-11 12:06:29 +02:00
calixteman	c1c372c320	Merge pull request #16225 from calixteman/16224 Thin whitespaces must have their own span	2023-04-11 11:13:16 +02:00
Jonas Jenwald	9881dbf927	Attempt to also cache images at the "page"-level (issue 16263) Currently we have two separate image-caches on the worker-thread: - A local one, which is unique to each `PartialEvaluator.getOperatorList` invocation. This one caches both names and references, since image-resources may be accessed in either way. - A global one, which applies to the entire PDF documents and all its pages. This one only caches references, since nothing else would work. This patch introduces a third image-cache, which essentially sits "between" the two existing ones. The new `RegionalImageCache`[1] will be usable throughout a `PartialEvaluator` instance, and consequently it only caches references, which thus allows us to keep track of repeated image-resources found in e.g. different /Form and /SMask objects. --- [1] For lack of a better word, since naming things is hard...	2023-04-10 11:34:41 +02:00
Jonas Jenwald	5063a6f2a9	[api-minor] Remove the `disableCombineTextItems` option Please note: This parameter has never been used within the PDF.js library/viewer itself, and it was only ever added for backwards compatibility reasons. This parameter was added in PR 7475, over six years ago, to try and optionally maintain the previous default text-extraction behaviour. However as part of the general text-extraction improvements in PR 13257, almost two years ago, the `disableCombineTextItems` functionality was accidentally "broken" in various ways. Note how the only (very basic) unit-test was updated in a way that doesn't really make sense, since generally speaking you'd expect that using the option should result in more (or at least the same number of) text-items. Furthermore there's also the recent issue 16209, where the option causes almost all textContent to be concatenated together. Hence this patch proposes that we simply remove the `disableCombineTextItems` option since it's essentially unused/untested functionality, as evident from the fact that it took almost two years for someone to notice that it's broken.	2023-03-30 14:23:38 +02:00
Calixte Denizet	4b7eb1436d	Thin whitespaces must have their own span	2023-03-29 11:23:58 +02:00
Calixte Denizet	a96f10e55d	Create a new chunk when the char is too rised compared to the previouse one	2023-03-28 13:56:46 +02:00
Jonas Jenwald	a4dfa04a0b	Enable the `declaration-block-no-redundant-longhand-properties` Stylelint rule Note that these changes were done automatically, using `gulp lint --fix`. This rule will help avoid unnecessary repetition in the CSS; please see https://stylelint.io/user-guide/rules/declaration-block-no-redundant-longhand-properties/	2023-03-25 10:08:27 +01:00
Jonas Jenwald	1fc09f0235	Enable the `unicorn/prefer-string-replace-all` ESLint plugin rule Note that the `replaceAll` method still requires that a global regular expression is used, however by using this method it's immediately obvious when looking at the code that all occurrences will be replaced; please see https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/replaceAll#parameters Please find additional details at https://github.com/sindresorhus/eslint-plugin-unicorn/blob/main/docs/rules/prefer-string-replace-all.md	2023-03-23 12:57:10 +01:00
Jonas Jenwald	5f64621d46	Use `String.prototype.replaceAll()` where appropriate This fairly new method allows replacing multiple occurrences within a string without having to use regular expressions. Please refer to: - https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/replaceAll - https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/replaceAll#browser_compatibility	2023-03-22 15:31:10 +01:00
Jonas Jenwald	915bdd6576	Merge pull request #16173 from Snuffleupagus/inset Introduce `inset` usage in the CSS files	2023-03-22 12:57:57 +01:00
Jonas Jenwald	137a2d6e30	Add even more non-standard ligatures (PR 15517 follow-up) Given that we already create multi-byte ToUnicode entries in other cases, see e.g. the `getNormalizedUnicodes` table, this is hopefully fine.	2023-03-22 10:42:52 +01:00
Jonas Jenwald	9321758d91	Merge pull request #16186 from Snuffleupagus/issue-16176 Support multi-byte ToUnicode entries, when using predefined CMaps (issue 16176)	2023-03-21 22:17:18 +01:00
Jonas Jenwald	d4bcfe8c16	Support multi-byte ToUnicode entries, when using predefined CMaps (issue 16176) Hopefully this makes sense, since we already "create" multi-byte ToUnicode entries in other cases (see e.g. the `getNormalizedUnicodes` table).	2023-03-21 21:35:57 +01:00

1 2 3 4 5 ...

3176 Commits