pdf.js

Author	SHA1	Message	Date
Calixte Denizet	599b9498f2	[Editor] Add support for printing/saving newly added Stamp annotations In order to minimize the size the of a saved pdf, we generate only one image and use a reference in each annotation using it. When printing, it's slightly different since we have to render each page independantly but we use the same image within a page.	2023-06-26 15:47:05 +02:00
Calixte Denizet	be25ee12bb	Add a container for Signature with their own canvas	2023-06-15 16:11:52 +02:00
Calixte Denizet	1a047f843c	[Editor] Add the possibility to update an existing annotation with some new properties when saving or printing	2023-06-09 17:14:53 +02:00
Calixte Denizet	3d0ce1cff2	Concat data when push fails in the CFF compiler	2023-06-09 15:48:01 +02:00
Jonas Jenwald	459d26edec	Improve handling of mismatching /BaseFont and /FontName entries for non-embedded fonts (issue 7454) This patch is the result of me going through some old issues regarding non-embedded Wingdings support. There's a few different things wrong in the referenced PDF document: - The /BaseFont and /FontName entries don't agree on the name of the fonts, with one font using `/BaseFont /Wingdings-Regular` and `/FontName /wg09np` which obviously makes no sense. To address this we'll compare the font-names against our lists of known ones and ignore /FontName entries that don't make sense iff the /BaseFont entry is a known font-name. - The non-embedded Wingdings font also set an incorrect /Encoding, in this case /MacRomanEncoding, which should have been fixed by PR 16465. However this doesn't work since the font has bogus font-flags, that fail to categorize the font as Symbolic. To address this we'll also compare the font-name against the list of known symbol fonts.	2023-06-02 17:10:25 +02:00
Calixte Denizet	0e610cab04	Try to not omit some values when printing a choice list with several selected items	2023-05-31 21:17:22 +02:00
Calixte Denizet	133d103186	[Editor] Add few more info when saving ink data (thickness, opacity, ...) Fix the InkList entry: the coordinates were relative to the page and not to the bounding box of the annotation.	2023-05-31 15:43:07 +02:00
Calixte Denizet	78e6020a6e	[OTS] Remove cntrmask instruction with no stem in charstring (bug 1529502)	2023-05-28 19:03:37 +02:00
Calixte Denizet	35a58ed987	Extract all the text of text annotations	2023-05-25 23:11:42 +02:00
Jonas Jenwald	5a7beb9f30	Attempt to improve non-embedded Wingdings font support (bug 1652224) Now that font-substitution has been implemented, we should be able to do much a better job at supporting non-embedded Wingdings fonts. Given that this is a Windows-specific font, see https://en.wikipedia.org/wiki/Wingdings, this is however not guaranteed to work (well) on other platforms.	2023-05-24 14:59:13 +02:00
Jonas Jenwald	aeed6f2b67	Ignore named encoding for non-embedded symbol fonts (issue 16464) The affected font is non-embedded ZapfDingbats, however the PDF document for some inexplicable reason specifies the encoding as "WinAnsiEncoding" (which is obviously wrong). To work-around this bug in the PDF generator, we'll simply ignore any explicitly specified named encoding for non-embedded symbol fonts.	2023-05-24 10:48:47 +02:00
Jonas Jenwald	a6f9505a39	Merge pull request #16461 from Snuffleupagus/issue-16454 Improve "EI" detection in inline images (PR 12028 follow-up, issue 16454)	2023-05-23 22:23:22 +02:00
Calixte Denizet	a76a69e1ed	Take into account the final space if any in the TJ command The final space was just ignored and that led to wrongly position the next chunk of text.	2023-05-23 17:09:32 +02:00
Jonas Jenwald	dfbbb8c0ac	Improve "EI" detection in inline images (PR 12028 follow-up, issue 16454) Given that inline images may contain "EI"-sequences in the image-data itself, actually finding the end-of-image operator isn't always straightforward. Here we extend the implementation from PR 12028 to potentially check all of the following bytes, rather than stopping immediately. While we have fairly decent test-coverage for this code, whenever you're changing it there's unfortunately a slightly higher than normal risk of regressions. (You'd really wish that PDF generators just stop using inline images.)	2023-05-23 17:04:51 +02:00
Calixte Denizet	ca12bca276	Sanitize the glyph bounding box - if the contours count is lower than -1, the glyph is really likely wrong so just remove it from the font; - if a contour has the repeat flag then repeats count mustn't be 0.	2023-05-21 16:24:41 +02:00
Jonas Jenwald	f657de7de2	Extend `getNonStdFontMap` for non-embedded Impact fonts (bug 1365930) According to https://en.wikipedia.org/wiki/Impact_(typeface) this font should be available on all current versions of Windows, and with the recently added font-substitution we should actually be able to render it correctly (at least on Windows).	2023-05-19 18:40:03 +02:00
Jonas Jenwald	bfb374dbf6	Attempt to fallback to a default font, for non-available ones, in more cases (issue 16432) This essentially extends PR 11218 to also apply when looking up the final font-reference, via the XRef-table, fails because the font isn't available. This patch also changes `PartialEvaluator.fallbackFontDict` to simply use "Helvetica" as the default font-name, since that seems generally reasonable given the now existing font-substitution code.	2023-05-17 11:41:08 +02:00
Calixte Denizet	2486536843	Compress the data when saving annotions CompressionStream API has been added in Firefox 113 (see https://bugzilla.mozilla.org/show_bug.cgi?id=1823619) hence we can use it to compress the streams with added/modified annotations.	2023-05-09 14:46:50 +02:00
Calixte Denizet	6c0fdc6ec2	Make something similar to Acrobat when Underline annotation has no appearance	2023-05-06 21:19:25 +02:00
Jonas Jenwald	722e5910e1	Improve handling of JPEG images with non-standard /Decode-entries (issue 16395) The /Decode-implementation in the our JPEG decoder, i.e. `src/core/jpg.js`, seems to only handle inverting of images properly. To support arbitrary /Decode-entries correctly we'll always use the `PDFImage.decodeBuffer` method, even for "simple" JPEG images, which should be fine since non-default /Decode-entries aren't a very common occurrence. Please note: This patch will lead to a little bit of movement in some existing test-cases, however it should be virtually imperceivable to the naked eye.	2023-05-06 13:55:39 +02:00
calixteman	f151a39d14	Merge pull request #16387 from calixteman/issue16384 [Annotations] Draw readonly annotations on their own canvas and show the HTML elements when there is a JS interaction (issue #16384)	2023-05-04 21:49:08 +02:00
Calixte Denizet	72da14f005	[Annotations] Draw readonly annotations on their own canvas and show the HTML elements when there is a JS interaction (issue #16384 )	2023-05-04 20:08:32 +02:00
calixteman	a24e11a91c	Merge pull request #16106 from bungeman/improve_color_stop_detection Better approximate gradient color stops	2023-05-04 19:48:57 +02:00
Calixte Denizet	c07149a44f	Apply HCM filters on annotations which have their own canvas (bug 1830850)	2023-05-03 10:19:59 +02:00
Jonas Jenwald	3a36a9d337	Merge pull request #16268 from Snuffleupagus/RegionalImageCache Attempt to also cache images at the "page"-level (issue 16263)	2023-04-11 12:06:29 +02:00
Jonas Jenwald	9881dbf927	Attempt to also cache images at the "page"-level (issue 16263) Currently we have two separate image-caches on the worker-thread: - A local one, which is unique to each `PartialEvaluator.getOperatorList` invocation. This one caches both names and references, since image-resources may be accessed in either way. - A global one, which applies to the entire PDF documents and all its pages. This one only caches references, since nothing else would work. This patch introduces a third image-cache, which essentially sits "between" the two existing ones. The new `RegionalImageCache`[1] will be usable throughout a `PartialEvaluator` instance, and consequently it only caches references, which thus allows us to keep track of repeated image-resources found in e.g. different /Form and /SMask objects. --- [1] For lack of a better word, since naming things is hard...	2023-04-10 11:34:41 +02:00
Calixte Denizet	4b7eb1436d	Thin whitespaces must have their own span	2023-03-29 11:23:58 +02:00
Calixte Denizet	a96f10e55d	Create a new chunk when the char is too rised compared to the previouse one	2023-03-28 13:56:46 +02:00
Jonas Jenwald	137a2d6e30	Add even more non-standard ligatures (PR 15517 follow-up) Given that we already create multi-byte ToUnicode entries in other cases, see e.g. the `getNormalizedUnicodes` table, this is hopefully fine.	2023-03-22 10:42:52 +01:00
Calixte Denizet	2d0f30a67c	Use the position of the previous xref stream if any when saving a pdf (bug 1823296)	2023-03-21 19:27:24 +01:00
Jonas Jenwald	fc055dbd80	[api-minor] Extend general transfer function support to browsers without `OffscreenCanvas` This patch extends PR 16115 to work in all browsers, regardless of their `OffscreenCanvas` support, such that transfer functions will be applied to general rendering (and not just image data). In order to do this we introduce the `BaseFilterFactory` that is then extended in browsers/Node.js environments, similar to all the other factories used in the API, such that we always have the necessary factory available in `src/display/canvas.js`. These changes help simplify the existing `putBinaryImageData` function, and the new method can easily be stubbed-out in the Firefox PDF Viewer. Please note: This patch removes the old partial transfer function support, which only applied to image data, from Node.js environments since the `node-canvas` package currently doesn't support filters. However, this should hopefully be fine given that: - Transfer functions are not very commonly used in PDF documents. - Browsers in general, and Firefox in particular, are the primary development target for the PDF.js library. - The FAQ only lists Node.js as mostly supported, see https://github.com/mozilla/pdf.js/wiki/Frequently-Asked-Questions#faq-support	2023-03-14 13:09:08 +01:00
calixteman	b2a86350fc	Merge pull request #16096 from bungeman/fix_trig_functions Correct PostScript trigonometric operators	2023-03-11 14:32:23 +01:00
Calixte Denizet	07b094729e	Fix search in pdf a containing some UTF-32 characters (bug 1820909) Some chars were supposed to have a length equals to 1 but UTF-32 chars can be longuer.	2023-03-09 15:03:01 +01:00
Ben Wagner	5fad91a680	Better approximate gradient color stops PDF gradients do not have color stops but an arbitrary PDF function of the type f(t) -> color. CSS gradients are only based on color stops. Most PDF gradient functions are produced from color stop oriented gradients. Take advantage of this by sampling the PDF function at a higher frequency but not converting any samples which could be interpolated to color stops. The sampling frequency is chosen to be the least common multiple of as many values as practical to exactly re-create the common case of the PDF function implementing equally spaced linearly interpolated stops in RGB color space. This also allows for better approximation of other smooth PDF functions (non-linear, or non-equally spaced, or in different color space). Fixes: #10572, #14165	2023-03-09 08:49:50 -05:00
calixteman	a0ef5a4ae1	Merge pull request #16115 from calixteman/issue16114 Apply transfer filters to any graphic commands	2023-03-08 14:53:41 +01:00
Jonas Jenwald	471aef5fc6	Support (rare) Type3 fonts with Pattern resources (issue 16127) This simply extends the approach in PR 10727 to also cover Patterns, which shouldn't be a common occurrence in Type3 fonts (since this is the first issue we've seen).	2023-03-08 09:20:52 +01:00
Calixte Denizet	8304df2520	Apply transfer filters to any graphic commands	2023-03-07 22:17:19 +01:00
Calixte Denizet	b8dda089e2	Slightly modify the max width of a tracking space	2023-03-07 19:38:49 +01:00
Calixte Denizet	8db77cc361	Use appearance stream to render locked annotations (bug 1723568)	2023-03-07 15:01:31 +01:00
Calixte Denizet	3849063d36	[Annotation] Don't rotate an annotation when it has the NoRotate flag	2023-03-06 17:27:11 +01:00
Calixte Denizet	05b0c9d7e6	Render large images even if they're larger than the canvas limits (bug 1720282) The idea is to encode large image in BMP format (which is very simple and doesn't require to compute any checksums) and then use createImageBitmap with a BMP blob (which doesn't suffer of the Canvas/ImageData limits). From a performance point of view, it isn't crazy (generating a large blob + decoding it on the main thread is really not ideal) but at least we've something to display which is a way better than a blank page (and one can notice that most of the time is spent in decoding the image from the pdf stream).	2023-03-05 14:07:07 +01:00
Ben Wagner	158c836e26	Correct PostScript trigonometric operators PDF 32000-1:2008 7.10.5.1 "Type 4 (PostScript Calculator) Functions" defers to the PostScript Language Reference for the description of these functions. The PostScript Language Reference, third edition chapter 8 "Operators" defines the `angle` type as a "number of degrees". Section 8.1 defines "angle `sin` real", "angle `cos` real", and "num den `atan` angle". The documentation for `atan` further states that it will return an angle in degrees between 0 and 360. Handle these operators correctly in `PostScriptEvaluator.execute`. Convert the inputs to `sin` and `cos` from degrees to radians for use with `Math.sin` and `Math.cos`. Correctly pop two values from the stack for `atan`, use `Math.atan2`, and convert from radians to (positive) degrees.	2023-03-03 17:25:11 -05:00
Calixte Denizet	3a21423386	[Acroform] Use the full path to find the node in the XFA datasets where to store the value I noticed several 'Path not found' errors because of a field called #subform[2]. From the XFA specs, the hash is used for a class of elements in the template tree. When we're looking for a node in the datasets tree, it doesn't make sense to search for a class. Hence the path element starting with a hash are just skipped.	2023-02-23 12:09:39 +01:00
Calixte Denizet	58e4d92884	[Annotation] For choice widget, use the I entry instead of the V one (bug 1770750) It isn't really conform to the specifications but Acrobat is working like that...	2023-02-09 17:26:13 +01:00
Calixte Denizet	a25895bf72	[Annotation] Take into account the stroke alpha for a FreeText without appearance	2023-02-07 22:15:27 +01:00
Calixte Denizet	ea7b4b4d6c	[Annotation] Avoid to encrypt the appearance stream two times (bug 1815476)	2023-02-07 19:26:46 +01:00
Jonas Jenwald	808ca828f1	Extend `getGlyphMapForStandardFonts` with additional entries (issue 15977)	2023-01-30 12:13:21 +01:00
Jonas Jenwald	40a46e4397	Tweak `adjustType1ToUnicode` for fonts with a predefined named encoding (bug 1811668, PR 14050 follow-up) Please note: I cannot reproduce the problem reported in bug 1811668, regarding the context menu, and in any case it's not clear that that part is even a PDF Viewer bug. Looking at bug 1811668 I couldn't help but noticing that the textLayer isn't correct, and it's unfortunately once again a problem with the `adjustType1ToUnicode` function. That's intended to help improve text-selection for fonts without a /ToUnicode-entry, and in many cases it does help (the original PR fixed lots of issues) however it's also caused some problems. In order to improve text-selection in bug 1811668, we'll now properly ignore fonts that have a predefined named encoding specified since that's really the intention with PR 14050.	2023-01-21 12:21:21 +01:00
Jonas Jenwald	f2fce93826	[JBIG2] Ensure that the `decodeInteger` function returns valid integers (issue 15942) The JBIG2 images in this PDF document are corrupt enough that even Adobe Reader warns about it when opening the file. Please note: I don't really know the JBIG2 image format at all, however from a very brief look at the specification it seems that integers should be 32-bit.	2023-01-19 17:14:17 +01:00
Jonas Jenwald	d6be5141e9	Fallback to using the `name` table to infer the encoding for TrueType fonts missing such data (issue 15910) The relevant TrueType font is missing both /ToUnicode and /Encoding entires, either of which would have prevented the (current) broken textLayer rendering. My first idea was that we could use the `post` table in the TrueType font, see https://developer.apple.com/fonts/TrueType-Reference-Manual/RM06/Chap6post.html, to get the actual glyphNames and amend the fallback ToUnicode-map that way. Unfortunately that didn't work, since the `post` table only contained ".notdef" and "" (i.e. empty string) entries. Instead we try to use the `name` table in the TrueType font, see https://developer.apple.com/fonts/TrueType-Reference-Manual/RM06/Chap6name.html, to determine if the platform is Windows and thus fallback to generate a ToUnicode-map from the `WinAnsiEncoding`.	2023-01-17 16:04:51 +01:00

1 2 3 4 5 ...

1264 Commits