pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	ed73cf6d50	Support cmaps with only CID characters, when building the ToUnicode-map (issue 9367) In this particular case the `CMap`-data that we create contains only numbers, but no strings, which causes `PartialEvaluator.readToUnicode` to create a ToUnicode-map with only empty strings. Please note: This is yet another case where I don't know if it's necessarily the best and most correct solution, but it does fix the referenced issue.	2021-09-18 00:26:15 +02:00
Calixte Denizet	5bef8120e7	Annotation - For checkboxes, get field value from AS (if any) instead of V (bug 1722036) - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1722036. - AS and V should share the same value for checkbox: it's at least what the specs say; - the pdf in the above bug opens correctly in Acrobat so it likely means that AS is chosen over V.	2021-09-17 13:04:16 +02:00
Jonas Jenwald	a11343e9af	Improve glyph mapping for non-embedded composite standard fonts with a /CIDToGIDMap (issue 11915) Please note: All of this feels very handwavy, but at least it passes all tests locally. Hopefully we have enough tests for this part of the font code. For non-embedded composite standard fonts with an "incomplete" /CIDToGIDMap, we'll now fallback to an explicitly defined /ToUnicode map even when that one happens to be an /Identity-H or /Identity-V map. The `Font.fallbackToSystemFont` method is unfortunately getting more and more special-cases, however that might be unavoidable given all the weird non-embedded fonts found in the wild :-(	2021-09-15 11:30:40 +02:00
Jonas Jenwald	a47844d1fc	Let `Lexer.getObj` return a dummy-`Cmd` for commands that start with a non-visible ASCII character (issue 13999) This way we avoid breaking badly generated PDF documents where a non-visible ASCII character is "glued" to a valid command.	2021-09-11 19:54:13 +02:00
Jonas Jenwald	9ce63a6dc6	Merge pull request #13991 from brendandahl/interpolate Enable/disable image smoothing based on image interpolate value. (bug 1722191)	2021-09-11 10:02:53 +02:00
Brendan Dahl	f38fb42b42	Enable/disable image smoothing based on image interpolate value. (bug 1722191) While some of the output looks worse to my eye, this behavior more closely matches what I see when I open the PDFs in Adobe acrobat. Fixes: #4706, #9713, #8245, #1344	2021-09-10 14:23:35 -07:00
Jonas Jenwald	5678c75562	Merge pull request #13996 from Snuffleupagus/downloadutils-link-check Make `verifyManifestFiles` fail for non-linked test-cases with a `"link": true`-entry	2021-09-10 14:05:01 +02:00
calixteman	57b80074a2	Merge pull request #13995 from calixteman/xfa_record XFA - Handle $record shorcut in SOM expression (issue #13994)	2021-09-10 13:57:50 +02:00
Jonas Jenwald	d60cc7200b	Make `verifyManifestFiles` fail for non-linked test-cases with a `"link": true`-entry Currently it's possible to accidentally, e.g. by simply copy-and-pasting from an existing test-case, add an unnecessary `"link": true`-entry for locally available PDF files. This leads to inconsistencies in the manifest file, and doesn't feel like a great developer experience. However we can easily fix it by having `verifyManifestFiles` fail in this situation, and doing so actually turned up a couple of existing cases.	2021-09-10 09:51:34 +02:00
Calixte Denizet	c5841b3794	XFA - Handle shorcut in SOM expression (issue #13994 )	2021-09-09 19:54:45 +02:00
Calixte Denizet	623860bf8f	XFA - Remove the checked attribute from the checkbox when unchecked (bug 1729877) - it aims to fix: https://bugzilla.mozilla.org/show_bug.cgi?id=1729877.	2021-09-09 19:14:16 +02:00
Jonas Jenwald	69034ab8dc	Improve glyph mapping for non-embedded composite standard fonts (issue 11088) For non-embedded CIDFontType2 fonts with a non-/Identity encoding, use the /ToUnicode data to improve the glyph mapping.	2021-09-08 15:15:33 +02:00
Tim van der Meij	1b20f61b56	Merge pull request #13972 from Snuffleupagus/issue-13971 Treat all content as visible when no optional content groups are defined (issue 13971)	2021-09-04 15:53:44 +02:00
Jonas Jenwald	6318ccf6d2	Treat all content as visible when no optional content groups are defined (issue 13971) In the referenced PDF document the /Contents stream contains MarkedContent-operators, however no optional content dictionary exists; according to [the specification](https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#G7.3883825): > Null values or references to deleted objects shall be ignored. If this entry is not present, is an empty array, or contains references only to null or deleted objects, the membership dictionary shall have no effect on the visibility of any content.	2021-09-04 08:13:37 +02:00
Jonas Jenwald	3ccf277f58	Fallback to the /ToUnicode map for TrueType fonts with (3, 1) and (1, 0) cmap-tables (issue 13316) In the PDF document some of the glyphs have bogus `differences`-entries[1] that cannot be resolved to valid glyph names, thus causing the glyph mapping to fail. My initial idea was to use a similar approach as in the `PartialEvaluator._simpleFontToUnicode`-method, to extract the charCodes from those entries, however it turned out that that didn't actually help in this case (the mapping was still wrong). To fix this I'm thus proposing that we fallback to the /ToUnicode map when no other useable data exists (e.g. no post-table), since it hopefully shouldn't make things any worse than leaving parts of the glyph map empty (which currently happens). --- [1] As can be seem below, some of the entries are completely normal while others are non-standard: ``` Differences (array) 0 = 65 1 = /g5167 2 = /space 3 = /g11927 4 = /g17737 5 = /g11540 6 = /g2180 7 = /K 8 = /P 9 = /two 10 = /zero 11 = /one 12 = /five 13 = /four 14 = /g6932 15 = /g7246 16 = /g1691 17 = /g2343 18 = /g14792 19 = /g3325 20 = /g4280 21 = /g20383 22 = /g18166 23 = /g16988 24 = /g17943 25 = /g19223 26 = /g10830 27 = 97 28 = /g982 29 = /g1226 30 = /g5059 31 = /g2677 32 = /g1042 33 = /g11568 34 = /L 35 = /three 36 = /seven 37 = /g2364 38 = /g12063 39 = /g5356 40 = /g2173 41 = /g17877 42 = /g7273 43 = /g7647 44 = /g7224 45 = /g19327 46 = /g5054 47 = /g2342 48 = /g10136 49 = /g6856 50 = /g13381 51 = /g7257 52 = /g12093 53 = /g2359 ```	2021-09-04 07:38:22 +02:00
Brendan Dahl	da15dbf962	Merge pull request #13698 from linfangrong/master [FIX] fix jpx tag tree decode (issue 11957)	2021-09-03 10:00:19 -07:00
Tim van der Meij	0a366dda6a	Merge pull request #13955 from Snuffleupagus/issue-13433 Always prefer the post-table for TrueType fonts with (0, x) cmap-tables (issue 13433)	2021-09-01 21:46:34 +02:00
Jonas Jenwald	b7b6076294	Always prefer the post-table for TrueType fonts with (0, x) cmap-tables (issue 13433) While I don't know if this is necessarily the "correct" solution, it does fix issue 13433 without breaking any of the existing reference-tests.	2021-09-01 12:35:49 +02:00
Jonas Jenwald	ba9f004097	Extend `getNonStdFontMap` for non-embedded versions of the ItcSymbol font (issue 11532) Despite its name, the fonts in ItcSymbol-family are "regular" fonts and not Symbol ones. However, given that the font name contains the word "Symbol" we ended up picking the wrong code-path in the `Font.fallbackToSystemFont`-method. Please note: While this patch ensures that the text becomes readable, by falling back a standard font, the rendering will obviously not be perfect. However, that's the PDF generators "fault" since non-embedded fonts cannot be guaranteed to render correctly in all environments.	2021-08-31 23:21:16 +02:00
linfangrong	369f1899c6	[FIX] fix jpx tag tree decode (issue 11957)	2021-08-31 11:44:26 +08:00
Brendan Dahl	a7f807b059	Only use base encoding if it's populated. (bug 1727053) The font dict in this file has an encoding entry, but only specifies a differences map. The base encoding is empty in this case and shouldn't be used.	2021-08-30 12:51:59 -07:00
Jonas Jenwald	1a1de9bb3e	Add support for specifying non-default Optional Content in the ref-tests	2021-08-26 16:54:16 +02:00
Jonas Jenwald	853b1172a1	Support Optional Content in Image-/XObjects (issue 13931) Currently, in the `PartialEvaluator`, we only support Optional Content in Form-/XObjects. Hence this patch adds support for Image-/XObjects as well, which looks like a simple oversight in PR 12095 since the canvas-implementation already contains the necessary code to support this.	2021-08-26 16:54:15 +02:00
Jonas Jenwald	ac27f96987	Extend the glyph maps for standard respectively Calibri fonts (issue 13916)	2021-08-21 00:48:38 +02:00
calixteman	52ef63f1fe	Merge pull request #13856 from calixteman/xfa_layout_rounding XFA - Avoid to put something in very small areas	2021-08-04 10:09:13 +02:00
Calixte Denizet	be1ee155d1	XFA - Avoid to put something in very small areas - it aims to fix #13855.	2021-08-03 17:05:29 +02:00
Jonas Jenwald	d5e14d3dc3	Prevent breaking errors when an optional content group is undefined (issue 13851) In the referenced PDF document most of the form `/Form` XObjects don't have an `/OC` entry, which thus causes the runtime failure during rendering.	2021-08-03 15:59:29 +02:00
Tim van der Meij	10a1db6980	Merge pull request #13824 from Snuffleupagus/issue-13823 When no "V" entry exists, let the fieldValue fallback to the "DV" entry (issue 13823)	2021-07-30 22:48:38 +02:00
Jonas Jenwald	ff71be793d	When no "V" entry exists, let the fieldValue fallback to the "DV" entry (issue 13823)	2021-07-30 16:17:42 +02:00
Calixte Denizet	7bb5331087	XFA - Avoid an error when an exdata is a string (bug 1723114)	2021-07-30 14:43:53 +02:00
Calixte Denizet	4a4591bd2c	XFA - Fix font scale factors (bug 1720888) - All the scale factors in for the substitution font were wrong because of different glyph positions between Liberation and the other ones: - regenerate all the factors - Text may have polish chars for example and in this case the glyph widths were wrong: - treat substitution font as a composite one - add a map glyphIndex to unicode for Liberation in order to generate width array for cid font	2021-07-28 19:10:42 +02:00
Calixte Denizet	76d882b560	XFA - Fix auto-sized fields (bug 1722030) - In order to better compute text fields size, use line height with no gaps (and consequently guessed height for text are slightly better in general). - Fix default background color in fields.	2021-07-28 09:43:15 +02:00
Tim van der Meij	336a74a0e5	Merge pull request #13796 from Snuffleupagus/issue-13794 Allow `StreamsSequenceStream.readBlock` to skip sub-streams with errors (issue 13794)	2021-07-27 22:25:58 +02:00
Calixte Denizet	959120e6c9	XFA - Elements created outside of XML must have all their properties (bug 1722029) - an Image element was created, attached to its parent but the $globalData property was not set and that led to an error. - the pdf in bug 1722029 has 27 rendered rows (checked in Acrobat) when only one was displayed: this patch some binding issues around the occur element.	2021-07-26 19:38:52 +02:00
Jonas Jenwald	885e7a8aa4	Allow `StreamsSequenceStream.readBlock` to skip sub-streams with errors (issue 13794) This patch makes use of the existing `ignoreErrors` option, thus allowing a page to continue parsing/rendering even if (some of) its sub-streams are corrupt. Obviously this may cause part of a page to be broken/missing, however it should be better than (potentially) rendering nothing. Also, to the best of my knowledge, this is the first bug of its kind that we've encountered. To avoid having to pass in a bunch of, for a `BaseStream`-instance, mostly unrelated parameters when initializing a `StreamsSequenceStream`-instance, I settled on utilizing a callback function instead to allow conditional Error-suppression. Note that the `StreamsSequenceStream`-class is a special stream-implementation that we only use when the `/Contents`-entry, in the `/Page`-dictionary, consists of an Array with streams.	2021-07-26 16:42:50 +02:00
Jonas Jenwald	b82c802dff	When parsing corrupt documents, avoid inserting obviously broken data in the XRef-table (issue 13783) In cases where even the very first attempt at reading from an object will throw, simply ignoring such objects will help improve rendering of some corrupt documents. Note that this will lead to more parsing in some cases, but considering that this only applies to corrupt documents that shouldn't be a big deal.	2021-07-23 18:10:53 +02:00
Brendan Dahl	da1af02ac8	Improve performance of reused patterns. Bug 1721218 has a shading pattern that was used thousands of times. To improve performance of this PDF: - add a cache for patterns in the evaluator and only send the IR form once to the main thread (this also makes caching in canvas easier) - cache the created canvas radial/axial patterns - for shading fill radial/axial use the pattern directly instead of creating temporary canvas	2021-07-22 16:47:40 -07:00
Jonas Jenwald	2cf90cd9ad	Merge pull request #13766 from Snuffleupagus/issue-13751 XFA - Handle `startIndex` correctly in the `Template.$toHTML` method (issue 13751)	2021-07-21 18:58:29 +02:00
Calixte Denizet	5555114bb3	XFA - Remove namespace from nodes under xfa:data node - in real life some xfa contains xml like <xfa:data><xfa:Foo><xfa:Bar>...</xfa:data> since there are no Foo or Bar in the xfa namespace the JS representation are empty and that leads to errors. - so the idea is to make all nodes under xfa:data namespace agnostic which means that ns are removed from nodes in the parser but only xfa:data descendants.	2021-07-21 17:11:31 +02:00
Jonas Jenwald	7d1c19f8bd	XFA - Handle `startIndex` correctly in the `Template.$toHTML` method (issue 13751) Please note: The PDF document in issue 13751 is dynamically created (in e.g. Adobe Reader), with pages added when certain buttons are clicked, hence this patch simply fixes the breaking error and nothing more. It looks like the current code contains a little bit too much copy-and-paste from the similar `index` branch above, since we cannot set the `startIndex` to a negative value. Note how it's being used to initialize the loop-variable, which is then used to lookup values in an Array and accessing the `-1`th element of an Array obviously makes no sense.	2021-07-21 16:17:13 +02:00
Jonas Jenwald	c2fe493abe	XFA - Add a missing method to `XFAAttribute`, to prevent breaking errors (issue 13748) This is yet another case where I've got no idea if the patch is correct, but it does at least fix a breaking error :-) Note how in the [`Binder._bindValue` method](`683ce66a48/src/core/xfa/bind.js (L92-L93)`), we're assuming that if a `data`-value exists then it'll also be possible to actually access it. For the `XFAAttribute`-implementation however, the second method is missing and that's what causes the breaking errors in issue 13748. Please note that another possible way of "fixing" the error wouldn't been to simply change the exists-check to return `false`, and I could see that being a preferred solution. However, the reason for submitting the current patch is that we get fewer warnings about Nodes with mis-matched types this way.	2021-07-20 17:41:05 +02:00
Jonas Jenwald	cf7978d507	XFA - Prevent breaking errors in `Binder`, when `searchNode` doesn't return data (issue 13756) As can be seen in the code (see below), the `searchNode` helper function will return `null` in some cases and all of its call-sites should protect against that before attempting to access the returned data. While only one of these changes were necessary to fix the breaking errors in issue 13756, in order to prevent future bugs I've added similar defensive code throughout this file. - `07955fa1d3/src/core/xfa/som.js (L169)` - `07955fa1d3/src/core/xfa/som.js (L239)` - `07955fa1d3/src/core/xfa/som.js (L254)`	2021-07-19 18:07:07 +02:00
Jonas Jenwald	da808aeab3	Ensure that the field value, for checkboxes, refers to an existing appearance state (bug 1720411) Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1720411	2021-07-16 13:11:48 +02:00
Calixte Denizet	3fb30ddde5	XFA - Checkboxes must be printed (bug 1720182) - to avoid future regressions, annotationStorage is passed to the xfa render in reftests.	2021-07-16 11:32:03 +02:00
Calixte Denizet	5081167e7f	XFA - A rectangle must have the width of its parent but without inner margins - it aims to fix #13584; - to avoid bad rendering because of clipping just set overflow to visible on SVG element.	2021-07-14 16:46:13 +02:00
Calixte Denizet	dd55e76f5d	XFA - Avoid to have containers not pushed in the html - it aims to fix issue #13668.	2021-07-12 21:34:58 +02:00
calixteman	140c2bc563	Revert "XFA - Avoid to have containers not pushed in the html"	2021-07-12 09:46:38 +02:00
Calixte Denizet	fccc6c2242	XFA - Avoid to have containers not pushed in the html - it aims to fix issue #13668.	2021-07-11 19:14:44 +02:00
Calixte Denizet	690b5d1941	XFA - Use fake MyriadPro as a fallback for missing fonts - aims to fix #13597.	2021-07-11 13:52:13 +02:00
Calixte Denizet	58e1f51688	XFA - Fix text positions (bug 1718741) - font line height is taken into account by acrobat when it isn't with masterpdfeditor: I extracted a font from a pdf, modified some ascent/descent properties thanks to ttx and the reinjected the font in the pdf: only Acrobat is taken it into account. So in this patch, line heights for some substituted fonts are added. - it seems that Acrobat is using a line height of 1.2 when the line height in the font is not enough (it's the only way I found to fix correctly bug 1718741). - don't use flex in wrapper container (which was causing an horizontal overflow in the above bug). - consequently, the above fixes introduced a lot of small regressions, so in order to see real improvements on reftests, I fixed the regressions in this patch: - replace margin by padding in some case where padding is a part of a container dimensions; - remove some flex display: some containers are wrongly sized when rendered; - set letter-spacing to 0.01px: it helps to be sure that text is not broken because of not enough width in Firefox.	2021-07-09 18:11:12 +02:00

... 3 4 5 6 7 ...

1274 Commits