pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	8d5689387b	Improve handling of named destinations in out-of-order NameTrees (PR 10274 follow-up) According to the specification, see https://web.archive.org/web/20210404042322if_/https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G6.2384179, the keys of a NameTree/NumberTree should be ordered. For corrupt PDF files, which violate this assumption, it's thus possible that trying to lookup a single entry fails. Previously, in PR 10274, we implemented a fallback that only applies to the "bottom" node of a NameTree/NumberTree, which in general might not actually help for sufficiently corrupt NameTree/NumberTree data. Instead we remove the current limited fallback from `NameOrNumberTree.get`, and defer to the call-site to handle this case explicitly e.g. by using `NameOrNumberTree.getAll` for data where that makes sense. For well-formed documents, these changes should not lead to any additional data fetching/parsing. Finally, as part of these changes, the validation of named destination data is improved in the `Catalog` and a new unit-test is also added.	2021-05-21 15:48:37 +02:00
Brendan Dahl	53991d0924	Fix tiling pattern with smask. After drawing a tiling pattern we were not calling endDrawing, which handles compositing any active smasks. Fixes #8565.	2021-05-12 11:42:08 -07:00
Tim van der Meij	ba99e54c66	Merge pull request #13361 from brendandahl/patterns-fixes Fix several issues with radial/axial shadings and tiling patterns.	2021-05-12 20:27:37 +02:00
Brendan Dahl	ac44afa70e	Fix several issues with radial/axial shadings and tiling patterns. Previously, we set the base transformation and pattern matrix directly to the main rendering ctx of the page, however doing this caused the current transform to be lost. This would cause issues with things like shear missing so the pattern was misaligned or when stroke was used the scale of the line width or dash would be wrong. Instead we should leave the current transform and use setTransfrom on the pattern so it is applied correctly. For axial and radial shadings I had to create a temporary canvas to draw the shading so I could in turn use setTransform. Fixes: #13325, #6769, #7847, #11018, #11597, #11473 The following already in the corpus are improved: issue8078-page1 issue1877-page1	2021-05-11 16:32:24 -07:00
Jonas Jenwald	fc59a5f709	Take the `W` array into account when computing the hash, in `PartialEvaluator.preEvaluateFont`, for composite fonts (issue 13343) Without this some composite fonts may incorrectly end up with matching `hash`es, thus breaking rendering since we'll not actually try to load/parse some of the fonts. Please note: Given that the document, in the referenced issue, doesn't embed any of its fonts there's no guarantee that it renders correctly in all configurations even with this patch.	2021-05-07 21:22:36 +02:00
Calixte Denizet	3f29892d63	[JS] Fix several issues found in pdf in #13269 - app.alert and few other function can use an object as parameter ({cMsg: ...}); - support app.alert with a question and a yes/no answer; - update field siblings when one is changed in an action; - stop calculation if calculate is set to false in the middle of calculations; - get a boolean for checkboxes when they've been set through annotationStorage instead of a string.	2021-05-04 19:21:51 +02:00
Calixte Denizet	549aae6c3d	JS -- add support for page property in field	2021-05-03 15:46:29 +02:00
Brendan Dahl	d10da907da	Fix position of highlighted all text. (#13306 ) Adds a new integration test to ensure we don't regress this again.	2021-04-28 10:15:31 +02:00
Tim van der Meij	60ab15427f	Implement rendering polyline/polygon annotations without appearance stream	2021-04-27 19:02:20 +02:00
Jonas Jenwald	6f4394fcd8	Support `InkAnnotation`s without appearance streams (issue 13298) (#13301 ) For now, we keep things purposely simple by using straight lines (rather than curves); please see https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G11.2096579	2021-04-27 11:49:03 +02:00
Calixte Denizet	e868ab0051	Update all the text widgets having the same name with the same value	2021-04-20 20:03:19 +02:00
Jani Pehkonen	3a96977ea8	Implement visibility expressions for optional content	2021-04-14 17:39:41 +03:00
Tim van der Meij	d9d626a5e1	Merge pull request #13214 from calixteman/signatures Display widget signature	2021-04-10 19:35:16 +02:00
Calixte Denizet	5875ebb1ca	Display widget signature - but don't validate them for now; - Firefox will display a bar to warn that the signature validation is not supported (see https://bugzilla.mozilla.org/show_bug.cgi?id=854315) - almost all (all ?) pdf readers display signatures; - validation is done in edge but for now it's behind a pref.	2021-04-10 19:13:28 +02:00
Brendan Dahl	fc9501a637	Add support for basic structure tree for accessibility. When a PDF is "marked" we now generate a separate DOM that represents the structure tree from the PDF. This DOM is inserted into the <canvas> element and allows screen readers to walk the tree and have more information about headings, images, links, etc. To link the structure tree DOM (which is empty) to the text layer aria-owns is used. This required modifying the text layer creation so that marked items are now tracked.	2021-04-09 09:56:28 -07:00
Jonas Jenwald	f986ccdf0e	Fuzzy-match the fontName, for TrueType Collection fonts, where the "name"-table is wrong (issue 13193) The fontName, as defined in the PDF document, cannot be found in any of the "name"-tables in the TrueType Collection font. To work-around that, this patch adds a fallback code-path to allow using an approximately matching fontName rather than outright failing.	2021-04-07 15:25:32 +02:00
Jani Pehkonen	0117ee5071	Use post table when Encoding has only Differences Fixes #13107 In the issue, some TrueType glyph names have the format `uniXXXX`. Font's `Encoding` dictionary has the entry `Differences` but no `BaseEncoding`. `uniXXXX` names are converted to glyph indices using font's `post` table but currently that is done only when `BaseEncoding` exists. We must enable the conversion also when only `Differences` exists.	2021-03-31 17:58:44 +03:00
calixteman	84d7cccb1d	JS - Handle correctly hierarchy of fields (#13133 ) * JS - Handle correctly hierarchy of fields - it aims to fix #13132; - annotations can inherit their actions from the parent field; - there are some fields which act as a container for other fields: - they can be access through js so need to add them with an empty type (nothing in the spec about that but checked in Acrobat); - calculation order list (CO) can reference them so need make them through this.getField; - getArray method must return kids. - field values are number, string, ... depending of their type but nothing in the spec on how to know what's the type: - according to the comment for Canonical Format: https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#page=461 - it seems that this "type" can be guessed from js action Format (when setting a type in Acrobat DC, the only affected thing is this action). - util.scand with an empty string returns the current date.	2021-03-30 08:50:35 -07:00
Calixte Denizet	9296ee6986	Skip extra objects in object stream in using offsets	2021-03-28 13:03:05 +02:00
calixteman	81c602c61c	Set CFF header to 4 when writing it because it contains 4 elements (#13149 )	2021-03-26 18:23:18 +01:00
Jonas Jenwald	5099f1977f	Support `LineAnnotation`s with empty /Rect-entries (issue 6564) This extends PR 13033 slightly, with a heuristic to support corrupt PDF documents where the `LineAnnotation`s have an empty /Rect-entry. Please note that while I have no idea if this is "correct", this patch at least makes us output the same /BBox as re-saving in Adobe Reader does.	2021-03-15 16:33:43 +01:00
Tim van der Meij	5828ff6cb0	Implement rendering line annotations without appearance stream	2021-02-28 18:57:58 +01:00
Tim van der Meij	fa6cebf045	Implement rendering square/circle annotations without appearance stream	2021-02-27 19:05:12 +01:00
Calixte Denizet	4a5f1d1b7a	JS - Fix setting a color on an annotation - strokeColor corresponds to borderColor; - support fillColor and textColor; - support colors on the different annotations; - fix typo in aforms (+test).	2021-02-20 15:24:37 +01:00
Calixte Denizet	0fc8267576	Avoid infinite loop when getting annotation field name - aims to fix issue #12963; - use a Set to track already visited objects; - remove the loop limit in getInheritableProperty and use a RefSet too.	2021-02-14 19:58:19 +01:00
dhufnagel	fc925827b2	fix initial state of checkboxes in display layer (#12904 ) consider the export value when multiple checkboxes have the same name	2021-02-12 11:22:54 +01:00
Jonas Jenwald	d3e65f24e3	Request all data, rather than throwing, when encountering general errors in `ObjectLoader._walk` (issue 9462, PR 3289 follow-up) As far as I can tell, this has been broken ever since PR 3289 (back in 2013) without anyone noticing. For any non-`MissingDataException` errors encountered in `ObjectLoader._walk`, we're simply throwing immediately which thus has the potential to completely break rendering of an entire page. In practice this is obviously only an issue for PDF documents which are in one way or another corrupt, since that's the only way that `XRef.fetch` will throw non-`MissingDataException` errors. To make matters worse these errors are intermittent, since they can only occur if the document is still loading when the `ObjectLoader`-code runs (note the early return in `ObjectLoader.load`). Please note that we cannot simply catch the error and let "normal" parsing continue in `ObjectLoader._walk`, since that could lead to errors elsewhere given that resources "below" the current one (in the graph) might not be checked as intended then. All-in-all, the only way to make absolutely sure that we won't cause unexpected `MissingDataException`s somewhere else in the code-base is to fallback to fetching the entire document in this edge-case.	2021-02-06 14:33:50 +01:00
Tim van der Meij	286271152f	Merge pull request #12910 from calixteman/bidi Add back dir property in spans in text layer	2021-01-27 22:09:00 +01:00
Calixte Denizet	539256c351	Add back dir property in spans in text layer - aims to fix #12909	2021-01-26 12:00:05 +01:00
calixteman	a3f6882b06	JS -- add support for choice widget (#12826 )	2021-01-25 23:40:57 +01:00
Dominik Hufnagel	c5083cda02	set font size and color on annotation layer use the default appearance to set the font size and color of a text annotation widget	2021-01-22 23:12:14 +01:00
Brendan Dahl	2cba290361	Merge pull request #12836 from calixteman/update_buttons JS -- update radio/checkbox values even if there are no actions	2021-01-21 14:00:26 -08:00
Calixte Denizet	0d1b19632d	Enforce linewidth to 1px when at least one of scale factor is lower than 1	2021-01-15 13:18:24 +01:00
Jonas Jenwald	cf7eb87934	Remove a duplicated reference test (PR 12812 follow-up) - Remove a duplicated reference test, see "issue12810", from the manifest. - Improve the spelling in a couple of comments in `src/core/canvas.js`, most notable of the word "parallelogram". - Update a comment, also in `src/core/canvas.js`, to actually agree with the value used to reduce confusion when reading the code.	2021-01-15 10:57:15 +01:00
Brendan Dahl	6619f1f3f2	Merge pull request #12812 from calixteman/too_thin Enforce line width to be at least 1px after applied transform	2021-01-14 15:21:44 -08:00
Jonas Jenwald	2600e59acb	Always re-measure non-embedded ArialNarrow fonts (bug 1671312, PR 12725 follow-up) While PR 12725 fixed bug 1671312 as reported, i.e. the "In the upper right corner "Purposes' has bad kerning."-part, it however broke other parts of the text rendering. Note in particular the tables, e.g. on page 2 and beyond, where the glyphs are now rendered too close together. The reason for this is that the fonts in question are non-embedded ArialNarrow, which we just replace with Helvetica which obviously is not narrow. Given that the font replacement isn't a perfect fit for non-embedded ArialNarrow, we still need to re-measure the glyph widths in this case.	2021-01-14 15:51:48 +01:00
Ross Johnson	6dae2677d5	[api-minor] Highlight search results correctly for normalized text (PR 9448) This patch is a rebased and refactored version of PR 9448, such that it applies cleanly given that `PDFFindController` has changed since that PR was opened; obviously keeping the original author information intact. This patch will thus ensure that e.g. fractions, and other things that we normalize before searching, will still be highlighted correctly in the textLayer. Furthermore, this patch also adds basic unit-tests for this functionality. Note: The `[api-minor]` tag is added, since third-party implementations of the `PDFFindController` must now always use the `pageMatchesLength` property to get accurate length information (see the `web/text_layer_builder.js` changes). Co-authored-by: Ross Johnson <ross@mazira.com> Co-authored-by: Jonas Jenwald <jonas.jenwald@gmail.com>	2021-01-12 18:08:08 +01:00
calixteman	1de1ae0be6	Merge pull request #12838 from calixteman/authors [api-minor] Change the "dc:creator" Metadata field to an Array	2021-01-12 02:44:58 -08:00
Calixte Denizet	43d5512f5c	[api-minor] Change the "dc:creator" Metadata field to an Array - add scripting support for doc.info.authors - doc.info.metadata is the raw string with xml code	2021-01-11 21:34:07 +01:00
Calixte Denizet	b3dccd66ab	Enforce line width to be at least 1px after applied transform * add a comment to explain how minimal linewidth is computed. * when context.linewidth < 1 after transform, firefox and chrome don't render in the same way (issue #12810). * set lineWidth to 1 after transform and before stroking - aims fix issue #12295 - a pixel can be transformed into a rectangle with both heights < 1. A single rescale leads to a rectangle with dim equals to 1 and the other to something greater than 1. * change the way to render rectangle with null dimensions: - right now we rely on the lineWidth set before "re" but it can be set after "re" and before "S" and in this case the rendering will be wrong. - render such rectangles as a single line.	2021-01-10 18:02:12 +01:00
Jonas Jenwald	cd9422a075	Improve handling of JPEG images without an EOI marker (issue 12841) Given that the PDF document in the issue contains the same very large JPEG image three times, this patch includes a test-case where only the first page has been extracted from it.	2021-01-09 20:19:39 +01:00
Calixte Denizet	7172f0a928	JS -- update radio/checkbox values even if there are no actions	2021-01-08 16:43:16 +01:00
Jonas Jenwald	78c32c2697	Improve the handling of errors, in `PartialEvaluator.loadFont`, occuring in `PartialEvaluator.preEvaluateFont` (issue 12823) Currently any errors thrown in `preEvaluateFont`, which is a synchronous method, will not be handled at all in the `loadFont` method and we were thus failing to return an `ErrorFont`-instance as intended here. Also, add an explicit check in `PartialEvaluator.preEvaluateFont` to ensure that Type0-fonts always have a valid dictionary.	2021-01-07 11:38:38 +01:00
Tim van der Meij	ca18af6af3	Merge pull request #12774 from calixteman/doc_action_test JS -- Add tests for print/save actions	2021-01-03 18:46:37 +01:00
Tim van der Meij	50303fc8f4	Merge pull request #12766 from Snuffleupagus/issue-11004 Ignore, rather than throwing on, unsupported Coding style default (COD) options in JPEG 2000 images (issue 11004)	2020-12-28 20:26:10 +01:00
Calixte Denizet	ffd4bc790c	JS -- Add tests for print/save actions * change PDFDocument::hasJSActions to return true when there are JS actions in catalog.	2020-12-24 18:51:00 +01:00
Calixte Denizet	7c3facb174	JS -- Add support for buttons * radio buttons * checkboxes	2020-12-22 16:41:51 +01:00
Jonas Jenwald	cffb7af3b0	Ignore, rather than throwing on, unsupported Coding style default (COD) options in JPEG 2000 images (issue 11004) Similar to other markers that we currently skip, by ignoring unsupported Coding style default (COD) options we'll at least render something here (although some JPEG 2000 images may look slightly wrong). Note that if the unsupported COD options lead to additional errors, during parsing, we'll still abort parsing of the JPEG 2000 image.	2020-12-21 20:35:52 +01:00
Brendan Dahl	3ea1c43b15	Merge pull request #12751 from calixteman/da_not_a_string Add a default DA for textfield to avoid issues when printing or saving	2020-12-21 09:44:08 -08:00
Calixte Denizet	a7c682c600	Add a default DA for textfield to avoid issues when printing or saving * it aims to fix issue #12750	2020-12-19 23:38:45 +01:00

1 2 3 4 5 ...

1002 Commits