pdf.js

Author	SHA1	Message	Date
Tim van der Meij	5828ff6cb0	Implement rendering line annotations without appearance stream	2021-02-28 18:57:58 +01:00
Tim van der Meij	fa6cebf045	Implement rendering square/circle annotations without appearance stream	2021-02-27 19:05:12 +01:00
Calixte Denizet	4a5f1d1b7a	JS - Fix setting a color on an annotation - strokeColor corresponds to borderColor; - support fillColor and textColor; - support colors on the different annotations; - fix typo in aforms (+test).	2021-02-20 15:24:37 +01:00
Calixte Denizet	0fc8267576	Avoid infinite loop when getting annotation field name - aims to fix issue #12963; - use a Set to track already visited objects; - remove the loop limit in getInheritableProperty and use a RefSet too.	2021-02-14 19:58:19 +01:00
calixteman	a3f6882b06	JS -- add support for choice widget (#12826 )	2021-01-25 23:40:57 +01:00
Dominik Hufnagel	c5083cda02	set font size and color on annotation layer use the default appearance to set the font size and color of a text annotation widget	2021-01-22 23:12:14 +01:00
Calixte Denizet	0d1b19632d	Enforce linewidth to 1px when at least one of scale factor is lower than 1	2021-01-15 13:18:24 +01:00
Jonas Jenwald	cf7eb87934	Remove a duplicated reference test (PR 12812 follow-up) - Remove a duplicated reference test, see "issue12810", from the manifest. - Improve the spelling in a couple of comments in `src/core/canvas.js`, most notable of the word "parallelogram". - Update a comment, also in `src/core/canvas.js`, to actually agree with the value used to reduce confusion when reading the code.	2021-01-15 10:57:15 +01:00
Brendan Dahl	6619f1f3f2	Merge pull request #12812 from calixteman/too_thin Enforce line width to be at least 1px after applied transform	2021-01-14 15:21:44 -08:00
Jonas Jenwald	2600e59acb	Always re-measure non-embedded ArialNarrow fonts (bug 1671312, PR 12725 follow-up) While PR 12725 fixed bug 1671312 as reported, i.e. the "In the upper right corner "Purposes' has bad kerning."-part, it however broke other parts of the text rendering. Note in particular the tables, e.g. on page 2 and beyond, where the glyphs are now rendered too close together. The reason for this is that the fonts in question are non-embedded ArialNarrow, which we just replace with Helvetica which obviously is not narrow. Given that the font replacement isn't a perfect fit for non-embedded ArialNarrow, we still need to re-measure the glyph widths in this case.	2021-01-14 15:51:48 +01:00
Ross Johnson	6dae2677d5	[api-minor] Highlight search results correctly for normalized text (PR 9448) This patch is a rebased and refactored version of PR 9448, such that it applies cleanly given that `PDFFindController` has changed since that PR was opened; obviously keeping the original author information intact. This patch will thus ensure that e.g. fractions, and other things that we normalize before searching, will still be highlighted correctly in the textLayer. Furthermore, this patch also adds basic unit-tests for this functionality. Note: The `[api-minor]` tag is added, since third-party implementations of the `PDFFindController` must now always use the `pageMatchesLength` property to get accurate length information (see the `web/text_layer_builder.js` changes). Co-authored-by: Ross Johnson <ross@mazira.com> Co-authored-by: Jonas Jenwald <jonas.jenwald@gmail.com>	2021-01-12 18:08:08 +01:00
calixteman	1de1ae0be6	Merge pull request #12838 from calixteman/authors [api-minor] Change the "dc:creator" Metadata field to an Array	2021-01-12 02:44:58 -08:00
Calixte Denizet	43d5512f5c	[api-minor] Change the "dc:creator" Metadata field to an Array - add scripting support for doc.info.authors - doc.info.metadata is the raw string with xml code	2021-01-11 21:34:07 +01:00
Calixte Denizet	b3dccd66ab	Enforce line width to be at least 1px after applied transform * add a comment to explain how minimal linewidth is computed. * when context.linewidth < 1 after transform, firefox and chrome don't render in the same way (issue #12810). * set lineWidth to 1 after transform and before stroking - aims fix issue #12295 - a pixel can be transformed into a rectangle with both heights < 1. A single rescale leads to a rectangle with dim equals to 1 and the other to something greater than 1. * change the way to render rectangle with null dimensions: - right now we rely on the lineWidth set before "re" but it can be set after "re" and before "S" and in this case the rendering will be wrong. - render such rectangles as a single line.	2021-01-10 18:02:12 +01:00
Jonas Jenwald	cd9422a075	Improve handling of JPEG images without an EOI marker (issue 12841) Given that the PDF document in the issue contains the same very large JPEG image three times, this patch includes a test-case where only the first page has been extracted from it.	2021-01-09 20:19:39 +01:00
Jonas Jenwald	78c32c2697	Improve the handling of errors, in `PartialEvaluator.loadFont`, occuring in `PartialEvaluator.preEvaluateFont` (issue 12823) Currently any errors thrown in `preEvaluateFont`, which is a synchronous method, will not be handled at all in the `loadFont` method and we were thus failing to return an `ErrorFont`-instance as intended here. Also, add an explicit check in `PartialEvaluator.preEvaluateFont` to ensure that Type0-fonts always have a valid dictionary.	2021-01-07 11:38:38 +01:00
Calixte Denizet	ffd4bc790c	JS -- Add tests for print/save actions * change PDFDocument::hasJSActions to return true when there are JS actions in catalog.	2020-12-24 18:51:00 +01:00
Calixte Denizet	7c3facb174	JS -- Add support for buttons * radio buttons * checkboxes	2020-12-22 16:41:51 +01:00
Brendan Dahl	3ea1c43b15	Merge pull request #12751 from calixteman/da_not_a_string Add a default DA for textfield to avoid issues when printing or saving	2020-12-21 09:44:08 -08:00
Calixte Denizet	a7c682c600	Add a default DA for textfield to avoid issues when printing or saving * it aims to fix issue #12750	2020-12-19 23:38:45 +01:00
Calixte Denizet	1e2173f038	JS - Collect and execute actions at doc and pages level * the goal is to execute actions like Open or OpenAction * can be tested with issue6106.pdf (auto-print) * once #12701 is merged, we can add page actions	2020-12-18 20:03:59 +01:00
Calixte Denizet	c6c9cc96bf	Follow-up of #12707 : Add an integration test for checkboxes as radio buttons * Integration tests: Add a function to load a pdf and wait for a selected element * Integration tests: Add a function to close all the open pages	2020-12-15 00:00:04 +01:00
Tim van der Meij	d1848f5022	Merge pull request #12725 from brendandahl/remeasure-std Use widths defined by font for standard fonts.	2020-12-11 20:36:19 +01:00
Jonas Jenwald	67e5db75d8	Ignore color-operators in Type3 glyphs beginning with a `d1` operator (issue 12705) Please refer to the PDF specification at https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G8.1977497 and https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G7.3998470 This patch removes the color-operators in the evaluator, since that should be more efficient than doing it repeatedly in the main-thread when rendering the Type3 glyphs.	2020-12-11 15:49:13 +01:00
Brendan Dahl	45d9ab6e45	Use widths defined by font for standard fonts. There doesn't seem to be anything definitive about this in the spec, but from experimenting, it seems acrobat lets PDFs override the widths of the standard fonts.	2020-12-10 15:30:39 -08:00
Tim van der Meij	012e15f7a3	Fix non-standard quadpoints orders for annotations This change requires us to use valid quadpoints arrays in the existing unit tests too due to the normalization.	2020-12-06 16:02:41 +01:00
Brendan Dahl	4ba28de260	Merge pull request #12567 from calixteman/hidden [api-minor] JS -- hidden annotations must be built in case a script show them	2020-11-19 08:35:47 -08:00
Calixte Denizet	611207d2c9	Fix popup for highlights without popup (follow-up of #12505 ) * remove 1st param of _createPopup (almost useless for a method) * prepend popup div to avoid to have them on top of some highlights (and so "disable" partially mouse events) * add a ref test for issue #12504	2020-11-10 17:33:54 +01:00
Calixte Denizet	b11592a756	JS -- hidden annotations must be built in case a script show them * in some pdf, there are actions with "event.source.hidden = ..." * in order to handle visibility when printing, annotationStorage is extended to store multiple properties (value, hidden, editable, ...)	2020-11-10 12:48:34 +01:00
Jakub Olek	b642d49108	Make sure that Popup is rendered next to trigger for textAnnotation	2020-11-05 06:45:17 +01:00
Calixte Denizet	6be2f84b4e	Render not displayed annotations in using normal appearance when printing	2020-10-27 19:00:31 +01:00
Calixte Denizet	37c86b2daa	Fallback font for buttons must be ZapfDingbats. Fix bug https://bugzilla.mozilla.org/show_bug.cgi?id=1669099.	2020-10-24 12:00:03 +02:00
Jani Pehkonen	935568c2f1	Fix invalid `XUID` entries in CFF fonts In CFF fonts, entry `XUID` should be an array that has no more than 16 elements. In the issue, the length is 20, which causes the fonts to fail. See Appendix B, "Implementation Limits" in PostScript Language Reference Manual https://web.archive.org/web/20170218093716/https://www.adobe.com/products/postscript/pdfs/PLRM.pdf Actually entries `XUID` and `UniqueID` are obsolete altogether. https://blogs.adobe.com/CCJKType/2016/06/no-more-xuid-arrays.html	2020-10-05 17:38:01 +03:00
Jonas Jenwald	bd3b15b897	Use the `cidToGidMap`, if it exists, when building the glyph mapping for non-embedded composite fonts (issue 12418)	2020-09-28 14:40:43 +02:00
Tim van der Meij	3ecd984758	Implement resetting of created streams for annotations	2020-09-14 23:08:50 +02:00
Tim van der Meij	20c891542b	Merge pull request #12269 from calixteman/highlight Add support for missing appearances for hightlights, strikeout, squiggly and underline annotations.	2020-09-06 22:25:36 +02:00
Calixte Denizet	65ecd981fe	Add support for missing appearances for hightlights, strikeout, squiggly and underline annotations.	2020-09-06 15:40:15 +02:00
Brendan Dahl	45e8a31cc0	Fix handling of symbolic fonts and unicode cmaps. In issue 12120, the font has a 1,0 cmap and is marked symbolic which according to the spec means we should directly use the cmap instead of the extra steps that are defined in 9.6.6.4. However, just fixing that caused bug 1057544 to break. The font in bug 1057544 has a 0,1 cmap (Unicode 1.1) which we were not using, but is easy to support. We're also easily able to support some of the other unicode cmaps, so I added those as well. There was also a second issue with bug 1057544, the cmap doesn't have a mapping for the "quoteright" glyph, but it is defined in the post table. To handle this, I've moved post table as a fallback for any font that has an encoding.	2020-08-27 14:33:11 -07:00
Jonas Jenwald	1058f16605	Add (basic) support for transfer functions to Images (issue 6931, bug 1149713) This is similar to the existing transfer function support for SMasks, but extended to simple image data. Please note that the extra amount of data now being sent to the worker-thread, for affected /ExtGState entries, is limited to at most 4 `Uint8Array`s each with a length of 256 elements. Refer to https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G9.1658137 for additional details.	2020-08-17 10:34:12 +02:00
Brendan Dahl	7fb01f9f2a	Merge pull request #12186 from brendandahl/loca-2 Fix bad truetype loca tables.	2020-08-10 20:34:19 -07:00
Brendan Dahl	f6dff81223	Fix bad truetype loca tables. Some fonts have loca tables that aren't sorted or use 0 as an offset to signal a missing glyph. This fixes the bad loca tables by sorting them and then rewriting the loca table and potentially re-ordering the glyf table to match. Fixes #11131 and bug 1650302.	2020-08-10 14:15:49 -07:00
Jonas Jenwald	128e41d6be	Add a test-case for issue 4398 Issue 4398 was fixed by PR 4437, however a test-case wasn't included as far as I can tell. Given that PR 12186 is now in the process of re-factoring that code, adding a test-case cannot hurt as far as I'm concerned.	2020-08-08 11:50:19 +02:00
Brendan Dahl	ac494a2278	Add support for optional marked content. Add a new method to the API to get the optional content configuration. Add a new render task param that accepts the above configuration. For now, the optional content is not controllable by the user in the viewer, but renders with the default configuration in the PDF. All of the test files added exhibit different uses of optional content. Fixes #269. Fix test to work with optional content. - Change the stopAtErrors test to ensure the operator list has something, instead of asserting the exact number of operators.	2020-08-04 09:26:55 -07:00
Jonas Jenwald	fef24658e7	Adjust the heuristics used when dealing with rectangles, i.e. `re` operators, with zero width/height (issue 12010)	2020-07-02 00:02:49 +02:00
Jonas Jenwald	e451cabe37	Add a reduced test-case for issue 4260 (PR 4521 follow-up)	2020-06-30 09:26:41 +02:00
Jonas Jenwald	28d2ada59c	Attempt to detect inline images which contain "EI" sequence in the actual image data (issue 11124) This should reduce the possibility of accidentally truncating some inline images, while not causing the "EI" detection to become significantly slower.[1] There's obviously a possibility that these added checks are not sufficient to catch every single case of "EI" sequences within the actual inline image data, but without specific test-cases I decided against over-engineering the solution here. Please note: The interpolation issues are somewhat orthogonal to the main issue here, which is the truncated image, and it's already tracked elsewhere. --- [1] I've looked at the issue a few times, and this is the first approach that I was able to come up with that didn't cause unacceptable performance regressions in e.g. issue 2618.	2020-06-26 13:15:06 +02:00
Carlos Rodríguez	802aa14a99	Jpeg encoded with RGB -instead of YCbCr- write the components index as "RGB" in ASCII to say it so On ISO/IEC 10918-6:2013 (E), section 6.1: (http://www.itu.int/rec/T-REC-T.872-201206-I/en) "Images encoded with three components are assumed to be RGB data encoded as YCbCr unless the image contains an APP14 marker segment as specified in 6.5.3, in which case the colour encoding is considered either RGB or YCbCr according to the application data of the APP14 marker segment" But common jpeg libraries consider RGB too if components index are ASCII R (0x52), G (0x47) and B (0x42): https://stackoverflow.com/questions/50798014/determining-color-space-for-jpeg/50861048 Issue #11931	2020-06-04 15:08:47 +02:00
Jonas Jenwald	56ebf01ae0	Avoid hanging the worker-thread for CMap data with ridiculously large ranges (issue 11922) This patch was inspired by `ad2b64f124/xpdf/CharCodeToUnicode.cc (L480-L484)`	2020-05-22 15:23:17 +02:00
Jonas Jenwald	dda6626f40	Attempt to cache repeated images at the document, rather than the page, level (issue 11878) Currently image resources, as opposed to e.g. font resources, are handled exclusively on a page-specific basis. Generally speaking this makes sense, since pages are separate from each other, however there's PDF documents where many (or even all) pages actually references exactly the same image resources (through the XRef table). Hence, in some cases, we're decoding the same images over and over for every page which is obviously slow and wasting both CPU and memory resources better used elsewhere.[1] Obviously we cannot simply treat all image resources as-if they're used throughout the entire PDF document, since that would end up increasing memory usage too much.[2] However, by introducing a `GlobalImageCache` in the worker we can track image resources that appear on more than one page. Hence we can switch image resources from being page-specific to being document-specific, once the image resource has been seen on more than a certain number of pages. In many cases, such as e.g. the referenced issue, this patch will thus lead to reduced memory usage for image resources. Scrolling through all pages of the document, there's now only a few main-thread copies of the same image data, as opposed to one for each rendered page (i.e. there could theoretically be twenty copies of the image data). While this obviously benefit both CPU and memory usage in this case, for very large image data this patch may possibly increase persistent main-thread memory usage a tiny bit. Thus to avoid negatively affecting memory usage too much in general, particularly on the main-thread, the `GlobalImageCache` will only cache a certain number of image resources at the document level and simply fallback to the default behaviour. Unfortunately the asynchronous nature of the code, with ranged/streamed loading of data, actually makes all of this much more complicated than if all data could be assumed to be immediately available.[3] Please note: The patch will lead to small movement in some existing test-cases, since we're now using the built-in PDF.js JPEG decoder more. This was done in order to simplify the overall implementation, especially on the main-thread, by limiting it to only the `OPS.paintImageXObject` operator. --- [1] There's e.g. PDF documents that use the same image as background on all pages. [2] Given that data stored in the `commonObjs`, on the main-thread, are only cleared manually through `PDFDocumentProxy.cleanup`. This as opposed to data stored in the `objs` of each page, which is automatically removed when the page is cleaned-up e.g. by being evicted from the cache in the default viewer. [3] If the latter case were true, we could simply check for repeat images before parsing started and thus avoid handling any duplicate image resources.	2020-05-21 18:13:45 +02:00
Jonas Jenwald	44b4a74f48	A couple of small `String.fromCodePoint` improvements (PR 11698 and 11769 follow-up) - Add a reduced test-case for issue 11768, to prevent future regressions. (Given that PR 11769 is only a work-around, rather than a proper solution, it may not be entirely accurate for the issue to be closed as fixed.) - Add more validation of the charCode, as found by the heuristics, in `PartialEvaluator._buildSimpleFontToUnicode` to prevent future issues.	2020-04-15 13:45:08 +02:00

1 2 3 4 5 ...

500 Commits