pdf.js

Author	SHA1	Message	Date
Calixte Denizet	5dc7f4ade8	XFA - CDATA can be xml so parse it when required	2021-06-07 10:38:39 +02:00
Calixte Denizet	112645ea3d	XFA - Don't bind a form node with an empty value when the data node doesn't exist	2021-06-06 17:59:01 +02:00
Jonas Jenwald	20770cb06a	Improve text-selection for Type3 fonts with empty /FontBBox-entries (issue 6605) For Type3 fonts where the /CharProcs-streams of the individual glyph starts with a `d1` operator, we can use that to build a fallback bounding box for the font and thus improve text-selection in some cases.	2021-06-05 08:09:29 +02:00
Brendan Dahl	6255c2a8f3	Merge pull request #13376 from calixteman/6132 Replace command with not enough args by an endchar in CFF font	2021-06-04 14:00:51 -07:00
Calixte Denizet	11573ddd16	XFA - Implement usehref support - attribute 'use' was already implemented but not usehref - in general, usehref should make reference to current document - add support for SOM expressions in use and usehref to search a node. - get prototype for all nodes if any.	2021-06-04 14:57:05 +02:00
Jonas Jenwald	af78ba64bd	Don't change options of the globally used `PartialEvaluator` in the "should render checkbox with fallback font for printing" unit-test Given that the same `PartialEvaluator`-instance is used for a lot of these unit-tests, manually changing the options in any one test-case could lead to intermittently failing unit-tests since they're run in a random order. To fix this, we simply have to use the existing method to clone the `PartialEvaluator`-instance but with the custom options.	2021-05-31 12:14:58 +02:00
calixteman	8c53bf8647	Merge pull request #13437 from calixteman/xfa_mv_root XFA - Move the fake HTML representation of XFA from the worker to the main thread	2021-05-31 10:14:15 +02:00
Tim van der Meij	a0ce3cb3b4	Merge pull request #13448 from Snuffleupagus/_setDefaultAppearance-alpha Support strokeAlpha/fillAlpha when creating a fallback appearance stream (issue 6810)	2021-05-28 23:39:36 +02:00
Jonas Jenwald	707a9e3b02	Work-around for HighlightAnnotations without a top-level /ExtGState-entry (issue 13242) For HighlightAnnotations with a built-in appearance stream, we still rely on it to specify the opacity correctly via a suitable blend mode. However, if the Annotation-drawing operators are placed within a /XObject of the /Form-type, the /ExtGState won't apply to the final rendering and the result is that the highlighting obscures the underlying text. The more correct and general solution would likely be to somehow modify the implementation in `src/display/canvas.js`, to special-case handling of /Form-type /XObjects when rendering Annotations. Since we can very easily work-around this problem for now by using the "no appearance stream" code-path, doing something here ought to be preferable. This patch is (obviously) merely a work-around, but given that the referenced issue is (as far as I know) the first case we've seen of this problem a simple solution will hopefully suffice for now.	2021-05-28 13:49:27 +02:00
Jonas Jenwald	a6447f2ca2	Support strokeAlpha/fillAlpha when creating a fallback appearance stream (issue 6810) This fixes the colours, by respecting the strokeAlpha/fillAlpha-values, for a couple of Annotations in the PDF document from issue 13447.[1] --- [1] Some of the annotations still won't render at all, when compared with Adobe Reader, but that could/should probably be handled separately.	2021-05-27 16:23:18 +02:00
Calixte Denizet	45c3f00a27	XFA - Move the fake HTML representation of XFA from the worker to the main thread - the only goal of this patch is to be able to get synchronously the fake html when printing from firefox: - in order to print we need to inject some html in beforeprint callback but we cannot block in waiting for all the pages. - from a memory point of view: it doesn't change anything since the fake HTML is deleted in the worker; - this way we don't break any assumptions.	2021-05-25 19:33:07 +02:00
Calixte Denizet	7cebdbd58c	XFA - Fix lot of layout issues - I thought it was possible to rely on browser layout engine to handle layout stuff but it isn't possible - mainly because when a contentArea overflows, we must continue to layout in the next contentArea - when no more contentArea is available then we must go to the next page... - we must handle breakBefore and breakAfter which allows to "break" the layout to go to the next container - Sometimes some containers don't provide their dimensions so we must compute them in order to know where to put them in their parents but to compute those dimensions we need to layout the container itself... - See top of file layout.js for more explanations about layout. - fix few bugs in other places I met during my work on layout.	2021-05-25 17:51:36 +02:00
Tim van der Meij	99430225b0	Drop obsolete logic from the `downloadFile` function in `test/downloadutils.js` This code is old and predates the improvements we made to the test manifest to only contain working URLs (either Web Archive or GitHub/Bugzilla links), so the fallback logic to try the Web Archive is no longer necessary. This greatly simplifies the function and also makes sure that we fail directly in case a bad URL is added to the manifest, instead of having it work "accidentally" because of this logic, since we want the manifest to be correct at all times (and otherwise fail loudly).	2021-05-22 14:45:42 +02:00
Tim van der Meij	d1d9b9043d	Merge pull request #13415 from Snuffleupagus/getDestination-out-of-order Improve handling of named destinations in out-of-order NameTrees (PR 10274 follow-up)	2021-05-21 20:15:09 +02:00
Jonas Jenwald	8d5689387b	Improve handling of named destinations in out-of-order NameTrees (PR 10274 follow-up) According to the specification, see https://web.archive.org/web/20210404042322if_/https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G6.2384179, the keys of a NameTree/NumberTree should be ordered. For corrupt PDF files, which violate this assumption, it's thus possible that trying to lookup a single entry fails. Previously, in PR 10274, we implemented a fallback that only applies to the "bottom" node of a NameTree/NumberTree, which in general might not actually help for sufficiently corrupt NameTree/NumberTree data. Instead we remove the current limited fallback from `NameOrNumberTree.get`, and defer to the call-site to handle this case explicitly e.g. by using `NameOrNumberTree.getAll` for data where that makes sense. For well-formed documents, these changes should not lead to any additional data fetching/parsing. Finally, as part of these changes, the validation of named destination data is improved in the `Catalog` and a new unit-test is also added.	2021-05-21 15:48:37 +02:00
Jonas Jenwald	1a8d05fdcf	Remove some, with Prettier `2.3.0`, unnecessary `// prettier-ignore` comments To get the maximum benefit from something like Prettier, you obviously don't want to disable the automatic formatting unless absolutely necessary. When we added Prettier there were a number of cases, mostly involving larger Arrays, which required disabling of the automatic formatting for overall readability and/or to not break inline comments. With changes in Prettier version `2.3.0`, see [the release notes](https://prettier.io/blog/2021/05/09/2.3.0.html#concise-formatting-of-number-only-arrays-10106httpsgithubcomprettierprettierpull10106-10160httpsgithubcomprettierprettierpull10160-by-thorn0httpsgithubcomthorn0), there's now better formatting support for Arrays containing only numbers. Hence we can now remove a number of `// prettier-ignore` comments, and thus get the benefit of automatic formatting in (slightly) more of the code-base.	2021-05-19 11:36:03 +02:00
Calixte Denizet	4544ebf38a	Handle PI with no value in xml parser - an XML PI contains a target and optionally some content (see https://en.wikipedia.org/wiki/Processing_Instruction) - the parser expected to always have some content and so it could lead to wrong parsing.	2021-05-18 10:22:18 +02:00
Brendan Dahl	17e9cfcd2a	Merge pull request #13328 from calixteman/js_display1 JS - Add support for display property	2021-05-17 08:47:13 -07:00
Jonas Jenwald	8943bcd3c3	Account for formatting changes in Prettier version `2.3.0` With the exception of one tweaked `eslint-disable` comment, in `web/generic_scripting.js`, this patch was generated automatically using `gulp lint --fix`. Please find additional information at: - https://github.com/prettier/prettier/releases/tag/2.3.0 - https://prettier.io/blog/2021/05/09/2.3.0.html	2021-05-16 11:44:05 +02:00
Calixte Denizet	1a2cea21a5	Replace command with not enough args by an endchar in CFF font - Right now, a glyph with an erroneous outline is replaced by an empty glyph if the error is far enough from the start there's likely something to render so the idea is to replace a command with args by an endchar when no args are on the stack: this way OTS is likely happy (no remaining args on stack) and we can draw something which is likely better than nothing.	2021-05-14 13:45:45 +02:00
Brendan Dahl	53991d0924	Fix tiling pattern with smask. After drawing a tiling pattern we were not calling endDrawing, which handles compositing any active smasks. Fixes #8565.	2021-05-12 11:42:08 -07:00
Tim van der Meij	ba99e54c66	Merge pull request #13361 from brendandahl/patterns-fixes Fix several issues with radial/axial shadings and tiling patterns.	2021-05-12 20:27:37 +02:00
Jonas Jenwald	757636d519	Convert the remaining functions in `src/core/primitives.js` to use standard classes This patch was tested using the PDF file from issue 2618, i.e. https://bug570667.bugzilla-attachments.gnome.org/attachment.cgi?id=226471, with the following manifest file: ``` [ { "id": "issue2618", "file": "../web/pdfs/issue2618.pdf", "md5": "", "rounds": 50, "type": "eq" } ] ``` which gave the following results when comparing this patch against the `master` branch: ``` -- Grouped By browser, stat -- browser \| stat \| Count \| Baseline(ms) \| Current(ms) \| +/- \| % \| Result(P<.05) ------- \| ------------ \| ----- \| ------------ \| ----------- \| --- \| ---- \| ------------- firefox \| Overall \| 50 \| 3417 \| 3426 \| 9 \| 0.27 \| firefox \| Page Request \| 50 \| 1 \| 1 \| 0 \| 5.41 \| firefox \| Rendering \| 50 \| 3416 \| 3426 \| 9 \| 0.27 \| ``` Based on these results, there's no significant performance regression from using standard classes and this patch should thus be OK.	2021-05-12 09:36:28 +02:00
Brendan Dahl	ac44afa70e	Fix several issues with radial/axial shadings and tiling patterns. Previously, we set the base transformation and pattern matrix directly to the main rendering ctx of the page, however doing this caused the current transform to be lost. This would cause issues with things like shear missing so the pattern was misaligned or when stroke was used the scale of the line width or dash would be wrong. Instead we should leave the current transform and use setTransfrom on the pattern so it is applied correctly. For axial and radial shadings I had to create a temporary canvas to draw the shading so I could in turn use setTransform. Fixes: #13325, #6769, #7847, #11018, #11597, #11473 The following already in the corpus are improved: issue8078-page1 issue1877-page1	2021-05-11 16:32:24 -07:00
Calixte Denizet	38503d1c5f	Fix some integration tests	2021-05-08 16:27:45 +02:00
Jonas Jenwald	fc59a5f709	Take the `W` array into account when computing the hash, in `PartialEvaluator.preEvaluateFont`, for composite fonts (issue 13343) Without this some composite fonts may incorrectly end up with matching `hash`es, thus breaking rendering since we'll not actually try to load/parse some of the fonts. Please note: Given that the document, in the referenced issue, doesn't embed any of its fonts there's no guarantee that it renders correctly in all configurations even with this patch.	2021-05-07 21:22:36 +02:00
Calixte Denizet	af125cd299	JS - Add support for display property - in annotation_layer, move common properties treatment in a common method instead having duplicated code in each widget.	2021-05-06 11:15:38 +02:00
Tim van der Meij	afb8c4fd25	Merge pull request #13327 from Snuffleupagus/split-fonts Split the functionality in `src/core/fonts.js` into multiple files, and use standard classes	2021-05-05 20:16:24 +02:00
Calixte Denizet	451091b89b	Fix integration test in the windows bot	2021-05-05 19:05:08 +02:00
Calixte Denizet	3f29892d63	[JS] Fix several issues found in pdf in #13269 - app.alert and few other function can use an object as parameter ({cMsg: ...}); - support app.alert with a question and a yes/no answer; - update field siblings when one is changed in an action; - stop calculation if calculate is set to false in the middle of calculations; - get a boolean for checkboxes when they've been set through annotationStorage instead of a string.	2021-05-04 19:21:51 +02:00
Calixte Denizet	549aae6c3d	JS -- add support for page property in field	2021-05-03 15:46:29 +02:00
Jonas Jenwald	77b258440b	Move some constants and helper functions `from src/core/fonts.js` and into their own file - `FontFlags`, is used in both `src/core/fonts.js` and `src/core/evaluator.js`. - `getFontType`, same as the above. - `MacStandardGlyphOrdering`, is a fairly large data-structure and `src/core/fonts.js` is already a very large file. - `recoverGlyphName`, a dependency of `type1FontGlyphMapping`; please see below. - `SEAC_ANALYSIS_ENABLED`, is used by both `Type1Font`, `CFFFont`, and unit-tests; please see below. - `type1FontGlyphMapping`, is used by both `Type1Font` and `CFFFont` which a later patch will move to their own files.	2021-05-02 21:00:29 +02:00
Jonas Jenwald	6912bb5e0a	Move the `IdentityToUnicodeMap`/`ToUnicodeMap` from `src/core/fonts.js` and into its own file	2021-05-02 21:00:29 +02:00
Jonas Jenwald	883ce5d120	Fix highlighting of search results when the `textLayer` contains `br`-elements (PR 13257 follow-up, issue 13323) Apparently we need to layout `br`-elements in the same exact way as the regular `span`-elements which contain the text-content.	2021-05-02 15:36:01 +02:00
Tim van der Meij	f6f335173d	Merge pull request #13303 from Snuffleupagus/BaseStream Add an abstract base-class, which all the various Stream implementations inherit from	2021-05-01 19:13:36 +02:00
calixteman	af4dc55019	[api-minor] Fix the way to chunk the strings (#13257 ) - Improve chunking in order to fix some bugs where the spaces aren't here: * track the last position where a glyph has been drawn; * when a new glyph (first glyph in a chunk) is added then compare its position with the last saved one and add a space or break: - there are multiple ways to move the glyphs and to avoid to have to deal with all the different possibilities it's a way easier to just compare positions; - and so there is now one function (i.e. "compareWithLastPosition") where all the job is done. - Add some breaks in order to get lines; - Remove the multiple whites spaces: * some spaces were filled with several whites spaces and so it makes harder to find some sequences of words using the search tool; * other pdf readers replace spaces by one white space. Update src/core/evaluator.js Co-authored-by: Jonas Jenwald <jonas.jenwald@gmail.com> Co-authored-by: Jonas Jenwald <jonas.jenwald@gmail.com>	2021-04-30 14:41:13 +02:00
Jonas Jenwald	66d9d83dcb	Move the `PredictorStream` from `src/core/stream.js` and into its own file	2021-04-28 10:16:51 +02:00
Brendan Dahl	d10da907da	Fix position of highlighted all text. (#13306 ) Adds a new integration test to ensure we don't regress this again.	2021-04-28 10:15:31 +02:00
Tim van der Meij	60ab15427f	Implement rendering polyline/polygon annotations without appearance stream	2021-04-27 19:02:20 +02:00
Jonas Jenwald	6f4394fcd8	Support `InkAnnotation`s without appearance streams (issue 13298) (#13301 ) For now, we keep things purposely simple by using straight lines (rather than curves); please see https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G11.2096579	2021-04-27 11:49:03 +02:00
Tim van der Meij	da0e7ea969	Merge pull request #13272 from calixteman/issue13271 Update all the text widgets having the same name with the same value	2021-04-23 21:08:54 +02:00
Jonas Jenwald	57a1ea840f	Ensure that `saveDocument` works if there's no /ID-entry in the PDF document (issue 13279) (#13280 ) First of all, while it should be very unlikely that the /ID-entry is an indirect object, note how we're using `Dict.get` when parsing it e.g. in `PDFDocument.fingerprint`. Hence we definitely should be consistent here, since if the /ID-entry is an indirect object the existing code in `src/core/writer.js` would already fail. Secondly, to fix the referenced issue, we also need to check that the /ID-entry actually is an Array before attempting to access its contents in `src/core/writer.js`. Drive-by change: In the `xrefInfo` object passed to the `incrementalUpdate` function, re-name the `encrypt` property to `encryptRef` since its data is fetched using `Dict.getRaw` (given the names of the other properties fetched similarly).	2021-04-22 12:08:56 +02:00
Jonas Jenwald	7b8d2495ca	Convert the font-test `ttx` helper function to use the Fetch API By replacing `XMLHttpRequest` with a `fetch` call, the helper function can be modernized to use async/await instead. Note that the headers doesn't seem necessary to set now, since: - The Fetch API provides a method for accessing the response as text, which renders the "Content-type" header unnecessary. - According to https://developer.mozilla.org/en-US/docs/Glossary/Forbidden_header_name, the "Content-length" header isn't necessary.	2021-04-20 23:44:15 +02:00
Calixte Denizet	e868ab0051	Update all the text widgets having the same name with the same value	2021-04-20 20:03:19 +02:00
Jonas Jenwald	3d55b2b10e	Replace `done` callbacks in the font-tests with async/await instead	2021-04-19 13:26:39 +02:00
Tim van der Meij	d42f3d0bfe	Convert done callbacks to async/await in `test/unit/evaluator_spec.js`	2021-04-18 14:20:54 +02:00
Tim van der Meij	f4237d3a09	Convert done callbacks to async/await in `test/unit/annotation_spec.js`	2021-04-17 19:59:18 +02:00
Tim van der Meij	c2f3a71eca	Convert done callbacks to async/await in `test/unit/api_spec.js`	2021-04-17 17:52:23 +02:00
Jonas Jenwald	f560fe6875	A couple of small scripting/XFA-related tweaks in the worker-code - Use `PDFManager.ensureDoc`, rather than `PDFManager.ensure`, in a couple of spots in the code. If there exists a short-hand format, we should obviously use it whenever possible. - Fix a unit-test helper, to account for the previous changes. (Also, converts a function to be `async` instead.) - Add one more exists-check in `PDFDocument.loadXfaFonts`, which I missed to suggest in PR 13146, to prevent any possible errors if the method is ever called in a situation where it shouldn't be. Also, print a warning if the actual font-loading fails since that could help future debugging. (Finally, reduce overall indentation in the loop.) - Slightly unrelated, but make a small tweak of a comment in `src/core/fonts.js` to reduce possible confusion.	2021-04-17 10:34:22 +02:00
Brendan Dahl	ac3fa1e3d7	Merge pull request #13146 from calixteman/xfa_fonts XFA -- Load fonts permanently from the pdf	2021-04-16 12:55:12 -07:00

... 6 7 8 9 10 ...

2711 Commits