pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	f560fe6875	A couple of small scripting/XFA-related tweaks in the worker-code - Use `PDFManager.ensureDoc`, rather than `PDFManager.ensure`, in a couple of spots in the code. If there exists a short-hand format, we should obviously use it whenever possible. - Fix a unit-test helper, to account for the previous changes. (Also, converts a function to be `async` instead.) - Add one more exists-check in `PDFDocument.loadXfaFonts`, which I missed to suggest in PR 13146, to prevent any possible errors if the method is ever called in a situation where it shouldn't be. Also, print a warning if the actual font-loading fails since that could help future debugging. (Finally, reduce overall indentation in the loop.) - Slightly unrelated, but make a small tweak of a comment in `src/core/fonts.js` to reduce possible confusion.	2021-04-17 10:34:22 +02:00
Brendan Dahl	ac3fa1e3d7	Merge pull request #13146 from calixteman/xfa_fonts XFA -- Load fonts permanently from the pdf	2021-04-16 12:55:12 -07:00
Calixte Denizet	7e9579045f	XFA -- Load fonts permanently from the pdf - Different fonts can be used in xfa and some of them are embedded in the pdf. - Load all the fonts in window.document. Update src/core/document.js Co-authored-by: Jonas Jenwald <jonas.jenwald@gmail.com> Update src/core/worker.js Co-authored-by: Jonas Jenwald <jonas.jenwald@gmail.com>	2021-04-15 17:57:42 +02:00
Tim van der Meij	38ed655562	Convert done callbacks to async/await in `test/unit/cmap_spec.js`	2021-04-14 22:24:28 +02:00
Tim van der Meij	046467ff47	Drop obsolete done callbacks in `test/unit/annotation_storage_spec.js` There is no asynchronous code involved here, so we can get rid of all done callbacks here and simply use the fact that if the function call ends without failed assertion that the test passed.	2021-04-14 22:11:45 +02:00
Tim van der Meij	82bdba78fb	Drop obsolete done callbacks in `test/unit/crypto_spec.js` There is no asynchronous code involved here, so we can get rid of all done callbacks here and simply use the fact that if the function call ends without failed assertion that the test passed.	2021-04-14 22:09:17 +02:00
Tim van der Meij	43eb4302ff	Convert done callbacks to async/await in `test/unit/message_handler_spec.js`	2021-04-14 21:59:13 +02:00
Tim van der Meij	bc8c0bbbfd	Convert done callbacks to async/await in `test/unit/display_svg_spec.js`	2021-04-14 21:59:13 +02:00
Tim van der Meij	cd2c4e277c	Merge pull request #13222 from timvandermeij/unit-test-async Convert done callbacks to async/await in the smaller unit test files	2021-04-14 20:37:17 +02:00
Tim van der Meij	c1e9f6025f	Convert done callbacks to async/await in `test/unit/custom_spec.js`	2021-04-13 21:51:27 +02:00
Tim van der Meij	a1c1e1b9f8	Convert done callbacks to async/await in `test/unit/fetch_stream_spec.js`	2021-04-13 21:51:27 +02:00
Tim van der Meij	5607484402	Convert done callbacks to async/await in `test/unit/network_spec.js`	2021-04-13 21:51:26 +02:00
Tim van der Meij	fcf4d02fca	Convert done callbacks to async/await in `test/unit/node_stream_spec.js`	2021-04-13 21:51:26 +02:00
Tim van der Meij	99dc0d6b65	Convert done callbacks to async/await in `test/unit/primitives_spec.js`	2021-04-13 21:50:13 +02:00
Tim van der Meij	a56ffb92be	Convert done callbacks to async/await in `test/unit/ui_utils_spec.js`	2021-04-13 21:50:13 +02:00
Tim van der Meij	a2811e925d	Convert done callbacks to async/await in `test/unit/util_spec.js`	2021-04-13 21:47:53 +02:00
Jonas Jenwald	2b2234fd5a	[api-minor] Ensure that `PDFDocumentProxy.hasJSActions` won't fail if `MissingDataException`s are thrown during the associated worker-thread parsing With the current implementation of `PDFDocument.hasJSActions`, in the worker-thread, we're not actually handling not-yet-loaded data correctly. This can thus fail in two different ways: - The `PDFDocument.fieldObjects` getter (and its helper method), while it may return a Promise, still fetches all of its data synchronously and it can thus throw a `MissingDataException` during parsing. - The `Catalog.jsActions` getter, which is completely synchronous, can obviously throw a `MissingDataException` during parsing. If either of these cases occur currently, the `PDFDocumentProxy.hasJSActions` method in the API can either return a rejected Promise (which it never should) or possibly "hang" and never resolve. Please note: While I've not yet seen this error in an actual PDF document, it can happen during loading if you're unlucky enough with e.g. the structure of the PDF document and/or the download speed offered by the server. This patch is thus based on code-inspection and on manually throwing a `MissingDataException` on the first access of `Catalog.jsActions` to simulate this situation. Finally, this patch adds a couple of API unit-tests for this (since none existed).	2021-04-13 14:33:56 +02:00
Calixte Denizet	a4c986515f	XFA -- Display text content - display xhtml; - allow spaces in xhtml (xfa-spacerun:yes); - support column layout; - fix some border issues.	2021-04-12 14:13:49 +02:00
Jonas Jenwald	5adee0cdd1	[api-minor] Let `PDFPageProxy.getStructTree` return `null`, rather than an empty structTree, for documents without any accessibility data (PR 13171 follow-up) This is first of all consistent with existing API-methods, where we return `null` when the data in question doesn't exist. Secondly, it should also be (slightly) more efficient since there's less dummy-data that we need to transfer between threads. Finally, this prevents us from adding an empty/unnecessary span to every single page even in documents without any structure tree data.	2021-04-11 12:35:33 +02:00
Tim van der Meij	10574a0f8a	Remove obsolete done callbacks from the unit tests The done callbacks are an outdated mechanism to signal Jasmine that a unit test is done, mostly in cases where a unit test needed to wait for an asynchronous operation to complete before doing its assertions. Nowadays a much better mechanism is in place for that, namely simply passing an asynchronous function to Jasmine, so we don't need callbacks anymore (which require more code and may be more difficult to reason about). In these particular cases though the done callbacks never had any real use since nothing asynchronous happens in these places. Synchronous functions don't need to use done callbacks since Jasmine simply knows it's done when the function reaches its normal end, so we can safely get rid of these callbacks. The telltale sign is if the done callback is used unconditionally at the end of the function. This is all done in an effort to over time get rid of all callbacks in the unit test code.	2021-04-10 20:29:39 +02:00
Tim van der Meij	d9d626a5e1	Merge pull request #13214 from calixteman/signatures Display widget signature	2021-04-10 19:35:16 +02:00
Calixte Denizet	5875ebb1ca	Display widget signature - but don't validate them for now; - Firefox will display a bar to warn that the signature validation is not supported (see https://bugzilla.mozilla.org/show_bug.cgi?id=854315) - almost all (all ?) pdf readers display signatures; - validation is done in edge but for now it's behind a pref.	2021-04-10 19:13:28 +02:00
Brendan Dahl	fc9501a637	Add support for basic structure tree for accessibility. When a PDF is "marked" we now generate a separate DOM that represents the structure tree from the PDF. This DOM is inserted into the <canvas> element and allows screen readers to walk the tree and have more information about headings, images, links, etc. To link the structure tree DOM (which is empty) to the text layer aria-owns is used. This required modifying the text layer creation so that marked items are now tracked.	2021-04-09 09:56:28 -07:00
Tim van der Meij	228adbf673	Merge pull request #13172 from Snuffleupagus/cleanup-keepFonts [api-minor] Add an option, in `PDFDocumentProxy.cleanup`, to allow fonts to remain attached to the DOM	2021-04-05 14:21:34 +02:00
Jonas Jenwald	232fbd28e1	Re-factor the `PDFDocumentProxy.cleanup` unit-tests to use async/await	2021-04-02 12:32:35 +02:00
Jonas Jenwald	0eb1433c78	[api-minor] Change the format of the `fontName`-property, in `defaultAppearanceData`, on Annotation-instances (PR 12831 follow-up) Currently the `fontName`-property contains an actual /Name-instance, which is a problem given that its fallback value is an empty string; see `ca7f546828/src/core/default_appearance.js (L35)` The reason that this is a problem can be seen in `ca7f546828/src/core/primitives.js (L30-L34)`, since an empty string short-circuits the cache. Essentially, in PDF documents, a /Name-instance cannot be empty and the way that the `DefaultAppearanceEvaluator` does things is unfortunately not entirely correct. Hence the `fontName`-property is changed to instead contain a string, rather than a /Name-instance, which simplifies the code overall. Please note: I'm tagging this patch with "[api-minor]", since PR 12831 is included in the current pre-release (although we're not using the `fontName`-property in the display-layer).	2021-04-01 16:47:30 +02:00
Jonas Jenwald	db1e1612df	[api-minor] Support proper `URL`-objects, in addition to URL-strings, in `getDocument` Currently only URL-strings are officially supported by `getDocument`, however at this point in time I cannot really see any compelling reason to not support `URL`-objects as well. Most likely the reason that we've don't already support `URL`-objects, in `getDocument`, is that historically `URL` wasn't fully implemented across browsers and our old polyfill wasn't perfect; see https://developer.mozilla.org/en-US/docs/Web/API/URL/URL#browser_compatibility Please note: Because of how the `url` parameter is currently handled, there's actually some cases where passing a `URL`-object to `getDocument` already works. That, in my opinion, provides additional motivation for supporting `URL`-objects officially, since it makes the API more consistent. The following is an attempt to summarize the current situation, based on the actual code rather than the JSDocs: - `getDocument("url string")` works and is documented.[1] - `getDocument({ url: "url string", })` works and is documented.[1] - `getDocument(new URL(...))` throws immediately, since no supported parameters are found. - `getDocument({ url: new URL(...), })` actually works even though it's not documented.[1] Originally, when data was fetched on the worker-thread, this would likely have thrown since `URL` isn't clonable.[2] - `getDocument({ url: { abc: 123, }, })`, or some similarily meaningless input, will be "accepted" by `getDocument` and then throw a `MissingPDFException` when attempting to fetch the bogus data. With the changes in this patch, not only is `URL`-objects now officially supported and documented when calling `getDocument`, but we'll also do a much better job at actually validating any URL-data passed to `getDocument` (and instead fail early). --- [1] In browsers, we create a valid URL thus indirectly validating the input. In Node.js environments, on the other hand, no validation is done since obtaining a baseUrl is more difficult (and PDF.js is primarily written for browsers anyway). [2] https://developer.mozilla.org/en-US/docs/Web/API/Web_Workers_API/Structured_clone_algorithm#supported_types	2021-03-31 16:21:41 +02:00
calixteman	b3528868c1	XFA - Add support for few ui elements (#13115 ) - input; - layout; - border; - margin; - color.	2021-03-31 15:42:21 +02:00
calixteman	63471bcbbe	XFA - Convert some template properties into CSS ones (#13082 ) - implement few positioning properties: position, width, height, anchor; - implement font element; - implement fill element (used by font) and its children (linear, radial, ...); - font property is inherited from ancestor container (see https://www.pdfa.org/wp-content/uploads/2020/07/XFA-3_3.pdf#page=43) so let CSS handles that stuff; - in order to reduce the number of properties to set, only set non default properties and put the default in CSS; - set a background to some containers to be able to see them (will be removed in a future commit).	2021-03-25 13:02:39 +01:00
Jonas Jenwald	eeda2215d7	Remove redundant `done`-callback functions from unit-tests which are `async` For unit-tests which are asynchronous, using a `done`-callback is redundant and future Jasmine versions will stop supporting that pattern.	2021-03-21 11:33:39 +01:00
Tim van der Meij	8269ddbd16	Merge pull request #13105 from Snuffleupagus/BasePdfManager-parseDocBaseUrl Improve memory usage around the `BasePdfManager.docBaseUrl` parameter (PR 7689 follow-up)	2021-03-19 23:03:20 +01:00
calixteman	24e598a895	XFA - Add a layer to display XFA forms (#13069 ) - add an option to enable XFA rendering if any; - for now, let the canvas layer: it could be useful to implement XFAF forms (embedded pdf in xml stream for the background and xfa form for the foreground); - ui elements in template DOM are pretty close to their html counterpart so we generate a fake html DOM from template one: - it makes easier to translate template properties to html ones; - it makes faster the creation of the html element in the main thread.	2021-03-19 10:11:40 +01:00
Jonas Jenwald	bd9dee1544	Move the `getPdfFilenameFromUrl` helper function from `web/ui_utils.js` and into `src/display/display_utils.js` It seems reasonable to place this alongside the similar `getFilenameFromUrl` helper function. This way, with the changes in the next patch, we also avoid having to expose the `isDataScheme` function in the API itself and we instead expose `getPdfFilenameFromUrl` in the API (which feels overall more appropriate).	2021-03-17 15:48:24 +01:00
Jonas Jenwald	5b5061afa8	Enable the ESLint `no-var` rule globally A significant portion of the code-base has now been converted to use `let`/`const`, rather than `var`, hence it should be possible to simply enable the ESLint `no-var` rule globally. This way we can ensure that new code won't accidentally use `var`, and it also removes the need to manually enable the rule in various folders. Obviously it makes sense to continue the efforts to replace `var`, but that should probably happen on a file and/or folder basis. Please note that this patch excludes the following code: - The `extensions/` folder, since that seemed easiest for now (and I don't know exactly what the support situation is for the Chromium-extension). - The entire `external/` folder is ignored, since most of it's currently excluded from linting. For the code that isn't imported from elsewhere (and should be ignored), we should probably (at some point) bring the code up to the same linting/formatting standard as the rest of the code-base. - Various files in the `test/` folder are ignored, as necessary, since the way that a lot of this code is loaded will require some care (or perhaps larger re-factoring) when removing `var` usage.	2021-03-13 16:12:53 +01:00
Jonas Jenwald	22e0ed51c6	Remove unnecessary `/* eslint no-var: error */` lines in the `test/unit/` folder (PR 12528 follow-up) These lines are no longer needed, since the ESLint `no-var` rule has been enabled in the entire folder.	2021-03-13 11:50:11 +01:00
Brendan Dahl	47a5550f10	Disable intermittent unit test "creates pdf doc from non-existent URL" Disable this test so we don't have to manually review unit test failure log all the time.	2021-03-11 11:40:40 -08:00
Calixte Denizet	3243672727	XFA - Create Form DOM in merging template and data trees - Spec: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.364.2157&rep=rep1&type=pdf#page=171; - support for the 2 ways of merging: consumeData and matchTemplate; - create additional nodes in template DOM when occur node allows it; - support for global values in data DOM.	2021-03-08 14:10:30 +01:00
calixteman	45329af926	XFA -- Add support for SOM expressions (#12983 ) - specifications: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.364.2157&rep=rep1&type=pdf#page=87; - add a parser for SOM expressions; - add search functions to resolve those expressions; - search functions will be used to bind data into template.	2021-02-24 10:13:02 +01:00
Tim van der Meij	f3aa4408a5	Merge pull request #13005 from calixteman/colors JS - Fix setting a color on an annotation	2021-02-21 14:50:03 +01:00
Calixte Denizet	4a5f1d1b7a	JS - Fix setting a color on an annotation - strokeColor corresponds to borderColor; - support fillColor and textColor; - support colors on the different annotations; - fix typo in aforms (+test).	2021-02-20 15:24:37 +01:00
Jonas Jenwald	e9038cc3d1	Send the `AnnotationStorage`-data to the worker-thread as a `Map` Rather than converting the `AnnotationStorage`-data to an Object, before sending it to the worker-thread, we should be able to simply send the internal `Map` directly. The "structured clone algorithm" doesn't have a problem with `Map`s, however the `LoopbackPort` used when workers are disabled (e.g. in Node.js environments) didn't use to support them. With PR 12997 having lifted that restriction, we should now be able to simply send the `AnnotationStorage`-data as-is rather than having to iterate through it to first create an Object. Please note: The changes in `src/core/annotation.js` could have been a lot more compact if we were able to use optional chaining in the `src/core` folder. Unfortunately that's still not possible, since SystemJS is being used in the development viewer (i.g. `gulp server`) and fixing that is still blocked by [bug 1247687](https://bugzilla.mozilla.org/show_bug.cgi?id=1247687).	2021-02-18 17:13:43 +01:00
calixteman	0fa9976268	XFA - Add support for prototypes (#12979 ) - specifications: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.364.2157&rep=rep1&type=pdf#page=225&zoom=auto,-207,784 - add a clone method on nodes in order to be able to clone a proto; - support ids in template namespace; - prevent from cycle when applying protos.	2021-02-18 10:32:25 +01:00
Tim van der Meij	4619b1b568	Merge pull request #12997 from Snuffleupagus/metadata-worker Move the Metadata parsing to the worker-thread	2021-02-17 20:57:46 +01:00
calixteman	b5be515375	XFA - Add a lexer/parser for FormCalc language (#12936 ) - the language specifications are: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.364.2157&rep=rep1&type=pdf#page=1049 - it can be used to: * as a scripting language for calculation, validations, ... * in SOM expressions to select nodes: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.364.2157&rep=rep1&type=pdf#page=101	2021-02-17 20:28:06 +01:00
Jonas Jenwald	d366bbdf51	Move the `encodeToXmlString` helper function to `src/core/core_utils.js` With the previous patch this function is now only accessed on the worker-thread, hence it's no longer necessary to include it in the built `pdf.js` file.	2021-02-17 13:12:01 +01:00
Jonas Jenwald	b66f294f64	Move the XML-parser to the `src/core/`-folder With the previous patch this functionality is now only accessed on the worker-thread, hence it's no longer necessary to include it in the built `pdf.js` file.	2021-02-17 13:12:01 +01:00
Jonas Jenwald	cc3a6563ee	Move the Metadata parsing to the worker-thread The only reason, as far as I can tell, for parsing the Metadata on the main-thread is how it was originally implemented. When Metadata support was first implemented, it utilized the [`DOMParser`](https://developer.mozilla.org/en-US/docs/Web/API/DOMParser) which isn't available in workers. Today, with the custom XML-parser being used, that's no longer an issue and it seems reasonable to move the Metadata parsing to the worker-thread[1], since that's where all parsing should happen (for performance reasons). Based on these changes, we'll be able to reduce the now unnecessary duplication of the XML-parser (and related code) in both of the built `pdf.js`/`pdf.worker.js` files. Finally, this patch changes the `_repair` method to use "Array + join" rather than string concatenation. --- [1] This needed the previous patch, to enable sending of `Map`s between threads with workers disabled.	2021-02-17 13:12:01 +01:00
Calixte Denizet	ccef734ebb	Remove Promise.all and async+done from unit/scripting_spec	2021-02-17 11:19:39 +01:00
Calixte Denizet	82f75a8ac2	JS -- Fix doc.getField and add missing field methods - getField("foo") was wrongly returning a field named "foobar"; - field object had few missing unimplemented methods	2021-02-17 10:42:52 +01:00
Tim van der Meij	bab059d8fd	Merge pull request #12964 from calixteman/12963 Avoid infinite loop when getting annotation field name	2021-02-16 22:36:24 +01:00
Calixte Denizet	0fc8267576	Avoid infinite loop when getting annotation field name - aims to fix issue #12963; - use a Set to track already visited objects; - remove the loop limit in getInheritableProperty and use a RefSet too.	2021-02-14 19:58:19 +01:00
Jonas Jenwald	b26c7974fe	[api-minor] Change the `dc:subject` Metadata field to an Array This patch simply extends the existing handling of the `dc:creator` field, which should hopefully suffice here; please refer to https://wwwimages2.adobe.com/content/dam/acom/en/devnet/xmp/pdfs/XMP%20SDK%20Release%20cc-2016-08/XMPSpecificationPart1.pdf#page=34	2021-02-14 17:16:40 +01:00
Calixte Denizet	ea06bb0e36	[api-minor] Annotation -- Don't compute appearance when nothing has changed * don't set a value in annotationStorage by default: - having an undefined when the annotation is rendered for saving/printing means nothing has changed so use normal appearance - aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1681687 * change the way to compute font size when this one is null in DA: - make fontSize proportional to line height - in multiline case, take into account the number of lines for text entered to adapt the font size	2021-02-12 19:27:21 +01:00
calixteman	a8021208ea	Restore window.alert after use in scripting test (#12987 )	2021-02-12 14:19:58 +01:00
Jonas Jenwald	4733f163e8	Replace a few `new Date().getTime()` instances with `Date.now()` The former format is not only more verbose, but it's also slightly less efficient since it creates a new `Date` object.	2021-02-11 23:00:42 +01:00
calixteman	0479deef4e	XFA -- Add other objects (#12949 ) - connectionSet: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.364.2157&rep=rep1&type=pdf#page=969 - datasets: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.364.2157&rep=rep1&type=pdf#page=1038 - signature: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.364.2157&rep=rep1&type=pdf#page=1040 - stylesheet: the same - xhtml: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.364.2157&rep=rep1&type=pdf#page=1187	2021-02-11 12:30:37 +01:00
Jonas Jenwald	0068dba009	[api-minor] Rename `-es5` to `-legacy`, to reduce confusion over what's actually supported (issue 12976) Please note that this will also require some edits of the Wiki.	2021-02-10 16:01:59 +01:00
Calixte Denizet	652ff57897	XFA -- Add template object - Specifications: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.364.2157&rep=rep1&type=pdf#page=596	2021-02-03 21:05:10 +01:00
Calixte Denizet	0ff5cd7eb5	XFA - Add a parser for XFA files - the parser is base on a class extending XMLParserBase - it handle xml namespaces: * each namespace is assocated with a builder * builder builds nodes belonging to the namespace * when a node is inserted in the parent namespace compatibility is checked (if required) - to avoid name collision between xml names and object properties, use Symbol.	2021-02-01 13:45:31 +01:00
Tim van der Meij	f2c7338b02	Merge pull request #12897 from calixteman/12895 JS - Fix mouse event names	2021-01-24 12:28:24 +01:00
Calixte Denizet	34d2e72df2	JS - Fix mouse event names - fix issue #12895	2021-01-23 20:26:22 +01:00
Tim van der Meij	d4c4f5d4e5	Merge pull request #12870 from Snuffleupagus/page-advance Add previous/next-page functionality that takes scroll/spread-modes into account (issue 11946)	2021-01-23 19:35:08 +01:00
Jonas Jenwald	ef1d33a29e	Use slightly less verbose font-names in the "Default appearance" unit-tests The new names are not only less verbose, but also uses a very common PDF font-naming convention.	2021-01-23 15:34:22 +01:00
Jonas Jenwald	6bcb4e3ad9	Ensure that `parseDefaultAppearance` won't attempt to access a not yet defined variable (PR 12831 follow-up) Note how, in the `if (this.stateManager.stateStack.length !== 0) {` branch, we're attempting to access the not yet defined variable[1] `args`. If this code-path is ever hit, an Error will be thrown and parsing will thus be aborted immediately (likely leading to e.g. rendering bugs). Note that I found this purely by accident, since I happened to glance at the LGTM report. However, I've since found that the error is also present during the unit-test[2] and with this patch we're actually testing the intended thing here. As part of fixing this, and to avoid re-introducing a similar bug in the future, we'll now instead always reset `args.length` before attempting to read the next operator. Also, we can use the existing `EvaluatorPreprocessor.savedStatesDepth` getter to simplify the save/restore detection a tiny bit. --- [1] The ESLint rule `no-use-before-define` would have helped catch this problem, but unfortunately we cannot enable that without quite a bit of refactoring all over the code-base. [2] The unit-test was updated such that it would fail in the `master`-branch.	2021-01-23 15:33:28 +01:00
Jonas Jenwald	a2b592f4a2	Add previous/next-page functionality that takes scroll/spread-modes into account (issue 11946) - For wrapped scrolling, we unfortunately need to do a fair bit of parsing of the current page layout. Compared to e.g. the spread-modes, where we can easily tell how the pages are laid out, with wrapped scrolling we cannot tell without actually checking. In particular documents with varying page sizes require some care, since we need to check all pages on the "row" of the current page are visible and that there aren't any "holes" present. Otherwise, in the general case, there's a risk that we'd skip over pages if we'd simply always advance to the previous/next "row" in wrapped scrolling. - For horizontal scrolling, this patch simply maintains the current behaviour of advancing one page at a time. The reason for this is to prevent inconsistent behaviour for the next and previous cases, since those cannot be handled identically. For the next-case, it'd obviously be simple to advance to the first not completely visible page. However for the previous-case, we'd only be able to go back one page since it's not possible to (easily) determine the page layout of non-visible pages (documents with varying page sizes being a particular issue). - For vertical scrolling, this patch maintains the current behaviour by default. When spread-modes are being used, we'll now attempt to advance to the next spread, rather than just the next page, whenever possible. To prevent skipping over a page, this two-page advance will only apply when both pages of the current spread are visible (to avoid breaking documents with varying page sizes) and when the second page in the current spread is fully visible horizontally (to handle larger zoom values). In order to reduce the performance impact of these changes, note that the previous/next-functionality will only call `getVisibleElements` for the scroll/spread-modes where that's necessary and that "normal" vertical scrolling is thus unaffected by these changes. To support these changes, the `getVisibleElements` helper function will now also include the `widthPercent` in addition to the existing `percent` property. The `PDFViewer._updateHelper` method is changed slightly w.r.t. updating the `currentPageNumber` for the non-vertical/spread modes, i.e. won't affect "normal" vertical scrolling, since that helped simplify the overall calculation of the page advance. Finally, these new `BaseViewer` methods also allow (some) simplification of previous/next-page functionality in various viewer components. Please note: There's one thing that this patch does not attempt to change, namely disabling of the previous/next toolbarButtons respectively the firstPage/lastPage secondaryToolbarButtons. The reason for this is that doing so would add quite a bit of complexity in general, and if for some reason `BaseViewer._getPageAdvance` would get things wrong we could end up incorrectly disabling the buttons. Hence it seemed overall safer to not touch this, and accept that the buttons won't be `disabled` despite in some edge-cases no further scrolling being possible.	2021-01-22 21:38:15 +01:00
calixteman	1039698697	Add a parser to get font data from the default appearance (#12831 ) * Add a parser to get font data from the default appearance - pdfium & poppler use a special parser too to get these info. * Update src/core/default_appearance.js Co-authored-by: Jonas Jenwald <jonas.jenwald@gmail.com> Co-authored-by: Jonas Jenwald <jonas.jenwald@gmail.com>	2021-01-21 20:15:31 +01:00
Brendan Dahl	f45ba02fd3	Merge pull request #12850 from calixteman/missing_cstes JS -- Add few missing constants in global scope	2021-01-20 11:33:02 -08:00
Ross Johnson	6dae2677d5	[api-minor] Highlight search results correctly for normalized text (PR 9448) This patch is a rebased and refactored version of PR 9448, such that it applies cleanly given that `PDFFindController` has changed since that PR was opened; obviously keeping the original author information intact. This patch will thus ensure that e.g. fractions, and other things that we normalize before searching, will still be highlighted correctly in the textLayer. Furthermore, this patch also adds basic unit-tests for this functionality. Note: The `[api-minor]` tag is added, since third-party implementations of the `PDFFindController` must now always use the `pageMatchesLength` property to get accurate length information (see the `web/text_layer_builder.js` changes). Co-authored-by: Ross Johnson <ross@mazira.com> Co-authored-by: Jonas Jenwald <jonas.jenwald@gmail.com>	2021-01-12 18:08:08 +01:00
calixteman	1de1ae0be6	Merge pull request #12838 from calixteman/authors [api-minor] Change the "dc:creator" Metadata field to an Array	2021-01-12 02:44:58 -08:00
Calixte Denizet	43d5512f5c	[api-minor] Change the "dc:creator" Metadata field to an Array - add scripting support for doc.info.authors - doc.info.metadata is the raw string with xml code	2021-01-11 21:34:07 +01:00
Calixte Denizet	8e6bec6e2e	JS -- Add few missing constants in global scope - these constants are available in pdfium implementation too - fix error code in aform.js	2021-01-11 17:19:28 +01:00
Jonas Jenwald	81525fd446	Use ESLint to ensure that `export`s are sorted alphabetically There's built-in ESLint rule, see `sort-imports`, to ensure that all `import`-statements are sorted alphabetically, since that often helps with readability. Unfortunately there's no corresponding rule to sort `export`-statements alphabetically, however there's an ESLint plugin which does this; please see https://www.npmjs.com/package/eslint-plugin-sort-exports The only downside here is that it's not automatically fixable, but the re-ordering is a one-time "cost" and the plugin will help maintain a consistent ordering of `export`-statements in the future. Note: To reduce the possibility of introducing any errors here, the re-ordering was done by simply selecting the relevant lines and then using the built-in sort-functionality of my editor.	2021-01-09 20:37:51 +01:00
Jonas Jenwald	941b65f683	Remove unncessary `CanvasFactory`/`CMapReaderFactory`/`FileReaderFactory` duplication in unit-tests Given that the API will now, after PR 12039, automatically pick the correct factories to use depending on the environment (browser vs. Node.js), we can utilize that in the unit-tests as well. This way we don't have to manually repeat the same initialization code in multiple unit-tests. Note: The official PDF.js API is defined in `src/pdf.js`, hence the new exports in `src/display/api.js` will not affect that. Also, updates the unit-test `FileReaderFactory` helpers similarily. Drive-by change: Fix the `CMapReaderFactory` usage in the annotation unit-tests, since the cache should only contain raw data and not a Promise. While this obviously works as-is, having unit-tests that "abuse" the intended data format can easily lead to unnecessary failures if changes are made to the relevant `src/core/` code.	2021-01-08 17:33:59 +01:00
Tim van der Meij	5bde4b71f8	Merge pull request #12292 from calixteman/encoding Fix encoding issues when printing/saving a form with non-ascii characters	2021-01-07 22:56:42 +01:00
Calixte Denizet	56424967f2	Fix encoding issues when printing/saving a form with non-ascii characters	2021-01-05 17:23:18 +01:00
Tim van der Meij	ca18af6af3	Merge pull request #12774 from calixteman/doc_action_test JS -- Add tests for print/save actions	2021-01-03 18:46:37 +01:00
Jonas Jenwald	739d7c6d77	Support the `once` option, when registering `EventBus` listeners This follows the same principle as the `once` option that exists in the native `addEventListener` method, and will thus automatically remove an `EventBus` listener when it's invoked; see https://developer.mozilla.org/en-US/docs/Web/API/EventTarget/addEventListener#Parameters Finally, this patch also tweaks some the existing `EventBus`-code to use modern features such as optional chaining and logical assignment operators.	2020-12-29 16:49:13 +01:00
Calixte Denizet	ffd4bc790c	JS -- Add tests for print/save actions * change PDFDocument::hasJSActions to return true when there are JS actions in catalog.	2020-12-24 18:51:00 +01:00
Calixte Denizet	1e2173f038	JS - Collect and execute actions at doc and pages level * the goal is to execute actions like Open or OpenAction * can be tested with issue6106.pdf (auto-print) * once #12701 is merged, we can add page actions	2020-12-18 20:03:59 +01:00
Calixte Denizet	167ff1a7fc	JS -- Actions must be evaluated in global scope * All the public properties of doc are injected into globalThis, in order to make them available through `this` * Put event in the global scope too.	2020-12-17 22:01:45 +01:00
Calixte Denizet	8bff4f1ea9	In order to simplify m-c code, move some in pdf.js * move set/clear\|Timeout/Interval and crackURL code in pdf.js * remove the "backdoor" in the proxy (used to dispatch event) and so return the dispatch function in the initializer * remove listeners if an error occured during sandbox initialization * add support for alert and prompt in the sandbox * add a function to eval in the global scope	2020-12-17 15:03:26 +01:00
Calixte Denizet	03814bd6a2	Don't use 'in' operator to check if key is in a Map	2020-12-16 16:00:12 +01:00
Calixte Denizet	6502ae889d	JS -- Send events to the sandbox from annotation layer	2020-12-15 16:28:47 +01:00
Jonas Jenwald	9adb225a7d	Call `done.fail` correctly in the `scripting_spec.js` unit-tests The `done.fail` method should always be called with a reason, to ensure that any errors are propagated as intended to the test results.	2020-12-12 12:41:47 +01:00
Tim van der Meij	954ac3d944	Merge pull request #12719 from calixteman/emailvalidate JS -- add function eMailValidate used to validate an email address	2020-12-10 22:19:37 +01:00
Calixte Denizet	f94269c0d1	JS -- add function eMailValidate used to validate an email address	2020-12-10 21:51:37 +01:00
Tim van der Meij	7097114e0c	Merge pull request #12720 from calixteman/fix_co Be sure that CalculationOrder is either null or a non-empty array	2020-12-10 21:43:35 +01:00
Calixte Denizet	c7b09b8efc	JS -- fix printd issue with negative number	2020-12-10 18:43:04 +01:00
Calixte Denizet	25bf504ff5	Be sure that CalculationOrder is either null or a non-empty array	2020-12-10 16:02:11 +01:00
Calixte Denizet	0f899edfc8	JS -- Add aform functions * These functions aren't in the PDF specs but seems to be widely used * So the specs for these functions are: * http://www.sfu.ca/~wcs/ForGraham/Aladdin%20stuff/Acrobat%20Reader%205.0/Contents/MacOS/JavaScripts/AForm.js * pdfium source code	2020-12-07 19:37:34 +01:00
Tim van der Meij	d784af3f38	Merge pull request #12696 from timvandermeij/annotation-quadpoints Fix non-standard quadpoints orders for annotations	2020-12-06 16:52:33 +01:00
Tim van der Meij	012e15f7a3	Fix non-standard quadpoints orders for annotations This change requires us to use valid quadpoints arrays in the existing unit tests too due to the normalization.	2020-12-06 16:02:41 +01:00
Jonas Jenwald	c549069ebd	Replace the `testMode` parameter in `src/pdf.sandbox.js` with a constant, set using the pre-processor This simplifies not just this code, but the unit-tests as well, and should be sufficient as far as I can tell. Note also that currently, in the built `pdf.sandbox.js` file, there's even a line reading `testMode = testMode && false;` because of an accidentally flipped pre-processor statement. Finally, in the `scripting_spec.js` unit-test, defines `sandboxBundleSrc` at the top of the file to make it easier to find and/or change it when necessary.	2020-12-05 23:04:34 +01:00
Brendan Dahl	956fcab967	Merge pull request #12631 from calixteman/app JS -- Implement app object	2020-12-01 16:50:16 -08:00
Jonas Jenwald	c42029489e	Run `gulp lint --fix`, to account for changes in Prettier version `2.2.1` Please refer to https://github.com/prettier/prettier/blob/master/CHANGELOG.md#221 for additional details.	2020-11-29 10:01:46 +01:00
Calixte Denizet	18b525de2e	Parenthesis in names are not escaped when saving	2020-11-25 12:28:12 +01:00
Calixte Denizet	283aac4c53	JS -- Implement app object * https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/AcrobatDC_js_api_reference.pdf * Add color, fullscreen objects + few constants.	2020-11-20 15:46:52 +01:00
Jonas Jenwald	01d12b465c	[api-minor] Add "contentLength" to the information returned by the `getMetadata` method Given that we already include the "Content-Disposition"-header filename, when it exists, it shouldn't hurt to also include the information from the "Content-Length"-header. For PDF documents opened via a URL, which should be a very common way for the PDF.js library to be used, this will[1] thus provide a way of getting the PDF filesize without having to wait for the `getDownloadInfo`-promise to resolve[2]. With these API improvements, we can also simplify the filesize handling in the `PDFDocumentProperties` class. --- [1] Assuming that the server is correctly configured, of course. [2] Since that's not guaranteed to happen in general, with e.g. `disableAutoFetch = true` set.	2020-11-20 15:30:36 +01:00
Brendan Dahl	c88e805870	Merge pull request #12604 from calixteman/quickjs JS -- Add a sandbox based on quickjs	2020-11-19 08:40:21 -08:00
Brendan Dahl	4ba28de260	Merge pull request #12567 from calixteman/hidden [api-minor] JS -- hidden annotations must be built in case a script show them	2020-11-19 08:35:47 -08:00

1 2 3 4 5 ...

822 Commits