pdf.js

Author	SHA1	Message	Date
Tim van der Meij	8ce24744f2	Merge pull request #9769 from Snuffleupagus/node-unittest-rm-console-errors Reduce the amount of errors logged, primarily in Node.js/Travis, when running the unit-tests	2018-06-03 19:50:42 +02:00
Rob Wu	0e4e79169b	Fall back to ISO-8859-1 in content_disposition.js Updates content_disposition.js to include `9b789d9b3b`	2018-06-03 16:17:28 +02:00
Rob Wu	e992480baa	Fix multibyte decoding in content_disposition.js I made some mistakes when trying to make the content_disposition.js compatible with non-modern browsers (IE/Edge). Notably, text decoding was usually skipped because of the inverted logical check at the top of `textdecode`. I verified that this new version works as expected, as follows: 1. Visit `55c71eb44e/test/` and get test-content-disposition.js also get test-content-disposition.node.js if using Node.js, or get test-content-disposition.html if you use a browser. 2. Modify `test-content-disposition.node.js` (or the HTML file) and change `../extension/content-disposition.js` to `PDFJS-content_disposition.js` 3. Copy the `getFilenameFromContentDispositionHeader` function from `content_disposition.js` (i.e. the file without the trailing exports) and save it as `PDFJS-content_disposition.js`. 4. Run the tests (`node test-content-disposition.node.js` or by opening `test-content-disposition.html` in a browser). 5. Confirm that there are no failures: "Finished all tests (0 failures)" The code has a best-efforts fallback for Microsoft Edge, which lacks the TextDecoder API. The fallback only supports the common UTF-8 encoding. To simulate this in a test, modify `PDFJS-content_disposition.js` and deliberately throw an error before `new TextDecoder`. There will be two failures because we don't want to include too much code to support text decoding for non-UTF-8 encodings in Edge ``` test-content-disposition.js:265 Assertion failed: Input: attachment; filename=ISO-8859-1''%c3%a4 Expected: "Ã¤" Actual : "ä" test-content-disposition.js:268 Assertion failed: Input: attachment; filename=ISO-8859-1''%e2%82%ac Expected: "â‚¬" Actual : "€" ```	2018-06-03 15:28:22 +02:00
Jonas Jenwald	ef081a0531	Ensure that the `WorkerTransport._passwordCapability` is always rejected, even when errors are thrown in `PDFDocumentLoadingTask.onPassword` callback Please note that while the current code works, both in the viewer and the unit-tests, it can leave the `WorkerTransport._passwordCapability` Promise in a pending state. In the `PasswordRequest` handler, in src/display/api.js, we're returning the Promise from a `capability` object (rather than just a "plain" Promise). While an error thrown anywhere within this handler was fortunately enough to propagate it to the Worker side, it won't cause the Promise (in `WorkerTransport._passwordCapability`) to actually be rejected. Finally note that while we're now catching errors in the `PasswordRequest` handler, those errors are still propagated to the Worker side via the (now) rejected Promise and the existing `return this._passwordCapability.promise;` line. This prevents warnings about uncaught Promises, with messages such as "Error: Worker was destroyed during onPassword callback", when running the unit-tests both in browsers and in Node.js/Travis.	2018-06-03 00:28:40 +02:00
Jonas Jenwald	0ecc22cb04	Attempt to provide better default values for the `disableFontFace`/`nativeImageDecoderSupport` API options in Node.js This should provide a better out-of-the-box experience when using PDF.js in a Node.js environment, since it's missing native support for both `@font-face` and `Image`. Please note that this change only affects the default values, hence it's still possible for an API consumer to override those values when calling `getDocument`. Also, prevents "ReferenceError: document is not defined" errors, when running the unit-tests in Node.js/Travis.	2018-06-03 00:28:37 +02:00
Tim van der Meij	36af85db92	Merge pull request #9740 from pedrotp/replace-get-getArray Use Dict.getArray, instead of Dict.get, when getting the 'Size' in constructSampled in src/core/function.js	2018-06-02 19:50:09 +02:00
pedrotp	a190d21dd7	Use Dict.getArray, instead of Dict.get, when getting the 'Size' in constructSampled in src/core/function.js (PR 7295 follow-up)	2018-06-02 11:16:05 -04:00
Jonas Jenwald	83ff7d9de9	Simplify the DNL (Define Number of Lines) marker warning in `JpegImage.parse`	2018-05-30 22:40:11 +02:00
Jonas Jenwald	620f65488b	Ignore the rest of the image when encountering an EOI (End of Image) marker while parsing Scan data (issue 9679)	2018-05-30 22:40:11 +02:00
Tim van der Meij	c5d5d29b03	Merge pull request #9756 from Snuffleupagus/Type1Parser-rm-makeSubStream Remove usage of `makeSubStream` from `Type1Parser.extractFontProgram` in src/core/type1_parser.js (issue 9735)	2018-05-28 23:34:57 +02:00
Jonas Jenwald	f68f60099e	Remove usage of `makeSubStream` from `Type1Parser.extractFontProgram` in src/core/type1_parser.js (issue 9735) This avoids the initialization of, potentially thousands of, unnecessary `Stream` objects, by getting the required number of bytes directly instead. Given the special behaviour, when `length === 0`, of the `getBytes`/`skip` methods, it's also necessary to handle that particular case to prevent errors when encountering empty CharStrings.	2018-05-28 14:32:20 +02:00
Mukul Mishra	949c3e9417	Add abort functionality in fetch stream	2018-05-22 12:46:59 +05:30
Jani Pehkonen	fe2cf2f73f	SVG clip intersections and operators	2018-04-17 19:20:29 +03:00
Brendan Dahl	2dc4af525d	Merge pull request #9659 from yurydelendik/rm-createFromIR Remove createFromIR from PDFFunctionFactory	2018-04-12 14:22:43 -07:00
Yury Delendik	20085aaa5e	Remove createFromIR from PDFFunctionFactory; forgive invalid Dict values.	2018-04-10 18:49:31 -05:00
Jani Pehkonen	8ea505545a	Use FDSelect and FDArray when converting CFF CID font to paths	2018-04-10 16:44:42 +03:00
Brendan Dahl	e8cf7fd512	Merge pull request #9624 from wojtekmaj/no-warning-on-dependency-operator Prevent warning on unimplemented operator thrown for OPS.dependency	2018-04-03 10:55:29 -07:00
Wojciech Maj	acc0a0fe95	Prevent warning on unimplemented operator thrown for OPS.dependency	2018-04-02 14:29:34 +02:00
Wojciech Maj	ea2850e9a7	Fix typos	2018-04-01 23:20:41 +02:00
Tim van der Meij	8887a09e8f	Merge pull request #9588 from swftvsn/patch-1 Improve node.js support	2018-04-01 12:26:39 +02:00
Jonas Jenwald	8b09f7c34e	Clean-up `getMainThreadWorkerMessageHandler` for non-PRODUCTION mode This is a final piece of clean-up of code that I recently wrote, after which I'm done :-) When the `getMainThreadWorkerMessageHandler` function was added, in PR 9385, it did so by basically introducing a `web/app.js` dependency in `src/display/api.js` through the `window.pdfjsNonProductionPdfWorker` property[1]. Even though this is limited to non-`PRODUCTION` mode, i.e. `gulp server`, it still seems unfortunate to have that sort of viewer dependency in the API code itself. With the new, much nicer and shorter, names introduced in PR 9565 we can remove this non-`PRODUCTION` hack and just use `window.pdfjsWorker` in both the viewer and the API regardless of the build mode. --- [1] It didn't seem correct to piggy-back on the `window.pdfjsDistBuildPdfWorker` property in non-`PRODUCTION` mode.	2018-03-29 11:03:47 +02:00
Tim van der Meij	5c1a16ba6e	Merge pull request #9586 from Snuffleupagus/pageSize-api-rotate Ensure that `PDFPageProxy.pageSizeInches` handles non-default /Rotate entries correctly	2018-03-25 18:03:32 +02:00
Jonas Jenwald	d547936827	Ensure that `PDFPageProxy.pageSizeInches` handles non-default /Rotate entries correctly Without this patch, the pageSize will be incorrectly reported for some PDF files. --- Move pageSizeInches to ui_utils	2018-03-25 16:48:29 +02:00
Tim van der Meij	115fbc47fe	Merge pull request #9594 from Snuffleupagus/pageLabel-validation Add stricter validation in `Catalog.readPageLabels`	2018-03-24 19:40:49 +01:00
Brendan Dahl	24f766b14d	Merge pull request #9573 from yurydelendik/xml_parser New XML parser	2018-03-21 17:00:00 -07:00
Jonas Jenwald	374d074f6e	Add stricter validation in `Catalog.readPageLabels` The current PageLabel dictionary validation code won't catch some (unlikely) forms of corruption. For example: a `Type`/`S` entry being `null`/`0`/empty string, a `P`/`St` entry being `null`/`0`. Please note: I'm not aware of any bugs caused by the old code, but I've had this patch sitting locally for some time and figured it couldn't hurt to submit it.	2018-03-21 14:36:05 +01:00
swftvsn	c20426efef	Improve node.js support This change fixes "Unhandled rejection ReferenceError: HTMLElement is not defined" issue that is discussed in more detail in #8489.	2018-03-21 13:43:53 +02:00
Brendan Dahl	63c7aee112	Merge pull request #9565 from brendandahl/new-name Rename the globals to shorter names.	2018-03-20 13:49:04 -07:00
Yury Delendik	655c8d34d0	New XML parser	2018-03-19 20:51:41 -05:00
Jonas Jenwald	e0ae157582	[api-minor] Fix various issues related to the pageSize information The `getPageSizeInches` method was implemented on `PDFDocumentProxy`, which seems conceptually wrong since the size property isn't global to the document but rather specific to each page. Hence the method is moved into `PDFPageProxy`, as `get pageSizeInches` instead to address this. Despite the fact that new API functionality was implemented, no unit-tests were added. To prevent issues later on, we should always ensure that new functionality has at least some test-coverage; something that this patch also takes care of. The new `PDFDocumentProperties._parsePageSize` method seemed unnecessary convoluted. Furthermore, in the "no data provided"-case it even returned incorrect data (an array, rather than the expected object). Finally, the fallback strings didn't actually agree with the `en-US` locale. This inconsistency doesn't look too great, and it's thus addressed here as well.	2018-03-18 09:10:19 +01:00
Brendan Dahl	01bff1a81d	Rename the globals to shorter names. pdfjsDistBuildPdf=pdfjsLib pdfjsDistWebPdfViewer=pdfjsViewer pdfjsDistBuildPdfWorker=pdfjsWorker	2018-03-16 11:08:56 -07:00
Jonas Jenwald	d431ae069d	Attempt to handle corrupt PDF documents that inline Page dictionaries in a Kids array (issue 9540) According to the specification, see https://www.adobe.com/content/dam/acom/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G6.1942297, the contents of a Kids array should be indirect objects.	2018-03-12 14:13:23 +01:00
Tim van der Meij	6662985a20	Merge pull request #9541 from Rob--W/crx-fetch-chrome61plus [CRX] Disable fetch in Chrome 60-	2018-03-10 00:15:47 +01:00
Tim van der Meij	fded22c618	Merge pull request #9509 from timvandermeij/document Implement a single `getInheritableProperty` utility function	2018-03-08 22:40:27 +01:00
Yury Delendik	e0fb18a339	Merge pull request #9508 from pal03377/file-info-page-size Add paper size to document information/properties	2018-03-08 11:57:38 -06:00
Rob Wu	db428004f4	[CRX] Disable fetch in Chrome 60- Chrome 60 and earlier does not include credentials (cookies) in requests made with fetch, regardless of extension permissions. This was fixed in 61.0.3138.0 by `2e231cf052` This patch disables the fetch backend in all affected Chrome versions. The browser detection is done by checking for a change that coincides with the release of Chrome 61. Test case: 1. Copy the `isChromeWithFetchCredentials` function from the patch. 2. Run it in the JS console of Chrome and verify the return value. Verified results: - 49.0.2623.75 - false (earliest supported version by us) - 60.0.3112.90 - false (last major version affected by bug) - 61.0.3163.100 - true (first major version without bug) - 65.0.3325.146 - true (current stable) Test case 2: 1. Build the extension (`gulp chromium`) and load it in Chrome. 2. Open the developer tools, and open any PDF file. 3. In the "Network tab" of the developer tools, look at "request type". In Chrome 60-: Should be "xhr" In Chrome 61+: Should be "fetch"	2018-03-08 18:27:30 +01:00
palsch	8558c5b1d9	Add page size to the document properties dialog	2018-03-08 18:23:47 +01:00
Tim van der Meij	f308d73d40	Implement a single `getInheritableProperty` utility function This function combines the logic of two separate methods into one. The loop limit is also a good thing to have for the calls in `src/core/annotation.js`. Moreover, since this is important functionality, a set of unit tests and documentation is added.	2018-03-03 19:19:39 +01:00
Tim van der Meij	4e5eb59a33	Remove the `getPageProp` method in `src/core/document.js` It's only used in two places in the class and those callsites can directly get the information from the dictionary, which is more readable and avoids an additional method call.	2018-03-03 14:57:42 +01:00
Jonas Jenwald	b8606abbc1	[api-major] Completely remove the global `PDFJS` object	2018-03-01 18:13:27 +01:00
Jonas Jenwald	4b4fcecf70	Ensure that we only pass in the necessary parameters when initializing `PDFDataTransportStream`/`PDFNetworkStream` in `src/display/api.js` With options being moved from the global `PDFJS` object and into `getDocument`, a side-effect is that we're now passing in a fair number of useless parameters to the various transport/network streams. Even though this doesn't currently cause any problems, it nonetheless seem like a good idea to explicitly provide the parameters that are actually necessary.	2018-03-01 18:11:17 +01:00
Jonas Jenwald	212553840f	Move the `pdfBug` option from the global `PDFJS` object and into `getDocument` instead Also removes the now unused `getDefaultSetting` helper function.	2018-03-01 18:11:17 +01:00
Jonas Jenwald	1d03ad0060	Move the `disableCreateObjectURL` option from the global `PDFJS` object and into `getDocument` instead	2018-03-01 18:11:17 +01:00
Jonas Jenwald	05c05bdef5	Move the `disableStream` option from the global `PDFJS` object and into `getDocument` instead	2018-03-01 18:11:16 +01:00
Jonas Jenwald	b69abf1111	Move the `disableRange` option from the global `PDFJS` object and into `getDocument` instead	2018-03-01 18:11:16 +01:00
Jonas Jenwald	69d7191034	Move the `disableAutoFetch` option from the global `PDFJS` object and into `getDocument` instead One additional complication with removing this option from the global `PDFJS` object, is that the viewer currently needs to check `disableAutoFetch` in a couple of places. To address this I'm thus proposing adding a getter in `PDFDocumentProxy`, to allow checking the actually used values for a particular `getDocument` invocation.	2018-03-01 18:11:16 +01:00
Jonas Jenwald	c7c583583b	Move the `disableFontFace` option from the global `PDFJS` object and into `getDocument` instead	2018-03-01 18:11:16 +01:00
Jonas Jenwald	f3900c4e57	Move the `isEvalSupported` option from the global `PDFJS` object and into `getDocument` instead	2018-03-01 18:11:16 +01:00
Jonas Jenwald	3c2fbdffe6	Move the `cMapUrl` and `cMapPacked` options from the global `PDFJS` object and into `getDocument` instead	2018-03-01 18:11:16 +01:00
Jonas Jenwald	b674409397	Move the `maxImageSize` option from the global `PDFJS` object and into `getDocument` instead	2018-03-01 18:11:16 +01:00

1 2 3 4 5 ...

3230 Commits