pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	f297e4d17c	[api-minor] Add a parameter to `PDFPageProxy_getTextContent` that controls whether `PartialEvaluator_getTextContent` will attempt to combine same line text items From the discussion in issue 7445, it seems that there may be cases where an API consumer would want to get the text content as is, without combined text items.	2016-07-19 13:38:57 +02:00
Jonas Jenwald	72c1df726e	Add a `getAttachments` unit-test for a PDF file that actually contains attachments	2016-07-02 13:13:30 +02:00
Tim van der Meij	f97d52182a	Merge pull request #7341 from Snuffleupagus/getDestinationHash-Array [api-minor] Improve handling of links that are using explicit destination arrays	2016-06-09 00:29:10 +02:00
Jonas Jenwald	6260fc09a3	Attempt to recover valid `format 3` FDSelect data from broken CFF fonts (bug 1146106) According to the CFF specification, see http://partners.adobe.com/public/developer/en/font/5176.CFF.pdf#G3.46884, for `format 3` FDSelect data: "The first range must have a ‘first’ GID of 0". Since the PDF file (attached in the bug) violates that part of the specification, this patch tries to recover valid FDSelect data to prevent OTS from rejecting the font. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1146106.	2016-06-06 18:20:52 +02:00
Jonas Jenwald	98fe094d18	Let non-viewable Popup Annotations inherit the parent's Annotation Flags if the parent is viewable Fixes http://www.pdf-archive.com/2013/09/30/file2/file2.pdf. Note how it's not possible to show the various Popup Annotations in the above document. To fix that, this patch lets the Popup inherit the flags of the parent, in the special case where the parent is `viewable` and the Popup is not. In general, I don't think that a Popup must have the same flags set as the parent. However, it seems very strange to have a `viewable` parent annotation, and then not being able to view the Popup. Annoyingly the PDF specification doesn't, as far as I can find, mention anything about how this case should be handled, but this patch seem consistent with the actual behaviour in Adobe Reader.	2016-05-25 23:00:26 +02:00
Brendan Dahl	b86610ffdb	Merge pull request #7300 from Snuffleupagus/bug-1068432 Prevent adding invalid values in `CFFDict_setByKey` (bug 1068432)	2016-05-24 12:12:38 -07:00
Jonas Jenwald	b354682dd6	[api-minor] Let `LinkAnnotation`/`PDFLinkService_getDestinationHash` return a stringified version of the destination array for explicit destinations Currently for explicit destinations, compared to named destinations, we manually try to build a hash that often times is a quite poor representation of the actual destination. (Currently this only, kind of, works for `\XYZ` destinations.) For PDF files using explicit destinations, this can make it difficult/impossible to obtain a link to a specific section of the document through the URL. Note that in practice most PDF files, especially newer ones, use named destinations and these are thus unnaffected by this patch. This patch also fixes an existing issue in `PDFLinkService_getDestinationHash`, where a named destination consisting of only a number would not be handled correctly. With the added, and already existing, type checks in place for destinations, I really don't think that this patch exposes any "sensitive" internal destination code not already accessible through normal hash parameters. Please note: Just trying to improve the algorithm that generates the hash is unfortunately not possible in general, since there are a number of cases where it will simply never work well. - First of all, note that `getDestinationHash` currently relies on the `_pagesRefCache`, hence it's possible that the hash returned is empty during e.g. ranged/streamed loading of a PDF file. - Second of all, the currently computed hash is actually dependent on the document rotation. With named destinations, the fetched internal destination array is rotational invariant (as it should be), but this will not hold in general for the hash. We can easily avoid this issue by using a stringified destination array. - Third of all, note that according to the PDF specification[1], `GoToR` destinations may actually contain explicit destination arrays. Since we cannot really construct a hash in `annotation.js`, we currently have no good way to support those. Even though this case seems very rare in practice (I've not actually seen such a PDF file), it's in the specification, and this patch allows us to support that for "free". --- [1] http://www.adobe.com/content/dam/Adobe/en/devnet/acrobat/pdfs/PDF32000_2008.pdf#G11.1951685	2016-05-21 14:14:07 +02:00
Jonas Jenwald	01ab15a6f1	[api-minor] Let `Catalog_getPageIndex` check that the `Ref` actually points to a /Page dictionary Currently the `getPageIndex` method will happily return `0`, even if the `Ref` parameter doesn't actually point to a proper /Page dictionary. Having the API trust that the consumer is doing the right thing seems error-prone, hence this patch which adds a check for this case. Given that the `Catalog_getPageIndex` method isn't used in any hot part of the codebase, this extra check shouldn't be a problem. (Note: in the standard viewer, it is only ever used from `PDFLinkService_navigateTo` if a destination needs to be resolved during document loading, which isn't common enough to be an issue IMHO.)	2016-05-21 14:13:41 +02:00
Tim van der Meij	db46829ef7	Merge pull request #7316 from timvandermeij/remove-unused Remove unused variables	2016-05-21 14:07:33 +02:00
Jonas Jenwald	c5c5a2a71f	Add basic unit-tests for unicode.js Re: issue 7261.	2016-05-19 19:45:45 +02:00
Tim van der Meij	6a7012aaca	Remove unused variables These have been found using `gulp lint` in combination with the `unused: true` parameter for JSHint. Unfortunately there are too many false positives to enable this feature, but now that most globals have been removed because of the conversion to UMD the results are much more useful than before.	2016-05-11 16:11:13 +02:00
Jonas Jenwald	c9b6de3b16	Prevent adding invalid values in `CFFDict_setByKey` (bug 1068432) In the font in question, there are a couple of `topDict` entries that have invalid values (`0xF 0xF`, i.e. just eof markers without any actual numbers). This causes the `parseFloatOperand` function, inside `CFFParser_parseDict`, to return `NaN`. Currently we pass this broken font onto the browser, which OTS unsurprisingly rejects. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1068432.	2016-05-07 21:09:58 +02:00
Jonas Jenwald	29c4a604af	Split the font_spec.js unit-tests into cff_parser_spec.js and type1_parser_spec.js Re: issue 7261. Given the we have `gulp fonttest`, which tests the `fonts.js` functionality at a higher level, and that we have a lot of font specific reference tests, I'm not convinced that we also need unit-tests for it.	2016-05-03 09:37:36 +02:00
Yury Delendik	7fd3db9977	Adds EventBus.	2016-04-28 06:57:24 -05:00
Jonas Jenwald	b4a17323b6	Move `isDict` unit-tests from util_spec.js to primitives_spec.js This patch moves the unit-test to the correct file, since the `isDict` function was moved PR 6683.	2016-04-16 20:32:46 +02:00
Jonas Jenwald	4523ae0b91	Add a couple of `CipherTransformFactory` unit-tests to check that blank passwords are correctly rejected	2016-04-16 20:24:55 +02:00
Jonas Jenwald	171f908b89	Add a couple of `LinkAnnotation` unit-tests We currently don't have any unit-tests for `LinkAnnotation`s, so it seemed a good idea to add a few. These tests are taken from various actual PDF files.	2016-04-15 22:59:08 +02:00
Prakash Palanisamy	a25c29d98d	Remove `combineUrl` and replace it with `new URL`.	2016-04-15 21:33:10 +05:30
Yury Delendik	6282ec24d1	Merge pull request #7172 from yurydelendik/umd-web Introduces UMD headers to the web/ folder.	2016-04-13 10:23:23 -05:00
Yury Delendik	006e8fb59d	Introduces UMD headers to the web/ folder.	2016-04-13 10:09:48 -05:00
Yury Delendik	879340d741	Removes hijack describe() hack from unit tests.	2016-04-11 07:37:35 -05:00
Yury Delendik	44c63bca28	Merge pull request #7175 from Snuffleupagus/issue-6905-font_spec Use `beforeAll`/`afterAll` in font_spec.js (issue 6905)	2016-04-11 07:20:45 -05:00
Yury Delendik	070f2d32ad	Merge pull request #7171 from Snuffleupagus/remove-new-Name/Cmd Remove the remaining usages of `new {Name,Cmd}` in favor of `{Name,Cmd}.get`	2016-04-11 07:17:30 -05:00
Jonas Jenwald	c4e21c93a2	Use `beforeAll`/`afterAll` in font_spec.js (issue 6905) This patch fixes the only remaining point in issue 6905.	2016-04-10 16:09:11 +02:00
Jonas Jenwald	b0ce83b372	Use `beforeAll`/`afterAll` in `CipherTransformFactory` in crypto_spec.js (issue 6905 This patch also adds/improves utility functions for checking if the passwords are correct/incorrect, and replaces `string2binary` with `stringToBytes`. Finally the patch does away with the `DictMock`, in favour of using actual `Dict`s. Re: issue 6905.	2016-04-10 13:20:21 +02:00
Jonas Jenwald	f59c3a0644	Remove the remaining usages of `new {Name,Cmd}` in favor of `{Name,Cmd}.get` Using `new {Name,Cmd}` should be avoided, since it creates a new object on every call, whereas `{Name,Cmd}.get` uses caches to only create one object regardless of how many times they are called. Most of these are found in the unit-tests, where increased memory usage probably doesn't matter very much. But it still seems good to get rid of those cases, since no part of the codebase ought to advertise that usage. Given the small size of the patch, I'm also tweaking a few comments and class names.	2016-04-08 12:14:05 +02:00
Jonas Jenwald	c6c5b8fab8	Use `beforeAll`/`afterAll` in `isExternalLinkTargetSet` in dom_utils_spec.js (issue 6905) Re: issue 6905.	2016-04-07 14:00:40 +02:00
Jonas Jenwald	ef551e8266	Extract `Type1Parser` from fonts.js	2016-04-01 23:38:53 +02:00
Jonas Jenwald	b961e1d21b	Extract `CFFParser` from fonts.js (issue 6777)	2016-04-01 22:32:39 +02:00
Jonas Jenwald	7163e1eff3	Faster unit-tests by using `beforeAll`/`afterAll` in api_spec.js In the API unit-tests, we're currently loading the `basicapi.pdf` before every sub-test in `PDFDocument` and `Page`, which slows down the unit-tests quite a bit. Locally this patch reduces the run time for `gulp unittest` by at least 40% for me.	2016-03-30 15:32:01 +02:00
Jonas Jenwald	ac772017b6	Add unit-tests for destionations in /Names (NameTree) dictionaries where all entries are indirect objects Re: issue 6204 and PR 6208.	2016-03-29 17:55:05 +02:00
Yury Delendik	0a700fa29d	Updates Jasmine version.	2016-03-29 09:34:13 -05:00
Yury Delendik	a8e5912cb1	Moves shared/global to display/global	2016-03-23 19:24:37 -05:00
Yury Delendik	bda5e6235e	Removes global PDFJS usage from the src/core/.	2016-03-23 19:24:37 -05:00
Manas	f6d28ca323	Refactors CMapFactory.create to make it async	2016-03-21 23:08:19 +05:30
Yury Delendik	22341c0761	Merge pull request #6879 from yurydelendik/streams Makes PDF data reading Streams API friendly.	2016-03-01 09:10:52 -06:00
Jonas Jenwald	41efb92d3a	Merge pull request #6988 from timvandermeij/fileattachment-annotation Implement support for FileAttachment annotations	2016-02-24 12:58:06 +01:00
Tim van der Meij	0351c7eff4	Move the `getFileName` helper function to the core This is required to be able to use it in the annotation display code, where we now apply it to sanitize the filename of the FileAttachment annotation. The PDF file from https://bugzilla.mozilla.org/show_bug.cgi?id=1230933 has shown that some PDF generators include the path of the file rather than the filename, which causes filenames with weird initial characters. PDF viewers handle this differently (for example Foxit Reader just replaces forward slashes with spaces), but we think it's better to only show the filename as intended. Additionally we add unit tests for the `getFilenameFromUrl` helper function.	2016-02-23 22:49:53 +01:00
Tim van der Meij	10902fd882	Implement unit and reference testing for FileAttachment annotations	2016-02-23 22:49:53 +01:00
Yury Delendik	0d591719d9	Makes PDF data reading Streams API friendly.	2016-02-18 13:17:53 -06:00
Jonas Jenwald	7cf9de2c17	[api-minor] Change `getOutline` to actually return the RGB color of outline items Currently the `C` entry in an outline item is returned as is, which is neither particularly useful nor what the API documentation claims. This patch also adds unit-tests for both the color handling, and the `F` entry (bold/italic flags).	2016-02-15 13:41:22 +01:00
Jonas Jenwald	98db068079	Reduce the overall indentation level in `Catalog_readDocumentOutline`, by using early returns, in order to improve readability	2016-02-14 11:38:43 +01:00
Tim van der Meij	5bcf4c1895	Destroy workers when they are no longer needed in the unit tests	2016-01-29 12:23:17 +01:00
Jonas Jenwald	1140a34f5c	[api-minor] Change `getPageLabels` to always return the pageLabels, even if they are identical to standard page numbering	2016-01-27 13:36:03 +01:00
Jonas Jenwald	85cf90643f	[api-minor] Add support for PageLabels in the API	2016-01-19 22:49:04 +01:00
Jonas Jenwald	0030a82dc3	[api-minor] Add support for URLs in the document outline Re: issue 5089. (Note that since there are other outline features that we currently don't support, e.g. bold/italic text and custom colours, I thus think we can keep the referenced issue open.)	2016-01-19 21:36:27 +01:00
Tim van der Meij	4399d01169	Merge pull request #6834 from Snuffleupagus/issue-6832 Strip `null` (\x00) characters from the URLs in LinkAnnotations (issue 6832)	2016-01-05 23:59:25 +01:00
Brendan Dahl	eb7c36beb6	Add validation for callsubr and callgsubr for type 2 charstrings.	2016-01-05 09:54:25 -08:00
Jonas Jenwald	97c10e9c08	Strip `null` (\x00) characters from the URLs in LinkAnnotations (issue 6832) Apparently some PDF files can have annotations with `URI` entries ending with `null` characters, thus breaking the links. To handle this edge-case of bad PDFs, this patch moves the already existing utility function from `ui_utils.js` into `util.js`, in order to fix those URLs. Fixes 6832.	2016-01-04 21:55:20 +01:00
Yury Delendik	85e95d34ed	Use RequireJS in the viewer, examples and tests.	2015-12-29 09:20:52 -06:00

1 2 3 4 5 ...

292 Commits