pdf.js

Author	SHA1	Message	Date
Calixte Denizet	6db9cefaaf	[Annotation] Replace use of id by data-element-id to have the correct id	2022-10-19 23:36:28 +02:00
Jonas Jenwald	3c046c0a21	Extend `getSupplementalGlyphMapForCalibri` with some umlauts (issue 15594)	2022-10-19 17:49:40 +02:00
Jonas Jenwald	bc13a277ce	Relax the /Pages dictionary /Count check for corrupt documents (issue 9105) After PR 14311, and follow-up patches, we no longer require that the /Count entry (in the /Pages dictionary) is either present or even valid in order to parse/render a PDF document. Hence it seems strange to keep this requirement for corrupt PDF documents, when trying to find a usable `trailer` in the `XRef.indexObjects` method.	2022-10-19 12:28:25 +02:00
Jonas Jenwald	de99f99a01	Fallback and try a previous generation if all else fails in `XRef.indexObjects` (issue 15577) When we fail to find a usable PDF document `trailer` and there were errors during parsing, try and fallback to a previous generation as a last resort during fetching of uncompressed references. Please note: This will not affect "normal" PDF documents, with valid /XRef data, and even most corrupt documents should be completely unaffected by these changes.	2022-10-18 20:24:01 +02:00
Calixte Denizet	556513a6e7	Use all the current transform as key when caching some image for masks used with pattern fill (bug 1795263, #15573 )	2022-10-14 14:37:58 +02:00
Jonas Jenwald	858d941ff8	Take the /CIDToGIDMap into account when getting the glyph mapping for CFF fonts (issue 15559) Please note: I don't really know what I'm doing here, however the patch appears to fix the referenced issue when comparing the rendering with Adobe Reader (with the caveat that I don't speak the language in question).	2022-10-13 10:02:25 +02:00
Jonas Jenwald	081e897588	Ensure that `Page.getOperatorList` handles Annotation parsing errors correctly (issue 15557) Fixes a regression from PR 15246, sorry about that! The return value of all `Annotation.getOperatorList` methods was changed in PR 15246, however I missed updating the error code-path in `Page.getOperatorList` which thus breaks all operatorList-parsing for pages with corrupt Annotations.	2022-10-10 09:48:01 +02:00
Jonas Jenwald	ce66fefbff	[api-minor] Add partial support for the "GoToE" action (issue 8844) Please note: The referenced issue is the only mention that I can find, in either GitHub or Bugzilla, of "GoToE" actions. Hence why I've purposely settled for a very simple, and partial, "GoToE" implementation to avoid complicating things initially.[1] In particular, this patch only supports "GoToE" actions that references the /EmbeddedFiles-dict in the PDF document. See https://web.archive.org/web/20220309040754if_/https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#G11.2048909 --- [1] Usually I always prefer having real-world test-cases to work with, whenever I'm implementing new features.	2022-10-06 10:33:07 +02:00
Jonas Jenwald	c87f90102c	Add more non-standard ligatures in the `glyphlist.js` file (issue 15516) Note that this PR only adds the "underscore"-variant of actually existing ligatures, however the referenced PDF document also uses a couple of non-standard ones (e.g. `ft`, `Th`, and `fh`) that we cannot easily support without larger changes (since they don't have official Unicode-entries). Given that it's clearly the PDF document, and its fonts, that's the culprit here it's not entirely clear to me that we actually want to attempt a larger refactoring/rewriting of the `glyphlist.js` code, assuming it's even generally possible. Especially when this patch alone already improves our copy-paste behaviour when compared to both Adobe Reader and PDFium, and that this is only the second time this sort of bug has been reported.	2022-09-27 16:31:51 +02:00
Jonas Jenwald	f1b0dc6f04	Tweak the heuristic that handles JPEG images with a wildly incorrect SOF (Start of Frame) `scanLines` parameter (issue 15492)	2022-09-22 14:09:04 +02:00
Calixte Denizet	198e9a3db1	Initialize values in the path bounding box before flushing the operator list (bug 1791583) OperatorList.addOp can trigger a flush if it's required, hence the values passed to it must be correctly initialized in order to avoid some wrong values in the renderer. Because of that a clip path was considered as empty, nothing was clipped, hence the wrong rendering in bug 1791583.	2022-09-20 20:01:54 +02:00
Jonas Jenwald	7a19def34c	Extend `getSupplementalGlyphMapForCalibri` with more entries (issue 15443)	2022-09-15 22:19:16 +02:00
Jonas Jenwald	2f2ecad8fd	Extend `getGlyphMapForStandardFonts` with some quote-entries (issue 15441)	2022-09-15 11:37:20 +02:00
Jonas Jenwald	947d390421	Fallback to a standard font when a Type1 font program is empty (issue 15292) Please note: This is only a, hopefully generally helpful, work-around rather than a proper solution to issue 15292. There's something that's "special" about the Type1 fonts in the referenced PDF document, since we don't manage to find any actual font programs and thus cannot render anything. Given that it shouldn't make sense for a Type1 font program to ever be empty, since that means that there's no glyph-data to render, we simply fallback to a standard font to at least try and render something in these rare cases.	2022-09-05 12:07:19 +02:00
Jonas Jenwald	9578152ae4	Merge pull request #15392 from Snuffleupagus/issue-15352 Don't allow `adjustToUnicode` to extend a built-in /ToUnicode map (issue 15352)	2022-09-04 15:12:10 +02:00
Calixte Denizet	6c6f6fb2b8	Don't replace cr by a white space when the last char on the line is an ideographic char	2022-09-04 14:21:05 +02:00
Jonas Jenwald	12d60e0acf	Don't allow `adjustToUnicode` to extend a built-in /ToUnicode map (issue 15352) Given that the change in PR 13393 was slightly speculative, given the lack of test-cases, let's just revert part of that to fix the referenced issue. Based on a quick look at old issues and existing test-cases, it seems that most (if not all) PDF documents that benefit from using the font-data in this way lack any /ToUnicode maps which should mean that they're unaffected by these changes.	2022-09-03 23:11:42 +02:00
Jonas Jenwald	cc4baa2fe9	[api-minor] Add basic support for the `SetOCGState` action (issue 15372) Note that this patch implements the `SetOCGState`-handling in `PDFLinkService`, rather than as a new method in `OptionalContentConfig`[1], since this action is nothing but a series of `setVisibility`-calls and that it seems quite uncommon in real-world PDF documents. The new functionality also required some tweaks in the `PDFLayerViewer`, to ensure that the `layersView` in the sidebar is updated correctly when the optional-content visibility changes from "outside" of `PDFLayerViewer`. --- [1] We can obviously move this code into `OptionalContentConfig` instead, if deemed necessary, but for an initial implementation I figured that doing it this way might be acceptable.	2022-09-01 17:34:24 +02:00
Jonas Jenwald	216b86a082	[api-minor] Support Named-actions in the outline (issue 15367) Apparently this is implemented in e.g. Adobe Reader, and the specification does support it, however it cannot be commonly used in real-world PDF documents since it took over ten years for this feature to be requested.	2022-08-30 18:47:45 +02:00
Calixte Denizet	c06c5f7cbd	[Annotations] charLimit === 0 means unlimited (bug 1782564) Changing the charLimit in JS had no impact, so this patch aims to fix that and add an integration test for it.	2022-08-19 11:28:28 +02:00
Calixte Denizet	f316300113	[Annotations] Add some aria-owns in the text layer to link to annotations (bug 1780375) This patch doesn't structurally change the text layer: it just adds some aria-owns attributes to some spans. The aria-owns attribute expect to have an element id, hence it's why it adds back an id on the element rendering an annotation, but this id is built in using crypto.randomUUID to avoid any potential issues with the hash in the url. The elements in the annotation layer are moved into the DOM in order to have them in the same "order" as they visually are. The overall goal is to help screen readers to present to the user the annotations as they visually are and as they come in the text flow. It is clearly not perfect, but it should improve readability for some people with visual disabilities.	2022-08-12 14:35:26 +02:00
Calixte Denizet	04f78c935c	Fix OTS issue with empty index (#15289 )	2022-08-08 22:56:26 +02:00
Jonas Jenwald	899fc29eef	Always set a border-radius for RadioButton annotations (issue 15262)	2022-08-02 13:58:20 +02:00
Calixte Denizet	d092a85b6c	Fix wrong order of arguments when calling the CipherTransform ctor (bug 1782186)	2022-07-29 12:46:45 +02:00
Jonas Jenwald	fc018ea9ea	Support images with /Filter-entries that contain Arrays (issue 15220) This patch "borrows" the code found in the `Parser.makeInlineImage`-method, to ensure that JBIG2 and JPX images can be rendered correctly.	2022-07-25 08:41:37 +02:00
Jonas Jenwald	60bd9580e2	Ignore invalid /CIDToGIDMap-entries when parsing fonts (issue 15139) In the referenced PDF document the fonts have /CIDToGIDMap-entries that cannot be loaded. Hence, only when `ignoreErrors` is set, we'll now ignore these corrupt /CIDToGIDMap-entries and fallback to simply assume that no such data is available. Given that this is clearly a case of a corrupt PDF document, there's no guarantee that this will "fix" things in the general case since a /CIDToGIDMap may be required in order for some composite fonts to render correctly. However, attempting to render something is surely better than skipping a font altogether.	2022-07-20 11:58:44 +02:00
calixteman	2b6a67c5d0	Merge pull request #15153 from calixteman/1778692 [Annotation] A push button can have no action (bug 1778692)	2022-07-08 21:06:53 +02:00
Calixte Denizet	8f26ba5487	[Annotation] A push button can have no action (bug 1778692)	2022-07-08 15:39:56 +02:00
Jonas Jenwald	c2f7942aea	Ensure that the /Resources-entry is actually a dictionary (issue 15150) Prevent issues in corrupt PDF documents, if the /Resources-entry is not of the correct and expected type.	2022-07-08 12:43:43 +02:00
Jonas Jenwald	79cfc548fc	Improve text-selection for Type3 fonts with bogus /FontBBox-entries (issue 14999) This extends PR 13461, by also building a fallback bounding box for Type3 fonts that contain a much too small /FontBBox-entry. Please note: While this patch improves things overall, copy-and-pasting still doesn't work perfectly for this document. In particular the lowercase letter "c" cannot be selected/copied, however this can be reproduced in both Adobe Reader and PDFium (in Google Chrome) too, which is caused by a lack of proper /ToUnicode-data in the PDF document.	2022-07-05 14:27:14 +02:00
Calixte Denizet	a334a21a1d	[JS] Update siblings when a field is updated after a calculation (#15092 )	2022-06-24 14:23:06 +02:00
Calixte Denizet	cdc58b7a52	Rotate annotations based on the MK::R value (bug 1675139) - it aims to fix: https://bugzilla.mozilla.org/show_bug.cgi?id=1675139; - An annotation can be rotated (counterclockwise); - the rotation can be set in using JS.	2022-06-21 17:57:26 +02:00
Calixte Denizet	7e3941da9d	[JS] Hide field borders and buttons (#15053 ) - Since the border belongs to the section containing the HTML counterpart of an annotation, this section must be hidden when a JS action requires it; - it wasn't possible to hide a button in using JS.	2022-06-17 17:36:38 +02:00
Jonas Jenwald	64cce1269e	Add basic support for non-embedded ArialUnicodeMS fonts (issue 15044) This appears to be a Microsoft-specific version of the regular Arial font, hence we simply map this to Helvetica in the same way that we treat many other Arial-named fonts.	2022-06-15 10:37:20 +02:00
Jonas Jenwald	2dca14028d	Extend `getGlyphMapForStandardFonts` with some Hebrew entries (issue 15033) This only adds the minimum entries required in order to render the referenced document correctly, rather than trying to support "all" Hebrew glyphs, to ensure that all lines in `getGlyphMapForStandardFonts` are covered by tests.	2022-06-13 10:08:39 +02:00
Jonas Jenwald	3d244cb6a8	Render PopupAnnotations even if they have missing or empty /Rect-entries (issue 15012, PR 14439 follow-up) This only applies to corrupt PDF documents, where Annotations are missing the required /Rect-entry. Rendering PopupAnnotations unconditionally shouldn't be a problem, since we're not using a `BaseSVGFactory`-instance in that case.	2022-06-09 15:10:54 +02:00
Calixte Denizet	2dd0c861bf	Outline fields which are required (bug 1724918) - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1724918; - it applies for both Acroform and XFA.	2022-06-07 17:02:11 +02:00
Calixte Denizet	96d0d22d66	Reset all the canvas states after rendering each annotations (#14105 ) - each annotation must be rendered independently of the others. So after having rendered each annotation, the canvas states are reset in order to have something clean to render the next one.	2022-06-07 14:59:02 +02:00
Jonas Jenwald	59dd4ea2b0	Lookup image-data correctly in `paintImageMaskXObjectGroup` (issue 14990) This fixes a regression from PR 14754. We didn't lookup the image-data correctly, with the result that we tried to render some ImageMasks using a string rather than the intended TypedArray. To make matters worse, this code-path was apparently not properly covered by existing test-cases.	2022-06-05 12:39:23 +02:00
Calixte Denizet	66b513fc00	[Annotations] Show buttons even if they've no actions - it's a regression from PR #14247: - before the PR, the button was rendered on the canvas whatever its status was; - after the PR, the button image has been moved in an other canvas so when the button is not renderable (because it has no actions) then the image is not added the HTML element. - the buttons in the pdf in bug 1737260 or in the pdf in #14308 were not visible - make the button always renderable but don't add the link element if it's useless.	2022-05-28 23:50:50 +02:00
Calixte Denizet	9d82106d20	Set the text fields font size based on their height - right now we're using the font size from the pdf itself but we use an other font in the annotation layer. So this size doesn't really make sense and leads to bad rendering (see pdf in #14928); - use a sans-serif font for the fields containing text (fix issue #14736); - remove useless padding in text-based fields (fix issue #14301); - text fields allow/disallow scrolling bars (see bit 24 in Ff entry), so use this value to hide/show scrollbars in annotation layer.	2022-05-28 18:00:39 +02:00
Calixte Denizet	c7afce4210	Support Hangul syllables when searching some text (bug 1771477) - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1771477; - hangul contains some syllables which are decomposed when using NFD, hence the text must be correctly shifted in case it contains some of them.	2022-05-28 16:50:03 +02:00
Jonas Jenwald	5a2899c57e	Skip bogus `d1` operators in Type3-glyphs (issue 14953) In the `src/display/canvas.js` code the `d1` operator will be used to set the clipping region, and it obviously cannot be empty since that prevents the Type3-glyph from rendering. Also, the patch removes an outdated comment; refer to PR 12718.	2022-05-24 12:20:31 +02:00
Calixte Denizet	9407adc416	[JS] Format all the fields if any when the document is open (bug 1766987) - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1766987.	2022-05-22 15:50:42 +02:00
Calixte Denizet	60498c67e4	Display background when printing or saving a text widget (issue #14928 )	2022-05-19 16:41:54 +02:00
Jonas Jenwald	5a774b7ed3	Adjust the heuristics for handling of incomplete path operators (issue 14917) This limits the heuristics for handling of incomplete path operators, see PR 9838, to only apply to sequences of such operators. In practice a couple of invalid path operators are (hopefully) unlikely to completely break rendering, whereas a sequence of them will easily lead to fairly chaotic rendering artifacts.	2022-05-15 11:24:39 +02:00
Jonas Jenwald	6e7e9d83d8	Add support for TrueType format 12 `cmap`s (issue 14881) This is, as far as I can tell, the first case we've seen of a format 12 `cmap`. Please see https://developer.apple.com/fonts/TrueType-Reference-Manual/RM06/Chap6cmap.html	2022-05-06 11:11:38 +02:00
Jonas Jenwald	8135d7ccf6	Merge pull request #14869 from calixteman/14862 [JS] Fix few bugs present in the pdf for issue #14862	2022-05-03 18:31:31 +02:00
Calixte Denizet	094ff38da0	[JS] Fix few bugs present in the pdf for issue #14862 - since resetForm function reset a field value a calculateNow is consequently triggered. But the calculate callback can itself call resetForm, hence an infinite recursive loop. So basically, prevent calculeNow to be triggered by itself. - in Firefox, the letters entered in some fields were duplicated: "AaBb" instead of "AB". It was mainly because beforeInput was triggering a Keystroke which was itself triggering an input value update and then the input event was triggered. So in order to avoid that, beforeInput calls preventDefault and then it's up to the JS to handle the event. - fields have a property valueAsString which returns the value as a string. In the implementation it was wrongly used to store the formatted value of a field (2€ when the user entered 2). So this patch implements correctly valueAsString. - non-rendered fields can be updated in using JS but when they're, they must take some properties in the annotationStorage. It was implemented for field values, but it wasn't for display, colors, ... - it fixes #14862 and #14705.	2022-05-03 15:48:44 +02:00
Jonas Jenwald	df5a4fd0a7	Support encoded dest-strings in /GoTo destination dictionaries (issue 14864) Interestingly enough this appears to be the very first case of encoded dest-strings, in /GoTo destination dictionaries, that we've actually come across. What's really fascinating is that it's less than a week after issue 14847, given that these issues are somewhat similar.	2022-05-02 10:14:32 +02:00

1 2 3 4 5 ...

1253 Commits