pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	12d60e0acf	Don't allow `adjustToUnicode` to extend a built-in /ToUnicode map (issue 15352) Given that the change in PR 13393 was slightly speculative, given the lack of test-cases, let's just revert part of that to fix the referenced issue. Based on a quick look at old issues and existing test-cases, it seems that most (if not all) PDF documents that benefit from using the font-data in this way lack any /ToUnicode maps which should mean that they're unaffected by these changes.	2022-09-03 23:11:42 +02:00
Jonas Jenwald	571ce13dd6	[api-major] Remove the `enhanceTextSelection` functionality (PR 15145 follow-up) For the `gulp mozcentral` command, this reduces the size of the built `pdf.js` file by `> 10` kB.	2022-08-28 15:04:47 +02:00
Calixte Denizet	04f78c935c	Fix OTS issue with empty index (#15289 )	2022-08-08 22:56:26 +02:00
Jonas Jenwald	899fc29eef	Always set a border-radius for RadioButton annotations (issue 15262)	2022-08-02 13:58:20 +02:00
Calixte Denizet	d092a85b6c	Fix wrong order of arguments when calling the CipherTransform ctor (bug 1782186)	2022-07-29 12:46:45 +02:00
Jonas Jenwald	2cad5cf45b	Set `opacity` in the reference tests (PR 15219 follow-up) Without these changes in the manifest, the affected test-cases fail to render correctly.	2022-07-28 14:35:09 +02:00
Jonas Jenwald	fc018ea9ea	Support images with /Filter-entries that contain Arrays (issue 15220) This patch "borrows" the code found in the `Parser.makeInlineImage`-method, to ensure that JBIG2 and JPX images can be rendered correctly.	2022-07-25 08:41:37 +02:00
Jonas Jenwald	60bd9580e2	Ignore invalid /CIDToGIDMap-entries when parsing fonts (issue 15139) In the referenced PDF document the fonts have /CIDToGIDMap-entries that cannot be loaded. Hence, only when `ignoreErrors` is set, we'll now ignore these corrupt /CIDToGIDMap-entries and fallback to simply assume that no such data is available. Given that this is clearly a case of a corrupt PDF document, there's no guarantee that this will "fix" things in the general case since a /CIDToGIDMap may be required in order for some composite fonts to render correctly. However, attempting to render something is surely better than skipping a font altogether.	2022-07-20 11:58:44 +02:00
Calixte Denizet	8f26ba5487	[Annotation] A push button can have no action (bug 1778692)	2022-07-08 15:39:56 +02:00
Jonas Jenwald	79cfc548fc	Improve text-selection for Type3 fonts with bogus /FontBBox-entries (issue 14999) This extends PR 13461, by also building a fallback bounding box for Type3 fonts that contain a much too small /FontBBox-entry. Please note: While this patch improves things overall, copy-and-pasting still doesn't work perfectly for this document. In particular the lowercase letter "c" cannot be selected/copied, however this can be reproduced in both Adobe Reader and PDFium (in Google Chrome) too, which is caused by a lack of proper /ToUnicode-data in the PDF document.	2022-07-05 14:27:14 +02:00
calixteman	23fcdabb37	Merge pull request #15088 from calixteman/editor_rotation Support rotating editor layer	2022-06-25 16:18:07 +02:00
Calixte Denizet	0c420f5135	Support rotating editor layer - As in the annotation layer, use percent instead of pixels as unit; - handle the rotation of the editor layer in allowing editing when rotation angle is not zero; - the different editors are rotated counterclockwise in order to be usable when the main page is itself rotated; - add support for saving/printing rotated editors.	2022-06-24 20:02:32 +02:00
Jonas Jenwald	c48dc251e0	Add (basic) support for Optional Content in Annotations Given that Annotations can also have an `OC`-entry, we need to take that into account when generating their operatorLists. Note that in order to simplify the patch the `getOperatorList`-methods, for the Annotation-classes, were converted to be `async`.	2022-06-24 15:19:56 +02:00
Calixte Denizet	e49d039853	Correctly order added annotations when saving or printing - the annotations must be rendered in the same order as the chronological one. - fix a bug in document.js which avoids to read a saved pdf correctly in Acrobat: there is no need to reset the xref state: it's done in worker.js once everything has been saved.	2022-06-23 17:39:12 +02:00
Calixte Denizet	f27c8c4471	[Editor] Add support for printing newly added Ink annotations	2022-06-21 18:21:49 +02:00
Calixte Denizet	cdc58b7a52	Rotate annotations based on the MK::R value (bug 1675139) - it aims to fix: https://bugzilla.mozilla.org/show_bug.cgi?id=1675139; - An annotation can be rotated (counterclockwise); - the rotation can be set in using JS.	2022-06-21 17:57:26 +02:00
Jonas Jenwald	64cce1269e	Add basic support for non-embedded ArialUnicodeMS fonts (issue 15044) This appears to be a Microsoft-specific version of the regular Arial font, hence we simply map this to Helvetica in the same way that we treat many other Arial-named fonts.	2022-06-15 10:37:20 +02:00
Jonas Jenwald	2dca14028d	Extend `getGlyphMapForStandardFonts` with some Hebrew entries (issue 15033) This only adds the minimum entries required in order to render the referenced document correctly, rather than trying to support "all" Hebrew glyphs, to ensure that all lines in `getGlyphMapForStandardFonts` are covered by tests.	2022-06-13 10:08:39 +02:00
Jonas Jenwald	3d244cb6a8	Render PopupAnnotations even if they have missing or empty /Rect-entries (issue 15012, PR 14439 follow-up) This only applies to corrupt PDF documents, where Annotations are missing the required /Rect-entry. Rendering PopupAnnotations unconditionally shouldn't be a problem, since we're not using a `BaseSVGFactory`-instance in that case.	2022-06-09 15:10:54 +02:00
Calixte Denizet	2dd0c861bf	Outline fields which are required (bug 1724918) - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1724918; - it applies for both Acroform and XFA.	2022-06-07 17:02:11 +02:00
Calixte Denizet	96d0d22d66	Reset all the canvas states after rendering each annotations (#14105 ) - each annotation must be rendered independently of the others. So after having rendered each annotation, the canvas states are reset in order to have something clean to render the next one.	2022-06-07 14:59:02 +02:00
Jonas Jenwald	59dd4ea2b0	Lookup image-data correctly in `paintImageMaskXObjectGroup` (issue 14990) This fixes a regression from PR 14754. We didn't lookup the image-data correctly, with the result that we tried to render some ImageMasks using a string rather than the intended TypedArray. To make matters worse, this code-path was apparently not properly covered by existing test-cases.	2022-06-05 12:39:23 +02:00
Calixte Denizet	66b513fc00	[Annotations] Show buttons even if they've no actions - it's a regression from PR #14247: - before the PR, the button was rendered on the canvas whatever its status was; - after the PR, the button image has been moved in an other canvas so when the button is not renderable (because it has no actions) then the image is not added the HTML element. - the buttons in the pdf in bug 1737260 or in the pdf in #14308 were not visible - make the button always renderable but don't add the link element if it's useless.	2022-05-28 23:50:50 +02:00
Calixte Denizet	9d82106d20	Set the text fields font size based on their height - right now we're using the font size from the pdf itself but we use an other font in the annotation layer. So this size doesn't really make sense and leads to bad rendering (see pdf in #14928); - use a sans-serif font for the fields containing text (fix issue #14736); - remove useless padding in text-based fields (fix issue #14301); - text fields allow/disallow scrolling bars (see bit 24 in Ff entry), so use this value to hide/show scrollbars in annotation layer.	2022-05-28 18:00:39 +02:00
Jonas Jenwald	5a2899c57e	Skip bogus `d1` operators in Type3-glyphs (issue 14953) In the `src/display/canvas.js` code the `d1` operator will be used to set the clipping region, and it obviously cannot be empty since that prevents the Type3-glyph from rendering. Also, the patch removes an outdated comment; refer to PR 12718.	2022-05-24 12:20:31 +02:00
Calixte Denizet	9407adc416	[JS] Format all the fields if any when the document is open (bug 1766987) - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1766987.	2022-05-22 15:50:42 +02:00
Calixte Denizet	60498c67e4	Display background when printing or saving a text widget (issue #14928 )	2022-05-19 16:41:54 +02:00
Jonas Jenwald	5a774b7ed3	Adjust the heuristics for handling of incomplete path operators (issue 14917) This limits the heuristics for handling of incomplete path operators, see PR 9838, to only apply to sequences of such operators. In practice a couple of invalid path operators are (hopefully) unlikely to completely break rendering, whereas a sequence of them will easily lead to fairly chaotic rendering artifacts.	2022-05-15 11:24:39 +02:00
Jonas Jenwald	6e7e9d83d8	Add support for TrueType format 12 `cmap`s (issue 14881) This is, as far as I can tell, the first case we've seen of a format 12 `cmap`. Please see https://developer.apple.com/fonts/TrueType-Reference-Manual/RM06/Chap6cmap.html	2022-05-06 11:11:38 +02:00
Calixte Denizet	c8afd6ce8c	[api-minor] Improve pdf reading in high contrast mode - Use Canvas & CanvasText color when they don't have their default value as background and foreground colors. - The colors used to draw (stroke/fill) in a pdf are replaced by the bg/fg ones according to their luminance.	2022-05-05 16:34:51 +02:00
Jonas Jenwald	df5a4fd0a7	Support encoded dest-strings in /GoTo destination dictionaries (issue 14864) Interestingly enough this appears to be the very first case of encoded dest-strings, in /GoTo destination dictionaries, that we've actually come across. What's really fascinating is that it's less than a week after issue 14847, given that these issues are somewhat similar.	2022-05-02 10:14:32 +02:00
Calixte Denizet	624d8a8e3e	Use integer coordinates when drawing images (bug 1264608, issue #3351 ) - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1264608; - it's only a partial fix for #3351; - some tiled images have some spurious white lines between the tiles. When the current transform is applyed the corners of an image can have some non-integer coordinates leading to some extra transparency added to handle that. So with this patch the current transform is applied on the point and on the dimensions in order to have at the end only integer values.	2022-04-29 16:01:34 +02:00
Tim van der Meij	752dee5caa	Merge pull request #14825 from Snuffleupagus/issue-14824 Ensure that worker-thread image caching doesn't break optional content (issue 14824)	2022-04-23 13:19:56 +02:00
Tim van der Meij	f9e54d9226	Merge pull request #14823 from Snuffleupagus/issue-14821 Ignore invalid /Encoding-entries when parsing fonts (issue 14821)	2022-04-23 13:19:26 +02:00
Jonas Jenwald	6c229dffb1	Ensure that worker-thread image caching doesn't break optional content (issue 14824) Currently we only insert optionalContent-data into the operatorList the first time that an image is parsed, which will (in hindsight) obviously cause problems for cached images. Hence we also need to insert the optionalContent-data in the various worker-thread image caches, such that it can be accessed in the fast-paths that are used to skip re-parsing of images. In order to reduce the amount of repeated code, this patch also adds a new `OperatorList`-method that takes care of inserting the necessary data in the operatorList.	2022-04-22 14:49:16 +02:00
Jonas Jenwald	e723da7261	Ignore invalid /Encoding-entries when parsing fonts (issue 14821) In the referenced PDF document the fonts have /Encoding-entries that are Streams (containing completely bogus data), which are thus obviously not valid here. Hence, only when `ignoreErrors` is set, we'll now ignore these corrupt /Encoding-entries and fallback to the existing code to try and infer a usable encoding. Given that this is clearly a case of corrupt PDF documents, there's no guarantee that this will "fix" all such cases, however it's the best that we do here and shouldn't really be worse than ignoring an entire font.	2022-04-22 11:49:03 +02:00
Jonas Jenwald	39d1bdde09	Ignore non-Stream /SMask-entries when parsing images (issue 14814) This is similar to the pre-existing check used in the /Mask-case below, to handle corrupt PDF documents that include non-Stream /SMask-entries in images; please refer to the PDF specification: https://web.archive.org/web/20220309040754if_/https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#page=216 Please note: Adobe Reader also fails to render the image on the second page, and displays an error message.	2022-04-21 12:14:08 +02:00
Jonas Jenwald	5bc7339c1b	Add support for the /Catalog Base-URI when resolving URLs (issue 14802) As far as I can tell, this is actually the very first time that we've seen a PDF document with a Base-URI specified in the /Catalog; please refer to the specification: https://web.archive.org/web/20220309040754if_/https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#G11.2097122 To simplify the overall implementation, this new parameter is accessed via the existing `BasePdfManager.docBaseUrl`-getter and will thus override any user-specified `docBaseUrl` API-parameter.	2022-04-19 17:14:52 +02:00
Calixte Denizet	3d74d2c6cb	Don't clip when the clip path is empty (issue #12306 )	2022-04-18 10:33:44 +02:00
Calixte Denizet	f62d961dfe	Improve performances with image masks (bug 857031) - it's the second part of the fix for https://bugzilla.mozilla.org/show_bug.cgi?id=857031; - some image masks can be used several times but at different positions; - an image need to be pre-process before to be rendered: * rescale it; * use the fill color/pattern. - the two operations above are time consuming so we can cache the generated canvas; - the cache key is based on the current transform matrix (without the translation part) and the current fill color when it isn't a pattern. - the rendering of the pdf in the above bug is really faster than without this patch.	2022-04-16 20:48:39 +02:00
Calixte Denizet	040fcae5ab	Improve performance with image masks (bug 857031) - it aims to partially fix performance issue reported: https://bugzilla.mozilla.org/show_bug.cgi?id=857031; - the idea is too avoid to use byte arrays but use ImageBitmap which are a way faster to draw: * an ImageBitmap is Transferable which means that it can be built in the worker instead of in the main thread: - this is achieved in using an OffscreenCanvas when it's available, there is a bug to enable them for pdf.js: https://bugzilla.mozilla.org/show_bug.cgi?id=1763330; - or in using createImageBitmap: in Firefox a task is sent to the main thread to build the bitmap so it's slightly slower than using an OffscreenCanvas. * it's transfered from the worker to the main thread by "reference"; * the byte buffers used to create the image data have a very short lifetime and ergo the memory used is globally less than before. - Use the localImageCache for the mask; - Fix the pdf issue4436r.pdf: it was expected to have a binary stream for the image; - Move the singlePixel trick from operator_list to image: this way we can use this trick even if it isn't in a set as defined in operator_list.	2022-04-09 18:26:26 +02:00
Calixte Denizet	0b597304c1	[Annotations] Some annotations can have their values stored in the xfa:datasets - it aims to fix #14685; - add a basic object to get values from the parsed datasets; - these annotations don't have an appearance so we must create one when printing or saving.	2022-04-01 10:28:04 +02:00
Calixte Denizet	ad3fb71a02	[Annotations] Add support for printing/saving choice list with multiple selections - it aims to fix issue #12189.	2022-03-29 18:59:44 +02:00
Calixte Denizet	18e79e3c0b	[text selection] Add the whitespaces present in the pdf in the text chunk - it aims to fix issue #14627; - the basic idea of the recent text refactoring was to only consider the rendered visible whitespaces. But sometimes, the heuristics aren't correct and although some whitespaces are in the text stream they weren't in the text chunks because they were too small. Hence we added some exceptions, for example, we always add a whitespace when it is between two non-whitespace chars but only when in the same Tj. So basically, this patch removes the constraint to have the chars in the same Tj (in using a circular buffer to save the two last chars) but don't add a space when the visible space is really too small (hence `NOT_A_SPACE_FACTOR`).	2022-03-27 14:34:56 +02:00
Jonas Jenwald	1a7921dbf0	Compute the loca table `endOffset`, of the "first" glyph, correctly (issue 14618) When there are multiple empty glyphs at the start of the data, ensure that the "first" glyph gets a correct `endOffset` to avoid skipping it during parsing in the `sanitizeGlyph` function.	2022-03-03 14:22:45 +01:00
Brendan Dahl	85ff7b117e	Merge pull request #14536 from calixteman/thin_line Fix some issues with lineWidth < 1 after transform (bug 1753075, bug 1743245, bug 1710019)	2022-03-02 09:46:15 -08:00
Jeff Muizelaar	9b9609a6d8	[JPEG 2000] Add support for resetContextProbabilities (bug 1731483)	2022-02-26 13:05:23 +01:00
Calixte Denizet	46369e4aa5	Fix some issues with lineWidth < 1 after transform (bug 1753075, bug 1743245, bug 1710019) - it aims to fix: - https://bugzilla.mozilla.org/show_bug.cgi?id=1753075; - https://bugzilla.mozilla.org/show_bug.cgi?id=1743245; - https://bugzilla.mozilla.org/show_bug.cgi?id=1710019; - issue #13211; - issue #14521. - previously we were trying to adjust lineWidth to have something correct after the current transform is applied but this approach was not correct because finally the pixel is rescaled with the same factors in both directions. And sometimes those factors must be different (see bug 1753075). - So the idea of this patch is to apply a scale matrix to the current transform just before setting lineWidth and stroking. This scale matrix is computed in order to ensure that after transform, a pixel will have its two thickness greater than 1.	2022-02-25 18:37:34 +01:00
Jonas Jenwald	889b761f22	Merge pull request #14545 from brendandahl/output-scale Generate test images at different output scales.	2022-02-24 21:56:54 +01:00
Brendan Dahl	f5c3abb8f7	Generate test images at different output scales. This will default to generating test images at the device pixel ratio of the machine the tests are created on unless the test explicitly defines and output scale using the `outputScale` setting. This makes the test look visually like they would on the machine they are running on. It also allows us to test different output scales.	2022-02-24 11:27:41 -08:00
Jonas Jenwald	530af48b8e	Merge pull request #14569 from brendandahl/smask-state Fix canvas state getting out of sync from smasks. (bug 1755507)	2022-02-18 19:35:58 +01:00
Brendan Dahl	7def6d12c8	Fix canvas state getting out of sync from smasks. (bug 1755507) Soft masks can be enabled/disabled at anytime and at different points in the save/restore stack. This can lead to the amount of save/restores becoming unbalanced across the two canvases. Instead of save/restoring on the temporary canvas change it so we only track state on the main (suspended canvas). I was also getting an out balance stack from patterns, so I've also fixed that and added a warning that will at least show up on chrome. It would be nice to add this so Firefox at some point too. Fixes #11328, #14297 and bug 1755507	2022-02-17 17:38:32 -08:00
Calixte Denizet	18f4e560ae	[Search] Some matches were incorrectly shifted because of some '-\n' - it aims to fix #14562; - 'X-\n' were not correctly positioned; - when X is a diacritic (e.g. in "sä-\n", which is decomposed into "sa¨-\n") we must handle both things: - diacritics on the one hand; - "-\n" on the other hand.	2022-02-14 10:12:33 +01:00
Calixte Denizet	18e3a98c2b	[api-minor] Don't add in the text content the chars which are out-of-page (bug 1755201) - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1755201; - if the glyph position is not within the view then skip it.	2022-02-13 21:07:11 +01:00
Brendan Dahl	f8b2a99ddc	Merge pull request #14543 from Snuffleupagus/bug-1753983 Let `Lexer.getNumber` treat a single minus sign as zero (bug 1753983)	2022-02-09 14:06:35 -08:00
Jonas Jenwald	60efae96fd	Update the file used with the `xfa_bug1720182` test-case The file used in this test-case is identical to, i.e. the md5 entry perfectly matches, the file used with the `xfa_bug1716380` test-case. While it's obviously fine to use the same PDF document in different reference-tests, note how we e.g. have both `eq` and `text` tests for one document, we should always avoid adding duplicate files in the `test/pdfs/` folder.	2022-02-08 14:23:41 +01:00
Jonas Jenwald	64f3dbeb48	Let `Lexer.getNumber` treat a single minus sign as zero (bug 1753983) This appears to be consistent with the behaviour in both Adobe Reader and PDFium (in Google Chrome); this is essentially the same approach as used for a single decimal point in PR 9827.	2022-02-07 17:09:47 +01:00
Jonas Jenwald	e13353cf21	Remove the `xfa_bug1716838` browser-test since it's a duplicate The md5 entry perfectly matches the `xfa_bug1717668_1` test-case, which means that we unnecessarily test the same exact document twice.	2022-02-01 12:05:19 +01:00
Jonas Jenwald	ef1676678f	Remove the `MBTA-pretax-form-July2012` browser-test since it's a duplicate The md5 entry perfectly matches the `xfa_bug1718521_2` test-case, which means that we unnecessarily test the same exact document twice.	2022-01-31 12:35:26 +01:00
Calixte Denizet	ae842e1c3a	[api-minor] Annotations - Adjust the font size in text field in considering the total width (bug 1721335) - it aims to fix #14502 and bug 1721335; - Acrobat and Pdfium do the same; - it'll avoid to have truncated data when printed; - change the factor to compute font size in using field height: lineHeight = 1.35*fontSize - this is the value used by Acrobat. - in order to not have truncated strings on the bottom, add few basic metrics for standard fonts.	2022-01-30 15:53:31 +01:00
calixteman	838909f8c1	Merge pull request #14491 from quaoaris/lines-rendered-too-thick fix for lines (stroke) are rendered too thick (Bug 1743245)	2022-01-27 18:46:26 +01:00
Calixte Denizet	3a7004ca25	Take into account all rotations before comparing glyph positions - it aims to fix #14497; - previously, only rotations with an angle 0, 90, 180 or 270 were taken into account; - so generalize to any angle but keep the fast path for 0, 90, ... because they're likely more common than anything else.	2022-01-26 17:19:00 +01:00
quaoaris	3f77d80f31	fix for lines (stroke) are rendered too thick (Bug 1743245) This commit fixes Bug 1743245 (Grided PDF file lines rendered too thick) which was created by a fix for #12868 . The lineWidth was set to round(1 * this._combinedScaleFactor) when the pixel is drawn as a parallelorgam with a height <1. This fix changes this to floor(1*this._combinedScaleFactor) . This change shows a visual result comparable to Chrome and Acrobat. Regarding the last PR 3 statements in canvas.js are affected and will change with this commit (stroke and paintChar). renaming the reference files to naming comvention	2022-01-25 10:27:30 +01:00
Calixte Denizet	e1d3a3b414	Remove the invisible format marks from the text chunks - it aims to fix issue #9186.	2022-01-24 13:47:24 +01:00
Tim van der Meij	23b6fde9fc	Merge pull request #14464 from Snuffleupagus/issue-14462 Support Type1 font files with incomplete /CharStrings definitions (issue 14462)	2022-01-19 20:38:46 +01:00
Calixte Denizet	74f25d2755	Font renderer - get int8 instead of uint8 in composite glyphes (bug 1749563) - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1749563; - use some helper functions to get (u\|i)int** values in buffer: it helps to have a clearer code; - in composite glyphes the translations values with a transformations are signed so consequently get some int8 instead of uint8; - add few TODOs.	2022-01-18 22:06:23 +01:00
Jonas Jenwald	a13ae5d97d	Support Type1 font files with incomplete /CharStrings definitions (issue 14462) Please refer to https://www.pdfa.org/norm-refs/Type1Fonts.pdf#page=15 for the expected format for the /CharStrings entries. In the referenced PDF document the /CharStrings are missing the expected end-token, which causes us to swallow the start of the next glyph name.	2022-01-17 18:55:22 +01:00
Tim van der Meij	922dac035c	Merge pull request #14448 from Snuffleupagus/Type3-circular-refs Prevent circular references in Type3 fonts	2022-01-15 14:11:47 +01:00
Jonas Jenwald	4c55563574	Add an additional test-case for circular references in Type3 fonts The PDF document in this patch already worked without the previous patch, but I wanted to improve our test-coverage for the Type3-parsing. The attached PDF document was also found in https://github.com/pdf-association/safedocs/tree/main/Miscellaneous%20Targeted%20Test%20PDFs	2022-01-13 17:59:57 +01:00
Jonas Jenwald	53d4ee7990	Prevent circular references in Type3 fonts In corrupt PDF documents Type3 fonts may introduce circular dependencies, thus resulting in the affected font(s) never loading and parsing/rendering never completing. Note that I've not seen any real-world examples of this kind of font corruption, but the attached PDF document was rather found in https://github.com/pdf-association/safedocs/tree/main/Miscellaneous%20Targeted%20Test%20PDFs Please note: That repository contains a number of reduced test-cases that are specifically intended to test interoperability (between PDF viewer) and parsing/rendering for various kinds of strange/corrupt PDF documents. Some of the test-cases found there may thus not make sense to try and "fix" upfront, in my opinion, unless the problems are also found in real-world PDF documents.	2022-01-13 17:58:37 +01:00
Jonas Jenwald	08d88a0235	Ignore Annotations with empty /Rect-entries in the display-layer (issue 14438) This prevents the `BaseSVGFactory.create`-method from throwing, and thus preventing any remaining Annotations (on the page) from rendering in corrupt documents.	2022-01-11 13:54:35 +01:00
Calixte Denizet	6cdae5ac4d	Use positive dimensions for text chunks in the text layer (issue #14415 ).	2022-01-05 10:49:56 +01:00
KouWakai	98158b67a3	Handle non-integer Annotation border widths correctly (issue 14203) The existing code appears to be wrong, since according to the PDF specification the border width of an Annotation only has to be a number and not specifically an integer. Please see: - https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#page=392 - https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#G11.2096210 - https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#G6.1965562	2021-12-24 22:10:19 +09:00
Jonas Jenwald	909f012fb8	Add a (linked) test-case for issue 8022 Given that [bug 1336591](https://bugzilla.mozilla.org/show_bug.cgi?id=1336591) was just closed as fixed, thus fixing issue 8022 in Firefox, let's add a test-case to enable us to catch any future regressions either in PDF.js or in browsers themselves.	2021-12-06 15:27:40 +01:00
Jonas Jenwald	ca82e1832f	Add a (linked) test-case for issue 8019 Given that [bug 1336572](https://bugzilla.mozilla.org/show_bug.cgi?id=1336572) was just closed as fixed, thus fixing issue 8019 in Firefox[1], let's add a test-case to enable us to catch any future regressions either in PDF.js or in browsers themselves. --- [1] It also seems to be working in Google Chrome, although I'm having a slightly difficult time deciphering exactly what configurations were affected when looking through issue 8019.	2021-12-04 08:56:04 +01:00
Calixte Denizet	31e13515f5	XFA - Draw arcs correctly - it aims to fix #14315; - take into account the startAngle to compute the coordinates of the final point.	2021-11-27 19:30:12 +01:00
Jonas Jenwald	afcc99a86d	When parsing corrupt documents without any trailer-dictionary, fallback to the "top"-dictionary (issue 14269) There's obviously no guarantee that this will work in general, if the document is sufficiently corrupt, but it should hopefully be better than just throwing `InvalidPDFException` as currently happens. Please note that, as is often the case with corrupt documents, it's somewhat difficult to know if we're rendering the document "correctly" with this patch[1]. In this case even Adobe Reader cannot open the document, which is always a good sign that it's really corrupt, however we're at least able to render something with this patch. --- [1] Whatever "correct" even means when dealing with corrupt PDF documents, where often times different PDF viewers won't agree completely.	2021-11-13 13:21:38 +01:00
Jonas Jenwald	28fb3975eb	Merge pull request #14266 from calixteman/bug931481 Don't consider space as real space when there is an extra spacing (bug 931481)	2021-11-12 21:42:32 +01:00
Calixte Denizet	a88ff34eb7	Don't consider space as real space when there is an extra spacing (bug 931481) - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=931481; - real space chars are pushed in the chunk but when there is an extra spacing, the next char position must be compared with the previous one; - for example, an extra spacing can cancel a space so visually there are no space.	2021-11-12 18:53:48 +01:00
Calixte Denizet	5b7e1f5232	XFA - Avoid an exception when looking for a font in a parent node - it aims to fix issue https://github.com/mozilla/pdf.js/issues/14150; - a parent can be null in case the root has been reached, so just add a check.	2021-11-12 16:27:08 +01:00
Jonas Jenwald	ea1c348c67	Always prefer abbreviated keys, over full ones, when doing any dictionary lookups (issue 14256) Note that issue 14256 was specifically about inline images, please refer to: - https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#G7.1852045 - https://www.pdfa.org/safedocs-unearths-pdf-inline-image-issue/ - https://pdf-issues.pdfa.org/32000-2-2020/clause08.html#H8.9.7 However, during review of the initial PR in https://github.com/mozilla/pdf.js/pull/14257#issuecomment-964469710, it was suggested that we instead do this unconditionally for all dictionary lookups. In addition to re-ordering the existing call-sites in the `src/core`-code, and adding non-PRODUCTION/TESTING asserts to catch future errors, for consistency a number of existing `if`/`switch`-blocks were re-factored to also check the abbreviated keys first.	2021-11-10 11:56:18 +01:00
calixteman	4bb9de4b00	Merge pull request #14239 from calixteman/1739502 XFA - Fix a breakBefore issue when target is a contentArea and startNew is 1 (bug 1739502)	2021-11-08 03:14:42 -08:00
Brendan Dahl	8161d3f29d	Don't double apply a group xobject's bbox. In `beginGroup` we create a new canvas that is the size of the bounding box and we translate it to the offset. This means we don't need to also apply the bounding box during `paintFormXObjectBegin`. This improves #6961 quite a bit, but it still is missing the indention in the ruler.	2021-11-05 15:40:58 -07:00
Calixte Denizet	a08763f4aa	XFA - Fix a breakBefore issue when target is a contentArea and startNew is 1 (bug 1739502) - it aims to fix https://bugzilla.mozilla.org/show_bug.cgi?id=1739502; - when the target area was the current content area, everything was pushed in it instead of creating a new one (and consequently a new pageArea is created). - the pdf shows an alignment issue on page 4: - the hAlign is "center" but the subform was the width of its parent, so compute the real width of the subform with tb layout; - there is an extra empty page at the end of the pdf: - there is a subform with some hidden elements which are not rendered for now (since there is no plugged JS engine it isn't possible to draw them in changing their visibility). - so in case a subform is empty and has no real dimensions (at least one is 0), we just consider it as empty.	2021-11-05 18:59:55 +01:00
calixteman	e136afbabc	Merge pull request #14218 from janekotovich/subform_min_0 XFA subform with occur min=0 and no bound data displaying.	2021-11-05 04:12:34 -07:00
Jonas Jenwald	8222d6530b	Merge pull request #14232 from brendandahl/show-text-pattern Use correct matrix for patterns with showText.	2021-11-05 10:04:56 +01:00
Brendan Dahl	1c7048399b	Use correct matrix for patterns with showText. We were incorrectly using the transform in the pattern before it had been adjusted causing the pattern to be misplaced relative to the page. Fixes: ShowText-ShadingPattern.pdf (already in corpus) Fixes: #8111 Fixes: #9243	2021-11-04 16:57:36 -07:00
Jane-Kotovich	56b502391c	XFA subform with occur min=0 and no bound data displaying Subfrom nomin displays even though it's subform is set to <occur max=-1 min=0> If we look through specs of XFA 3.3 : https://www.pdfa.org/norm-refs/XFA-3_3.pdf - The min attribute is used when processing a form that contains data. Regardless of the data at least this number of instances is included. It is permissible to set this value to zero, in which case the container is entirely excluded if there is no data for it. However, in our case it doesn't happen, because we let our empty dataNode get through. Though by setting a clause: - eliminate unmatched data with occur min=0 we are checking our empty data and sending it to uselessNode array where at the end it gets removed;	2021-11-04 20:22:05 +10:00
Jonas Jenwald	5f77d3719b	Tweak the Bidi-detection heuristics for very short RTL strings (issue 11656) Very short strings can narrowly miss the existing Bidi-detection threshold, leading to incorrect text-selection and copying behaviour. In my testing, neither Adobe Reader or PDFium seem to handle copying "correctly" for this document. Hence it's not entirely clear to me that we actually want to fix this, since tweaking these heuristics can obviously cause regressions elsewhere (and our test coverage for RTL-text isn't exactly great).	2021-11-03 20:31:57 +01:00
Jonas Jenwald	8edec018fe	Add a RTL-text reference test (issue 10301) It seems that issue 10301 was fixed by PR 13424, by combining the spans, however given that we don't have a lot of test coverage for RTL-text I figured that adding a simple reference test wouldn't hurt (rather than just closing the issue as WORKSFORME).	2021-10-31 16:55:11 +01:00
Jonas Jenwald	8c70258065	Merge pull request #14182 from calixteman/richtext Support rich content in markup annotation	2021-10-31 14:41:56 +01:00
Calixte Denizet	cf8dc750d6	Support rich content in markup annotation - use the xfa parser but in the xhtml namespace.	2021-10-31 13:44:51 +01:00
Tim van der Meij	ec1633c33c	Merge pull request #14201 from Snuffleupagus/bug-1219400 Use the correct border-style for Annotations, when a dash array is specified (bug 1219400)	2021-10-30 12:39:46 +02:00
calixteman	2c0bbaf208	Merge pull request #14153 from catherinemds/xfa-link Fix XFA links (bug 1735738)	2021-10-29 11:06:00 -07:00
Catherine	db0b3cda8b	XFA - Fix xfaLink class to make links work (bug 1735738) There were some links not working in some XFA files,I realized that the anchor tag that contains the link has an inline display and couldn't receive any height, solved this by adding a "position: absolute". Tested with two different files in Firefox Nightly and Chrome and now all links are working perfectly fine. Added reftest to avoid future regressions	2021-10-29 11:39:33 -04:00
Jonas Jenwald	884caf602e	Use the correct border-style for Annotations, when a dash array is specified (bug 1219400) Even though we cannot use the dash array in the display layer, at least ensure that we use the correct border-style.	2021-10-27 13:20:21 +02:00
Jonas Jenwald	aa1b78684f	Handle ranges that "overflow" the last byte in `CMap.mapBfRange` (bug 1627427)	2021-10-24 13:48:38 +02:00
Jonas Jenwald	52372b9378	Merge pull request #14175 from brendandahl/smask-v2 Use a new method for handling soft masks.	2021-10-23 09:27:18 +02:00
Brendan Dahl	2d1f9ff7a3	Use a new method for handling soft masks. The old method of handling soft masks had a number of issues where the temporary drawing canvas and the suspended main canvas could get out of sync (e.g. mismatched save/restores or clip state) or we could end up compositing at the wrong time. A good example of things getting out sync is the reduced test case in #9017. To fix this I've changed two big things: 1) Duplicate all the needed graphics state from the temporary canvas to the suspended main canvas. This ensure the canvases stay in sync so that when we switch back to the main canvas the graphics state stack is the same (e.g. transforms, clip paths). 2) Immediately composite after each drawing operation. This ensures that if there's an active clip region that we'll still be able to composite the correct portions of the canvas. Note: This solution could be avoided by using getImageData and putImageData since those ignore clipping region, but this is very very slow. Note2: I also think the old way of only compositing at the end of the soft mask is incorrect and can lead to wrong colors if drawing over the same region, but in practice this doesn't seem to matter much. Fixes: #5781 Fixes: #5853 Fixes: #7267 Fixes: #7891 Fixes: #8403 Fixes: #8624 Fixes: #12798 Fixes: #13891 Fixes: #9017 (reduced test case) Fixes: https://bugzilla.mozilla.org/show_bug.cgi?id=1703683	2021-10-22 13:41:21 -07:00
Jonas Jenwald	68e6622c57	Ignore Square/Circle-annnotations with a zero borderWidth when creating a fallback appearance stream (issue 14164) Trying to render these Annotation-types, when the borderWidth is `0`, causes a "hairline" border to appear. If these Annotations included an appearance stream, as they are supposed to, this wouldn't have happened and the simplest solution here seem to be to just ignore these particular Annotations.	2021-10-19 15:27:42 +02:00

1 2 3 4 5 ...

1235 Commits