Sakurai/pdf.js - pdf.js - Gitea on kemo

Sakurai/pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	088ce6c009	Add a unit-test to check that `ProblematicCharRanges` contains valid entries When adding new entries to `ProblematicCharRanges`, you have to be careful to not make any mistakes since that could cause glyph mapping issues. Currently the existing reference tests should probably help catch any errors, but based on experience I think that having a unit-test which specifically checks `ProblematicCharRanges` would be both helpful and timesaving when modifying/reviewing changes to this code. Hence this patch which adds a function (and unit-test) that is used to validate the entries in `ProblematicCharRanges`, and also checks that we don't accidentally add more character ranges than the Private Use Area can actually contain. The way that the validation code, and thus the unit-test, is implemented also means that we have an easy way to tell how much of the Private Use Area is potentially utilized by re-mapped characters.	2016-08-27 11:56:00 +02:00
Tim van der Meij	10f9f11ec4	Merge pull request #7490 from Snuffleupagus/issue-7426 Don't map glyphs to the Lepcha Unicode block (issue 7426)	2016-07-21 14:39:19 +02:00
Jonas Jenwald	64783c8b6e	Don't map glyphs to the Lepcha Unicode block (issue 7426) In the PDF file in the issue, some of the glyphs end up being mapped to the Lepcha Unicode block; see https://en.wikipedia.org/wiki/Lepcha_(Unicode_block). This didn't use to matter, but after HarfBuzz updates that improved support for Lepcha fonts, in particular https://bugzilla.mozilla.org/show_bug.cgi?id=1249861, some glyphs are now moved horizontally. To avoid that, this patch adds the Lepcha block to the list of Unicode ranges that we skip when building the glyph mapping. Fixes 7426.	2016-07-17 16:53:36 +02:00
klemens	6f03f62327	trivial spelling fixes	2016-07-17 14:33:41 +02:00
Jonas Jenwald	51e46fa1a7	Change the `warn` to `info` in `recoverGlyphName` to reduce the console spam After PR 7441, where `recoverGlyphName` is used a lot more than before, many PDF files will generate a lot of warnings the console. For normal usage, compared to debugging/development, this is probably more annoying than helpful.	2016-07-09 12:08:41 +02:00
Brendan Dahl	1f3f4a8dd7	Merge pull request #7441 from Snuffleupagus/issue-7439 Fallback to attempt to recover standard glyph names when amending the `charCodeToGlyphId` with entries from the `differences` array in `type1FontGlyphMapping` (issue 7439)	2016-07-06 13:02:21 -07:00
Brendan Dahl	e2e657e44f	Merge pull request #7390 from Snuffleupagus/issue-7180 Add upper-case `I` as a possible space replacement fallback in `Font.spaceWidth` to improve text-selection (issue 7180)	2016-06-29 15:11:19 -07:00
Jonas Jenwald	7866109af9	Fallback to attempt to recover standard glyph names when amending the `charCodeToGlyphId` with entries from the `differences` array in `type1FontGlyphMapping` (issue 7439) Fixes 7439.	2016-06-25 14:54:34 +02:00
Jonas Jenwald	c1ca268ef3	Skip mapping of glyphs to Unicode "Ideographic space" (issue 7416) Fixes 7416, which is an IE specific issue.	2016-06-22 08:58:00 +02:00
Jonas Jenwald	6a0b047bfa	Add upper-case `I` as a possible space replacement fallback in `Font.spaceWidth` to improve text-selection (issue 7180) In fonts with only upper-case glyphs, that are also missing a space glyph, `get spaceWidth` won't be able to return anything useful. By adding upper-case `I` as a fallback, we can thus improve text-selection in some PDF files. Note that locally, the patch causes slight movement in a few existing `text` tests, but in my opinion this actually looks like slight improvements. Fixes 7180.	2016-06-07 22:55:25 +02:00
Jonas Jenwald	a36a946976	Move the `isSpace` utility function from core/parser.js to shared/util.js Currently the `isSpace` utility function is a member of `Lexer`, which seems suboptimal, given that it's placed in `core/parser.js`. In practice, this means that in a number of `core/.js` files we thus have an otherwise* completely unnecessary dependency on `core/parser.js` for a one-line function. Instead, this patch moves `isSpace` into `shared/util.js` which seems more appropriate for this kind of utility function. Not to mention that since all the affected `core/*.js` files already depends on `shared/util.js`, this doesn't incur any more file dependencies.	2016-06-06 09:11:33 +02:00
Yury Delendik	32ce369d88	Fixes some static analysis warnings and recommendations * Useless conditional * Superfluous trailing arguments * Useless assignment to local variable * Misspelled identifier * JSDoc tag for non-existent parameter	2016-05-02 17:34:58 -05:00
Yury Delendik	118b71925c	Forces UMD header to have relative path and extension for CommonJS.	2016-04-02 11:10:36 -05:00
Jonas Jenwald	ef551e8266	Extract `Type1Parser` from fonts.js	2016-04-01 23:38:53 +02:00
Jonas Jenwald	b961e1d21b	Extract `CFFParser` from fonts.js (issue 6777)	2016-04-01 22:32:39 +02:00
Brendan Dahl	13d440df61	Merge pull request #7078 from Snuffleupagus/refactor-toFontChar-without-file Refactor the building of `toFontChar` for non-embedded fonts	2016-03-31 10:43:11 -07:00
Jonas Jenwald	05cf709f8e	Parse Type1 font files to determine the various `Length{n}` properties, instead of trusting the PDF file (issue 5686, issue 3928) Fixes 5686. Fixes 3928.	2016-03-31 11:08:12 +02:00
Jonas Jenwald	c40df8a393	Make `Type1Font` more class-like, by adding closure Note: Ignoring whitespace should simplify reviewing a great deal.	2016-03-31 11:00:27 +02:00
Brendan Dahl	df7afcf004	Merge pull request #7053 from yurydelendik/rm-pdfjs-core Removes global PDFJS usage from the src/core/.	2016-03-25 13:19:43 -07:00
Yury Delendik	bda5e6235e	Removes global PDFJS usage from the src/core/.	2016-03-23 19:24:37 -05:00
Jonas Jenwald	d78fae0181	Ensure that TrueType font tables have `uint32` checksums According to "The table directory" under https://developer.apple.com/fonts/TrueType-Reference-Manual/RM06/Chap6.html#Directory, TrueType font tables should have `uint32` checksums. This is something that I noticed, and was initially confused about, while debugging a TrueType issue. As far as I can tell, the current (`int32`) checksums we use doesn't cause any issues in practice. However, I do think that this should be addressed to agree with the specification, and to reduce possible confusion when reading the font code.	2016-03-22 13:40:50 +01:00
Manas	f6d28ca323	Refactors CMapFactory.create to make it async	2016-03-21 23:08:19 +05:30
Jonas Jenwald	cd2bd057ab	Refactor the building of `toFontChar` for non-embedded fonts Currently there's a lot of duplicate code for non-embedded `toFontChar`, which this patch simplifies by extracting the code into a helper function instead.	2016-03-10 21:25:39 +01:00
Jonas Jenwald	dfe9015a43	Convert `uniXXXX` glyph names to proper ones when building the `charCodeToGlyphId` map for TrueType fonts (bug 1132849, issue 6893, issue 6894) This patch adds a `getUnicodeForGlyph` helper function, which is used to recover Unicode values for non-standard glyph names. Some PDF generators, e.g. Scribus PDF, use improper `uniXXXX` glyph names which breaks the glyph mapping. We can avoid this by converting them to "standard" glyph names instead. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1132849. Fixes 6893. Fixes 6894.	2016-03-09 19:37:15 +01:00
Preetham Mysore	be1e12dbcb	Fix for descent calculation while reading font hhea headers	2016-03-03 08:51:41 -05:00
Jonas Jenwald	8402c79171	Merge pull request #7050 from brendandahl/issue4402 For CIDFontType2 use CID as glyph ID when missing CID to GID map.	2016-03-02 10:11:42 +01:00
Brendan Dahl	a6acf74b54	Merge pull request #7023 from brendandahl/issue6721 Only draw glyphs on canvas if they are in the font or the font file is missing.	2016-03-01 18:03:37 -08:00
Brendan Dahl	6e1d131384	For CIDFontType2 use CID as glyph ID when missing CID to GID map.	2016-03-01 17:05:33 -08:00
Brendan Dahl	ff87f3fb86	Only draw glyphs on canvas if they are in the font or the font file is missing.	2016-03-01 13:24:58 -08:00
Jonas Jenwald	505f15f221	Avoid accidentally getting the entire font file in `readNameTable` (issue 7020) In the PDF file in question, some of the 'name' table entries have `record.length === 0`. This becomes problematic in the non-unicode case, since `font.getBytes(0)` will fetch the entire stream. Given that OTS rejects 'name' entries larger than `2^16`, this thus explain the sanitizer errors. Fixes 7020.	2016-03-01 21:59:49 +01:00
Tim van der Meij	02b161d432	Merge pull request #6933 from brendandahl/faster-decrypt Make type 1 font program decryption faster.	2016-02-09 23:41:22 +01:00
Brendan Dahl	02331f6e33	Make type 1 font program decryption faster. Discard the values first so we don't have to slice the array.	2016-01-29 11:10:30 -08:00
Yury Delendik	2edf2792dc	Replaces literal {} created lookup tables with Object.create	2016-01-28 12:18:38 -06:00
Yury Delendik	55a201d92d	Lazify NormalizedUnicodes	2016-01-28 11:56:42 -06:00
Yury Delendik	d0738d7e24	Lazify stdFontMap, serifFonts, GlyphMapForStandardFonts	2016-01-28 11:51:54 -06:00
Yury Delendik	1a9a665adf	Refactor Encodings	2016-01-28 11:32:59 -06:00
Yury Delendik	4ef20de429	Lazify GlyphsUnicode.	2016-01-28 11:32:59 -06:00
Yury Delendik	0aa373cdf3	Merge pull request #6891 from Snuffleupagus/issue-6889 Map missing glyphs to the `notdef` glyph for TrueType (3, 1) fonts regardless if the 'post' table is defined or not (issue 6889)	2016-01-20 13:14:47 -06:00
Jonas Jenwald	4855d4cc9f	Map missing glyphs to the `notdef` glyph for TrueType (3, 1) fonts regardless if the 'post' table is defined or not (issue 6889)	2016-01-17 22:58:00 +01:00
Jonas Jenwald	d52495a9c8	[TrueType] Recover from a missing "glyf" table by replacing it with dummy data, utilizing the existing code in `sanitizeGlyphLocations` It seems to be fairly common for OCR software to include incomplete TrueType fonts, notable missing the "glyf" table, in PDF files. Since we currently reject such fonts, the result is that text-selection/copying is broken. This patch contains a suggested approach to try and use these kind of broken fonts, by using existing code in `sanitizeGlyphLocations` to replace a missing "glyf" table with dummy data. Fixes 4684. Fixes 6007. Fixes 6829.	2016-01-15 21:44:59 +01:00
Jonas Jenwald	896e390285	Check that CIDFontType0 fonts does not actually contain OpenType font files (issue 6782) This patch follows a similar idea as PR 5756. The patch is based on the nice debugging done by Brendan in the referenced issue 6782. A better way to handle this, and similar issues, would probably be to completely ignore what the PDF file claims about font type/subtype, and just check the actual data. But until that kind of rewrite happens, this patch should help. Fixes 6782.	2016-01-06 02:19:02 +01:00
Brendan Dahl	eb7c36beb6	Add validation for callsubr and callgsubr for type 2 charstrings.	2016-01-05 09:54:25 -08:00
Yury Delendik	6b60c8f4db	Adds UMD headers to core, display and shared files.	2015-12-15 13:24:39 -06:00
Jonas Jenwald	ee0d522187	Use `adjustWidths` for TrueType fonts if we handle them as OpenType (issue 5027, issue 5084, issue 6556, bug 1204903) In `Font_checkAndRepair` we can decide that a font isn't TrueType, and instead parse it as CFF. In that case it's quite possible that the `fontMatrix` will be changed, and without calling `adjustWidths` we're failing to update the glyph widths correctly. Fixes 5027. Fixes 5084. Fixes 6556. Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1204903.	2015-12-08 00:49:22 +01:00
Jonas Jenwald	4810b7b8fc	Fix the `charCodeOf` method in `IdentityToUnicodeMap` in order to prevent text selection from breaking After PR 6590, `font.spaceWidth` is now called in more cases than before (in `PartialEvaluator_getTextContent`), which exposed an underlying issue with `IdentityToUnicodeMap_charCodeOf` throwing an error. This breaks text-selection in some PDF files found in the wild, hence this patch replaces the `error` with an actual function instead (modelled after `IdentityCMap_charCodeOf`).	2015-12-05 13:15:55 +01:00
Brendan Dahl	87762afec4	Remove glyph id's outside the range of valid glyphs. OTS does not like invalid glyph ids in a camp table.	2015-12-03 11:53:06 -08:00
Manas	a2ba1b8189	Uses editorconfig to maintain consistent coding styles Removes the following as they unnecessary /* -- Mode: Java; tab-width: 2; indent-tabs-mode: nil; c-basic-offset: 2 -- / / vim: set shiftwidth=2 tabstop=2 autoindent cindent expandtab: */	2015-11-14 07:32:18 +05:30
Jonas Jenwald	ff64ef0243	Prevent `readCmapTable` from failing if the `cmap` is missing in TrueType fonts Fixes http://arrow.dit.ie/cgi/viewcontent.cgi?article=1000&context=aaschadpoth#page=3.	2015-11-08 16:48:37 +01:00
Yury Delendik	cc5bc18728	Fixes incorrect PDF file font metrics.	2015-11-06 14:47:10 -06:00
Yury Delendik	fa46b73c47	Better spacing in text layer.	2015-11-02 08:54:15 -06:00

1 2 3 4 5