Sakurai/pdf.js - pdf.js - Gitea on kemo

Sakurai/pdf.js

Author	SHA1	Message	Date
Jonas Jenwald	50c2856097	Move `EOF`/`isEOF` from core/parser.js to core/primitives.js Given the nature of `EOF` and `isEOF`, it seems to me that they really ought to be placed in `core/primitives.js` instead. In general, it doesn't seem great to have to depend on the entire `core/parser.js` file for such simple primitives/helper functions. In particular, while `core/ps_parser.js` is completely separate from `core/parser.js` with regards to its function, it still depends on the latter for just one primitive. Note that compared to e.g. PR 7389, this will not reduce the number of dependencies for `core/ps_parser`, however the new dependency IMHO makes more sense.	2017-01-27 13:37:48 +01:00
Jonas Jenwald	90d19de935	Catch errors and continue parsing in `parseCMap` (issue 7492) After PR 7039, the PDF file in issue 7492 no longer renders at all, but note that text selection wasn't working correctly previously. The problem with the PDF file in issue 7492 is that the `cMap`, in the `toUnicode` entry in the font, contains an invalid name: ``` /CMapName /-usr-share-fonts-truetype-Panton-Panton Family-Fontfabric - Panton.otf,000-UTF16 def ``` When we parse that line, things obviously break because there are spaces present in the wrong places. To avoid that issue, the patch simply lets `parseCMap` continue when errors are encountered, to try and recover usable data. Note that by not aborting immediatly when an error is encountered, we are also able to fix the text selection. Obviously, it could be argued that we should just immediatly reject a corrupt `cMap`. But given that they usually are correct, it seems that trying to recover as much data as possible from corrupt one can only be a good thing for both glyph mapping and text selection. Fixes 7492.	2016-07-18 16:39:56 +02:00
Yury Delendik	bda5e6235e	Removes global PDFJS usage from the src/core/.	2016-03-23 19:24:37 -05:00
Manas	f6d28ca323	Refactors CMapFactory.create to make it async	2016-03-21 23:08:19 +05:30
Yury Delendik	6b60c8f4db	Adds UMD headers to core, display and shared files.	2015-12-15 13:24:39 -06:00
Manas	a2ba1b8189	Uses editorconfig to maintain consistent coding styles Removes the following as they unnecessary /* -- Mode: Java; tab-width: 2; indent-tabs-mode: nil; c-basic-offset: 2 -- / / vim: set shiftwidth=2 tabstop=2 autoindent cindent expandtab: */	2015-11-14 07:32:18 +05:30
Jonas Jenwald	8d831449ab	Right-size the `map` array in PartialEvaluator_readToUnicode We can avoid a lot of intermediate resizings, by directly allocating the required number of elements for the `map` array.	2015-09-24 13:08:53 +02:00
Jonas Jenwald	7c7d05e7a3	Attempt to infer if a CMap file actually contains just a standard `Identity-H`/`Identity-V` map	2015-04-25 11:28:33 +02:00
Jonas Jenwald	9ef0d0b878	Fix the error handling for CMaps that fail to load	2014-08-14 16:29:10 +02:00
Nicholas Nethercote	61e6b576d4	Avoid an allocation in readCharCode(). readCharCode() returns two values, and currently allocates a length-2 array on every call to do so. This change makes it instead us a passed-in object which can be reused. This tiny change reduces the total JS allocations done for the document in Mozilla bug 992125 by 4.2%.	2014-08-12 16:12:58 -07:00
Yury Delendik	57860149e9	Merge pull request #5135 from nnethercote/identity-cmap-proper Make IdentityCMaps more compact.	2014-08-06 09:11:08 -05:00
Yury Delendik	682b93ac9e	Fixes lint errors	2014-08-05 21:55:59 -05:00
Yury Delendik	46a9a35ddc	Merge pull request #5071 from nnethercote/font-savings Optimize a font-heavy document	2014-08-05 18:57:46 -05:00
Nicholas Nethercote	51055e5836	Make IdentityCMaps more compact. IdentityCMap uses an array to represent a 16-bit unsigned identity function. This is very space-inefficient, and some files cause multiple IdentityCMaps to be instantiated (e.g. the one from #4580 has 74). This patch make the representation implicit. When loading the PDF from issue #4580, this change reduces peak RSS from ~370 to ~280 MiB. It also improves overall speed on that PDF by ~30%, going from 522 ms to 366 ms.	2014-08-05 03:01:39 -07:00
Nicholas Nethercote	adf58ed687	Represent cid chars using integers, not strings. cid chars are 16-bit unsigned integers. Currently we convert them to single-char strings when inserting them into the CMap, and then convert them back to integers when extracting them from the CMap. This patch changes CMap so that cid chars stay in integer format throughout, saving both time and space. When loading the PDF from issue #4580, this change reduces peak RSS from ~600 to ~370 MiB. It also improves overall speed on that PDF by ~26%, going from 724 ms to 533 ms.	2014-08-01 02:35:17 -07:00
Nicholas Nethercote	28687bca75	Optimize CMap.prototype.forEach(). This change avoids the element stringification caused by for..in for the vast majority of CMaps. When loading the PDF from issue #4580, this change reduces peak RSS from ~650 to ~600 MiB, and improves overall speed by ~20%, from 902 ms to 713 ms. Other CMap-heavy documents will also see improvements.	2014-07-30 06:28:47 -07:00
Nicholas Nethercote	b86daed29d	Make CMap.map quasi-private. This makes it easier for the representation to be improved.	2014-07-30 06:26:35 -07:00
Nicholas Nethercote	501446ccc4	Optimize common cases in hexToStr(). This avoids the creation of over two million array objects when viewing http://www.dynacw.co.jp/Portals/3/fontsamplepdf/sample_4942546800828.pdf, and reduces load time from 76 to 73 ms.	2014-07-22 23:26:03 -07:00
Jonas Jenwald	04975acceb	Prevent CMapFactory.create from failing by passing the necessary parameters from PartialEvaluator_readToUnicode (issue 5010)	2014-06-27 00:46:16 +02:00
Jonas Jenwald	d1c71ab7ad	Prevent adding undefined array entries to CMap.map in mapRangeToArray (issue 4875)	2014-06-02 14:29:54 +02:00
Tim van der Meij	df91acf239	Fixes lint warning W004 in src/core	2014-04-11 00:41:08 +02:00
Yury Delendik	69efd9cb96	CMaps binary packing	2014-03-14 16:46:35 -05:00
Brendan Dahl	b5b94a4af3	Use built in CMaps and unify the glyph mapping.	2014-02-11 10:27:09 -08:00
Brendan Dahl	f32e65b19f	Read multi-byte character codes based on codespace ranges.	2013-09-25 10:32:04 -07:00