Commit Graph

93 Commits

Author SHA1 Message Date
c1fd18f355 PdfTextExtractor: Fix text size calculation 2016-09-07 09:06:20 +02:00
c0a8de2617 Merge branch 'master' of https://github.com/Kableado/VAR.PdfTools 2016-09-06 17:42:55 +02:00
4d92f144f8 PdfParser: Parse inline images 2016-09-06 17:42:13 +02:00
c388e9daae Fixes on project files to be compatible with Monodevelop 2016-07-04 07:15:09 +02:00
29e49546fa Bump version 1.1 1_1 2016-06-27 12:44:43 +02:00
d46f8d2abe PdfTextExtractor: New method HasText to determine if there is some text in the document. 2016-06-27 12:20:24 +02:00
7d9b7981a8 PdfTextExtractor: Fix a typo 2016-06-27 09:45:28 +02:00
9b2310ea96 Create README.md 2016-06-27 08:37:59 +02:00
58c09c1110 PdfDocument: Move filter processing to PdfFilters. 2016-06-25 14:52:21 +02:00
241b68bd0a PdfTextExtractor: Allow exact match searching text. Used for column and field search. 2016-06-24 17:45:28 +02:00
85d998a8d3 PdfTextExtractor: Better column and field extraction heuristics 2016-06-24 17:37:19 +02:00
da908d0f36 PdfTextExtractor: Group PdfTextElement when there is no position change 2016-06-24 15:36:00 +02:00
ed03166f6f PdfFont: Keep font name 2016-06-24 15:35:08 +02:00
17001d6c8a Add support to .Net 3.5 and VisualStudio 2010 2016-06-23 08:51:37 +02:00
35ec8b478f ParseNumber: Use InvariantCulture to avoid errors on double type number conversions 2016-06-23 08:41:50 +02:00
83b69e3f49 PdfParser: Get decimal separator on creation 1_0 2016-06-21 18:04:21 +02:00
8f12b13a9f PdfExtractor: GetColumn and GetField for easy data exploration 2016-06-21 17:35:35 +02:00
5e96ee22d8 Robust stream parsing 2016-06-21 16:33:23 +02:00
b1eecbf902 Standard 14 Fonts 2016-06-21 15:24:19 +02:00
baea8aabc2 PdfTextExtractor: Calculate PdfTextElement height 2016-06-21 15:19:39 +02:00
44f6cb3d0c PdfTextExtractor: Keep basic graphics state for correct matrix calculations 2016-06-21 15:17:31 +02:00
c41c541149 ParseObject: Fix "xref" skipping 2016-06-21 01:31:26 +02:00
75582de5e4 PdfDocumentPage: Fix resource inheritance 2016-06-21 01:31:06 +02:00
22e640a02c FrmPdfInfo: Show page changes 2016-06-20 18:35:27 +02:00
bbafe91037 PdfTextElement: Extract text elements with coordinates and size 2016-06-20 18:23:45 +02:00
d568cbd74b PdfFont: Character widths 2016-06-20 17:10:29 +02:00
c4dc8bf9c3 Fix "xref" skipping 2016-06-20 14:26:44 +02:00
63b50a8198 Parse Fonts ToUnicode 2016-06-20 11:40:15 +02:00
6e8f58e2e1 Fixes on hexstring parsing of ParseString 2016-06-20 11:39:10 +02:00
3b0bdf8a66 PdfDocumentPage: Parse content actions 2016-06-20 01:15:22 +02:00
e3b1f9c2b2 PdfParser: Fixes on ParseToken, ParseNumber, ParseNumberOrReference and ParseString 2016-06-20 01:14:06 +02:00
dff6910b4d Placeholders for the rest of the stream filters 2016-06-19 15:29:37 +02:00
95d748be48 FrmPdfInfo: Changes to ease information showing 2016-06-19 15:29:05 +02:00
8ffa5db9f1 Extract Pages information 2016-06-19 15:22:42 +02:00
5a68559cc2 Resolve all indirect object references 2016-06-19 13:08:52 +02:00
e0ec1c0f5c Move utility methods from PdfStream class to PdfDictionary. 2016-06-19 03:49:05 +02:00
142241e791 Parse ObjectStreams 2016-06-19 03:29:59 +02:00
af5644758c Apply FlateDecode filter to streams 2016-06-18 13:31:39 +02:00
0f575df7ce FrmPdfInfo: Fix alignment of buttons 2016-06-18 13:31:06 +02:00
a5b5b97e7b ParseObject: Fix stream parsing, detecting "endstream" when it is unavailable on the dictionary (forward references or missing) 2016-06-18 02:42:15 +02:00
c9370080b1 Fixes on ParseNull 2016-06-18 01:25:27 +02:00
1dcf5f6f4c Fixes on ParseString, ParseName and ParseObject 2016-06-17 22:48:57 +02:00
973bfe3fe2 Basic PDF file structure parsing 2016-06-17 22:24:08 +02:00