You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Nico Prenzel (JIRA)" <ji...@apache.org> on 2019/01/17 11:21:00 UTC
[jira] [Comment Edited] (PDFBOX-4426) Not parsable pdf document
[ https://issues.apache.org/jira/browse/PDFBOX-4426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16744908#comment-16744908 ]
Nico Prenzel edited comment on PDFBOX-4426 at 1/17/19 11:20 AM:
----------------------------------------------------------------
The pdf document seems to be produced/created with
Microsoft® Word 2010
macOS Version 10.14.2 (Build 18C54) Quartz PDFContext
Foxit and Adobe Reader are capable to display it.
See attached file for the area around the offset:
Thanks. !PDFBox Bug - PDFBOX-4426.png!
With PDFDebugger in eclipse debugger mode i've changed the value to an COSInteger and the document is shown. So it seems to be "only" that part is strange.
was (Author: nico.prenzel):
The pdf document seems to be produced/created with
Microsoft® Word 2010
macOS Version 10.14.2 (Build 18C54) Quartz PDFContext
Foxit and Adobe Reader are capable to display it.
See attached file for the area around the offset:
Thanks. !PDFBox Bug - PDFBOX-4426.png!
> Not parsable pdf document
> -------------------------
>
> Key: PDFBOX-4426
> URL: https://issues.apache.org/jira/browse/PDFBOX-4426
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 2.0.13
> Reporter: Nico Prenzel
> Priority: Minor
> Attachments: PDFBox Bug - PDFBOX-4426.png
>
>
> I've got another not parsable pdf document from our customers.
> Unfortunately, i'am not allowed to post the pdf document, this time.
> Pherhaps the stacktrace is sufficient to fix the parsing...
> IOException expected number, actual=COSFloat\{18446744073177568688} at offset 693140
> org.apache.pdfbox.pdfparser.BaseParser parseCOSDictionaryValue: 166
> org.apache.pdfbox.pdfparser.BaseParser parseCOSDictionaryNameValuePair: 279
> org.apache.pdfbox.pdfparser.BaseParser parseCOSDictionary: 212
> org.apache.pdfbox.pdfparser.BaseParser parseDirObject: 864
> org.apache.pdfbox.pdfparser.COSParser parseFileObject: 904
> org.apache.pdfbox.pdfparser.COSParser parseObjectDynamically: 873
> org.apache.pdfbox.pdfparser.COSParser parseObjectDynamically: 793
> org.apache.pdfbox.pdfparser.COSParser parseDictObjects: 753
> org.apache.pdfbox.pdfparser.PDFParser initialParse: 187
> org.apache.pdfbox.pdfparser.PDFParser parse: 226
> org.apache.pdfbox.pdmodel.PDDocument load: 1200
> org.apache.pdfbox.pdmodel.PDDocument load: 1097
> vlh.Tools.PDF.PDFBoxUtil$1 run: 148
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org