You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Nico Prenzel (JIRA)" <ji...@apache.org> on 2019/01/17 11:21:00 UTC

[jira] [Comment Edited] (PDFBOX-4426) Not parsable pdf document

    [ https://issues.apache.org/jira/browse/PDFBOX-4426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16744908#comment-16744908 ] 

Nico Prenzel edited comment on PDFBOX-4426 at 1/17/19 11:20 AM:
----------------------------------------------------------------

The pdf document seems to be produced/created with

Microsoft® Word 2010
 macOS Version 10.14.2 (Build 18C54) Quartz PDFContext

Foxit and Adobe Reader are capable to display it.

See attached file for the area around the offset:

Thanks. !PDFBox Bug - PDFBOX-4426.png!

 

With PDFDebugger in eclipse debugger mode i've changed the value to an COSInteger and the document is shown. So it seems to be "only" that part is strange.


was (Author: nico.prenzel):
The pdf document seems to be produced/created with

Microsoft® Word 2010
macOS Version 10.14.2 (Build 18C54) Quartz PDFContext

Foxit and Adobe Reader are capable to display it.

See attached file for the area around the offset:

Thanks. !PDFBox Bug - PDFBOX-4426.png!

> Not parsable pdf document
> -------------------------
>
>                 Key: PDFBOX-4426
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4426
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.13
>            Reporter: Nico Prenzel
>            Priority: Minor
>         Attachments: PDFBox Bug - PDFBOX-4426.png
>
>
> I've got another not parsable pdf document from our customers.
> Unfortunately, i'am not allowed to post the pdf document, this time.
> Pherhaps the stacktrace is sufficient to fix the parsing...
> IOException expected number, actual=COSFloat\{18446744073177568688} at offset 693140
> org.apache.pdfbox.pdfparser.BaseParser parseCOSDictionaryValue: 166
> org.apache.pdfbox.pdfparser.BaseParser parseCOSDictionaryNameValuePair: 279
> org.apache.pdfbox.pdfparser.BaseParser parseCOSDictionary: 212
> org.apache.pdfbox.pdfparser.BaseParser parseDirObject: 864
> org.apache.pdfbox.pdfparser.COSParser parseFileObject: 904
> org.apache.pdfbox.pdfparser.COSParser parseObjectDynamically: 873
> org.apache.pdfbox.pdfparser.COSParser parseObjectDynamically: 793
> org.apache.pdfbox.pdfparser.COSParser parseDictObjects: 753
> org.apache.pdfbox.pdfparser.PDFParser initialParse: 187
> org.apache.pdfbox.pdfparser.PDFParser parse: 226
> org.apache.pdfbox.pdmodel.PDDocument load: 1200
> org.apache.pdfbox.pdmodel.PDDocument load: 1097
> vlh.Tools.PDF.PDFBoxUtil$1 run: 148



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org