You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (JIRA)" <ji...@apache.org> on 2019/02/19 04:44:00 UTC

[jira] [Comment Edited] (PDFBOX-4467) java.io.IOException: expected number, actual=COSFloat{18446744073226773920} at offset 11398

    [ https://issues.apache.org/jira/browse/PDFBOX-4467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16771559#comment-16771559 ] 

Tilman Hausherr edited comment on PDFBOX-4467 at 2/19/19 4:43 AM:
------------------------------------------------------------------

The "=" is part of the exception message.

The file has a lot of entries like this in the structure tree:

/P 42 0 R /Pg 18446744073226773584 0 R

"18446744073226773584" is not an object number. These are much shorter. Something went wrong with the software that created / manipulated this file.

What you could do is to use NOTEPAD++ to search for "/StructTreeRoot 39 0 R", then overwrite that one with blanks. However you'll lose the tagging information, so the form won't be filled by vision impaired people.

Or get the original form here: https://www.irs.gov/pub/irs-pdf/fw4.pdf


was (Author: tilman):
The file has a lot of entries like this in the structure tree:

/P 42 0 R /Pg 18446744073226773584 0 R

"18446744073226773584" is not an object number. These are much shorter. Something went wrong with the software that created / manipulated this file.

What you could do is to use NOTEPAD++ to search for "/StructTreeRoot 39 0 R", then overwrite that one with blanks. However you'll lose the tagging information, so the form won't be filled by vision impaired people.

Or get the original form here: https://www.irs.gov/pub/irs-pdf/fw4.pdf

> java.io.IOException: expected number, actual=COSFloat{18446744073226773920} at offset 11398
> -------------------------------------------------------------------------------------------
>
>                 Key: PDFBOX-4467
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4467
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.13
>            Reporter: steve neo
>            Priority: Major
>         Attachments: pdnekz1gvl7.pdf
>
>
> My PDF has an IOException when using PDDocument.load().  In my system, it reports "actual\u003COSFloat\{18..}".  Note that "=" is an unicode character. I am not very sure if this the root cause or that equal sign shouldn't be there. 
> expected number, actual\u003dCOSFloat\{18446744073226773920} at offset 11398"
>  
> The whole exception stack is:
> java.io.IOException: expected number, actual=COSFloat\{18446744073226773920} at offset 11398
>  at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:166)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:279)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:212)
>  at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:864)
>  at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:904)
>  at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:873)
>  at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:793)
>  at org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:753)
>  at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:187)
>  at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226)
>  at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1068)
>  at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1024)
>  at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:972)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org