You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2017/03/12 18:34:04 UTC

[jira] [Comment Edited] (PDFBOX-3629) "expected number, actual=COSString" on a valid document

    [ https://issues.apache.org/jira/browse/PDFBOX-3629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15906629#comment-15906629 ] 

Andreas Lehmkühler edited comment on PDFBOX-3629 at 3/12/17 6:34 PM:
---------------------------------------------------------------------

the attached pdf is malformed, some of the data was erased and/or replaced with null bytes. As a first step I've fixed an issue with {{BaseParser#parseCOSDictionaryValue}} as it triggers a wrong exception. Now the real exception is thrown.


was (Author: lehmi):
he attached pdf is malformed, some of the data was erased and/or replaced with null bytes. As a first step I've fixed an issue with {{BaseParser#parseCOSDictionaryValue}} as it triggers a wrong exception. Now the real exception is thrown.

> "expected number, actual=COSString" on a valid document
> -------------------------------------------------------
>
>                 Key: PDFBOX-3629
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3629
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.3
>         Environment: Windows 7 x64, JVM 1.8.0_101
>            Reporter: Seva Alekseyev
>         Attachments: Book#4 - O'Reilly - JavaScript The Definitive Guide 2ed.pdf
>
>
> On the attached document, which opens in Adobe Reader, PDDocument.load() throws an error:
> java.io.IOException: expected number, actual=COSString{file:///C|/Oreilly Unix etc/O'Reilly Reference Library/web/cgi/index.html} at offset 845803
> 	at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:165)
> 	at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:277)
> 	at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:210)
> 	at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:885)
> 	at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:153)
> 	at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:277)
> 	at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:210)
> 	at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:885)
> 	at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:772)
> 	at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:741)
> 	at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:672)
> 	at org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:632)
> 	at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:217)
> 	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252)
> 	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:966)
> 	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:922)
> 	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:870)



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org