You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2017/10/03 20:29:00 UTC

[jira] [Resolved] (PDFBOX-3947) ArrayIndexOutOfBoundsException in bfSearchForObjStreams

     [ https://issues.apache.org/jira/browse/PDFBOX-3947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andreas Lehmkühler resolved PDFBOX-3947.
----------------------------------------
       Resolution: Fixed
    Fix Version/s: 3.0.0
                   2.0.8

One of the object streams is broken (there are a lot of null bytes in the middle of it) and the brute force search stumbled upon that. After skipping the broken stream the pdf is rendered fine at least the repaired parts of it.

> ArrayIndexOutOfBoundsException in bfSearchForObjStreams
> -------------------------------------------------------
>
>                 Key: PDFBOX-3947
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-3947
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.8
>            Reporter: Tilman Hausherr
>            Assignee: Andreas Lehmkühler
>              Labels: regression
>             Fix For: 2.0.8, 3.0.0
>
>         Attachments: 670064.pdf
>
>
> {code}
> java.lang.ArrayIndexOutOfBoundsException: 184
>     org.apache.pdfbox.pdfparser.COSParser.bfSearchForObjStreams(COSParser.java:1766)
>     org.apache.pdfbox.pdfparser.COSParser.bfSearchForObjects(COSParser.java:1529)
>     org.apache.pdfbox.pdfparser.COSParser.getBFCOSObjectOffsets(COSParser.java:1445)
>     org.apache.pdfbox.pdfparser.COSParser.checkXrefOffsets(COSParser.java:1379)
>     org.apache.pdfbox.pdfparser.COSParser.parseXref(COSParser.java:368)
>     org.apache.pdfbox.pdfparser.COSParser.retrieveTrailer(COSParser.java:214)
>     org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:189)
>     org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:239)
>     org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1017)
>     org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:955)
>     org.apache.pdfbox.debugger.PDFDebugger.parseDocument(PDFDebugger.java:1408)
>     org.apache.pdfbox.debugger.PDFDebugger.readPDFFile(PDFDebugger.java:1326)
>     org.apache.pdfbox.debugger.PDFDebugger.readPDFFile(PDFDebugger.java:1309)
>     org.apache.pdfbox.debugger.PDFDebugger.main(PDFDebugger.java:275)
>     org.apache.pdfbox.tools.PDFBox.main(PDFBox.java:72)
> {code}
> worked with 2.0.7. It's the only file from the recent test by Tim.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org