You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Arjohn Kampman (JIRA)" <ji...@apache.org> on 2013/10/28 13:36:34 UTC

[jira] [Updated] (PDFBOX-1607) StringIndexOutOfBoundsException in PDFParser

     [ https://issues.apache.org/jira/browse/PDFBOX-1607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Arjohn Kampman updated PDFBOX-1607:
-----------------------------------

    Attachment: pdfbox-1607-fix.patch

> StringIndexOutOfBoundsException in PDFParser
> --------------------------------------------
>
>                 Key: PDFBOX-1607
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1607
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 1.8.1
>         Environment: Windows 7, JRE 1.7.0_15-b03
>            Reporter: Alex Alishevskikh
>         Attachments: pdfbox-1607-fix.patch, pdf-govdocs-036902.pdf, pdf-govdocs-107566.pdf
>
>
> I have few test files parsed fine in PDFBox 1.7.1 but not in 1.8.1:
> java.lang.StringIndexOutOfBoundsException: String index out of range: 2047
>      at java.lang.AbstractStringBuilder.deleteCharAt(AbstractStringBuilder.java:762)
>      at java.lang.StringBuilder.deleteCharAt(StringBuilder.java:258)
>      at org.apache.pdfbox.pdfparser.BaseParser.parseCOSHexString(BaseParser.java:1000)
>      at org.apache.pdfbox.pdfparser.BaseParser.parseCOSString(BaseParser.java:808)
>      at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:1241)
>      at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:558)
>      at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:188)



--
This message was sent by Atlassian JIRA
(v6.1#6144)