You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (JIRA)" <ji...@apache.org> on 2013/03/09 15:25:13 UTC

[jira] [Resolved] (PDFBOX-1535) Extract text from PDF cause Nullpointer Exception in PDFStreamEngine.processEncodedText Method

     [ https://issues.apache.org/jira/browse/PDFBOX-1535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andreas Lehmkühler resolved PDFBOX-1535.
----------------------------------------

       Resolution: Fixed
    Fix Version/s: 1.8.0
         Assignee: Andreas Lehmkühler

I don't now which change resolves this issue, but it works well using the current trunk.

Set to resolved
                
> Extract text from PDF cause Nullpointer Exception in PDFStreamEngine.processEncodedText Method
> ----------------------------------------------------------------------------------------------
>
>                 Key: PDFBOX-1535
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1535
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.7.1
>         Environment: jdk 1.7_17
>            Reporter: Alex
>            Assignee: Andreas Lehmkühler
>            Priority: Critical
>             Fix For: 1.8.0
>
>         Attachments: 1.pdf, PDFBOX1535-1.txt
>
>
> The xpdfbin-win-3.03 -> pdftotext.exe works fine with this pdf File.
> Tried pdfbox Version 1.2.1 too, but same error.
> [org.apache.pdfbox.util.PDFStreamEngine] java.lang.NullPointerException
> java.lang.NullPointerException
> 	at org.apache.pdfbox.util.PDFStreamEngine.processEncodedText(PDFStreamEngine.java:357)
> 	at org.apache.pdfbox.util.operator.ShowTextGlyph.process(ShowTextGlyph.java:62)
> 	at org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:556)
> 	at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:270)
> 	at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:237)
> 	at org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:217)
> 	at org.apache.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:448)
> 	at org.apache.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:372)
> 	at org.apache.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:328)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira