You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Michael McCandless (JIRA)" <ji...@apache.org> on 2012/05/23 12:16:08 UTC

[jira] [Commented] (PDFBOX-1320) NPE in extractEmbeddedDocuments

    [ https://issues.apache.org/jira/browse/PDFBOX-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13281500#comment-13281500 ] 

Michael McCandless commented on PDFBOX-1320:
--------------------------------------------

Good catch Sumuli!  We can also null-check the return from the getNames() method.

Tika's PDFParser has moved back to Tika sources (thanks Jukka!) ... I'll fix this there.

But, separately, we should also fix ExtractText to null check the call to embeddedFiles.getNames()....
                
> NPE in extractEmbeddedDocuments
> -------------------------------
>
>                 Key: PDFBOX-1320
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1320
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 1.7.0
>         Environment: pdfbox 1.7.0 (current trunk)
>            Reporter: Samuli Saarinen
>         Attachments: PDNameTreeNode.java.patch
>
>
> While parsing a pdf document the following exception is thrown:
> java.lang.NullPointerException
> 	at org.apache.pdfbox.tika.PDFParser.extractEmbeddedDocuments(PDFParser.java:155)
> 	at org.apache.pdfbox.tika.PDFParser.parse(PDFParser.java:133)
> 	at test.TikaParse.main(TikaParse.java:27)
> The document I'm trying to parse is probably confidential so I cannot attach it until (or if) I get clearence.  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira