You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Maruan Sahyoun (JIRA)" <ji...@apache.org> on 2013/04/06 17:35:15 UTC

[jira] [Closed] (PDFBOX-230) NullPointerException when extracting text

     [ https://issues.apache.org/jira/browse/PDFBOX-230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Maruan Sahyoun closed PDFBOX-230.
---------------------------------

    Resolution: Cannot Reproduce

unable to reproduce as the pdf file in question is no longer available. 
                
> NullPointerException when extracting text
> -----------------------------------------
>
>                 Key: PDFBOX-230
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-230
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>            Priority: Minor
>
> [imported from SourceForge]
> http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1630272
> Originally submitted by bowzie on 2007-01-07 20:17.
> I got the following exception when stripping a PDF 
>  
> java.lang.NullPointerException 
> at org.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:194) 
> at org.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:182) 
> at org.pdfbox.pdmodel.PDDocumentCatalog.getAllPages(PDDocumentCatalog.java:162) 
> at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:220) 
> at org.pdfbox.util.PDFTextStripper.getText(PDFTextStripper.java:140) 
> ... 
>  
> The PDF is a bit odd in that it appears to have no text, pictures or anything else of any use, but it says it has 8 pages. I can mail it if you want. 
> PDF sent separately.
> [comment on SourceForge]
> Originally sent by jalonsogaleon.
> Logged In: YES 
> user_id=1854910
> Originator: NO
> I have the same problem when extracting text using pdfbox0.7.4
> java.lang.NullPointerException.atorg.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:194)
> I can mail pdf if you want
> regards
> [comment on SourceForge]
> Originally sent by brzrkr.
> Logged In: YES 
> user_id=1489602
> Originator: NO
> This happens so often in our app that we installed a NPE exception handler that scans the stack looking for getAllKids, and maps it into an IOException.
> [comment on SourceForge]
> Originally sent by nobody.
> Logged In: NO 
> I am getting the same error. However, it appears to happen only on a Windows machine. Same code is working fine on Linux. I wonder if this has to do anything with the CR/LF differences on the platform.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira