You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "James Wilson (JIRA)" <ji...@apache.org> on 2009/01/20 17:03:03 UTC

[jira] Commented: (PDFBOX-361) NullPointerException in PDPageNode.getAllKids

    [ https://issues.apache.org/jira/browse/PDFBOX-361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12665450#action_12665450 ] 

James Wilson commented on PDFBOX-361:
-------------------------------------

I have fixed the parser for us.  I'm not sure if my fix is the best it can be or not.  Infact, I'm sure it's not.  We no longer have this problem though.  :)   When a pdf is created in pdf spec 1.5 or greater  there is a possibility that it (the pdf) consists of "cross reference streams" and is "linearized".  When both of these are true there will be 2 "EOF" tags.  After the 1st "EOF" tag will be a long list of "obj" tags until you get the real "EOF" tag.

James


> NullPointerException in PDPageNode.getAllKids
> ---------------------------------------------
>
>                 Key: PDFBOX-361
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-361
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>            Reporter: Jukka Zitting
>
> [Issue from SourceForge]
> http://sourceforge.net/tracker/index.php?func=detail&aid=2008371&group_id=78314&atid=552832
> The parser cannot seem to find the Pages object in files created with
> Acrobat Pro 9. A sample file is attached.
> public static void main(String[] argv) throws Exception {
> String name = "./test.pdf";
> PDDocument doc = PDDocument.load(name);
> doc.close();
> PDPageNode root = doc.getDocumentCatalog().getPages();
> ArrayList<PDPage> pages = new ArrayList<PDPage>();
> root.getAllKids(pages);
> System.out.println("pages.size() == "+pages.size());
> }
> Exception in thread "main" java.lang.NullPointerException
> at org.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:194)
> at org.pdfbox.pdmodel.PDPageNode.getAllKids(PDPageNode.java:182)
> http://sourceforge.net/tracker/download.php?group_id=78314&atid=552832&file_id=283367&aid=2008371
> [Comment on SourceForge]
> Date: 2008-07-02 00:57
> Sender: foundart
> Logged In: YES 
> user_id=1693709
> Originator: YES
> This happens with the latest code from CVS and also in older versions.
> [Comment on SourceForge]
> Date: 2008-07-14 17:25
> Sender: orthello
> Logged In: YES 
> user_id=853566
> Originator: NO
> We are experiencing the same problem.  Offending pdf available if any of
> you need it (jwilson@nmcourt.fed.us).  Looks like pdfbox does not support
> some new feature introduced in Acrobat 9.
> [Comment on SourceForge]
> Date: 2008-07-14 23:20
> Sender: foundart
> Logged In: YES 
> user_id=1693709
> Originator: YES
> In Acrobat 8, the default was to generate PDFs following version 1.4 of
> the PDF specification.  In Acrobat 9, the default is to to generate PDFs
> following version 1.5 of the PDF specification.  PDF1.5 has objects known
> as cross-reference streams and it turns out that PDFBox does not parse them
> correctly.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.