You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (Jira)" <ji...@apache.org> on 2020/07/14 15:58:00 UTC

[jira] [Commented] (PDFBOX-4915) "Page tree root must be a dictionary" on PDDocument.load

    [ https://issues.apache.org/jira/browse/PDFBOX-4915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157474#comment-17157474 ] 

Andreas Lehmkühler commented on PDFBOX-4915:
--------------------------------------------

Just to avoid misunderstandings, you're explicitly mention that you are loading the pdf from a stream, does it work using a file?

 

The parser is complaining about the pages dictionary, your screen shot show the root dictionary which references the pages dictionary. You should check the object 1 1 which can by found by searching for "1 1 obj".  I guess that dictionary is malformed or maybe the reference to that object is somehow broken.

 

Is there a way to get a hand on the file in question?

> "Page tree root must be a dictionary" on PDDocument.load
> --------------------------------------------------------
>
>                 Key: PDFBOX-4915
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4915
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.19
>            Reporter: Gauthier Roebroeck
>            Priority: Minor
>         Attachments: Screenshot 2020-07-14 at 20.19.40.png
>
>
> Hi,
> i have a PDF file that throws the following exception:
> {{java.io.IOException: Page tree root must be a dictionaryjava.io.IOException: Page tree root must be a dictionary at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:198) ~[pdfbox-2.0.19.jar:2.0.19] at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226) ~[pdfbox-2.0.19.jar:2.0.19] at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1222) ~[pdfbox-2.0.19.jar:2.0.19] at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1122) ~[pdfbox-2.0.19.jar:2.0.19]}}
> This happens when loading the document from an InputStream.
> The document can be opened properly using Preview on Mac.
>  
> I have checked the PDF structure (even though i don't know it very well), from what i can see it could be because the /Pages is not the first element under the /Root.
>  
> !Screenshot 2020-07-14 at 20.19.40.png!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org