You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Andreas Lehmkühler (Jira)" <ji...@apache.org> on 2020/07/14 15:58:00 UTC
[jira] [Commented] (PDFBOX-4915) "Page tree root must be a
dictionary" on PDDocument.load
[ https://issues.apache.org/jira/browse/PDFBOX-4915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157474#comment-17157474 ]
Andreas Lehmkühler commented on PDFBOX-4915:
--------------------------------------------
Just to avoid misunderstandings, you're explicitly mention that you are loading the pdf from a stream, does it work using a file?
The parser is complaining about the pages dictionary, your screen shot show the root dictionary which references the pages dictionary. You should check the object 1 1 which can by found by searching for "1 1 obj". I guess that dictionary is malformed or maybe the reference to that object is somehow broken.
Is there a way to get a hand on the file in question?
> "Page tree root must be a dictionary" on PDDocument.load
> --------------------------------------------------------
>
> Key: PDFBOX-4915
> URL: https://issues.apache.org/jira/browse/PDFBOX-4915
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 2.0.19
> Reporter: Gauthier Roebroeck
> Priority: Minor
> Attachments: Screenshot 2020-07-14 at 20.19.40.png
>
>
> Hi,
> i have a PDF file that throws the following exception:
> {{java.io.IOException: Page tree root must be a dictionaryjava.io.IOException: Page tree root must be a dictionary at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:198) ~[pdfbox-2.0.19.jar:2.0.19] at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226) ~[pdfbox-2.0.19.jar:2.0.19] at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1222) ~[pdfbox-2.0.19.jar:2.0.19] at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1122) ~[pdfbox-2.0.19.jar:2.0.19]}}
> This happens when loading the document from an InputStream.
> The document can be opened properly using Preview on Mac.
>
> I have checked the PDF structure (even though i don't know it very well), from what i can see it could be because the /Pages is not the first element under the /Root.
>
> !Screenshot 2020-07-14 at 20.19.40.png!
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org