You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Johannes Wirkkala Westlund (Jira)" <ji...@apache.org> on 2022/03/30 14:17:00 UTC
[jira] [Updated] (PDFBOX-5405) "Page tree root must be a dictionary" when attempting to parse pdf
[ https://issues.apache.org/jira/browse/PDFBOX-5405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Johannes Wirkkala Westlund updated PDFBOX-5405:
-----------------------------------------------
Description:
Hi,
I have a PDF file that throws the following error when I try to parse it:
{code:java}
Caused by: java.io.IOException: Page tree root must be a dictionary
at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:198)
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1228)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1202)
at org.apache.tika.parser.pdf.PDFParser.getPDDocument(PDFParser.java:191)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:149)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:289)
... 5 more {code}
I have attached the file in question with this issue.
Might be related to https://issues.apache.org/jira/browse/PDFBOX-4915
was:
Hi,
I have a PDF file that throws the following error when I try to parse it:
{{
}}
{code:java}
Caused by: java.io.IOException: Page tree root must be a dictionary
at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:198)
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1228)
at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1202)
at org.apache.tika.parser.pdf.PDFParser.getPDDocument(PDFParser.java:191)
at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:149)
at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:289)
... 5 more {code}
{{}}
I have attached the file in question with this issue.[^Grafiska riktlinjer, fordon LRV.pdf]
> "Page tree root must be a dictionary" when attempting to parse pdf
> -------------------------------------------------------------------
>
> Key: PDFBOX-5405
> URL: https://issues.apache.org/jira/browse/PDFBOX-5405
> Project: PDFBox
> Issue Type: Bug
> Affects Versions: 2.0.25
> Reporter: Johannes Wirkkala Westlund
> Priority: Minor
> Attachments: Grafiska riktlinjer, fordon LRV.pdf
>
>
> Hi,
> I have a PDF file that throws the following error when I try to parse it:
> {code:java}
> Caused by: java.io.IOException: Page tree root must be a dictionary
> at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:198)
> at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:226)
> at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1228)
> at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1202)
> at org.apache.tika.parser.pdf.PDFParser.getPDDocument(PDFParser.java:191)
> at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:149)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:289)
> ... 5 more {code}
> I have attached the file in question with this issue.
> Might be related to https://issues.apache.org/jira/browse/PDFBOX-4915
--
This message was sent by Atlassian Jira
(v8.20.1#820001)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org