You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (JIRA)" <ji...@apache.org> on 2018/01/05 17:15:00 UTC

[jira] [Commented] (PDFBOX-4049) IllegalArgumentException: root cannot be null

    [ https://issues.apache.org/jira/browse/PDFBOX-4049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16313466#comment-16313466 ] 

Tilman Hausherr commented on PDFBOX-4049:
-----------------------------------------

Your PDF starts with 
{code}
----------------------------766440766244535832321017
Content-Disposition: form-data; name="fileData"; filename="514bf860-d583-494c-bb0b-68e8167bba53.pdf"
Content-Type: application/pdf
{code}
and ends with
{code}
----------------------------766440766244535832321017--
{code}
which indicates a bad MIME extracting tool. Did you receive this file from a client or did you extract it yourself? A coworker (great guy) wrote such a tool in perl and cursed a lot. Years later (he had quit) I rewrote such a tool myself in java with additional features and I quickly understood his pain.

I'm asking because if you wrote that tool yourself, or if you have influence on whoever wrote it, the whole problem would be moot.

> IllegalArgumentException: root cannot be null
> ---------------------------------------------
>
>                 Key: PDFBOX-4049
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4049
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 2.0.8
>         Environment: Windows 10
>            Reporter: savan patel
>              Labels: regression
>         Attachments: 372d5dd7-d4b8-41b2-9f50-80c1353aee59.pdf
>
>
> I got a pdf,,, in which pdfbox gives errors while parsing it.
> {code}
> Exception in thread "main" java.lang.IllegalArgumentException: root cannot be null
>         at org.apache.pdfbox.pdmodel.PDPageTree.<init>(PDPageTree.java:75)
>         at org.apache.pdfbox.pdmodel.PDDocumentCatalog.getPages(PDDocumentCatalog.java:129)
>         at org.apache.pdfbox.pdmodel.PDDocument.getNumberOfPages(PDDocument.java:1411)
> {code}
> This did not happen with 2.0.7.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org