You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Tilman Hausherr (JIRA)" <ji...@apache.org> on 2017/10/28 10:42:00 UTC
[jira] [Updated] (PDFBOX-3950) NPE in PageIterator.enqueueKids
[ https://issues.apache.org/jira/browse/PDFBOX-3950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tilman Hausherr updated PDFBOX-3950:
------------------------------------
Attachment: 6MNJMPJVZMZRSTE5A4ENHP3F5SIOG27T.pdf
I'm being lenient on the missing /gs name too because it gives us an advantage in text extraction, see page 4 of attached file 6MNJMPJVZMZRSTE5A4ENHP3F5SIOG27T.pdf. I'll have to adjust the test too and fix an NPE in font processing that happens with the original file (where my fix won't improve anything).
> NPE in PageIterator.enqueueKids
> -------------------------------
>
> Key: PDFBOX-3950
> URL: https://issues.apache.org/jira/browse/PDFBOX-3950
> Project: PDFBox
> Issue Type: Bug
> Components: Parsing
> Affects Versions: 2.0.8
> Reporter: Tilman Hausherr
> Assignee: Andreas Lehmkühler
> Labels: regression
> Fix For: 2.0.8, 3.0.0
>
> Attachments: 23EGDHXSBBYQLKYOKGZUOVYVNE675PRD.pdf, 6MNJMPJVZMZRSTE5A4ENHP3F5SIOG27T.pdf
>
>
> {code}
> Exception in thread "main" java.lang.NullPointerException
> at java.util.ArrayDeque.addLast(ArrayDeque.java:244)
> at java.util.ArrayDeque.add(ArrayDeque.java:418)
> at org.apache.pdfbox.pdmodel.PDPageTree$PageIterator.enqueueKids(PDPageTree.java:178)
> at org.apache.pdfbox.pdmodel.PDPageTree$PageIterator.enqueueKids(PDPageTree.java:173)
> at org.apache.pdfbox.pdmodel.PDPageTree$PageIterator.<init>(PDPageTree.java:159)
> at org.apache.pdfbox.pdmodel.PDPageTree$PageIterator.<init>(PDPageTree.java:153)
> at org.apache.pdfbox.pdmodel.PDPageTree.iterator(PDPageTree.java:123)
> at org.apache.pdfbox.text.PDFTextStripper.processPages(PDFTextStripper.java:282)
> {code}
> This worked in 2.0.7. There are about 200 occurences of this exception in the tests by Tim.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org