You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@pdfbox.apache.org by "Robert Bartlett-Schneider (JIRA)" <ji...@apache.org> on 2013/04/03 23:43:15 UTC

[jira] [Updated] (PDFBOX-1557) NonSequentialPDFParser incorrectly parsing document info

     [ https://issues.apache.org/jira/browse/PDFBOX-1557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Bartlett-Schneider updated PDFBOX-1557:
----------------------------------------------

    Attachment: TestParsers.java
                aa.pdf
    
> NonSequentialPDFParser incorrectly parsing document info
> --------------------------------------------------------
>
>                 Key: PDFBOX-1557
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1557
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 1.8.0
>         Environment: Mac OS X 10.6.8, Eclipse Version: Juno Service Release 2 (Build id: 20130225-0426), Java SE 6 (1.6.0)
>            Reporter: Robert Bartlett-Schneider
>            Priority: Minor
>         Attachments: aa.pdf, TestParsers.java
>
>
> When using the NonSequentialPDFParser, the PDDocumentInformation returned by getDocumentInformation() seems to contain all null entries, which does not occur when using the standard PDFParser. I have a large batch of PDF files which have random and strange issues that cause them to occasionally fail with the standard parser, so I was experimenting with the NonSequential parser and came across this issue.
> I'll attempt to attach some test code & a test PDF file for which I can replicate the issue.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira