You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Robert Bartlett-Schneider (JIRA)" <ji...@apache.org> on 2013/04/03 23:41:16 UTC

[jira] [Created] (PDFBOX-1557) NonSequentialPDFParser incorrectly parsing document info

Robert Bartlett-Schneider created PDFBOX-1557:
-------------------------------------------------

             Summary: NonSequentialPDFParser incorrectly parsing document info
                 Key: PDFBOX-1557
                 URL: https://issues.apache.org/jira/browse/PDFBOX-1557
             Project: PDFBox
          Issue Type: Bug
          Components: Parsing
    Affects Versions: 1.8.0
         Environment: Mac OS X 10.6.8, Eclipse Version: Juno Service Release 2 (Build id: 20130225-0426), Java SE 6 (1.6.0)
            Reporter: Robert Bartlett-Schneider
            Priority: Minor


When using the NonSequentialPDFParser, the PDDocumentInformation returned by getDocumentInformation() seems to contain all null entries, which does not occur when using the standard PDFParser. I have a large batch of PDF files which have random and strange issues that cause them to occasionally fail with the standard parser, so I was experimenting with the NonSequential parser and came across this issue.

I'll attempt to attach some test code & a test PDF file for which I can replicate the issue.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira