You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "John Hewson (JIRA)" <ji...@apache.org> on 2014/08/04 20:22:14 UTC

[jira] [Updated] (PDFBOX-1792) Different metadata with NonSequentialPDFParser

     [ https://issues.apache.org/jira/browse/PDFBOX-1792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

John Hewson updated PDFBOX-1792:
--------------------------------

    Summary: Different metadata with NonSequentialPDFParser  (was: Different metadata extracted with NonSequentialPDFParser vs classic parser on some documents)

> Different metadata with NonSequentialPDFParser
> ----------------------------------------------
>
>                 Key: PDFBOX-1792
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1792
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>    Affects Versions: 1.8.3
>            Reporter: Tim Allison
>            Priority: Minor
>         Attachments: PDFBOX-1792.tar.gz, testPDF_acroForm2.pdf
>
>
> The traditional parser is able to extract metadata from a test document from TIKA-738.  The NonSequentialPDFParser is not able to extract metadata from that file.  Another file from the Tika test suite has metadata that can be extracted by the NonSequentialPDFParser but not by classic. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)