You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Jeremy Anderson (Updated) (JIRA)" <ji...@apache.org> on 2011/12/12 20:35:30 UTC

[jira] [Updated] (TIKA-810) Upgrade to PDFbox 1.7.0 as available

     [ https://issues.apache.org/jira/browse/TIKA-810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jeremy Anderson updated TIKA-810:
---------------------------------

    Attachment: pdfbox-1.7.0.diff

Upgraded to 1.7.0 in revision 1213227 as of 2011-12-12.

Change is to TestCase where annotation text extraction is now off by default in PDFBox. (Appeared to be on in 1.6.0 release but no longer is in 1.7.0 daily)

Note, a proper fix may be required to change the Tika PDF Parser to turn on annotation extraction by default and then modify the test case appropriately.  Or to submit a fix in PDF box to have 1.7.0 behave the same as 1.6.0.
                
> Upgrade to PDFbox 1.7.0 as available
> ------------------------------------
>
>                 Key: TIKA-810
>                 URL: https://issues.apache.org/jira/browse/TIKA-810
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.0
>            Reporter: Jeremy Anderson
>            Priority: Minor
>         Attachments: pdfbox-1.7.0.diff
>
>
> This isssue is to track upgrading the PDFbox dependency 1.7.0 Final once it's available, and the daily build before then

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira