You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2008/12/07 20:23:44 UTC

[jira] Updated: (TIKA-152) Support for Office XML files

     [ https://issues.apache.org/jira/browse/TIKA-152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting updated TIKA-152:
-------------------------------

    Fix Version/s: 0.3

I upgraded the POI dependency to 3.5-beta4.

Note that if we want to use the new Office XML support in POI 3.5 we probably also need to add some of the extra XML dependencies. Any NOTICE and LICENSE changes related to POI 3.5 and potential other dependencies should be reviewed before our next release.

There's a problem with a GPLv3 file being included in the HDGF part of POI that we use for text extraction from Visio diagrams. I filed a bug for that (see https://issues.apache.org/bugzilla/show_bug.cgi?id=46361) and I think we need to find some resolution to the issue before our next release.

> Support for Office XML files
> ----------------------------
>
>                 Key: TIKA-152
>                 URL: https://issues.apache.org/jira/browse/TIKA-152
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>             Fix For: 0.3
>
>
> Apache POI has recently released the first betas of their support for Office XML file formats. We should use that in Tika.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.