You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@tika.apache.org by "Jukka Zitting (Resolved) (JIRA)" <ji...@apache.org> on 2011/10/07 11:00:30 UTC

[jira] [Resolved] (TIKA-433) Tika + Hadoop

     [ https://issues.apache.org/jira/browse/TIKA-433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting resolved TIKA-433.
--------------------------------

    Resolution: Won't Fix

Resolving as Won't Fix as discussed above.
                
> Tika + Hadoop
> -------------
>
>                 Key: TIKA-433
>                 URL: https://issues.apache.org/jira/browse/TIKA-433
>             Project: Tika
>          Issue Type: New Feature
>          Components: general
>            Reporter: Grant Ingersoll
>            Priority: Minor
>
> Would be great to have a Tika contrib that took in an HDFS location with "rich" documents on it and an output format (or output processor) and converted the docs to XHTML or Solr or whatever.  Seems like it should be pretty straightforward to do on the Hadoop side of things.  Only tricky part, I suppose, is the output format and how to make that pluggable.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira