You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@oodt.apache.org by "Radu Manole (JIRA)" <ji...@apache.org> on 2015/04/17 23:21:59 UTC

[jira] [Commented] (OODT-829) Implement an Apache Tika based Server Side Extractor

    [ https://issues.apache.org/jira/browse/OODT-829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14500703#comment-14500703 ] 

Radu Manole commented on OODT-829:
----------------------------------

Hi Chris,
I looked throught the existing extractors, and I think, to implement the tika extractor, I need to create a class exteded from AbstractFilemgrMetExtractor. This class will use the tika parsing API (https://tika.apache.org/1.7/parser.html) to extract as much metadata as posible.
I will start to implement the extractor. Please, can you tell me if I had missed something.
Thanks.

> Implement an Apache Tika based Server Side Extractor
> ----------------------------------------------------
>
>                 Key: OODT-829
>                 URL: https://issues.apache.org/jira/browse/OODT-829
>             Project: OODT
>          Issue Type: Bug
>          Components: file manager
>            Reporter: Chris A. Mattmann
>            Assignee: Chris A. Mattmann
>             Fix For: 0.9
>
>
> With an Apache Tika based server side extractor, we could extract metadata and science data on the server side in the File Manager.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)