You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@oodt.apache.org by "Radu Manole (JIRA)" <ji...@apache.org> on 2015/04/17 23:21:59 UTC
[jira] [Commented] (OODT-829) Implement an Apache Tika based Server
Side Extractor
[ https://issues.apache.org/jira/browse/OODT-829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14500703#comment-14500703 ]
Radu Manole commented on OODT-829:
----------------------------------
Hi Chris,
I looked throught the existing extractors, and I think, to implement the tika extractor, I need to create a class exteded from AbstractFilemgrMetExtractor. This class will use the tika parsing API (https://tika.apache.org/1.7/parser.html) to extract as much metadata as posible.
I will start to implement the extractor. Please, can you tell me if I had missed something.
Thanks.
> Implement an Apache Tika based Server Side Extractor
> ----------------------------------------------------
>
> Key: OODT-829
> URL: https://issues.apache.org/jira/browse/OODT-829
> Project: OODT
> Issue Type: Bug
> Components: file manager
> Reporter: Chris A. Mattmann
> Assignee: Chris A. Mattmann
> Fix For: 0.9
>
>
> With an Apache Tika based server side extractor, we could extract metadata and science data on the server side in the File Manager.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)