You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Andrea (JIRA)" <ji...@apache.org> on 2015/09/22 14:51:04 UTC

[jira] [Created] (TIKA-1740) RecursiveParserWrapper returning ContentHandler-s

Andrea created TIKA-1740:
----------------------------

             Summary: RecursiveParserWrapper returning ContentHandler-s
                 Key: TIKA-1740
                 URL: https://issues.apache.org/jira/browse/TIKA-1740
             Project: Tika
          Issue Type: Wish
          Components: core, parser
            Reporter: Andrea


I would like to build a mechanism to allow a custom object being built starting from a parsing result. This can be done easily by working with a custom ContentHandler "transformer", but how can I achieve this result using a RecursiveParserWrapper? In this case I can only set a ContentHandlerFactory and the parser will just call the toString method and set it as a metadata. Can you imagine something to get the entire ContentHandler object for each subfile instead of the result of the toString method? Of course, it would also be needed to have a flag to disable the TIKA_CONTENT metadata production.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)