You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Andrea (JIRA)" <ji...@apache.org> on 2015/09/22 14:51:04 UTC
[jira] [Created] (TIKA-1740) RecursiveParserWrapper returning
ContentHandler-s
Andrea created TIKA-1740:
----------------------------
Summary: RecursiveParserWrapper returning ContentHandler-s
Key: TIKA-1740
URL: https://issues.apache.org/jira/browse/TIKA-1740
Project: Tika
Issue Type: Wish
Components: core, parser
Reporter: Andrea
I would like to build a mechanism to allow a custom object being built starting from a parsing result. This can be done easily by working with a custom ContentHandler "transformer", but how can I achieve this result using a RecursiveParserWrapper? In this case I can only set a ContentHandlerFactory and the parser will just call the toString method and set it as a metadata. Can you imagine something to get the entire ContentHandler object for each subfile instead of the result of the toString method? Of course, it would also be needed to have a flag to disable the TIKA_CONTENT metadata production.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)