You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/06/07 19:17:00 UTC

[jira] [Created] (TIKA-2663) Allow nested decorations for the default parser

Tim Allison created TIKA-2663:
---------------------------------

             Summary: Allow nested decorations for the default parser
                 Key: TIKA-2663
                 URL: https://issues.apache.org/jira/browse/TIKA-2663
             Project: Tika
          Issue Type: Task
            Reporter: Tim Allison


I'm not sure what the cleanest solution is, but it would be nice to specify decorations on the AutoDetectParser somehow in tika-config.xml.

For example, I want the AutoDetectParser, but wrap it in a ForkParser.  Or, I want the AutoDetectParser, but wrap it in a DigestingParser, then a RecursiveParserWrapper, then a ForkParser.

These types of decorations feel fundamentally different to me than our current decorations which focus on child parsers.  I've done some really ugly things to get this functionality for tika-app and tika-batch, and it would be useful to clean this up.  Any ideas?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)