You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2009/06/02 03:25:07 UTC
[jira] Created: (TIKA-238) Better handling of delegating parser
implementations
Better handling of delegating parser implementations
----------------------------------------------------
Key: TIKA-238
URL: https://issues.apache.org/jira/browse/TIKA-238
Project: Tika
Issue Type: Improvement
Components: config, parser
Reporter: Jukka Zitting
Assignee: Jukka Zitting
Priority: Minor
Fix For: 0.4
Delegating Parser implementations (i.e. ones that call another Parser instance to process some parts of the input document) like the PackageParser subclasses currently simply instantiate a new AutoDetectParser() whenever a more specific delegate parser has not been specified. This is troublesome as it 1) makes customization more difficult and 2) performs poorly since a new AutoDetectParser (with a new freshly parsed copy of the default configuration) is currently instantiated whenever such a delegating parser is used.
Instead of working around the issue with things like keeping a static copy of the pre-parsed default configuration, I'd like to solve this issue by introducing a new DelegatingParser abstraction for this and hooking it to the configuration mechanism.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Resolved: (TIKA-238) Better handling of delegating parser
implementations
Posted by "Jukka Zitting (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/TIKA-238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jukka Zitting resolved TIKA-238.
--------------------------------
Resolution: Fixed
Implemented in revisions 780897 and 780899.
> Better handling of delegating parser implementations
> ----------------------------------------------------
>
> Key: TIKA-238
> URL: https://issues.apache.org/jira/browse/TIKA-238
> Project: Tika
> Issue Type: Improvement
> Components: config, parser
> Reporter: Jukka Zitting
> Assignee: Jukka Zitting
> Priority: Minor
> Fix For: 0.4
>
>
> Delegating Parser implementations (i.e. ones that call another Parser instance to process some parts of the input document) like the PackageParser subclasses currently simply instantiate a new AutoDetectParser() whenever a more specific delegate parser has not been specified. This is troublesome as it 1) makes customization more difficult and 2) performs poorly since a new AutoDetectParser (with a new freshly parsed copy of the default configuration) is currently instantiated whenever such a delegating parser is used.
> Instead of working around the issue with things like keeping a static copy of the pre-parsed default configuration, I'd like to solve this issue by introducing a new DelegatingParser abstraction for this and hooking it to the configuration mechanism.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.