You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2008/04/11 16:26:05 UTC

[jira] Created: (TIKA-139) Add a composite parser

Add a composite parser
----------------------

                 Key: TIKA-139
                 URL: https://issues.apache.org/jira/browse/TIKA-139
             Project: Tika
          Issue Type: New Feature
          Components: parser
            Reporter: Jukka Zitting
            Assignee: Jukka Zitting
            Priority: Minor


There should be a generic parser class that delegates parsing calls to underlying component parsers based on the incoming document type metadata.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (TIKA-139) Add a composite parser

Posted by "Jukka Zitting (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12588405#action_12588405 ] 

Jukka Zitting commented on TIKA-139:
------------------------------------

> How is this different from the AutoDetectParser

Separation of concerns. AutoDetectParser would both 1) auto-detect the content type, and 2) use the type to select an underlying parser to which the parse call is delegated.

The CompositeParser as added in revision 647181 takes care of the latter concern. AutoDetectParser now extends CompositeParser and just adds the auto-detection functionality.


> Add a composite parser
> ----------------------
>
>                 Key: TIKA-139
>                 URL: https://issues.apache.org/jira/browse/TIKA-139
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>            Priority: Minor
>
> There should be a generic parser class that delegates parsing calls to underlying component parsers based on the incoming document type metadata.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (TIKA-139) Add a composite parser

Posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12588036#action_12588036 ] 

Chris A. Mattmann commented on TIKA-139:
----------------------------------------

Hey Jukka:

How is this different from the AutoDetectParser?

Thanks!

Cheers,
 Chris


> Add a composite parser
> ----------------------
>
>                 Key: TIKA-139
>                 URL: https://issues.apache.org/jira/browse/TIKA-139
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>            Priority: Minor
>
> There should be a generic parser class that delegates parsing calls to underlying component parsers based on the incoming document type metadata.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (TIKA-139) Add a composite parser

Posted by "Jukka Zitting (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting resolved TIKA-139.
--------------------------------

       Resolution: Fixed
    Fix Version/s: 0.2-incubating

Implemented in revision 647181.

> Add a composite parser
> ----------------------
>
>                 Key: TIKA-139
>                 URL: https://issues.apache.org/jira/browse/TIKA-139
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>            Priority: Minor
>             Fix For: 0.2-incubating
>
>
> There should be a generic parser class that delegates parsing calls to underlying component parsers based on the incoming document type metadata.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (TIKA-139) Add a composite parser

Posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12588460#action_12588460 ] 

Chris A. Mattmann commented on TIKA-139:
----------------------------------------

Hey Jukka:

+1, AWESOME! Yep, totally makes sense.

Thanks!

Cheers, 
 Chris


> Add a composite parser
> ----------------------
>
>                 Key: TIKA-139
>                 URL: https://issues.apache.org/jira/browse/TIKA-139
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>            Priority: Minor
>
> There should be a generic parser class that delegates parsing calls to underlying component parsers based on the incoming document type metadata.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.