You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2008/03/09 16:29:46 UTC

[jira] Resolved: (TIKA-126) Add Parser.parse(InputStream, Metadata) for metadata extraction

     [ https://issues.apache.org/jira/browse/TIKA-126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting resolved TIKA-126.
--------------------------------

    Resolution: Fixed

Implemented in revision 635259.

> Add Parser.parse(InputStream, Metadata) for metadata extraction
> ---------------------------------------------------------------
>
>                 Key: TIKA-126
>                 URL: https://issues.apache.org/jira/browse/TIKA-126
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>             Fix For: 0.2-incubating
>
>
> In some cases a client is just interested in the parsed metadata and not the extracted text content. It is easy to ignore the text content by just passing a dummy DefaultHandler to the existing parse() method, but many parsers could avoid a lot of work if they knew in advance that the text content is not needed.
> Thus I want to add a parse(InputStream, Metadata) signature to the Parser interface. I'll also add an AbstractParser base class with a trivial implementation of that method:
>     public abstract AbstractParser implements Parser {
>         public void parse(InputStream stream, Metadata metadata) {
>             parse(stream, new DefaultHandler(), metadata);
>         }
>     }

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.