You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Rida Benjelloun (JIRA)" <ji...@apache.org> on 2008/01/13 06:02:34 UTC

[jira] Resolved: (TIKA-112) XMLParser improvement

     [ https://issues.apache.org/jira/browse/TIKA-112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rida Benjelloun resolved TIKA-112.
----------------------------------

    Resolution: Fixed

> XMLParser improvement
> ---------------------
>
>                 Key: TIKA-112
>                 URL: https://issues.apache.org/jira/browse/TIKA-112
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.2-incubating
>            Reporter: Rida Benjelloun
>            Assignee: Rida Benjelloun
>            Priority: Minor
>             Fix For: 0.2-incubating
>
>
> - Replace XMLParser by XMLParserUtils
> - Create Class DcXMLParser that extends XMLParserUtils and implements Parser. This class allows DublinCore metadata parsing
> - Add method setXMLParserNameSpaceContext() in XMLParserUtils.
> - Improvement of OpenOfficeParser to extract document content from office:body.
> - OpenOfficeParser extends XMLParserUtils
> - Modification to tika-config to use DcXMLParser instead of XMLParser

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.