You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Rida Benjelloun (JIRA)" <ji...@apache.org> on 2008/01/13 06:02:34 UTC
[jira] Closed: (TIKA-112) XMLParser improvement
[ https://issues.apache.org/jira/browse/TIKA-112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rida Benjelloun closed TIKA-112.
--------------------------------
> XMLParser improvement
> ---------------------
>
> Key: TIKA-112
> URL: https://issues.apache.org/jira/browse/TIKA-112
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 0.2-incubating
> Reporter: Rida Benjelloun
> Assignee: Rida Benjelloun
> Priority: Minor
> Fix For: 0.2-incubating
>
>
> - Replace XMLParser by XMLParserUtils
> - Create Class DcXMLParser that extends XMLParserUtils and implements Parser. This class allows DublinCore metadata parsing
> - Add method setXMLParserNameSpaceContext() in XMLParserUtils.
> - Improvement of OpenOfficeParser to extract document content from office:body.
> - OpenOfficeParser extends XMLParserUtils
> - Modification to tika-config to use DcXMLParser instead of XMLParser
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.