You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Ingo Renner (JIRA)" <ji...@apache.org> on 2010/02/24 17:54:28 UTC
[jira] Commented: (TIKA-365) Extract more OpenDocument metadata
[ https://issues.apache.org/jira/browse/TIKA-365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837878#action_12837878 ]
Ingo Renner commented on TIKA-365:
----------------------------------
Somehow I only get a very limited set of meta data on the command line (trunk export from a few minutes ago):
java -jar tika-app-0.7-SNAPSHOT.jar -m ~/tika-0.7/trunk/tika-parsers/src/test/resources/test-documents/testOpenOffice2.odf
Content-Length: 10977
Content-Type: application/zip
resourceName: testOpenOffice2.odf
Is that a known issue / limitation or is there something wrong on my end? (OS X 10.6.2)
> Extract more OpenDocument metadata
> ----------------------------------
>
> Key: TIKA-365
> URL: https://issues.apache.org/jira/browse/TIKA-365
> Project: Tika
> Issue Type: Improvement
> Components: metadata
> Affects Versions: 0.6
> Reporter: Nick Burch
> Assignee: Jukka Zitting
> Priority: Minor
> Fix For: 0.7
>
> Attachments: oo-metadata.patch, testOpenOffice2.odf
>
>
> The attached patch adds support for a few more kinds of OpenDocument metadata. These are added to the metadata object much like the existing ones.
> There's also support for user defined metadata support. (Custom Metadata is stored in lines like <meta:user-defined meta:name="Info 1">Text 1</meta:user-defined>). There's a new MetadataHandler, AttributeDependantMetadataHandler, which can use the value of an attribute on the node to decide what to call the metadata when done with the node.
> Also included are several more tests for the OpenDocument parser, and one more test file to go with this.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.