You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Ingo Renner (JIRA)" <ji...@apache.org> on 2010/02/24 17:54:28 UTC

[jira] Commented: (TIKA-365) Extract more OpenDocument metadata

    [ https://issues.apache.org/jira/browse/TIKA-365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837878#action_12837878 ] 

Ingo Renner commented on TIKA-365:
----------------------------------

Somehow I only get a very limited set of meta data on the command line (trunk export from a few minutes ago):

java -jar tika-app-0.7-SNAPSHOT.jar -m ~/tika-0.7/trunk/tika-parsers/src/test/resources/test-documents/testOpenOffice2.odf 
Content-Length: 10977
Content-Type: application/zip
resourceName: testOpenOffice2.odf

Is that a known issue / limitation or is there something wrong on my end? (OS X 10.6.2)


> Extract more OpenDocument metadata
> ----------------------------------
>
>                 Key: TIKA-365
>                 URL: https://issues.apache.org/jira/browse/TIKA-365
>             Project: Tika
>          Issue Type: Improvement
>          Components: metadata
>    Affects Versions: 0.6
>            Reporter: Nick Burch
>            Assignee: Jukka Zitting
>            Priority: Minor
>             Fix For: 0.7
>
>         Attachments: oo-metadata.patch, testOpenOffice2.odf
>
>
> The attached patch adds support for a few more kinds of OpenDocument metadata. These are added to the metadata object much like the existing ones.
> There's also support for  user defined metadata support. (Custom Metadata is stored in lines like <meta:user-defined meta:name="Info 1">Text 1</meta:user-defined>). There's a new MetadataHandler, AttributeDependantMetadataHandler, which can use the value of an attribute on the node to decide what to call the metadata when done with the node.
> Also included are several more tests for the OpenDocument parser, and one more test file to go with this.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.