You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Nick Burch (Created) (JIRA)" <ji...@apache.org> on 2011/10/29 04:19:32 UTC

[jira] [Created] (TIKA-764) OpenDocumentMetaParser should use common metadata keys for document statistics

OpenDocumentMetaParser should use common metadata keys for document statistics
------------------------------------------------------------------------------

                 Key: TIKA-764
                 URL: https://issues.apache.org/jira/browse/TIKA-764
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 0.10
            Reporter: Nick Burch
            Assignee: Nick Burch
            Priority: Minor
             Fix For: 1.0


The OpenDocumentMetaParser currently outputs a number of document statistics with its own Metadata keys, rather than using the standard ones defined on the Metadata class. It should be updated to output the common ones, and once people have updated then remove the current custom ones

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (TIKA-764) OpenDocumentMetaParser should use common metadata keys for document statistics

Posted by "Jukka Zitting (Resolved) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting resolved TIKA-764.
--------------------------------

    Resolution: Fixed
    
> OpenDocumentMetaParser should use common metadata keys for document statistics
> ------------------------------------------------------------------------------
>
>                 Key: TIKA-764
>                 URL: https://issues.apache.org/jira/browse/TIKA-764
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.10
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>            Priority: Minor
>             Fix For: 1.0
>
>
> The OpenDocumentMetaParser currently outputs a number of document statistics with its own Metadata keys, rather than using the standard ones defined on the Metadata class. It should be updated to output the common ones, and once people have updated then remove the current custom ones

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (TIKA-764) OpenDocumentMetaParser should use common metadata keys for document statistics

Posted by "Jukka Zitting (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting updated TIKA-764:
-------------------------------


Resolving for 1.0 as suggested by Nick on dev@:

{quote}
We should maybe split it and resolve the first part. The ODF parser now outputs the correct keys for the part we have keys for, along with the "wrong" (non-standard ones) for backwards compatibility. The 2nd step is to add a few extra common keys for the stats that ODF has that aren't covered, then remove the non standard keys
{quote}

See followup in TIKA-770.
                
> OpenDocumentMetaParser should use common metadata keys for document statistics
> ------------------------------------------------------------------------------
>
>                 Key: TIKA-764
>                 URL: https://issues.apache.org/jira/browse/TIKA-764
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.10
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>            Priority: Minor
>             Fix For: 1.0
>
>
> The OpenDocumentMetaParser currently outputs a number of document statistics with its own Metadata keys, rather than using the standard ones defined on the Metadata class. It should be updated to output the common ones, and once people have updated then remove the current custom ones

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (TIKA-764) OpenDocumentMetaParser should use common metadata keys for document statistics

Posted by "Nick Burch (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13139056#comment-13139056 ] 

Nick Burch commented on TIKA-764:
---------------------------------

Parser updated in r1190736 to output the common keys, along with the pre-existing custom ones for backwards compatibility. At some point we'll want to remove the non standard ones

There are a couple of fields we don't have common keys for, we should add these in and update to use them too
                
> OpenDocumentMetaParser should use common metadata keys for document statistics
> ------------------------------------------------------------------------------
>
>                 Key: TIKA-764
>                 URL: https://issues.apache.org/jira/browse/TIKA-764
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.10
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>            Priority: Minor
>             Fix For: 1.0
>
>
> The OpenDocumentMetaParser currently outputs a number of document statistics with its own Metadata keys, rather than using the standard ones defined on the Metadata class. It should be updated to output the common ones, and once people have updated then remove the current custom ones

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira