You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Staffan Olsson (JIRA)" <ji...@apache.org> on 2010/07/30 08:48:16 UTC

[jira] Created: (TIKA-472) Extract image title, description and author

Extract image title, description and author
-------------------------------------------

                 Key: TIKA-472
                 URL: https://issues.apache.org/jira/browse/TIKA-472
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 0.7
            Reporter: Staffan Olsson


Currently the JPEG+TIFF metadata extractor reads embedded keywords but not comments. There are XMP/IPTC fields that map nicely to the metadata fields AUTHOR, TITLE and DESCRIPTION. This is related to TIKA-314 and the XMP discussion in TIKA-442.

The attached patch adds parsing of these fields as set in XMP applications such as Photoshop: Object Name (Title), Caption (Description) and By-Line (Author).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (TIKA-472) Extract image title, description and author

Posted by "Staffan Olsson (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Staffan Olsson updated TIKA-472:
--------------------------------

    Attachment: tika-parsers_jpegcomments.patch
                testJPEG_commented.jpg

> Extract image title, description and author
> -------------------------------------------
>
>                 Key: TIKA-472
>                 URL: https://issues.apache.org/jira/browse/TIKA-472
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.7
>            Reporter: Staffan Olsson
>         Attachments: testJPEG_commented.jpg, tika-parsers_jpegcomments.patch
>
>
> Currently the JPEG+TIFF metadata extractor reads embedded keywords but not comments. There are XMP/IPTC fields that map nicely to the metadata fields AUTHOR, TITLE and DESCRIPTION. This is related to TIKA-314 and the XMP discussion in TIKA-442.
> The attached patch adds parsing of these fields as set in XMP applications such as Photoshop: Object Name (Title), Caption (Description) and By-Line (Author).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (TIKA-472) Extract image title, description and author

Posted by "Nick Burch (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nick Burch resolved TIKA-472.
-----------------------------

         Assignee: Nick Burch
    Fix Version/s: 0.8
       Resolution: Fixed

Thanks for the patch! Applied in r981072.

> Extract image title, description and author
> -------------------------------------------
>
>                 Key: TIKA-472
>                 URL: https://issues.apache.org/jira/browse/TIKA-472
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.7
>            Reporter: Staffan Olsson
>            Assignee: Nick Burch
>             Fix For: 0.8
>
>         Attachments: testJPEG_commented.jpg, tika-parsers_jpegcomments.patch
>
>
> Currently the JPEG+TIFF metadata extractor reads embedded keywords but not comments. There are XMP/IPTC fields that map nicely to the metadata fields AUTHOR, TITLE and DESCRIPTION. This is related to TIKA-314 and the XMP discussion in TIKA-442.
> The attached patch adds parsing of these fields as set in XMP applications such as Photoshop: Object Name (Title), Caption (Description) and By-Line (Author).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.