You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Staffan Olsson (JIRA)" <ji...@apache.org> on 2010/07/30 08:48:16 UTC
[jira] Created: (TIKA-472) Extract image title, description and
author
Extract image title, description and author
-------------------------------------------
Key: TIKA-472
URL: https://issues.apache.org/jira/browse/TIKA-472
Project: Tika
Issue Type: Improvement
Components: parser
Affects Versions: 0.7
Reporter: Staffan Olsson
Currently the JPEG+TIFF metadata extractor reads embedded keywords but not comments. There are XMP/IPTC fields that map nicely to the metadata fields AUTHOR, TITLE and DESCRIPTION. This is related to TIKA-314 and the XMP discussion in TIKA-442.
The attached patch adds parsing of these fields as set in XMP applications such as Photoshop: Object Name (Title), Caption (Description) and By-Line (Author).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (TIKA-472) Extract image title, description and
author
Posted by "Staffan Olsson (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/TIKA-472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Staffan Olsson updated TIKA-472:
--------------------------------
Attachment: tika-parsers_jpegcomments.patch
testJPEG_commented.jpg
> Extract image title, description and author
> -------------------------------------------
>
> Key: TIKA-472
> URL: https://issues.apache.org/jira/browse/TIKA-472
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 0.7
> Reporter: Staffan Olsson
> Attachments: testJPEG_commented.jpg, tika-parsers_jpegcomments.patch
>
>
> Currently the JPEG+TIFF metadata extractor reads embedded keywords but not comments. There are XMP/IPTC fields that map nicely to the metadata fields AUTHOR, TITLE and DESCRIPTION. This is related to TIKA-314 and the XMP discussion in TIKA-442.
> The attached patch adds parsing of these fields as set in XMP applications such as Photoshop: Object Name (Title), Caption (Description) and By-Line (Author).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Resolved: (TIKA-472) Extract image title, description and
author
Posted by "Nick Burch (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/TIKA-472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Nick Burch resolved TIKA-472.
-----------------------------
Assignee: Nick Burch
Fix Version/s: 0.8
Resolution: Fixed
Thanks for the patch! Applied in r981072.
> Extract image title, description and author
> -------------------------------------------
>
> Key: TIKA-472
> URL: https://issues.apache.org/jira/browse/TIKA-472
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 0.7
> Reporter: Staffan Olsson
> Assignee: Nick Burch
> Fix For: 0.8
>
> Attachments: testJPEG_commented.jpg, tika-parsers_jpegcomments.patch
>
>
> Currently the JPEG+TIFF metadata extractor reads embedded keywords but not comments. There are XMP/IPTC fields that map nicely to the metadata fields AUTHOR, TITLE and DESCRIPTION. This is related to TIKA-314 and the XMP discussion in TIKA-442.
> The attached patch adds parsing of these fields as set in XMP applications such as Photoshop: Object Name (Title), Caption (Description) and By-Line (Author).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.