You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Nick Burch (JIRA)" <ji...@apache.org> on 2010/09/03 15:46:33 UTC

[jira] Created: (TIKA-504) Support XMP metadata keys for more of the common EXIF tags

Support XMP metadata keys for more of the common EXIF tags
----------------------------------------------------------

                 Key: TIKA-504
                 URL: https://issues.apache.org/jira/browse/TIKA-504
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 0.7
            Reporter: Nick Burch
            Assignee: Nick Burch
            Priority: Minor
             Fix For: 1.0


Following the work on TIKA-442, we now have some XMP inspired image metadata keys and type properties. This currently only covers 4 core exif tags

There are a couple of other common tags that we should probably handle in the same way. The plan would be to define more entries in the TIFF metadata area, using the same definitions as in XMP, and update the Exif parser to additionally map onto these. (The raw exif tags would continue to be output too, for compatibility)

The tags I propose to support are:
* exif:ExposureTime
* exif:FNumber
* exif:Flash
* exif:FocalLength
* exif:IsoSpeedRatings
* exif:Manufacturer
* exif:Model
* exif:Software
* exif:Orientation
* exif:XResolution
* exif:YResolution
* exif:ResolutionUnit

Plus one extra date, which we don't really seem to have a suitable existing key for, but maybe should be made more generic?
* exif:DateTimeOriginal

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (TIKA-504) Support XMP metadata keys for more of the common EXIF tags

Posted by "Staffan Olsson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12907391#action_12907391 ] 

Staffan Olsson commented on TIKA-504:
-------------------------------------

A question on the implementation:
Should Tika really have a fallback to string on fields with expected non-string types. For example
               Object length = directory.getObject(ExifDirectory.TAG_ORIENTATION);
               if(length instanceof Integer) {
                  metadata.set(Metadata.ORIENTATION, Integer.toString( ((Integer)length).intValue() ));
               } else {
                  metadata.set(Metadata.ORIENTATION, directory.getString(ExifDirectory.TAG_ORIENTATION));
               }
Isn't there a risk that this produces type errors for the application that uses Tika? I think it would be better if the value is skipped when it can't be parsed to the expected type.

> Support XMP metadata keys for more of the common EXIF tags
> ----------------------------------------------------------
>
>                 Key: TIKA-504
>                 URL: https://issues.apache.org/jira/browse/TIKA-504
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.7
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>            Priority: Minor
>             Fix For: 0.8
>
>
> Following the work on TIKA-442, we now have some XMP inspired image metadata keys and type properties. This currently only covers 4 core exif tags
> There are a couple of other common tags that we should probably handle in the same way. The plan would be to define more entries in the TIFF metadata area, using the same definitions as in XMP, and update the Exif parser to additionally map onto these. (The raw exif tags would continue to be output too, for compatibility)
> The tags I propose to support are:
> * exif:ExposureTime
> * exif:FNumber
> * exif:Flash
> * exif:FocalLength
> * exif:IsoSpeedRatings
> * exif:Manufacturer
> * exif:Model
> * exif:Software
> * exif:Orientation
> * exif:XResolution
> * exif:YResolution
> * exif:ResolutionUnit
> Plus one extra date, which we don't really seem to have a suitable existing key for, but maybe should be made more generic?
> * exif:DateTimeOriginal

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (TIKA-504) Support XMP metadata keys for more of the common EXIF tags

Posted by "Staffan Olsson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12907561#action_12907561 ] 

Staffan Olsson commented on TIKA-504:
-------------------------------------

An alternative would be to have a secondary field with same name but "_str" appended or something.

> Support XMP metadata keys for more of the common EXIF tags
> ----------------------------------------------------------
>
>                 Key: TIKA-504
>                 URL: https://issues.apache.org/jira/browse/TIKA-504
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.7
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>            Priority: Minor
>             Fix For: 0.8
>
>
> Following the work on TIKA-442, we now have some XMP inspired image metadata keys and type properties. This currently only covers 4 core exif tags
> There are a couple of other common tags that we should probably handle in the same way. The plan would be to define more entries in the TIFF metadata area, using the same definitions as in XMP, and update the Exif parser to additionally map onto these. (The raw exif tags would continue to be output too, for compatibility)
> The tags I propose to support are:
> * exif:ExposureTime
> * exif:FNumber
> * exif:Flash
> * exif:FocalLength
> * exif:IsoSpeedRatings
> * exif:Manufacturer
> * exif:Model
> * exif:Software
> * exif:Orientation
> * exif:XResolution
> * exif:YResolution
> * exif:ResolutionUnit
> Plus one extra date, which we don't really seem to have a suitable existing key for, but maybe should be made more generic?
> * exif:DateTimeOriginal

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (TIKA-504) Support XMP metadata keys for more of the common EXIF tags

Posted by "Nick Burch (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Nick Burch resolved TIKA-504.
-----------------------------

    Fix Version/s: 0.8
                       (was: 1.0)
       Resolution: Fixed

> Support XMP metadata keys for more of the common EXIF tags
> ----------------------------------------------------------
>
>                 Key: TIKA-504
>                 URL: https://issues.apache.org/jira/browse/TIKA-504
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.7
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>            Priority: Minor
>             Fix For: 0.8
>
>
> Following the work on TIKA-442, we now have some XMP inspired image metadata keys and type properties. This currently only covers 4 core exif tags
> There are a couple of other common tags that we should probably handle in the same way. The plan would be to define more entries in the TIFF metadata area, using the same definitions as in XMP, and update the Exif parser to additionally map onto these. (The raw exif tags would continue to be output too, for compatibility)
> The tags I propose to support are:
> * exif:ExposureTime
> * exif:FNumber
> * exif:Flash
> * exif:FocalLength
> * exif:IsoSpeedRatings
> * exif:Manufacturer
> * exif:Model
> * exif:Software
> * exif:Orientation
> * exif:XResolution
> * exif:YResolution
> * exif:ResolutionUnit
> Plus one extra date, which we don't really seem to have a suitable existing key for, but maybe should be made more generic?
> * exif:DateTimeOriginal

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (TIKA-504) Support XMP metadata keys for more of the common EXIF tags

Posted by "Nick Burch (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/TIKA-504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12905963#action_12905963 ] 

Nick Burch commented on TIKA-504:
---------------------------------

Support added in r992368 and r992375.

All of these are now handled as strongly typed properties, based on their xmp types and keys

> Support XMP metadata keys for more of the common EXIF tags
> ----------------------------------------------------------
>
>                 Key: TIKA-504
>                 URL: https://issues.apache.org/jira/browse/TIKA-504
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 0.7
>            Reporter: Nick Burch
>            Assignee: Nick Burch
>            Priority: Minor
>             Fix For: 0.8
>
>
> Following the work on TIKA-442, we now have some XMP inspired image metadata keys and type properties. This currently only covers 4 core exif tags
> There are a couple of other common tags that we should probably handle in the same way. The plan would be to define more entries in the TIFF metadata area, using the same definitions as in XMP, and update the Exif parser to additionally map onto these. (The raw exif tags would continue to be output too, for compatibility)
> The tags I propose to support are:
> * exif:ExposureTime
> * exif:FNumber
> * exif:Flash
> * exif:FocalLength
> * exif:IsoSpeedRatings
> * exif:Manufacturer
> * exif:Model
> * exif:Software
> * exif:Orientation
> * exif:XResolution
> * exif:YResolution
> * exif:ResolutionUnit
> Plus one extra date, which we don't really seem to have a suitable existing key for, but maybe should be made more generic?
> * exif:DateTimeOriginal

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.