You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Jukka Zitting (JIRA)" <ji...@apache.org> on 2010/04/12 16:42:41 UTC

[jira] Created: (TIKA-404) Media-type handling depends on the locale

Media-type handling depends on the locale
-----------------------------------------

                 Key: TIKA-404
                 URL: https://issues.apache.org/jira/browse/TIKA-404
             Project: Tika
          Issue Type: Bug
          Components: mime
    Affects Versions: 0.7
            Reporter: Jukka Zitting
            Assignee: Jukka Zitting
            Priority: Critical
             Fix For: 0.8


As reported by Elif T. Kus on tika-user@, the media type handling code in Tika is locale-dependant. The MediaType class uses the String.toLowerCase() to normalize media type names, which is a good idea in general since types are supposed to be case insensitive, but unfortunately the toLowerCase() method returns different results based on the default locale of the platform.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] Resolved: (TIKA-404) Media-type handling depends on the locale

Posted by "Jukka Zitting (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/TIKA-404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jukka Zitting resolved TIKA-404.
--------------------------------

    Resolution: Fixed

Fixed in revision 933255.

> Media-type handling depends on the locale
> -----------------------------------------
>
>                 Key: TIKA-404
>                 URL: https://issues.apache.org/jira/browse/TIKA-404
>             Project: Tika
>          Issue Type: Bug
>          Components: mime
>    Affects Versions: 0.7
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>            Priority: Critical
>             Fix For: 0.8
>
>
> As reported by Elif T. Kus on tika-user@, the media type handling code in Tika is locale-dependant. The MediaType class uses the String.toLowerCase() to normalize media type names, which is a good idea in general since types are supposed to be case insensitive, but unfortunately the toLowerCase() method returns different results based on the default locale of the platform.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira