You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Gabor Szadovszky (JIRA)" <ji...@apache.org> on 2018/04/21 12:40:01 UTC

[jira] [Updated] (PARQUET-430) Change to use Locale parameterized version of String.toUpperCase()/toLowerCase

     [ https://issues.apache.org/jira/browse/PARQUET-430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gabor Szadovszky updated PARQUET-430:
-------------------------------------
    Fix Version/s: 1.8.2

> Change to use Locale parameterized version of String.toUpperCase()/toLowerCase
> ------------------------------------------------------------------------------
>
>                 Key: PARQUET-430
>                 URL: https://issues.apache.org/jira/browse/PARQUET-430
>             Project: Parquet
>          Issue Type: Bug
>          Components: parquet-mr
>    Affects Versions: 1.8.0, 1.8.1
>            Reporter: Liwei Lin(Inactive)
>            Assignee: Liwei Lin(Inactive)
>            Priority: Minor
>             Fix For: 1.9.0, 1.8.2
>
>
> A String is being converted to upper or lowercase, using the platform's default encoding. This may result in improper conversions when used with international characters.
> For instance, "TITLE".toLowerCase() in a Turkish locale returns "tıtle", where 'ı' -- without a dot -- is the LATIN SMALL LETTER DOTLESS I character. To obtain correct results for locale insensitive strings, we'd better use toLowerCase(Locale.ENGLISH).
> For more information on this, please see:
> - http://stackoverflow.com/questions/11063102/using-locales-with-javas-tolowercase-and-touppercase
> - http://lotusnotus.com/lotusnotus_en.nsf/dx/dotless-i-tolowercase-and-touppercase-functions-use-responsibly.htm
> - http://java.sys-con.com/node/46241
> This ticket proposes to change our use of String.toUpperCase()/toLowerCase() to String.toUpperCase(Locale.*ENGLISH*)/toLowerCase(*Locale.ENGLISH*)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)