You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@commons.apache.org by "Gary D. Gregory (Jira)" <ji...@apache.org> on 2022/04/03 18:45:00 UTC

[jira] [Commented] (LANG-1655) Fixed an issue where characters were separated in the NFD system

    [ https://issues.apache.org/jira/browse/LANG-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17516572#comment-17516572 ] 

Gary D. Gregory commented on LANG-1655:
---------------------------------------

I added and disabled this test: org.apache.commons.lang3.StringUtilsTrimStripTest.testStripAccents_Korean()

Feel free to provide a PR on GitHub with a fix ;)

 

> Fixed an issue where characters were separated in the NFD system
> ----------------------------------------------------------------
>
>                 Key: LANG-1655
>                 URL: https://issues.apache.org/jira/browse/LANG-1655
>             Project: Commons Lang
>          Issue Type: Bug
>          Components: General
>            Reporter: Donghwi Min
>            Priority: Minor
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> String origin = "잊지마 넌 흐린 어둠사이 왼손으로 그린 별 하나";
> *String.stripAccents(origin)* is expected to return *origin*,
> but it returns this.
> "ㅇㅣㅈㅈㅣㅁㅏ ㄴㅓㄴ ㅎㅡㄹㅣㄴ ㅇㅓㄷㅜㅁ ㅅㅏㅇㅣ ㅇㅗㅣㄴㅅㅗㄴㅇㅡㄹㅗ ㄱㅡㄹㅣㄴ ㅂㅕㄹ ㅎㅏ ㄴㅏ"
> The above string is a separate consonant and vowel of korean.
> This issue happens on NFD systems. (linux, windows)
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)