You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@commons.apache.org by Gary Gregory <gg...@seagullsw.com> on 2004/05/24 02:37:19 UTC

[codec] Soudex issue with accented character.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=29080

Currently, "ö" or "é" in a String causes Soundex to throw an ArrayIndexOutOfBoundsException.

We can either:

(1) Throw a better Exception, like IllegalArgumentException: Only 'plain' letter are allowed.

Or:

(2) Ignore unmapped characters. This would work for "ö" and "é" since vowels are ignored but this could cause bad encoding values for other chars like "ç".

AFAIK, you cannot ask if a character is a vowel or not.

Thoughts?

Gary


---------------------------------------------------------------------
To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-dev-help@jakarta.apache.org