You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@commons.apache.org by "Benedikt Ritter (JIRA)" <ji...@apache.org> on 2013/12/20 15:36:36 UTC
[jira] [Updated] (LANG-936) StringUtils.getLevenshteinDistance with
too big of a threshold returns wrong result
[ https://issues.apache.org/jira/browse/LANG-936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Benedikt Ritter updated LANG-936:
---------------------------------
Fix Version/s: Patch Needed
> StringUtils.getLevenshteinDistance with too big of a threshold returns wrong result
> -----------------------------------------------------------------------------------
>
> Key: LANG-936
> URL: https://issues.apache.org/jira/browse/LANG-936
> Project: Commons Lang
> Issue Type: Bug
> Components: lang.*
> Affects Versions: 3.1
> Reporter: Yaniv Kunda
> Priority: Minor
> Fix For: Patch Needed
>
>
> StringUtils.getLevenshteinDistance(CharSequence s, CharSequence t, int threshold) specifies:
> {quote}
> {{Find the Levenshtein distance between two Strings if it's _+*less than or equal to*+_ a given threshold.}}
> {quote}
> When passing a threshold > *Integer.MAX_VALUE - max(s.length(), t.length())* the method always returns -1.
> The simplest use case is passing *Integer.MAX_VALUE* (a common practice if one would want to find the min/max LD of a string to several other strings in an iterative fashion.
> The code should be fixed to consider the threshold in relation to the source/target lengths, or alternatively the javadoc should be fixed to pronounce the current limit.
--
This message was sent by Atlassian JIRA
(v6.1.4#6159)