You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@commons.apache.org by "Pascal Schumacher (JIRA)" <ji...@apache.org> on 2016/06/05 15:38:59 UTC
[jira] [Updated] (LANG-1199) Fix implementation of
StringUtils.getJaroWinklerDistance()
[ https://issues.apache.org/jira/browse/LANG-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Pascal Schumacher updated LANG-1199:
------------------------------------
Summary: Fix implementation of StringUtils.getJaroWinklerDistance() (was: Incorrect implementation of StringUtils.getJaroWinklerDistance())
> Fix implementation of StringUtils.getJaroWinklerDistance()
> ----------------------------------------------------------
>
> Key: LANG-1199
> URL: https://issues.apache.org/jira/browse/LANG-1199
> Project: Commons Lang
> Issue Type: Bug
> Components: lang.*
> Affects Versions: 3.4
> Reporter: M. Steiger
>
> The current implementation of StringUtils.getJaroWinklerDistance() does not compute the correct result in some cases. See #LANG-944 for the initial code contribution.
> StringUtils.getJaroWinklerDistance("Haus Ingeborg", "Ingeborg Esser") == 0.0
> This is due to the incorrect computation of common characters, which causes the algorithm to exit prematurely.
> In contrast, the implementation in Lucene gives ~0.63, which is about right.
> JaroWinklerDistance d = new JaroWinklerDistance();
> getDistance("Haus Ingeborg", "Ingeborg Esser");
> See https://lucene.apache.org/core/3_0_3/api/contrib-spellchecker/org/apache/lucene/search/spell/JaroWinklerDistance.html
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)