You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@commons.apache.org by "Pascal Schumacher (JIRA)" <ji...@apache.org> on 2016/06/05 15:38:59 UTC

[jira] [Updated] (LANG-1199) Fix implementation of StringUtils.getJaroWinklerDistance()

     [ https://issues.apache.org/jira/browse/LANG-1199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Pascal Schumacher updated LANG-1199:
------------------------------------
    Summary: Fix implementation of StringUtils.getJaroWinklerDistance()  (was: Incorrect implementation of StringUtils.getJaroWinklerDistance())

> Fix implementation of StringUtils.getJaroWinklerDistance()
> ----------------------------------------------------------
>
>                 Key: LANG-1199
>                 URL: https://issues.apache.org/jira/browse/LANG-1199
>             Project: Commons Lang
>          Issue Type: Bug
>          Components: lang.*
>    Affects Versions: 3.4
>            Reporter: M. Steiger
>
> The current implementation of StringUtils.getJaroWinklerDistance() does not compute the correct result in some cases. See #LANG-944 for the initial code contribution.
> StringUtils.getJaroWinklerDistance("Haus Ingeborg", "Ingeborg Esser") == 0.0
> This is due to the incorrect computation of common characters, which causes the algorithm to exit prematurely.
> In contrast, the implementation in Lucene gives ~0.63, which is about right.
>     JaroWinklerDistance d = new JaroWinklerDistance();
>     getDistance("Haus Ingeborg", "Ingeborg Esser");
> See https://lucene.apache.org/core/3_0_3/api/contrib-spellchecker/org/apache/lucene/search/spell/JaroWinklerDistance.html



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)