You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by "Rupert Westenthaler (JIRA)" <ji...@apache.org> on 2014/03/27 09:12:17 UTC

[jira] [Created] (STANBOL-1303) Geonames LocationEnhancementEngine confidence values are not in the range [0..1]]

Rupert Westenthaler created STANBOL-1303:
--------------------------------------------

             Summary: Geonames LocationEnhancementEngine confidence values are not in the range [0..1]]
                 Key: STANBOL-1303
                 URL: https://issues.apache.org/jira/browse/STANBOL-1303
             Project: Stanbol
          Issue Type: Bug
    Affects Versions: 0.12.0, 1.0.0
            Reporter: Rupert Westenthaler
            Assignee: Rupert Westenthaler
             Fix For: 1.0.0, 0.12.1


The Geonames.org service changed the value range of provided scores from [0..100] to [0..inv]. Because of that the engine does no longer report fise:confidence values in the range of [0..1].

Looking at the reported numbers one can assume that they do represent the relative confidence (similar as Solr scores).

For the normalization to [0..1] one could 
1. normalize relative to the result with the highest score
2. use the levenshtein distance between the mention in the text with the best matching label.

Until this gets fixed the unit tests for the engine will be deactivated.



--
This message was sent by Atlassian JIRA
(v6.2#6252)