You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by "Rupert Westenthaler (JIRA)" <ji...@apache.org> on 2014/03/27 09:12:17 UTC
[jira] [Created] (STANBOL-1303) Geonames LocationEnhancementEngine
confidence values are not in the range [0..1]]
Rupert Westenthaler created STANBOL-1303:
--------------------------------------------
Summary: Geonames LocationEnhancementEngine confidence values are not in the range [0..1]]
Key: STANBOL-1303
URL: https://issues.apache.org/jira/browse/STANBOL-1303
Project: Stanbol
Issue Type: Bug
Affects Versions: 0.12.0, 1.0.0
Reporter: Rupert Westenthaler
Assignee: Rupert Westenthaler
Fix For: 1.0.0, 0.12.1
The Geonames.org service changed the value range of provided scores from [0..100] to [0..inv]. Because of that the engine does no longer report fise:confidence values in the range of [0..1].
Looking at the reported numbers one can assume that they do represent the relative confidence (similar as Solr scores).
For the normalization to [0..1] one could
1. normalize relative to the result with the highest score
2. use the levenshtein distance between the mention in the text with the best matching label.
Until this gets fixed the unit tests for the engine will be deactivated.
--
This message was sent by Atlassian JIRA
(v6.2#6252)