You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by rw...@apache.org on 2013/11/19 15:33:09 UTC

svn commit: r1543439 - /stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/lucenefstlinking.mdtext

Author: rwesten
Date: Tue Nov 19 14:33:09 2013
New Revision: 1543439

URL: http://svn.apache.org/r1543439
Log:
STANBOL-1211: added documentation for the Minimum Chunk Match Score option for the FST linking engine

Modified:
    stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/lucenefstlinking.mdtext

Modified: stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/lucenefstlinking.mdtext
URL: http://svn.apache.org/viewvc/stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/lucenefstlinking.mdtext?rev=1543439&r1=1543438&r2=1543439&view=diff
==============================================================================
--- stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/lucenefstlinking.mdtext (original)
+++ stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/lucenefstlinking.mdtext Tue Nov 19 14:33:09 2013
@@ -154,7 +154,8 @@ The Entity Linking Configuration of this
 * __Redirect Field__ _(enhancer.engines.linking.redirectField)_</s>: Note implemented. __NOTE__ This might not be possible to efficiently implement. When those redirects need already be considered when building the FST models.
 * <s>__Use EntityRankings__ _(enhancer.engines.linking.useEntityRankings)_</s>: This configuration gets __IGNORED__. EntityRanking based sorting is enabled as soon as the _Entity Ranking Field_ is configured.
 * <s>__Lemma based Matching__ _(enhancer.engines.linking.lemmaMatching)_</s>: Not Yet implemented
-* <s>__Min Match Score__ _(enhancer.engines.linking.minMatchScore)_</s>: Not Yet Implemented. The FST linking engine is based on the Lucene Analyzer chains configured for the _index_ and _store_ field of the FST configuration. Only if Tokens do match after the Analyzers where applied a Entity is suggested.
+* __Min Match Score__ _(enhancer.engines.linking.minMatchScore)_: The FST linking engine uses [Levenshtein distance](http://en.wikipedia.org/wiki/Levenshtein_distance) between the mention in the text and the best matching label of an Entity. It only adds suggestions if the match is greater as the configured value. _NOTE_ that this might filter suggestions of the FST for several reasons but typical reasons are stemming on short labels as well as case insensitive analyzers combined with case sensitive matching.
+* __Minimum Chunk Match Score__ _(enhancer.engines.linking.minChunkMatchScore)_: Tags provided by FST linking are reduced if they do match less as the configured percentage of tokens in a chunk. Implemented as `TagClusterReducer`.
 
 In addition the following properties are __IGNORED__ as they are not relevant for the FST Linking Engine: