You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by "Rupert Westenthaler (JIRA)" <ji...@apache.org> on 2013/10/17 14:45:42 UTC

[jira] [Resolved] (STANBOL-1123) Label Token matching should consider tokens that are marked as "consumed"

     [ https://issues.apache.org/jira/browse/STANBOL-1123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rupert Westenthaler resolved STANBOL-1123.
------------------------------------------

    Resolution: Fixed

fixed with http://svn.apache.org/r1496359 (as part of STANBOL-1114)

> Label Token matching should consider tokens that are marked as "consumed"
> -------------------------------------------------------------------------
>
>                 Key: STANBOL-1123
>                 URL: https://issues.apache.org/jira/browse/STANBOL-1123
>             Project: Stanbol
>          Issue Type: Sub-task
>          Components: Enhancement Engines
>            Reporter: Rupert Westenthaler
>            Assignee: Rupert Westenthaler
>
> Tokens marked as "consumed" should be considered while matching Labels of Entities with the processed Text.
> Marking Tokens as "consumed" aims to reduce the number or required vocabulary lookups. However considering those while matching does not hurt performance while it dose increase the quality of the linking process.
> Allowing so will bring improvements especially for very long noun phrases, where an initial query (typically by using the first to nouns) might not suggest the best matching Entity. Person mentions like "{role} {given} {given} {family}" are typical examples for such cases.



--
This message was sent by Atlassian JIRA
(v6.1#6144)