You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by "Rupert Westenthaler (JIRA)" <ji...@apache.org> on 2014/01/22 07:31:20 UTC
[jira] [Created] (STANBOL-1264) EntityLinking engines should
consider Chunks with NER annotations
Rupert Westenthaler created STANBOL-1264:
--------------------------------------------
Summary: EntityLinking engines should consider Chunks with NER annotations
Key: STANBOL-1264
URL: https://issues.apache.org/jira/browse/STANBOL-1264
Project: Stanbol
Issue Type: Improvement
Components: Enhancement Engines
Affects Versions: 0.12.0
Reporter: Rupert Westenthaler
Assignee: Rupert Westenthaler
Detected Named Entities are represented in the AnalyzedText content part by a Chunk with a NER_ANNOTATION. In addition NLP engines may (or may not) also add a PHRASE_ANNOTATION for the same Chunk. However the EntityLinking engines currently only consider PHRASE_ANNOTATION when looking for processable Chunks. Because of that they will not consider Named Entities in cases where NER engines do not provide PHRASE_ANNOTATIONs.
Because especially chunks of NER_ANNOTATIONs are extremely useful for Entity Linking this issue will change the behavior so that Chunks with a NER_ANNOTATION are marked as processable in cases where Nouns are included as processable phrase type in the TextProcessingConfig of the EntityLinkingEngine.
This depends somehow on STANBOL-1262, as without the 'old' processing of Chunks this would result in unintended merging of Noun Phrases and NER chunks.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)