You are viewing a plain text version of this content. The canonical link for it is here.

Posted to commits@stanbol.apache.org by "Rupert Westenthaler (JIRA)" <ji...@apache.org> on 2014/01/22 07:31:20 UTC

[jira] [Created] (STANBOL-1264) EntityLinking engines should consider Chunks with NER annotations

Rupert Westenthaler created STANBOL-1264:
--------------------------------------------

             Summary: EntityLinking engines should consider Chunks with NER annotations
                 Key: STANBOL-1264
                 URL: https://issues.apache.org/jira/browse/STANBOL-1264
             Project: Stanbol
          Issue Type: Improvement
          Components: Enhancement Engines
    Affects Versions: 0.12.0
            Reporter: Rupert Westenthaler
            Assignee: Rupert Westenthaler


Detected Named Entities are represented in the AnalyzedText content part by a Chunk with a NER_ANNOTATION. In addition NLP engines may (or may not) also add a PHRASE_ANNOTATION for the same Chunk. However the EntityLinking engines currently only consider PHRASE_ANNOTATION when looking for processable Chunks. Because of that they will not consider Named Entities in cases where NER engines do not provide PHRASE_ANNOTATIONs.

Because especially chunks of NER_ANNOTATIONs are extremely useful for Entity Linking this issue will change the behavior so that Chunks with a NER_ANNOTATION are marked as processable in cases where Nouns are included as processable phrase type in the TextProcessingConfig of the EntityLinkingEngine.

This depends somehow on STANBOL-1262, as without the 'old' processing of Chunks this would result in unintended merging of Noun Phrases and NER chunks.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)