You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Mark Giaconia (JIRA)" <ji...@apache.org> on 2013/11/09 15:34:17 UTC

[jira] [Created] (OPENNLP-615) GeoEntityLinker should score toponyms based on surrounding context via a model

Mark Giaconia created OPENNLP-615:
-------------------------------------

             Summary: GeoEntityLinker should score toponyms based on surrounding context via a model
                 Key: OPENNLP-615
                 URL: https://issues.apache.org/jira/browse/OPENNLP-615
             Project: OpenNLP
          Issue Type: Sub-task
          Components: Entity Linker
    Affects Versions: 1.6.0
            Reporter: Mark Giaconia
            Assignee: Mark Giaconia


As per the concept in this paper http://www.jasonbaldridge.com/papers/speriosu-baldridge-acl2013.pdf
the GeoEntityLinker addon should allow a user to score toponyms based on a model. For instance, if the gazateer returns an ambiguous name associated to multiple countries, X and Y, then features should be generated from around the name, and those features should be used as a test set against a categorizer for the country returned and a score generated.
This functionality also implies the need for a rapid way to generate the models based on user defined data, because countries and location mentions have content that is highly variant. Also, this method will be configurable in the GeoEntityLinker.
The Sandbox contains a model-builder-prototype that I plan to use to generate the models based on user data and the countrycontext data that the GeoEntityLinker requires, which will make it easy to get started.



--
This message was sent by Atlassian JIRA
(v6.1#6144)