You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Mark Giaconia (JIRA)" <ji...@apache.org> on 2014/08/13 14:34:11 UTC

[jira] [Updated] (OPENNLP-706) GeoEntityLinker should handle hierarchical location names via improved indexing

     [ https://issues.apache.org/jira/browse/OPENNLP-706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Mark Giaconia updated OPENNLP-706:
----------------------------------

    Attachment: entitylinker.properties

Here is the new property file entries that support 8/13/2014 commit against this ticket. Main differences are:
opennlp.geoentitylinker.gaz.doublequote=false
opennlp.geoentitylinker.gaz.hierarchyfield=false
opennlp.geoentitylinker.gaz=C:\\temp\\gazetteers\\opennlp_geoentitylinker_gazetteer

if your names are hierarchical set hierarchyfield=true and then you may want to lower the min score thresh via this property below, since there is no telling how much of the hierarchy field a multitoken name will match.

opennlp.geoentitylinker.gaz.lucenescore.min=.3

> GeoEntityLinker should handle hierarchical location names via improved indexing
> -------------------------------------------------------------------------------
>
>                 Key: OPENNLP-706
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-706
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Entity Linker
>    Affects Versions: 1.6.0
>         Environment: java 7
>            Reporter: Mark Giaconia
>            Assignee: Mark Giaconia
>         Attachments: entitylinker.properties
>
>   Original Estimate: 672h
>  Remaining Estimate: 672h
>
> Currently the GeoEntitylinker (Geotagger) EntityLinker does not handle hierarchical location references such as Berlin, Germany or Hartford, Connecticut. This can be fixed by creating a hierarchy field in the index and querying this field whenever a multi-token name is found.



--
This message was sent by Atlassian JIRA
(v6.2#6252)