You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Mark Giaconia (JIRA)" <ji...@apache.org> on 2014/08/13 14:34:11 UTC
[jira] [Updated] (OPENNLP-706) GeoEntityLinker should handle
hierarchical location names via improved indexing
[ https://issues.apache.org/jira/browse/OPENNLP-706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Mark Giaconia updated OPENNLP-706:
----------------------------------
Attachment: entitylinker.properties
Here is the new property file entries that support 8/13/2014 commit against this ticket. Main differences are:
opennlp.geoentitylinker.gaz.doublequote=false
opennlp.geoentitylinker.gaz.hierarchyfield=false
opennlp.geoentitylinker.gaz=C:\\temp\\gazetteers\\opennlp_geoentitylinker_gazetteer
if your names are hierarchical set hierarchyfield=true and then you may want to lower the min score thresh via this property below, since there is no telling how much of the hierarchy field a multitoken name will match.
opennlp.geoentitylinker.gaz.lucenescore.min=.3
> GeoEntityLinker should handle hierarchical location names via improved indexing
> -------------------------------------------------------------------------------
>
> Key: OPENNLP-706
> URL: https://issues.apache.org/jira/browse/OPENNLP-706
> Project: OpenNLP
> Issue Type: Improvement
> Components: Entity Linker
> Affects Versions: 1.6.0
> Environment: java 7
> Reporter: Mark Giaconia
> Assignee: Mark Giaconia
> Attachments: entitylinker.properties
>
> Original Estimate: 672h
> Remaining Estimate: 672h
>
> Currently the GeoEntitylinker (Geotagger) EntityLinker does not handle hierarchical location references such as Berlin, Germany or Hartford, Connecticut. This can be fixed by creating a hierarchy field in the index and querying this field whenever a multi-token name is found.
--
This message was sent by Atlassian JIRA
(v6.2#6252)