You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Mark Giaconia (JIRA)" <ji...@apache.org> on 2013/05/22 14:54:20 UTC

[jira] [Commented] (OPENNLP-579) Framework to support Gazateer search in concert with NER for location entities.

    [ https://issues.apache.org/jira/browse/OPENNLP-579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13664067#comment-13664067 ] 

Mark Giaconia commented on OPENNLP-579:
---------------------------------------

The sql file in the attached Zip has sample data (about 600 cities), so if you stand up a PostGIS instance, and run the script it will create the database, the tables, and load the data. Debug the Example class and it will get a hit on New York.

As for other gazateers, USGS has a large Gazateer that could be loaded into the postgres database @ http://geonames.usgs.gov/domestic/download_data.htm.

Good point about the GeoSpan integration, I was just trying to go for maximum encapsulation, but I am open to any ideas you have.
Thanks!
                
> Framework to support Gazateer search in concert with NER for location entities.
> -------------------------------------------------------------------------------
>
>                 Key: OPENNLP-579
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-579
>             Project: OpenNLP
>          Issue Type: Wish
>          Components: Name Finder
>    Affects Versions: 1.6.0
>         Environment: Any
>            Reporter: Mark Giaconia
>            Priority: Minor
>              Labels: features
>             Fix For: 1.6.0
>
>         Attachments: geonamefinder.properties, geonamefind.zip
>
>   Original Estimate: 672h
>  Remaining Estimate: 672h
>
> An interface for defining a Gazeteer and the methods to search it, an extended Span object, and a Namefinder that encapsulates a TokenNameFinder for locations. Commercial applications that do this are extremely expensive, and there are many free gazateers one could use to create a solution with OpenNLP. The capability should provide a simple default implementation using the most popular open source geospatial database, PostGIS.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira