You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by Apache Wiki <wi...@apache.org> on 2015/05/24 08:08:15 UTC

[Tika Wiki] Update of "GeoTopicParser" by ChrisMattmann

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.

The "GeoTopicParser" page has been changed by ChrisMattmann:
https://wiki.apache.org/tika/GeoTopicParser

New page:
= GeoTopicParser =

The GeoTopicParser combines a Gazetteer (a lookup dictionary of names/places to latitudes, longitudes) and a Named Entity Recognition (NER) modeling technique that identifies names and places in text to provide a way to geo tag documents and text i.e., to identify places in the text, and then to look up the latitude/longitude pairs for those places.

GeoTopicParser uses [[http://lucene.apache.org/|Apache Lucene]] and [[http://opennlp.apache.org/|Apache OpenNLP]] to provide its capabilities.