You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov> on 2015/11/18 15:42:17 UTC

Named Entity Recognition support in trunk

Hey Folks,

With the commit of TIKA-1787/GH-61 in trunk we now have full integration
of Named Entity Recognition with Stanford NER/NLP and Apache OpenNLP.
Will also look to see if we can integrate NLTK too. This is a *big
deal* since NER is something we’ve always wanted to pull into Tika.

Woot!

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: chris.a.mattmann@nasa.gov
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++




Re: Named Entity Recognition support in trunk

Posted by Tyler Palsulich <tp...@gmail.com>.
That's awesome! Great work.

Have we tried running any benchmarks?

Tyler
On Nov 18, 2015 6:42 AM, "Mattmann, Chris A (3980)" <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Hey Folks,
>
> With the commit of TIKA-1787/GH-61 in trunk we now have full integration
> of Named Entity Recognition with Stanford NER/NLP and Apache OpenNLP.
> Will also look to see if we can integrate NLTK too. This is a *big
> deal* since NER is something we’ve always wanted to pull into Tika.
>
> Woot!
>
> Cheers,
> Chris
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattmann@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>