You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by Apache Wiki <wi...@apache.org> on 2016/02/28 21:24:29 UTC

[Tika Wiki] Update of "TikaAndNLTK" by ChrisMattmann

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.

The "TikaAndNLTK" page has been changed by ChrisMattmann:
https://wiki.apache.org/tika/TikaAndNLTK?action=diff&rev1=6&rev2=7

- Describe TikaAndNLTK here.
+ Natural Language Toolkit(NLTK) with Tika.
  
+ NLTK is a python library and in order to integrate and use its capabilities with Tika one must install the server endpoint created for NTLK in python to extract Named entities from text passed to it.
+ 
+ It can be installed with:
+ 
+ {{{
+  pip install nltkrest
+ }}}
+ 
+ or the module can be downloaded from [[this site|https://www.github.com/manalishah/NLTKRest]] and then installed with the following commands:
+ 
+ {{{
+  cd NLTKRest/nltkrest
+  python setup.py install nltkrest
+ 
+ }}}
+ This is will build the module and make a command line.
+ 
+ Then start the server as
+ 
+ {{{
+ nltk-server -v --port 8888
+ }}}
+