You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by Apache Wiki <wi...@apache.org> on 2015/09/24 11:35:22 UTC

[Tika Wiki] Update of "cTAKESParser" by LewisJohnMcgibbney

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.

The "cTAKESParser" page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/tika/cTAKESParser?action=diff&rev1=10&rev2=11

  The cTAKES parser can then be invoked from tika-app as follows:
  
  {{{
- java -classpath tika-app/target/tika-app-X.Y-SNAPSHOT.jar:${CTAKES_HOME}/desc:${CTAKES_HOME}/resources:${CTAKES_HOME}/lib/\*:$HOME/src/ctakes-config org.apache.tika.cli.TikaCLI --config=$HOME/src/ctakes-config/tika-config.xml -m Vose-2013-American_Journal_of_Hematology.pdf 
+ java -classpath ${TIKA_HOME}/tika-app/target/tika-app-X.Y-SNAPSHOT.jar:${CTAKES_HOME}/desc:${CTAKES_HOME}/resources:${CTAKES_HOME}/lib/\*:$HOME/src/ctakes-config org.apache.tika.cli.TikaCLI --config=$HOME/src/ctakes-config/tika-config.xml -m Vose-2013-American_Journal_of_Hematology.pdf 
  }}}
  
  Which will produce (after much printing and output):
@@ -228, +228 @@

  Save this script as `gen-server-classpath.sh`. Then, start Tika-server like so:
  
  {{{
- java -classpath tika-server/target/tika-server-1.10-SNAPSHOT.jar:${CTAKES_HOME}/desc:${CTAKES_HOME}/resources::./config:`gen-server-classpath.sh` org.apache.tika.server.TikaServerCli --config tika-config.xml
+ java -classpath ${TIKA_HOME}tika-server/target/tika-server-1.10-SNAPSHOT.jar:${CTAKES_HOME}/desc:${CTAKES_HOME}/resources::./config:`./gen-server-classpath.sh` org.apache.tika.server.TikaServerCli --config tika-config.xml
  }}}
  
  With Tika server started, let's post that biomedical PDF file to it and see what happens!