You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by Apache Wiki <wi...@apache.org> on 2016/03/29 00:38:37 UTC

[Tika Wiki] Update of "TikaAndNER" by anujshahx

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.

The "TikaAndNER" page has been changed by anujshahx:
https://wiki.apache.org/tika/TikaAndNER?action=diff&rev1=5&rev2=6

Comment:
Changes for including the export TIKA_APP code line in Tika + RegexNER in action snippet

  echo "PHONE_NUMBER=((\+\d{1,2}\s?)?\(?\d{3}\)?[\s.-]?\d{3}[\s.-]?\d{4})" > org/apache/tika/parser/ner/regex/ner-regex.txt
  echo "EMAIL=([a-zA-Z0-9.!#$%&'*+/=?^_`{|}~-]+@[a-zA-Z0-9](?:[a-zA-Z0-9-]{0,61}[a-zA-Z0-9])?(?:\.[a-zA-Z0-9](?:[a-zA-Z0-9-]{0,61}[a-zA-Z0-9])?))" >> org/apache/tika/parser/ner/regex/ner-regex.txt
  
+ export TIKA_APP={your/path/to/tika-app}/target/tika-app-1.12-SNAPSHOT.jar
  
  java -Dner.impl.class=org.apache.tika.parser.ner.regex.RegexNERecogniser \
      -classpath $NER_RES:$TIKA_APP org.apache.tika.cli.TikaCLI \