You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by Apache Wiki <wi...@apache.org> on 2016/03/29 00:38:37 UTC
[Tika Wiki] Update of "TikaAndNER" by anujshahx
Dear Wiki user,
You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "TikaAndNER" page has been changed by anujshahx:
https://wiki.apache.org/tika/TikaAndNER?action=diff&rev1=5&rev2=6
Comment:
Changes for including the export TIKA_APP code line in Tika + RegexNER in action snippet
echo "PHONE_NUMBER=((\+\d{1,2}\s?)?\(?\d{3}\)?[\s.-]?\d{3}[\s.-]?\d{4})" > org/apache/tika/parser/ner/regex/ner-regex.txt
echo "EMAIL=([a-zA-Z0-9.!#$%&'*+/=?^_`{|}~-]+@[a-zA-Z0-9](?:[a-zA-Z0-9-]{0,61}[a-zA-Z0-9])?(?:\.[a-zA-Z0-9](?:[a-zA-Z0-9-]{0,61}[a-zA-Z0-9])?))" >> org/apache/tika/parser/ner/regex/ner-regex.txt
+ export TIKA_APP={your/path/to/tika-app}/target/tika-app-1.12-SNAPSHOT.jar
java -Dner.impl.class=org.apache.tika.parser.ner.regex.RegexNERecogniser \
-classpath $NER_RES:$TIKA_APP org.apache.tika.cli.TikaCLI \