You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/11/17 18:30:11 UTC
[jira] [Resolved] (TIKA-1787) Include Stanford Name Entity
Recognition in Tika
[ https://issues.apache.org/jira/browse/TIKA-1787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Chris A. Mattmann resolved TIKA-1787.
-------------------------------------
Resolution: Fixed
Great work [~thammegowda] and [~Yueheng]!
Thamme - please take your docs below and add the to the wiki page. Thanks!
{noformat}
[mattmann-0420740:~/tmp/tika1.12] mattmann% svn commit -m "Fix for TIKA-1787: Include Stanford Name Entity Recognition in Tika contributed by Thamme Gowda N and Yueheng He this closes #61 this closes #62"
Sending .gitignore
Sending CHANGES.txt
Sending tika-parsers/pom.xml
Adding tika-parsers/src/main/java/org/apache/tika/parser/ner
Adding tika-parsers/src/main/java/org/apache/tika/parser/ner/NERecogniser.java
Adding tika-parsers/src/main/java/org/apache/tika/parser/ner/NamedEntityParser.java
Adding tika-parsers/src/main/java/org/apache/tika/parser/ner/corenlp
Adding tika-parsers/src/main/java/org/apache/tika/parser/ner/corenlp/CoreNLPNERecogniser.java
Adding tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp
Adding tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp/OpenNLPNERecogniser.java
Adding tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp/OpenNLPNameFinder.java
Adding tika-parsers/src/main/java/org/apache/tika/parser/ner/regex
Adding tika-parsers/src/main/java/org/apache/tika/parser/ner/regex/RegexNERecogniser.java
Adding tika-parsers/src/main/resources/org/apache/tika/parser/ner
Adding tika-parsers/src/main/resources/org/apache/tika/parser/ner/regex
Adding tika-parsers/src/main/resources/org/apache/tika/parser/ner/regex/ner-regex.txt
Adding tika-parsers/src/test/java/org/apache/tika/parser/ner
Adding tika-parsers/src/test/java/org/apache/tika/parser/ner/NamedEntityParserTest.java
Adding tika-parsers/src/test/java/org/apache/tika/parser/ner/regex
Adding tika-parsers/src/test/java/org/apache/tika/parser/ner/regex/RegexNERecogniserTest.java
Adding tika-parsers/src/test/resources/org/apache/tika/parser
Adding tika-parsers/src/test/resources/org/apache/tika/parser/ner
Adding tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp
Adding tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/ModelGetter.groovy
Adding tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/get-models.sh
Adding tika-parsers/src/test/resources/org/apache/tika/parser/ner/regex
Adding tika-parsers/src/test/resources/org/apache/tika/parser/ner/regex/ner-regex.txt
Adding tika-parsers/src/test/resources/org/apache/tika/parser/ner/tika-config.xml
Transmitting file data ................
Committed revision 1714835.
[mattmann-0420740:~/tmp/tika1.12] mattmann%
{noformat}
> Include Stanford Name Entity Recognition in Tika
> ------------------------------------------------
>
> Key: TIKA-1787
> URL: https://issues.apache.org/jira/browse/TIKA-1787
> Project: Tika
> Issue Type: Improvement
> Components: mime, parser
> Affects Versions: 1.12
> Environment: Java 1.8, Mac OSX 10.11
> Reporter: Yueheng He
> Assignee: Chris A. Mattmann
> Labels: features, newbie, test
> Fix For: 1.12
>
> Original Estimate: 168h
> Remaining Estimate: 168h
>
> Using the Stanford Name Entity Recognition, Tika will be able to extract name entities like PERSON, ORGANIZATION, LOCATION, etc from the given text. The extracted name entities will be added to the metadata
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)