You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/11/17 18:30:11 UTC

[jira] [Resolved] (TIKA-1787) Include Stanford Name Entity Recognition in Tika

     [ https://issues.apache.org/jira/browse/TIKA-1787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris A. Mattmann resolved TIKA-1787.
-------------------------------------
    Resolution: Fixed

Great work [~thammegowda] and [~Yueheng]!

Thamme - please take your docs below and add the to the wiki page. Thanks!

{noformat}
[mattmann-0420740:~/tmp/tika1.12] mattmann% svn commit -m "Fix for TIKA-1787: Include Stanford Name Entity Recognition in Tika contributed by Thamme Gowda N and Yueheng He this closes #61 this closes #62"
Sending        .gitignore
Sending        CHANGES.txt
Sending        tika-parsers/pom.xml
Adding         tika-parsers/src/main/java/org/apache/tika/parser/ner
Adding         tika-parsers/src/main/java/org/apache/tika/parser/ner/NERecogniser.java
Adding         tika-parsers/src/main/java/org/apache/tika/parser/ner/NamedEntityParser.java
Adding         tika-parsers/src/main/java/org/apache/tika/parser/ner/corenlp
Adding         tika-parsers/src/main/java/org/apache/tika/parser/ner/corenlp/CoreNLPNERecogniser.java
Adding         tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp
Adding         tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp/OpenNLPNERecogniser.java
Adding         tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp/OpenNLPNameFinder.java
Adding         tika-parsers/src/main/java/org/apache/tika/parser/ner/regex
Adding         tika-parsers/src/main/java/org/apache/tika/parser/ner/regex/RegexNERecogniser.java
Adding         tika-parsers/src/main/resources/org/apache/tika/parser/ner
Adding         tika-parsers/src/main/resources/org/apache/tika/parser/ner/regex
Adding         tika-parsers/src/main/resources/org/apache/tika/parser/ner/regex/ner-regex.txt
Adding         tika-parsers/src/test/java/org/apache/tika/parser/ner
Adding         tika-parsers/src/test/java/org/apache/tika/parser/ner/NamedEntityParserTest.java
Adding         tika-parsers/src/test/java/org/apache/tika/parser/ner/regex
Adding         tika-parsers/src/test/java/org/apache/tika/parser/ner/regex/RegexNERecogniserTest.java
Adding         tika-parsers/src/test/resources/org/apache/tika/parser
Adding         tika-parsers/src/test/resources/org/apache/tika/parser/ner
Adding         tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp
Adding         tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/ModelGetter.groovy
Adding         tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/get-models.sh
Adding         tika-parsers/src/test/resources/org/apache/tika/parser/ner/regex
Adding         tika-parsers/src/test/resources/org/apache/tika/parser/ner/regex/ner-regex.txt
Adding         tika-parsers/src/test/resources/org/apache/tika/parser/ner/tika-config.xml
Transmitting file data ................
Committed revision 1714835.
[mattmann-0420740:~/tmp/tika1.12] mattmann% 
{noformat}

> Include Stanford Name Entity Recognition in Tika
> ------------------------------------------------
>
>                 Key: TIKA-1787
>                 URL: https://issues.apache.org/jira/browse/TIKA-1787
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime, parser
>    Affects Versions: 1.12
>         Environment: Java 1.8, Mac OSX 10.11
>            Reporter: Yueheng He
>            Assignee: Chris A. Mattmann
>              Labels: features, newbie, test
>             Fix For: 1.12
>
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> Using the Stanford Name Entity Recognition, Tika will be able to extract name entities like PERSON, ORGANIZATION, LOCATION, etc from the given text. The extracted name entities will be added to the metadata



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)