You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@datafu.apache.org by "Matthew Hayes (JIRA)" <ji...@apache.org> on 2018/03/19 16:29:00 UTC
[jira] [Closed] (DATAFU-88) Port Stanford Core NLP Functionality to
DataFu
[ https://issues.apache.org/jira/browse/DATAFU-88?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Matthew Hayes closed DATAFU-88.
-------------------------------
Resolution: Won't Do
> Port Stanford Core NLP Functionality to DataFu
> ----------------------------------------------
>
> Key: DATAFU-88
> URL: https://issues.apache.org/jira/browse/DATAFU-88
> Project: DataFu
> Issue Type: New Feature
> Reporter: Russell Jurney
> Priority: Major
> Labels: lemmatizer, nlp, pig, pig_udf, stanford, stemmer
> Original Estimate: 168h
> Remaining Estimate: 168h
>
> For starters I need the Stanford Core NLP stemmer and lemmatizer.
> It looks like maybe I can add something generic and feed arguments to code like: props.put("annotators", "tokenize, ssplit, pos, lemma");
> Helpful example of lemmatizing at http://stackoverflow.com/questions/1578062/lemmatization-java
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)