You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Rodrigo Agerri (JIRA)" <ji...@apache.org> on 2016/02/18 22:25:18 UTC

[jira] [Resolved] (OPENNLP-760) probabilistic lemmatizer

     [ https://issues.apache.org/jira/browse/OPENNLP-760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rodrigo Agerri resolved OPENNLP-760.
------------------------------------
       Resolution: Fixed
    Fix Version/s: 1.6.1

statistical lemmatizer added, CLI, tests and documentation remains to make it fully usable.

> probabilistic lemmatizer
> ------------------------
>
>                 Key: OPENNLP-760
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-760
>             Project: OpenNLP
>          Issue Type: New Feature
>          Components: Lemmatizer
>            Reporter: Rodrigo Agerri
>            Assignee: Rodrigo Agerri
>            Priority: Minor
>             Fix For: 1.6.1
>
>
> Current SimpleLemmatizer is dictionary-based. A probabilistic lemmatizer works better for unknown words and can be combined with dictionaries.
> The method we will implement here is based on: 
> Grzegorz ChrupaƂa. 2008. Towards a Machine-Learning Architecture for Lexical Functional Grammar Parsing. PhD dissertation, Dublin City University. http://grzegorz.chrupala.me/papers/phd-single.pdf



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)