You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Joern Kottmann (JIRA)" <ji...@apache.org> on 2013/04/08 16:37:15 UTC

[jira] [Closed] (OPENNLP-557) Polish NLP

     [ https://issues.apache.org/jira/browse/OPENNLP-557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Joern Kottmann closed OPENNLP-557.
----------------------------------

    Resolution: Won't Fix

The proposed patch is not suitable for inclusion. The patch relies on external services to perform the actual NLP tasks, but OpenNLP is a NLP library which has components for these task and can process Polish text after it has been trained for it already.

Please contribute patches which make the training easier (e.g. format support for a Polish corpus) or enhance the feature generation to deal with Polish.
                
> Polish NLP
> ----------
>
>                 Key: OPENNLP-557
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-557
>             Project: OpenNLP
>          Issue Type: New Feature
>            Reporter: Tomek
>            Priority: Minor
>              Labels: patch
>
> Hi,
> recently we have developed some NLP tools for Polish language.
> We have implemented some OpenNLP interfaces (which we wanted to include in OpenNLP project):
> -Sentence detector
> -Tokenizer
> -Document Categorizer  (it needs to include in project tc.xml and cache.db files ,which are included in package)
> -Part-of-Speech Tagger
> -Chunker
> -Keyword Extractor
> download package: 
> https://dl.dropbox.com/u/4021344/polishNLP.7z
> package consist manual (manual/open_nlp_manual.html), javadoc, compiled java libraries , sources, cache.db and tc.xml files (used in document categorizer).
> Tomek

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira