You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "Joern Kottmann (JIRA)" <ji...@apache.org> on 2017/01/12 10:59:52 UTC

[jira] [Created] (OPENNLP-934) Replace leipzig corpus data with wikinews

Joern Kottmann created OPENNLP-934:
--------------------------------------

             Summary: Replace leipzig corpus data with wikinews
                 Key: OPENNLP-934
                 URL: https://issues.apache.org/jira/browse/OPENNLP-934
             Project: OpenNLP
          Issue Type: Improvement
            Reporter: Joern Kottmann


Wikinews is available in many languages and licensed under cc-a 2.5 which is classified as class b license at Apache. It should be ok to include that for testing resources to ensure OpenNLP works properly.

This data can be used for testing existing models and it can be partly automatically annotated to test training of all our components.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)