You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@opennlp.apache.org by anurag d <an...@gmail.com> on 2014/03/04 17:35:46 UTC
Which corpora have the current models been trained on?
Hi,
Could someone let me know which corpora the POSTagger and the Parser have
been trained on?
I searched online and on past messages in the discussion forum, but
couldn't locate an answer.
>From the models download page, http://opennlp.sourceforge.net/models-1.5/,
the chunker has been trained on the coNLL2000 shared task data, but the
other corpora used are not apparent.
Thanks,
Anurag