You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@opennlp.apache.org by anurag d <an...@gmail.com> on 2014/03/04 17:35:46 UTC

Which corpora have the current models been trained on?

Hi,

Could someone let me know which corpora the POSTagger and the Parser have
been trained on?

I searched online and on past messages in the discussion forum, but
couldn't locate an answer.

>From the models download page, http://opennlp.sourceforge.net/models-1.5/,
the chunker has been trained on the coNLL2000 shared task data, but the
other corpora used are not apparent.

Thanks,
Anurag