You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@opennlp.apache.org by Ian Jackson <Ia...@trilliumsoftware.com> on 2013/05/21 17:06:56 UTC

English Training Corpus

As far as I can tell, only the results of the training are available for download. It appears that the only method to change the model is to replace the model with a new model.

What corpus was used as the source for training the English models?
Was the Reuters Corpus used [http://trec.nist.gov/data/reuters/reuters.html]? If so in theory an organization could sign the correct agreements and run the existing models to create something close to input models and make the desired changes.