You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@opennlp.apache.org by Jeff Zemerick <jz...@apache.org> on 2022/11/28 19:34:53 UTC

[ANNOUNCE] OpenNLP 2.1.0 released

The Apache OpenNLP team is pleased to announce the release of version 2.1.0
of Apache OpenNLP. The Apache OpenNLP library is a machine learning based
toolkit for the processing of natural language text. It supports the most
common NLP tasks, such as tokenization, sentence segmentation,
part-of-speech tagging, named entity extraction, chunking, and parsing.

The OpenNLP 2.1.0 binary and source distributions are available for
download from our download page: https://opennlp.apache.org/download.html
The OpenNLP library is distributed by Maven Central as well. See the Maven
Dependency page for more details:
http://opennlp.apache.org/maven-dependency.html

Changes in this version:

- Update language codes in documentation
- Enable optional GPU inference in ONNX Runtime configuration
- Allow for unlimited text length in document classification with ONNX
Runtime
- Fix alphaNumOpt in tokenizer example
- Training of MaxEnt model with large corpora fails with
java.io.UTFDataFormatException
- Make parameter names in the params file be not case-sensitive
- Upgrade JUnit to version 5

For a complete list of fixed bugs and improvements please see the
announcement page at https://opennlp.apache.org/news/release-210.html.

The Apache OpenNLP Team