You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@opennlp.apache.org by Jeff Zemerick <jz...@apache.org> on 2022/06/06 13:25:07 UTC

[ANNOUNCE] OpenNLP 2.0.0 released

The Apache OpenNLP team is pleased to announce the release of version 2.0.0
of Apache OpenNLP. The Apache OpenNLP library is a machine learning based
toolkit for the processing of natural language text. It supports the most
common NLP tasks, such as tokenization, sentence segmentation,
part-of-speech tagging, named entity extraction, chunking, and parsing.

The OpenNLP 2.0.0 binary and source distributions are available for
download from our download page: https://opennlp.apache.org/download.html
The OpenNLP library is distributed by Maven Central as well. See the Maven
Dependency page for more details:
http://opennlp.apache.org/maven-dependency.html

Changes in this version:

- Now built using Java 11
- Supports model inference using the ONNX Runtime
- Adds MASC format support
- Made NameSample overlap exception more helpful
- Tokenizers can now output a new line token
- Adding missing charset to DictionaryLemmatizer
- Updated documentation to fix training API sample code
- Fixed build issues with Java 17
- Adds ability to download models from within Apache OpenNLP

For a complete list of fixed bugs and improvements please see the
RELEASE_NOTES file included in the distribution.

The Apache OpenNLP Team

Re: [ANNOUNCE] OpenNLP 2.0.0 released

Posted by Jeff Zemerick <jz...@apache.org>.
Hi Markus,

Thanks for asking about this. I think the problem was that the issues in
JIRA were not assigned to the "Fix Version" of 2.0.0 at the time of the RC
creation and it caused that file to end up empty. I have made a note in our
release process to make sure the  versions are set appropriately earlier in
the release process now.

Here's a link to the changes in 2.0:

https://issues.apache.org/jira/browse/OPENNLP-1368?jql=project%20%3D%20OPENNLP%20AND%20fixVersion%20%3D%202.0.0%20ORDER%20BY%20created%20DESC

Thanks,
Jeff


On Mon, Jun 13, 2022 at 7:13 AM Markus Jelsma <ma...@openindex.io>
wrote:

> Thanks Jeff!
>
> But i could not find the RELEASE_NOTES anywhere in source or binary
> distribution. The issuesFixed HTML file in the binary distribution is
> empty. Am i looking in the wrong direction?
>
> Thanks
>
>
> Op ma 6 jun. 2022 om 15:25 schreef Jeff Zemerick <jz...@apache.org>:
>
> > The Apache OpenNLP team is pleased to announce the release of version
> 2.0.0
> > of Apache OpenNLP. The Apache OpenNLP library is a machine learning based
> > toolkit for the processing of natural language text. It supports the most
> > common NLP tasks, such as tokenization, sentence segmentation,
> > part-of-speech tagging, named entity extraction, chunking, and parsing.
> >
> > The OpenNLP 2.0.0 binary and source distributions are available for
> > download from our download page:
> https://opennlp.apache.org/download.html
> > The OpenNLP library is distributed by Maven Central as well. See the
> Maven
> > Dependency page for more details:
> > http://opennlp.apache.org/maven-dependency.html
> >
> > Changes in this version:
> >
> > - Now built using Java 11
> > - Supports model inference using the ONNX Runtime
> > - Adds MASC format support
> > - Made NameSample overlap exception more helpful
> > - Tokenizers can now output a new line token
> > - Adding missing charset to DictionaryLemmatizer
> > - Updated documentation to fix training API sample code
> > - Fixed build issues with Java 17
> > - Adds ability to download models from within Apache OpenNLP
> >
> > For a complete list of fixed bugs and improvements please see the
> > RELEASE_NOTES file included in the distribution.
> >
> > The Apache OpenNLP Team
> >
>

Re: [ANNOUNCE] OpenNLP 2.0.0 released

Posted by Markus Jelsma <ma...@openindex.io>.
Thanks Jeff!

But i could not find the RELEASE_NOTES anywhere in source or binary
distribution. The issuesFixed HTML file in the binary distribution is
empty. Am i looking in the wrong direction?

Thanks


Op ma 6 jun. 2022 om 15:25 schreef Jeff Zemerick <jz...@apache.org>:

> The Apache OpenNLP team is pleased to announce the release of version 2.0.0
> of Apache OpenNLP. The Apache OpenNLP library is a machine learning based
> toolkit for the processing of natural language text. It supports the most
> common NLP tasks, such as tokenization, sentence segmentation,
> part-of-speech tagging, named entity extraction, chunking, and parsing.
>
> The OpenNLP 2.0.0 binary and source distributions are available for
> download from our download page: https://opennlp.apache.org/download.html
> The OpenNLP library is distributed by Maven Central as well. See the Maven
> Dependency page for more details:
> http://opennlp.apache.org/maven-dependency.html
>
> Changes in this version:
>
> - Now built using Java 11
> - Supports model inference using the ONNX Runtime
> - Adds MASC format support
> - Made NameSample overlap exception more helpful
> - Tokenizers can now output a new line token
> - Adding missing charset to DictionaryLemmatizer
> - Updated documentation to fix training API sample code
> - Fixed build issues with Java 17
> - Adds ability to download models from within Apache OpenNLP
>
> For a complete list of fixed bugs and improvements please see the
> RELEASE_NOTES file included in the distribution.
>
> The Apache OpenNLP Team
>

Re: [ANNOUNCE] OpenNLP 2.0.0 released

Posted by Markus Jelsma <ma...@openindex.io>.
Thanks Jeff!

But i could not find the RELEASE_NOTES anywhere in source or binary
distribution. The issuesFixed HTML file in the binary distribution is
empty. Am i looking in the wrong direction?

Thanks


Op ma 6 jun. 2022 om 15:25 schreef Jeff Zemerick <jz...@apache.org>:

> The Apache OpenNLP team is pleased to announce the release of version 2.0.0
> of Apache OpenNLP. The Apache OpenNLP library is a machine learning based
> toolkit for the processing of natural language text. It supports the most
> common NLP tasks, such as tokenization, sentence segmentation,
> part-of-speech tagging, named entity extraction, chunking, and parsing.
>
> The OpenNLP 2.0.0 binary and source distributions are available for
> download from our download page: https://opennlp.apache.org/download.html
> The OpenNLP library is distributed by Maven Central as well. See the Maven
> Dependency page for more details:
> http://opennlp.apache.org/maven-dependency.html
>
> Changes in this version:
>
> - Now built using Java 11
> - Supports model inference using the ONNX Runtime
> - Adds MASC format support
> - Made NameSample overlap exception more helpful
> - Tokenizers can now output a new line token
> - Adding missing charset to DictionaryLemmatizer
> - Updated documentation to fix training API sample code
> - Fixed build issues with Java 17
> - Adds ability to download models from within Apache OpenNLP
>
> For a complete list of fixed bugs and improvements please see the
> RELEASE_NOTES file included in the distribution.
>
> The Apache OpenNLP Team
>