You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@opennlp.apache.org by Jeff Zemerick <jz...@apache.org> on 2022/04/05 14:34:07 UTC

OpenNLP 2.0 release discussion

Hi all,

I would like to propose an OpenNLP 2.0 release for the following reasons:

- There are a few significant changes: Building using Java 11, support for
ONNX models, automatic model downloading
- User activity has been somewhat low and a 2.0 release might help bring
attention to these new features.
- 1.x has been around for 10+ years. :)
- Other reasons?

Thoughts? Concerns?

Thanks,
Jeff


Our current master branch has the following changes:

Bug
[OPENNLP-1353] - DictonaryLemmatizer missing charset

Improvement
[OPENNLP-565] - Add MASC format support
[OPENNLP-1185] - Tokenizers should be able to output a new line token
[OPENNLP-1306] - NameSample overlap exception not helpful

Task
[OPENNLP-1318] - Add ability to download models from within OpenNLP
[OPENNLP-1351] - Support ONNX models
[OPENNLP-1354] - Change build to use Java 11
[OPENNLP-1355] - Document ONNX capability introduced in OPENNLP-1351
[OPENNLP-1356] - Document the ONNX implementations
[OPENNLP-1359] - Build fails with Java 17
[OPENNLP-1364] - Move setKeepNewLines to the Tokenizer class

Documentation
[OPENNLP-1319] - The Training API code is outdated in Manual

Re: OpenNLP 2.0 release discussion

Posted by Jeff Zemerick <jz...@apache.org>.
I will start the prep work for a 2.0 RC (unless anyone else would like to)
in a few days.

Thanks,
Jeff

On Wed, Apr 6, 2022 at 3:01 AM Tommaso Teofili <to...@gmail.com>
wrote:

> +1
>
> Tommaso
>
> On Wed, 6 Apr 2022 at 04:00, Bruno P. Kinoshita
> <br...@yahoo.com.br.invalid> wrote:
>
> >  +1 Jeff, thanks!
> >
> > Bruno
> >
> >     On Wednesday, 6 April 2022, 02:34:35 am NZST, Jeff Zemerick <
> > jzemerick@apache.org> wrote:
> >
> >  Hi all,
> >
> > I would like to propose an OpenNLP 2.0 release for the following reasons:
> >
> > - There are a few significant changes: Building using Java 11, support
> for
> > ONNX models, automatic model downloading
> > - User activity has been somewhat low and a 2.0 release might help bring
> > attention to these new features.
> > - 1.x has been around for 10+ years. :)
> > - Other reasons?
> >
> > Thoughts? Concerns?
> >
> > Thanks,
> > Jeff
> >
> >
> > Our current master branch has the following changes:
> >
> > Bug
> > [OPENNLP-1353] - DictonaryLemmatizer missing charset
> >
> > Improvement
> > [OPENNLP-565] - Add MASC format support
> > [OPENNLP-1185] - Tokenizers should be able to output a new line token
> > [OPENNLP-1306] - NameSample overlap exception not helpful
> >
> > Task
> > [OPENNLP-1318] - Add ability to download models from within OpenNLP
> > [OPENNLP-1351] - Support ONNX models
> > [OPENNLP-1354] - Change build to use Java 11
> > [OPENNLP-1355] - Document ONNX capability introduced in OPENNLP-1351
> > [OPENNLP-1356] - Document the ONNX implementations
> > [OPENNLP-1359] - Build fails with Java 17
> > [OPENNLP-1364] - Move setKeepNewLines to the Tokenizer class
> >
> > Documentation
> > [OPENNLP-1319] - The Training API code is outdated in Manual
> >
>

Re: OpenNLP 2.0 release discussion

Posted by Tommaso Teofili <to...@gmail.com>.
+1

Tommaso

On Wed, 6 Apr 2022 at 04:00, Bruno P. Kinoshita
<br...@yahoo.com.br.invalid> wrote:

>  +1 Jeff, thanks!
>
> Bruno
>
>     On Wednesday, 6 April 2022, 02:34:35 am NZST, Jeff Zemerick <
> jzemerick@apache.org> wrote:
>
>  Hi all,
>
> I would like to propose an OpenNLP 2.0 release for the following reasons:
>
> - There are a few significant changes: Building using Java 11, support for
> ONNX models, automatic model downloading
> - User activity has been somewhat low and a 2.0 release might help bring
> attention to these new features.
> - 1.x has been around for 10+ years. :)
> - Other reasons?
>
> Thoughts? Concerns?
>
> Thanks,
> Jeff
>
>
> Our current master branch has the following changes:
>
> Bug
> [OPENNLP-1353] - DictonaryLemmatizer missing charset
>
> Improvement
> [OPENNLP-565] - Add MASC format support
> [OPENNLP-1185] - Tokenizers should be able to output a new line token
> [OPENNLP-1306] - NameSample overlap exception not helpful
>
> Task
> [OPENNLP-1318] - Add ability to download models from within OpenNLP
> [OPENNLP-1351] - Support ONNX models
> [OPENNLP-1354] - Change build to use Java 11
> [OPENNLP-1355] - Document ONNX capability introduced in OPENNLP-1351
> [OPENNLP-1356] - Document the ONNX implementations
> [OPENNLP-1359] - Build fails with Java 17
> [OPENNLP-1364] - Move setKeepNewLines to the Tokenizer class
>
> Documentation
> [OPENNLP-1319] - The Training API code is outdated in Manual
>

Re: OpenNLP 2.0 release discussion

Posted by "Bruno P. Kinoshita" <br...@yahoo.com.br.INVALID>.
 +1 Jeff, thanks!

Bruno

    On Wednesday, 6 April 2022, 02:34:35 am NZST, Jeff Zemerick <jz...@apache.org> wrote:  
 
 Hi all,

I would like to propose an OpenNLP 2.0 release for the following reasons:

- There are a few significant changes: Building using Java 11, support for
ONNX models, automatic model downloading
- User activity has been somewhat low and a 2.0 release might help bring
attention to these new features.
- 1.x has been around for 10+ years. :)
- Other reasons?

Thoughts? Concerns?

Thanks,
Jeff


Our current master branch has the following changes:

Bug
[OPENNLP-1353] - DictonaryLemmatizer missing charset

Improvement
[OPENNLP-565] - Add MASC format support
[OPENNLP-1185] - Tokenizers should be able to output a new line token
[OPENNLP-1306] - NameSample overlap exception not helpful

Task
[OPENNLP-1318] - Add ability to download models from within OpenNLP
[OPENNLP-1351] - Support ONNX models
[OPENNLP-1354] - Change build to use Java 11
[OPENNLP-1355] - Document ONNX capability introduced in OPENNLP-1351
[OPENNLP-1356] - Document the ONNX implementations
[OPENNLP-1359] - Build fails with Java 17
[OPENNLP-1364] - Move setKeepNewLines to the Tokenizer class

Documentation
[OPENNLP-1319] - The Training API code is outdated in Manual