You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@opennlp.apache.org by Jörn Kottmann <ko...@gmail.com> on 2011/09/29 16:48:12 UTC

OpenNLP 1.5.2 RC 2 ready for testing

Hi all,

our second release candidate is ready for testing.

It can be downloaded from here:
http://people.apache.org/~joern/releases/opennlp-1.5.2-incubating/rc2/

To use it in a maven build set the version for opennlp-tools or opennlp-uima
to 1.5.2, and for opennlp-maxent to 3.0.2, and add this URL to your 
settings.xml
file:
https://repository.apache.org/content/repositories/orgapacheopennlp-006/

The RC 1 staging repository is dropped now.

The current test plan can be found here:
https://cwiki.apache.org/OPENNLP/testplan152.html

The release plan can be found here:
https://cwiki.apache.org/OPENNLP/releaseplanandtasks152.html

The RC contains the following improvements compared to RC 1:
- OPENNLP-286    Write a test for the POSDictionary to test the case 
sensitive/insensitive flag
- OPENNLP-288    Loading a tag dictionary in the old format does not 
work with case insensitive
- OPENNLP-297    Add more descriptive documentation on the Span class
- OPENNLP-301    UIMA descriptors still use old IBM Watson name space
- OPENNLP-298    Span.contains(index) reports a value equal to end as 
being part of the span.
- OPENNLP-294    Name Finder Evaluator does not clear adaptive data
- OPENNLP-295    Sentence Detectors sent span array and probability 
array does not match
- OPENNLP-293    Deprecate Dictionary constructor that takes InputStream 
and case sensitivity as argument
- OPENNLP-292    Remove CL argument specifying if abbreviation 
dictionary is case sensitive
                              from SentenceDetector and Tokenizer 
training tools
- OPENNLP-291    DetailedF listener not working for TokenNameFinder
- OPENNLP-289    TokenNameFinderEvaluatorTool and ChunkerEvaluatorTool 
are not printing usage correctly
- OPENNLP-241    Model validations is only performed if instatiated from 
Input Stream.
- OPENNLP-245    Improve Dictionary Testing
- OPENNLP-258    Refactor cross validation and training code to always 
use the new Training Parameters object
- OPENNLP-200    Addition of prepositional phrase attachment dataset and 
unit test for it
- OPENNLP-280    Leipzig corpora links seem to be broken
- OPENNLP-279    CONLL03 documation link to LDC should be a html link
- OPENNLP-278    Link to Arvores Deitadas in documentation is broken

Most of these issues should not affect the performance of our taggers, 
an exception is OPENNLP-294 which
now clears the adaptive data of the name finder correctly during evaluation.

Jörn