You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@opennlp.apache.org by Jörn Kottmann <ko...@gmail.com> on 2011/09/29 16:48:12 UTC
OpenNLP 1.5.2 RC 2 ready for testing
Hi all,
our second release candidate is ready for testing.
It can be downloaded from here:
http://people.apache.org/~joern/releases/opennlp-1.5.2-incubating/rc2/
To use it in a maven build set the version for opennlp-tools or opennlp-uima
to 1.5.2, and for opennlp-maxent to 3.0.2, and add this URL to your
settings.xml
file:
https://repository.apache.org/content/repositories/orgapacheopennlp-006/
The RC 1 staging repository is dropped now.
The current test plan can be found here:
https://cwiki.apache.org/OPENNLP/testplan152.html
The release plan can be found here:
https://cwiki.apache.org/OPENNLP/releaseplanandtasks152.html
The RC contains the following improvements compared to RC 1:
- OPENNLP-286 Write a test for the POSDictionary to test the case
sensitive/insensitive flag
- OPENNLP-288 Loading a tag dictionary in the old format does not
work with case insensitive
- OPENNLP-297 Add more descriptive documentation on the Span class
- OPENNLP-301 UIMA descriptors still use old IBM Watson name space
- OPENNLP-298 Span.contains(index) reports a value equal to end as
being part of the span.
- OPENNLP-294 Name Finder Evaluator does not clear adaptive data
- OPENNLP-295 Sentence Detectors sent span array and probability
array does not match
- OPENNLP-293 Deprecate Dictionary constructor that takes InputStream
and case sensitivity as argument
- OPENNLP-292 Remove CL argument specifying if abbreviation
dictionary is case sensitive
from SentenceDetector and Tokenizer
training tools
- OPENNLP-291 DetailedF listener not working for TokenNameFinder
- OPENNLP-289 TokenNameFinderEvaluatorTool and ChunkerEvaluatorTool
are not printing usage correctly
- OPENNLP-241 Model validations is only performed if instatiated from
Input Stream.
- OPENNLP-245 Improve Dictionary Testing
- OPENNLP-258 Refactor cross validation and training code to always
use the new Training Parameters object
- OPENNLP-200 Addition of prepositional phrase attachment dataset and
unit test for it
- OPENNLP-280 Leipzig corpora links seem to be broken
- OPENNLP-279 CONLL03 documation link to LDC should be a html link
- OPENNLP-278 Link to Arvores Deitadas in documentation is broken
Most of these issues should not affect the performance of our taggers,
an exception is OPENNLP-294 which
now clears the adaptive data of the name finder correctly during evaluation.
Jörn