You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@uima.apache.org by Jerry Cwiklik <cw...@apache.org> on 2019/04/26 15:37:27 UTC

[ANNOUNCE] Apache UIMA-DUCC 3.0.0 released

The Apache UIMA team is pleased to announce the release of

Apache UIMA-DUCC version 3.0.0

You can download it from here:

http://uima.apache.org/downloads.cgi

DUCC stands for Distributed UIMA Cluster Computing. DUCC is a cluster
management system providing tooling, management, and scheduling
facilities to automate the scale-out of applications written to the UIMA
framework. Core UIMA provides a generalized framework for applications
that process unstructured information such as human language, but does 
not provide a
scale-out mechanism. UIMA-AS provides a scale-out mechanism to
distribute UIMA pipelines over a cluster of computing resources, but
does not provide job or cluster management of the resources.

DUCC defines a formal job model that closely maps to a standard UIMA
pipeline. Around this job model DUCC provides cluster management
services to automate the scale-out of UIMA pipelines over computing
clusters. It is primarily intended to be used for research and
development activities where multiple users need to efficiently share
cluster resources for a wide variety of computational activities.
Because DUCC is built for UIMA-based analytics from the ground up it
automatically makes available such details as what annotators are
currently initializing as well as the timing breakdown for each
primitive annotator in a pipeline.

The Unstructured Information Management Architecture (UIMA) is a
component framework supporting development, discovery, composition, and
deployment of multi-modal analytics tasked with the analysis of
unstructured information.

More information about DUCC is here:
https://uima.apache.org/d/uima-ducc-3.0.0/duccbook.html

This release contains number of improvements and bug fixes. Notable 
updates in this release include:

- Support for UIMA v2 and v3
- Created new pull service that can be run with or without DUCC
- Enable DUCC to run without shared file system
- Add new DUCC stop options, including quiesce
- Add "CASes processed" to annotator performance metrics
- Upgraded to Cassandra Server v.3.11.3, Cassandra Driver v.3.6.0, Jetty 
v.9.4.14.v20181114, guava v.18.0, joda v.2.4, commons.lang v.3.1, 
commons.math v.3.2, netty v.4.0.44, snappy v.1.1.1.7
- Other bug fixes

For a complete list of bugs and improvements included in this release 
please see
https://issues.apache.org/jira/browse/UIMA-6019?jql=project%20%3D%20UIMA%20AND%20fixVersion%20%3D%20%223.0.0-Ducc%22%20

Please send feedback via the Apache UIMA project mailing lists.

-- Jerry Cwiklik, for the Apache UIMA development team