You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@uima.apache.org by Jerry Cwiklik <cw...@apache.org> on 2018/03/22 14:20:42 UTC

[ANNOUNCE] Apache UIMA DUCC 2.2.2 released

The Apache UIMA team is pleased to announce the release of the Apache 
UIMA-DUCC version 2.2.2.

The Unstructured Information Management Architecture (UIMA) is a 
component framework supporting development, discovery, composition, and 
deployment of multi-modal analytics tasked with the analysis of 
unstructured information.

Apache UIMA is an Apache licensed open source implementation of the UIMA 
specification which is being developed by a technical committee within 
OASIS, a standards organization. The implementation comprises an SDK and 
tooling for composing and running analytic components written in Java and
C++, with some support for Perl, Python and TCL.

DUCC stands for Distributed UIMA Cluster Computing. DUCC is a cluster 
management system providing tooling,management, and scheduling 
facilities to automate the scale-out of applications written to the UIMA 
framework. Core UIMA provides a generalized framework for applications 
that process
unstructured information such as human language, but does not provide a 
scale-out mechanism. UIMA-AS provides a scale-out mechanism to 
distribute UIMA pipelines over a cluster of computing resources, but 
does not provide job or cluster management of the resources.

DUCC defines a formal job model that closely maps to a standard UIMA 
pipeline. Around this job model DUCC provides cluster management 
services to automate the scale-out of UIMA pipelines over computing 
clusters. It is primarily intended to be used for research and 
development activities where multiple users need to efficiently share 
cluster resources for a wide variety of computational activities. 
Because DUCC is built for UIMA-based analytics from the ground up it 
automatically makes available such details as what annotators are 
currently initializing as well as the timing breakdown for each 
primitive annotator in a pipeline.

More information about DUCC is here: 
https://uima.apache.org/d/uima-ducc-2.2.2/duccbook.html

This release contains a number of improvements and bug fixes. Notable 
updates in this release include:

- Upgraded UIMA (2.10.2), UIMA-AS (2.10.2), ActiveMQ (5.15.2), Spring 
Framework (4.3.9), Camel (2.19.3), XStream (1.4.10)
- Requires java 8 for builds and runtime. User jobs and services must 
use java 8.
- Fixed rogue process detector to prevent truncation of user names
- Reduced size of publications between the PM and Agents
- Improved Agent's detection of defunct processes
- Improved logging of system events ducc_watcher improvements
- Job & managed reservation specs now saved in DB
- DUCC Web Server (WS) queries DB too often
- DUCC Web Server (WS) should use Resource Manager (RM) calculated value 
for machine's memory available

For a complete list of bugs and improvements included in this release
please see 
https://issues.apache.org/jira/browse/UIMA-5746?jql=project%20%3D%20UIMA%20AND%20fixVersion%20%3D%20%222.2.2-Ducc%22%20

To download DUCC please visit: https://uima.apache.org/downloads.cgi

-- Jerry Cwiklik, for the Apache UIMA development team