You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@clerezza.apache.org by to...@apache.org on 2011/09/07 16:01:46 UTC

svn commit: r1166189 - /incubator/clerezza/site/trunk/graph.nt

Author: tommaso
Date: Wed Sep  7 14:01:46 2011
New Revision: 1166189

URL: http://svn.apache.org/viewvc?rev=1166189&view=rev
Log:
fixing description of uima.utils

Modified:
    incubator/clerezza/site/trunk/graph.nt

Modified: incubator/clerezza/site/trunk/graph.nt
URL: http://svn.apache.org/viewvc/incubator/clerezza/site/trunk/graph.nt?rev=1166189&r1=1166188&r2=1166189&view=diff
==============================================================================
--- incubator/clerezza/site/trunk/graph.nt (original)
+++ incubator/clerezza/site/trunk/graph.nt Wed Sep  7 14:01:46 2011
@@ -2,7 +2,7 @@
 <http://localhost:8080/> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://clerezza.org/2009/09/hierarchy#Collection> .
 <http://localhost:8080/clerezza-uima/index-content-el/0> <http://discobits.org/ontology#infoBit> "<h3 xmlns=\"http://www.w3.org/1999/xhtml\">Introduction to Clerezza-UIMA integration</h3>\n<a xmlns=\"http://www.w3.org/1999/xhtml\" href=\"http://uima.apache.org\" target=\"_blank\">UIMA</a> is an <a xmlns=\"http://www.w3.org/1999/xhtml\" href=\"http://www.oasis-open.org/committees/tc_home.php?wg_abbrev=uima\" target=\"_blank\">OASIS</a> standard that allows the definition of analysis pipelines to manage unstructured information and extract structures and semantics around given data.<br xmlns=\"http://www.w3.org/1999/xhtml\"></br>\nThe Clerezza-UIMA integration brings the power of UIMA inside Clerezza providing reuse of existing UIMA components, definition of new ones in a linked data oriented system.<br xmlns=\"http://www.w3.org/1999/xhtml\"></br>\nA basic mechanism for mapping UIMA CAS (Common Analysis Structure) to an RDF graph has been defined together with the ability of s
 toring such an object in one of the Clerezza supported triple stores.<br xmlns=\"http://www.w3.org/1999/xhtml\"></br>\nClerezza runs inside an OSGi environment while UIMA is not fully OSGi compliant as is, so this integration work takes care also of the OSGi adaption.\n"^^<http://www.w3.org/1999/02/22-rdf-syntax-ns#XMLLiteral> .
 <http://localhost:8080/clerezza-uima/index-content-el/0> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://discobits.org/ontology#XHTMLInfoDiscoBit> .
-<http://localhost:8080/clerezza-uima/index-content-el/1> <http://discobits.org/ontology#infoBit> "<h3 xmlns=\"http://www.w3.org/1999/xhtml\">Clerezza-UIMA modules</h3>\n<ul xmlns=\"http://www.w3.org/1999/xhtml\">\n  <li><b>uima.ontologies</b><div>an ontology, and the generated Java source code, for defining the UIMA CAS model classes.</div></li>\n  <li><b>uima.utils</b><div>base module which allows the usage of UIMA inside Clerezza.<br></br> It defines the way UIMA framework classes are instantiated and initialized with the Clerezza OSGi environment with an extension classloader which collects classloaders containing the registered UIMA analysis components. To make it possible to create a UIMA pipeline from a bundle the bundle needs to register any UIMA analysis component in the extension classloader, this can be done using a specialized OSGi Activator defined in this module.<br></br> The uima.utils module also allows to cache any previously initialized analysis engine, the 
 execution of previously defined (a UIMA pipeline based on external services of OpenCalais and AlchemyAPI is already implemented) and custom UIMA pipelies. Also there are utility methods to retrieve UIMA annotations from the CAS model and to decorate an existing graph node with the information extracted by UIMA.</div></li>\n  <li><b>uima.metadata-generator</b><div>this module contains an implementation of a Clerezza metadata generator which generates meta data about specified data sent as a sequence of bytes, analyzing the resource media type with Apache Tika and then extracting tags, concepts, language and other entities with uima.utils external services based UIMA pipeline.</div></li>\n  <li><b>uima.casconsumer</b><div>a CAS Consumer in UIMA is an analysis component which is responsible of consuming the annotations and feature structures contained in a CAS (or the CAS itself) in some way. The ClerezzaCASConsuemr contained in this module can map information contained in a CA
 S to a (RDF) graph and eventually store it inside a triple store. The mapping strategy can be configured and extended; the current implementations count a default mapping implementation based on the basic uima.utils mapping strategy and a mapping based on <a href=\"http://code.google.com/p/annotation-ontology/\" target=\"_blank\">Annotation Ontology</a>.</div></li>\n  <li><b>uima.concept-tagging</b><div>this module provides a UIMA enabled version of the base Clerezza concept tagger which is able to automatically annotate a node with concept tags. Also another service to automatically enhance an external resource (given the URI), write it in the triple store with the Clerezza CASConsumer and return an RDF version of the annotated resource.</div></li>\n</ul>"^^<http://www.w3.org/1999/02/22-rdf-syntax-ns#XMLLiteral> .
+<http://localhost:8080/clerezza-uima/index-content-el/1> <http://discobits.org/ontology#infoBit> "<h3 xmlns=\"http://www.w3.org/1999/xhtml\">Clerezza-UIMA modules</h3>\n<ul xmlns=\"http://www.w3.org/1999/xhtml\">\n  <li><b>uima.ontologies</b><div>an ontology, and the generated Java source code, for defining the UIMA CAS model classes.</div></li>\n  <li><b>uima.utils</b><div>base module which allows the usage of UIMA inside Clerezza.<br></br> It defines the way UIMA framework classes are instantiated and initialized with the Clerezza OSGi environment with an extension classloader which collects classloaders containing the registered UIMA analysis components. To make it possible to create a UIMA pipeline from a bundle the bundle needs to register any UIMA analysis component in the extension classloader, this can be done using a specialized OSGi Activator defined in this module.<br></br> The uima.utils module also allows to cache any previously initialized analysis engine, the 
 execution of previously defined (a UIMA pipeline based on external services of OpenCalais and AlchemyAPI is already implemented) and custom UIMA pipelies. The module provides  utility methods for retrieving UIMA annotations from the CAS model and decorating existing graph node with the information extracted by UIMA..</div></li>\n  <li><b>uima.metadata-generator</b><div>this module contains an implementation of a Clerezza metadata generator which generates meta data about specified data sent as a sequence of bytes, analyzing the resource media type with Apache Tika and then extracting tags, concepts, language and other entities with uima.utils external services based UIMA pipeline.</div></li>\n  <li><b>uima.casconsumer</b><div>a CAS Consumer in UIMA is an analysis component which is responsible of consuming the annotations and feature structures contained in a CAS (or the CAS itself) in some way. The ClerezzaCASConsuemr contained in this module can map information contained i
 n a CAS to a (RDF) graph and eventually store it inside a triple store. The mapping strategy can be configured and extended; the current implementations count a default mapping implementation based on the basic uima.utils mapping strategy and a mapping based on <a href=\"http://code.google.com/p/annotation-ontology/\" target=\"_blank\">Annotation Ontology</a>.</div></li>\n  <li><b>uima.concept-tagging</b><div>this module provides a UIMA enabled version of the base Clerezza concept tagger which is able to automatically annotate a node with concept tags. Also another service to automatically enhance an external resource (given the URI), write it in the triple store with the Clerezza CASConsumer and return an RDF version of the annotated resource.</div></li>\n</ul>"^^<http://www.w3.org/1999/02/22-rdf-syntax-ns#XMLLiteral> .
 <http://localhost:8080/clerezza-uima/index-content-el/1> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://discobits.org/ontology#XHTMLInfoDiscoBit> .
 <http://localhost:8080/clerezza-uima/index-content-el/2> <http://discobits.org/ontology#infoBit> "<h3 xmlns=\"http://www.w3.org/1999/xhtml\">Getting started</h3>"^^<http://www.w3.org/1999/02/22-rdf-syntax-ns#XMLLiteral> .
 <http://localhost:8080/clerezza-uima/index-content-el/2> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://discobits.org/ontology#OrderedContent> .