You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by rw...@apache.org on 2013/03/20 16:24:28 UTC

svn commit: r1458883 - in /stanbol/site/trunk/content/docs/trunk/components/enhancer: engines/list.mdtext nlp/index.mdtext

Author: rwesten
Date: Wed Mar 20 15:24:28 2013
New Revision: 1458883

URL: http://svn.apache.org/r1458883
Log:
Added documentation for the TextAnnotation new Model Enine (STANBOL-953) as well as the Kuromoji NLP engine for Japanese (STANBOL-980)

Modified:
    stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/list.mdtext
    stanbol/site/trunk/content/docs/trunk/components/enhancer/nlp/index.mdtext

Modified: stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/list.mdtext
URL: http://svn.apache.org/viewvc/stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/list.mdtext?rev=1458883&r1=1458882&r2=1458883&view=diff
==============================================================================
--- stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/list.mdtext (original)
+++ stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/list.mdtext Wed Mar 20 15:24:28 2013
@@ -110,11 +110,15 @@ This includes Engines that perform some 
 * __[RESTfull NLP Analysis Engine](restfulnlpanalysis)__:
     * client for the [RESTful NLP Analysis Service](../nlp/restfulnlpanalysisservice) as specified by [STANBOL-892](https://issues.apache.org/jira/browse/STANBOL-892)
 
-* __Paoding NLP Analyses Engine__:
+* __[Kuromoji NLP Engine](kuromojinlp)__:
+    * Supports Sentence Detection, Tokenizing Part of Speech tagging and Named Entity Recognition for Japanese
+
+* __Gosen NLP Analyses Engine__:
     * Supports Sentence Detection, Tokenizing, Part of Speech tagging and Named Entity Recognition for Japanese
     * Provided by the [Stanbol Gosen integration](https://github.com/westei/stanbol-gosen)
     * __NOTE:__ _This Engine is not part of Apache Stanbol and needs to be downloaded separately from https://github.com/westei/stanbol-gosen_
 
+
 ## Linking / Suggestions
 
 This category covers enhancement engines that suggest Entities for features present in the parsed content. An Entity is an uniquely identified resource. Typically it provides (or links to) further information such as the type, a description (text, pictures, videos …), spatial and/or temporal context, links to other entities … . 
@@ -187,6 +191,9 @@ Enhancement Engines in this category can
 	* converts NLP processing results stored in the [AnalyzedText](../nlp/analyzedtext) content part to RDF and adds them to the metadata of the [ContentItem](../contentitem)
 	* generated RDF uses the NIF (NLP Interchange Format)
 
+* __[TextAnnotation new Model Converter Engine](textannotationnewmodel)__
+ 	* This engine converts fise:TextAnnotation to include fise:selection-prefix and fise:selection-suffix properties.
+
 * __[Refactor Engine](refactorengine):__
 	* transforms enhancements according to a target ontology, requires KRES launcher.
 

Modified: stanbol/site/trunk/content/docs/trunk/components/enhancer/nlp/index.mdtext
URL: http://svn.apache.org/viewvc/stanbol/site/trunk/content/docs/trunk/components/enhancer/nlp/index.mdtext?rev=1458883&r1=1458882&r2=1458883&view=diff
==============================================================================
--- stanbol/site/trunk/content/docs/trunk/components/enhancer/nlp/index.mdtext (original)
+++ stanbol/site/trunk/content/docs/trunk/components/enhancer/nlp/index.mdtext Wed Mar 20 15:24:28 2013
@@ -115,7 +115,7 @@ This section provides an overview about 
 
 * __Chinese__ _(zh)_
     * [Smartcn](smartcn): _Sentence Detection_ and _Tokenization_
-    * [Paoding](paoding): _Tokenization_
+    * [Paoding](paoding): _Tokenization_ 
 
 * __Danish__ _(da)_
     * [OpenNLP](opennlp): _Sentence Detection_, _Tokenization_, _POS_ tagging
@@ -146,6 +146,7 @@ This section provides an overview about 
     * [CELI](celi): _NER_ for Persons, Organizations and Places, _Lemmatization_ and lexical analysis
 
 * __Japanese__ _(jp)_
+    * [Kuromoji](../engines/kuromojinlp): _Sentence Detection_, _Tokenization_, _POS_ tagging as well as _NER_ for Persons, Organizations and Places.
     * [Gosen](https://github.com/westei/stanbol-gosen): _Sentence Detection_, _Tokenization_, _POS_ tagging as well as _NER_ for Persons, Organizations and Places.
 
 * __Portuguese__ _(pt)_