You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by rw...@apache.org on 2012/11/23 18:04:13 UTC

svn commit: r1412977 - /stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/list.mdtext

Author: rwesten
Date: Fri Nov 23 17:04:12 2012
New Revision: 1412977

URL: http://svn.apache.org/viewvc?rev=1412977&view=rev
Log:
STANBOL-733 - minor formatting related changes

Modified:
    stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/list.mdtext

Modified: stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/list.mdtext
URL: http://svn.apache.org/viewvc/stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/list.mdtext?rev=1412977&r1=1412976&r2=1412977&view=diff
==============================================================================
--- stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/list.mdtext (original)
+++ stanbol/site/trunk/content/docs/trunk/components/enhancer/engines/list.mdtext Fri Nov 23 17:04:12 2012
@@ -4,12 +4,12 @@ This provides an overview about all [Enh
 
 ## Preprocessing
 
-* __[Tika Engine](tikaengine.html)__ (based on [Apache Tika](http://tika.apache.org/))
+* __[Tika Engine](tikaengine):__ (based on [Apache Tika](http://tika.apache.org/))
 	* content type detection
 	* text extraction from various document formats
 	* extraction of metadata from document formats
 
-* __[Metaxa Engine](metaxaengine.html)__
+* __[Metaxa Engine](metaxaengine):__
 	* text extraction from various document formats
 	* extraction of metadata from document formats
 	* _NOTE_ this engine is not includes in the default Stanbol Launchers
@@ -23,10 +23,10 @@ This does contain Engines the process te
 
 Language detection engines add Language annotations as defined by [STANBOL-613](https://issues.apache.org/jira/browse/STANBOL-613) to the metadata of the [ContentItem](../contentitem)
 
-* __[Language Identification Engine](langidengine.html)__
+* __[Language Identification Engine](langidengine):__
 	* language detection for textual content utilizing [Apache Tika](http://tika.apache.org/)
 
-* __[Language Detection Engine](langdetectengine.html)__
+* __[Language Detection Engine](langdetectengine):__
 	* language detection for textual content utilizing [language-detection](http://code.google.com/p/language-detection/) Project
 
 * __CELI language detection Engine__: This engine is part of the CELI enhancement engines (see [STANBOL-583](https://issues.apache.org/jira/browse/STANBOL-583))
@@ -36,49 +36,49 @@ Language detection engines add Language 
 
 Sentence detection engines add _Sentences_ to the [AnalyzedText](../nlp/analyzedtext) content part 
 
-* __[OpenNLP Sentence Detection Engine](opennlpsentence)
+* __[OpenNLP Sentence Detection Engine](opennlpsentence):__
 	* Sentence Detection based on [OpenNLP](http://opennlp.apache.org)
 
 ### Tokenizer Engines
 
 The responsibility of Tokenizer Engines is to add _Tokens_ to the [AnalyzedText](../nlp/analyzedtext) content part
 
-* __[OpenNLP Tokenizer Detection Engine](opennlptoken)
+* __[OpenNLP Tokenizer Detection Engine](opennlptoken):__
 	* Tokenizer implementation based on [OpenNLP](http://opennlp.apache.org)
 
 ### Part of Speech (POS) Tagging
 
 POS tagging engines do add [Part-of-Speech annotations](../nlp/nlpannotations#part-of-speech-pos-annotations) to _Tokens_ present in the [AnalyzedText](../nlp/analyzedtext) content part
 
-* __[OpenNLP POS Tagging Engine](opennlppos)
+* __[OpenNLP POS Tagging Engine](opennlppos):__
 	* POS tagger implementation based on [OpenNLP](http://opennlp.apache.org)
 
 ### Chunk/Phrase detection
 
 Chunker (or Phrase Detection) Engines do add detected _Chunks_ to the [AnalyzedText](../nlp/analyzedtext) content part. They also annotate added _Chunks_ with the [type of the detected phrase](../nlp/nlpannotations#phrase-annotations)
 
-* __[OpenNLP Chunker Engine](opennlpchunker)
+* __[OpenNLP Chunker Engine](opennlpchunker):__
 	* Chunker implementation based on [OpenNLP](http://opennlp.apache.org)
 
 ### Named Entity Recognition (NER) Engines
 
 NER engines need to write detected Named Entities as '[fise:TextAnnotation](../enhancementstructure.html#fisetextannotation)'s to the metadata of the [ContentItem](../contentitem). In addition they may also add [NER annotations](../nlp/nlpannotations#name-entity-ner-annotations) to _Chunks_ in the [AnalyzedText](../nlp/analyzedtext) content part
 
-* __[OpenNLP NER Engine](opennlpner)__ 
+* __[OpenNLP NER Engine](opennlpner):__ 
 	* NLP processing using OpenNLP NER
 	* detects occurrences of persons, places and organizations only
 	* supports [NER annotations](../nlp/nlpannotations#name-entity-ner-annotations)
 
-* __[Custom NER Model Extraction Enhancement Engine](customnermodelengine.html)__ 
+* __[Custom NER Model Extraction Enhancement Engine](customnermodelengine.html):__ 
 	* NLP processing using OpenNLP NER
 	* uses custom NameFinder models (user configured)
 	* supports custom Named Entity types (other than persons, places and organizations
 
-* __CELI NER engine__: This engine is part of the CELI enhancement engines (see [STANBOL-583](https://issues.apache.org/jira/browse/STANBOL-583))
+* __CELI NER engine:__ This engine is part of the CELI enhancement engines (see [STANBOL-583](https://issues.apache.org/jira/browse/STANBOL-583))
 	* NER based on a linguagrid.org server hosted by CELI
 	* detects occurrences of persons, places and organizations and some other types
 
-* __[OpenCalais Enhancement Engine](opencalaisengine.html)__
+* __[OpenCalais Enhancement Engine](opencalaisengine):__
  	* integrates service from Open Calais. (Note: You need to provide a key in order to use this engine)
 	* can be configured to do only NER and no EntityLinking
 
@@ -87,7 +87,7 @@ NER engines need to write detected Named
 
 This includes Engines that perform some sort of morphological analyses (e.g. lemmatization)
 
-* __CELI AnalyzedText Lemmatizer Engine: This engine is part of the CELI enhancement engines (see [STANBOL-583](https://issues.apache.org/jira/browse/STANBOL-583) and [STANBOL-739](https://issues.apache.org/jira/browse/STANBOL-739))
+* __CELI AnalyzedText Lemmatizer Engine:__ This engine is part of the CELI enhancement engines (see [STANBOL-583](https://issues.apache.org/jira/browse/STANBOL-583) and [STANBOL-739](https://issues.apache.org/jira/browse/STANBOL-739))
 	* lemmatization support for "it", "da", "de", "ru", "ro"
 
 
@@ -95,36 +95,36 @@ This includes Engines that perform some 
 
 This category covers enhancement engines that suggest Entities for features present in the parsed content. An Entity is an uniquely identified resource. Typically it provides (or links to) further information such as the type, a description (text, pictures, videos …), spatial and/or temporal context, links to other entities … . 
 
-* __[Named Entity Linking Engine](namedentitytaggingengine)__
+* __[Named Entity Linking Engine](namedentitytaggingengine):__
 	* suggest links to several Linked Data Sources (e.g. DBpedia)
 
-* __[Entityhub Linking Engine](entityhublinking)__
+* __[Entityhub Linking Engine](entityhublinking):__
 	* [EntityLinkingEngine](entity linking) configuration for the Stanbol Entityhub
 	* consumes NLP processing results form the [AnalyzedText](../nlp/analyzedtext) content part
 	* Links Entities managed by the Entityhub, ReferencedSites or ManagedSites
 	* Supports any language however quality/performance depends on NLP processing support
 
-* __DBpedia Spotlight Annotation Engine__: Integration of the DBpedia Spotlight with the Stanbol Enhancer (see [STANBOL-706](https://issues.apache.org/jira/browse/STANBOL-706))
+* __DBpedia Spotlight Annotation Engine:__ Integration of the DBpedia Spotlight with the Stanbol Enhancer (see [STANBOL-706](https://issues.apache.org/jira/browse/STANBOL-706))
 	* includes NLP, Entity Linking and Disambiguation of Entities using [DBpedia](http://dbpedia.org) as knowledge base
 	* accesses a remote service
 
-* __[Geonames Enhancement Engine](geonamesengine)__ 
+* __[Geonames Enhancement Engine](geonamesengine):__ 
 	* suggests links to geonames.org
 	* provides hierarchical links for locations
 	* accesses a remote service, requires a user account
 
-* __[OpenCalais Enhancement Engine](opencalaisengine)__
+* __[OpenCalais Enhancement Engine](opencalaisengine):__
  	* integrates service from Open Calais. (Note: You need to provide a key in order to use this engine)
 	* provides both NER and Entity Linking
 	* accesses a remote service, requires a user account
 
-* __[Zemanta Enhancement Engine](zemantaengine)__
+* __[Zemanta Enhancement Engine](zemantaengine):__
 	* integrates the Zemanta services. (Note: You need to provide a key in order to use this engine)
 	* provides both NLP and Entity Linking
 	* accesses a remote service, requires a user account
 
 
-* _[KeywordLinkingEngine](keywordlinkingengine)_ __depreacted__ use [EntityhubLinkingEngine](entityhublinking) instead!
+* _[KeywordLinkingEngine](keywordlinkingengine):_ __depreacted__ use [EntityhubLinkingEngine](entityhublinking) instead!
 	* NLP processing using OpenNLP
 	* supports multiple languages
 	* detects occurrences of untyped entities as concepts, takes local taxonomies as linking target	
@@ -134,11 +134,11 @@ This category covers enhancement engines
 
 This includes Engines that perform word/chunk level sentiment classifications on the [AnalyzedText](../nlp/analyzedtext) content part as well as Engines that summarize those lower level annotations to Sentiments for sentences, sections or the whole text. Sentiment summarizations are represented as 'fise:SentimentAnnotation's (TODO: not yet fully specified (see [STANBOL-760](https://issues.apache.org/jira/browse/STANBOL-760)).
 
-* __Sentiment WordClassifier Engine__: This engine annotates _Tokens_ of the  [AnalyzedText](../nlp/analyzedtext) content part with sentiment annotations (a double value in the range [-1..1]
+* __Sentiment WordClassifier Engine:__ This engine annotates _Tokens_ of the  [AnalyzedText](../nlp/analyzedtext) content part with sentiment annotations (a double value in the range [-1..1]
 	* supports de and en
 	* can be extended to support additional languages by implementing the _SentimentClassifier_ interface
 
-* _Sentiment Summarization Engine_: __under development__ (see [STANBOL-760](https://issues.apache.org/jira/browse/STANBOL-760))
+* _Sentiment Summarization Engine:_ __under development__ (see [STANBOL-760](https://issues.apache.org/jira/browse/STANBOL-760))
 	* summarizes sentiments on word level to chunks, sentences and the whole text
 	* create 'fise:SentimentAnnotations'
 
@@ -146,12 +146,12 @@ This includes Engines that perform word/
 
 Enhancement Engines in this category can disambiguate Entities based on contextual information (e.g. if "Apple" in a sentence refers to the fruit or the company). Based on that such engines can adjust existing Entity suggestions or also create new one.
 
-* __DBpedia Spotlight Disambiguation Engine__: (see [STANBOL-706](https://issues.apache.org/jira/browse/STANBOL-706))
+* __DBpedia Spotlight Disambiguation Engine:__ (see [STANBOL-706](https://issues.apache.org/jira/browse/STANBOL-706))
 	* consumes existing fise:TextAnnotations and disambiguate them by using DBpedia Spotlight
 	* create Entity suggestions (fise:EntityAnnotations) for the processed fise:TextAnnotations
 	* accesses a remote service
 
-* _Solr More-like-This Disambiguation Engine_: __under development_ (see [STANBOL-723](https://issues.apache.org/jira/browse/STANBOL-723))
+* _Solr More-like-This Disambiguation Engine:_ __under development_ (see [STANBOL-723](https://issues.apache.org/jira/browse/STANBOL-723))
 	* disambiguates Entities managed by the Stanbol Entityhub by using Solr MLT queries
 	* only available via the [disambiguation-engine](http://svn.apache.org/repos/asf/stanbol/branches/disambiguation-engine/) branch
 	* adjusts the fise:confidence of existing fise:EntityAnnotations
@@ -159,11 +159,11 @@ Enhancement Engines in this category can
 
 ## Postprocessing / Other
 
-* _NLP 2 RDF Engine_: __under development__ (see [STANBOL-741](https://issues.apache.org/jira/browse/STANBOL-741))
+* _NLP 2 RDF Engine:_ __under development__ (see [STANBOL-741](https://issues.apache.org/jira/browse/STANBOL-741))
 	* converts NLP processing results stored in the [AnalyzedText](../nlp/analyzedtext) content part to RDF and adds them to the metadata of the [ContentItem](../contentitem)
 	* generated RDF uses the NIF (NLP Interchange Format)
 
-* __[Refactor Engine](refactorengine)__
+* __[Refactor Engine](refactorengine):__
 	* transforms enhancements according to a target ontology, requires KRES launcher.