You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@uima.apache.org by pk...@apache.org on 2012/11/30 18:39:54 UTC
svn commit: r1415760 -
/uima/sandbox/trunk/TextMarker/uima-docbook-textmarker/src/docbook/tools.textmarker.overview.xml
Author: pkluegl
Date: Fri Nov 30 17:39:53 2012
New Revision: 1415760
URL: http://svn.apache.org/viewvc?rev=1415760&view=rev
Log:
UIMA-2285
- added sections "What is TextMarker?" and "Getting started" in overview chapter
Modified:
uima/sandbox/trunk/TextMarker/uima-docbook-textmarker/src/docbook/tools.textmarker.overview.xml
Modified: uima/sandbox/trunk/TextMarker/uima-docbook-textmarker/src/docbook/tools.textmarker.overview.xml
URL: http://svn.apache.org/viewvc/uima/sandbox/trunk/TextMarker/uima-docbook-textmarker/src/docbook/tools.textmarker.overview.xml?rev=1415760&r1=1415759&r2=1415760&view=diff
==============================================================================
--- uima/sandbox/trunk/TextMarker/uima-docbook-textmarker/src/docbook/tools.textmarker.overview.xml (original)
+++ uima/sandbox/trunk/TextMarker/uima-docbook-textmarker/src/docbook/tools.textmarker.overview.xml Fri Nov 30 17:39:53 2012
@@ -31,35 +31,62 @@ under the License.
<section id="ugr.tools.tm.overview.intro">
<title>What is TextMarker?</title>
<para>
- - rule-based script language
- - imperative execution
- - rapid prototyping
- - intuitive and flexible, fast developement
- - extensive tooling support, writing rules is tedious, needs support
- - all about UIMA, AE, TypeSystem...
+ Apache UIMA™ TextMarker is a rule-based script language supported by Eclipse-based tooling.
+ The language is designed to enable rapid development of text processing applications within UIMA
+ and a special focus lies on the intuitive and flexible domain specific language for defining
+ patterns of annotations. Writing rules for information extraction or other text processing
+ applications is a tedious process. The Eclipse-based tooling for TextMarker, called the TextMarker Workbench,
+ was created to support the user and to facilitate every step when writing TextMarker rules.
+ The TextMarker rule language and the TextMarker Workbench integrate both smoothly with Apache UIMA.
</para>
</section>
<section id="ugr.tools.tm.overview.gettingstarted">
<title>Getting started</title>
<para>
- - know about the core concepts of UIMA, this documentation assumes that the reader knows about that, CAS, Feature Structure, Annotation, Type, Analysis Engine, Type System,...
- - recommended way to learn textmarker? how to read this documentation?
- - installation of the IDE
- - download the examples and play around
- - read about the core concepts of TextMarker and take a look at the language examples
- - read the chapter about language for a precise description of the language, rather a reference book
- - workbench chapter for documentation of eclipse-based tooling
- - example project in SVN
- </para>
+ This section gives a short roadmap how to read the documentation and gives some recommendations how to
+ start developing TextMarker-based applications. This documentation assumes that the user knows about
+ the core concepts of Apache UIMA. Knowledge about the meaning and usage of at least the terms <quote>CAS</quote>,
+ <quote>Feature Structure</quote>, <quote>Annotation</quote>, <quote>Type</quote>, <quote>Type System</quote>
+ and <quote>Analysis Engine</quote> is required. Please refer to the documentation of Apache UIMA for an introduction.
+ </para>
+ <para>
+ Unexperienced users that want to learn about TextMarker can start with the next two sections:
+ <xref linkend="ugr.tools.tm.overview.coreconcepts"/>
+ gives a short overview about the core ideas and features of the TextMarker language and Workbench.
+ This section introduces the main concepts of the TextMarker language. It explains how TextMarker rules
+ are composed and applied, and discusses the advantages of the TextMarker system.
+ The following <xref linkend="ugr.tools.tm.overview.examples"/> approaches the TextMarker language using a different
+ perspective. Here, the language is introduced only with examples. The first example starts with explaining how a simple rule
+ looks like, and each following example extends the syntax or semantics of the TextMarker language.
+ After the consultation of these two sections, the reader should know enough to start writing her first TextMarker-based application.
+ </para>
+ <para>
+ The TextMarker Workbench was created to support the user and to facilitate the development process. It is strongly recommended to
+ use this Eclipse-based IDE since it, for example, automatically configures the component descriptors and provides editing support like
+ syntax checking. <xref linkend="section.ugr.tools.tm.workbench.install"/> describes how the TextMarker Workbench is installed.
+ TextMarker rules can of course also be applied on CAS without using the TextMarker Workbench.
+ <xref linkend="ugr.tools.tm.ae.basic.apply"/> contains examples how to execute TextMarker rules in plain java.
+ A good way to get started with TextMarker is to play around with an exemplary TextMarker project, e.g.,
+ <uri>https://svn.apache.org/repos/asf/uima/sandbox/trunk/TextMarker/example-projects/ExampleProject</uri>. This TextMarker project
+ contains some simple rules for processing citation metadata.
+ </para>
+ <para>
+ <xref linkend="ugr.tools.tm.language.language"/> and <xref linkend="ugr.tools.tm.workbench"/> provide
+ more detailed descriptions and can be referred to in order to gain knowledge about specific parts
+ of the TextMarker language or the TextMarker workbench.
+ </para>
</section>
<section id="ugr.tools.tm.overview.coreconcepts">
<title>Core Concepts</title>
<para>
- what does it look like? rule composition
+ In contrast to similar rule-based systems, TextMarker rules are executed
+ in an imperative manner: the rules are applied in the order they occur in a TextMarker script file.
- imperative execution, advantages and disadvantages
- depth first matching
+ - projects , scripts files
- distinctive language elements for some tasks, one action or condition for one task
- where to start with matching? annotations in CAS, default annotation seeding
- work with annotations, just spam them