You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@ctakes.apache.org by bl...@apache.org on 2012/11/15 23:37:45 UTC
svn commit: r1410075 -
/incubator/ctakes/site/trunk/content/ctakes/2.6.0/ctakes-2.6-Clinical-Documents-Pipeline.mdtext
Author: bleeker
Date: Thu Nov 15 22:37:44 2012
New Revision: 1410075
URL: http://svn.apache.org/viewvc?rev=1410075&view=rev
Log:
CMS commit to ctakes by bleeker
Added:
incubator/ctakes/site/trunk/content/ctakes/2.6.0/ctakes-2.6-Clinical-Documents-Pipeline.mdtext (with props)
Added: incubator/ctakes/site/trunk/content/ctakes/2.6.0/ctakes-2.6-Clinical-Documents-Pipeline.mdtext
URL: http://svn.apache.org/viewvc/incubator/ctakes/site/trunk/content/ctakes/2.6.0/ctakes-2.6-Clinical-Documents-Pipeline.mdtext?rev=1410075&view=auto
==============================================================================
--- incubator/ctakes/site/trunk/content/ctakes/2.6.0/ctakes-2.6-Clinical-Documents-Pipeline.mdtext (added)
+++ incubator/ctakes/site/trunk/content/ctakes/2.6.0/ctakes-2.6-Clinical-Documents-Pipeline.mdtext Thu Nov 15 22:37:44 2012
@@ -0,0 +1,70 @@
+Title: cTAKES 2.6 Clinical Documents Pipeline
+Notice: Licensed to the Apache Software Foundation (ASF) under one
+ or more contributor license agreements. See the NOTICE file
+ distributed with this work for additional information
+ regarding copyright ownership. The ASF licenses this file
+ to you under the Apache License, Version 2.0 (the
+ "License"); you may not use this file except in compliance
+ with the License. You may obtain a copy of the License at
+ .
+ http://www.apache.org/licenses/LICENSE-2.0
+ .
+ Unless required by applicable law or agreed to in writing,
+ software distributed under the License is distributed on an
+ "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+ KIND, either express or implied. See the License for the
+ specific language governing permissions and limitations
+ under the License.
+
+# cTAKES 2.6 - Clinical Documents Pipeline
+
+
+## Overview of Clinical Documents Pipeline
+
+This project is the top-level, main project for processing a clinical document
+through the entire cTAKES pipeline, including sentence detection, part of
+speech tagging [POS](http://ohnlp.sourceforge.net/cTAKES/#cd_pos_tagger),
+[chunking](http://ohnlp.sourceforge.net/cTAKES/#cd_chunker), named entity
+recognition, context detection, and negation detection. Refer to [context in
+the cTAKES documentation on
+SourceForge](http://ohnlp.sourceforge.net/cTAKES/#cd_necontexts).
+
+The pipeline can process two types of documents
+
+ * plain text files
+ * Clinical Document Architecture (CDA) XML files that conform to the DTD provided
+
+## Analysis engines (annotators)
+
+### AggregateCdaProcessor.xml for CDA documents conforming to the provided DTD
+
+The file desc/analysis_engine/AggregateCdaProcessor.xml is the aggregate
+analysis engine to use to run the entire pipeline, including the
+CdaCasInitialzer analysis engine, which reads CDA documents that conform to
+the DTD provided, and create Segment annotations based on the sections within
+the CDA document.
+
+**Parameters**
+
+**ChunkerCreatorClass**
+the full class name of an implementation of the interface
+edu.mayo.bmi.uima.chunker.ChunkerCreator
+
+### AggregatePlaintextProcessor.xml for plain text documents
+
+The file desc/analysis_engine/AggregatePlaintextProcessor.xml is the aggregate
+analysis engine to use to run the entire pipeline, including the
+SimpleSegmentAnnotator analysis engine, which creates a Segment annotation
+that wraps the entire plain text document. Other annotators in the pipeline
+require at least one Segment annotation.
+
+**Parameters**
+SegmentID the identifier or name to assign to the Segment annotation
+ChunkerCreatorClass the full class name of an implementation of the interface
+edu.mayo.bmi.uima.chunker.ChunkerCreator
+
+![](/images/icons/emoticons/information.png)
+
+The ChunkCreatorClass parameter of both annotators is set to
+edu.mayo.bmi.uima.chunker.PhraseTypeChunkCreator so that each phrase type gets
+its own type of annotation, rather than having all chunks be of type Chunk.
\ No newline at end of file
Propchange: incubator/ctakes/site/trunk/content/ctakes/2.6.0/ctakes-2.6-Clinical-Documents-Pipeline.mdtext
------------------------------------------------------------------------------
svn:eol-style = native