You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@ctakes.apache.org by se...@apache.org on 2014/11/04 17:08:26 UTC

svn commit: r1636628 - in /ctakes/sandbox/dictionarytool/data: default/ optional/ small/ tiny/

Author: seanfinan
Date: Tue Nov  4 16:08:25 2014
New Revision: 1636628

URL: http://svn.apache.org/r1636628
Log:
Slight restructuring, including the addition of "small" and "tiny" list versions.  These do not actually create "tiny" databases - to do that you'd need to do something like select fewer or different tuis OR use cmd parameters to increase the minimum text character length, decrease the maximum word token length, disable abbreviation extraction, etc.  They do, however, remove some text extraction in favor of text ignorance.  The reduction in text count, cui count, and db MB is roughly 5%, but mostly fat trimming.

Added:
    ctakes/sandbox/dictionarytool/data/default/RemovalFunctionTriggers.txt
    ctakes/sandbox/dictionarytool/data/optional/
    ctakes/sandbox/dictionarytool/data/optional/CtakesAllTuis.txt
      - copied unchanged from r1625577, ctakes/sandbox/dictionarytool/data/default/CtakesAllTuis.txt
    ctakes/sandbox/dictionarytool/data/optional/CtakesAnatTuis.txt
      - copied unchanged from r1625577, ctakes/sandbox/dictionarytool/data/default/CtakesAnatTuis.txt
    ctakes/sandbox/dictionarytool/data/optional/CtakesDisoTuis.txt
      - copied unchanged from r1625577, ctakes/sandbox/dictionarytool/data/default/CtakesDisoTuis.txt
    ctakes/sandbox/dictionarytool/data/optional/CtakesFindTuis.txt
      - copied unchanged from r1625577, ctakes/sandbox/dictionarytool/data/default/CtakesFindTuis.txt
    ctakes/sandbox/dictionarytool/data/optional/CtakesProcTuis.txt
      - copied unchanged from r1625577, ctakes/sandbox/dictionarytool/data/default/CtakesProcTuis.txt
    ctakes/sandbox/dictionarytool/data/optional/UmlsAllSources.txt
      - copied unchanged from r1625577, ctakes/sandbox/dictionarytool/data/default/UmlsAllSources.txt
    ctakes/sandbox/dictionarytool/data/optional/UmlsAllTuis.txt
      - copied unchanged from r1625577, ctakes/sandbox/dictionarytool/data/default/UmlsAllTuis.txt
    ctakes/sandbox/dictionarytool/data/small/
    ctakes/sandbox/dictionarytool/data/small/ConversionSources.txt
    ctakes/sandbox/dictionarytool/data/small/CtakesDrugTuis.txt
    ctakes/sandbox/dictionarytool/data/small/CtakesSnomedTuis.txt
    ctakes/sandbox/dictionarytool/data/small/CtakesSources.txt
    ctakes/sandbox/dictionarytool/data/small/ModifierSuffixes.txt
    ctakes/sandbox/dictionarytool/data/small/RemovalColonTriggers.txt
    ctakes/sandbox/dictionarytool/data/small/RemovalPrefixTriggers.txt
    ctakes/sandbox/dictionarytool/data/small/RemovalSuffixTriggers.txt
    ctakes/sandbox/dictionarytool/data/small/RightAbbreviations.txt
    ctakes/sandbox/dictionarytool/data/small/UnwantedPrefixes.txt
    ctakes/sandbox/dictionarytool/data/small/UnwantedSuffixes.txt
    ctakes/sandbox/dictionarytool/data/tiny/
    ctakes/sandbox/dictionarytool/data/tiny/ConversionSources.txt
    ctakes/sandbox/dictionarytool/data/tiny/CtakesAnatTuis.txt
    ctakes/sandbox/dictionarytool/data/tiny/CtakesDrugTuis.txt
      - copied unchanged from r1625577, ctakes/sandbox/dictionarytool/data/default/CtakesDrugTuis.txt
    ctakes/sandbox/dictionarytool/data/tiny/CtakesSnomedTuis.txt
      - copied, changed from r1625577, ctakes/sandbox/dictionarytool/data/default/CtakesSnomedTuis.txt
    ctakes/sandbox/dictionarytool/data/tiny/CtakesSources.txt
    ctakes/sandbox/dictionarytool/data/tiny/ModifierSuffixes.txt
    ctakes/sandbox/dictionarytool/data/tiny/RemovalColonTriggers.txt
    ctakes/sandbox/dictionarytool/data/tiny/RemovalFunctionTriggers.txt
    ctakes/sandbox/dictionarytool/data/tiny/RemovalPrefixTriggers.txt
    ctakes/sandbox/dictionarytool/data/tiny/RemovalSuffixTriggers.txt
    ctakes/sandbox/dictionarytool/data/tiny/RightAbbreviations.txt
    ctakes/sandbox/dictionarytool/data/tiny/UnwantedPrefixes.txt
    ctakes/sandbox/dictionarytool/data/tiny/UnwantedSuffixes.txt
Removed:
    ctakes/sandbox/dictionarytool/data/default/CtakesAllTuis.txt
    ctakes/sandbox/dictionarytool/data/default/CtakesAnatTuis.txt
    ctakes/sandbox/dictionarytool/data/default/CtakesDisoTuis.txt
    ctakes/sandbox/dictionarytool/data/default/CtakesFindTuis.txt
    ctakes/sandbox/dictionarytool/data/default/CtakesProcTuis.txt
    ctakes/sandbox/dictionarytool/data/default/UmlsAllSources.txt
    ctakes/sandbox/dictionarytool/data/default/UmlsAllTuis.txt
Modified:
    ctakes/sandbox/dictionarytool/data/default/RemovalSuffixTriggers.txt
    ctakes/sandbox/dictionarytool/data/default/UnwantedPrefixes.txt
    ctakes/sandbox/dictionarytool/data/default/UnwantedSuffixes.txt

Added: ctakes/sandbox/dictionarytool/data/default/RemovalFunctionTriggers.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/default/RemovalFunctionTriggers.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/default/RemovalFunctionTriggers.txt (added)
+++ ctakes/sandbox/dictionarytool/data/default/RemovalFunctionTriggers.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,4 @@
+] or [
+] & / or [
+) or (
+) & / or (
\ No newline at end of file

Modified: ctakes/sandbox/dictionarytool/data/default/RemovalSuffixTriggers.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/default/RemovalSuffixTriggers.txt?rev=1636628&r1=1636627&r2=1636628&view=diff
==============================================================================
--- ctakes/sandbox/dictionarytool/data/default/RemovalSuffixTriggers.txt (original)
+++ ctakes/sandbox/dictionarytool/data/default/RemovalSuffixTriggers.txt Tue Nov  4 16:08:25 2014
@@ -4,3 +4,31 @@
 ( reassigned )
 [ obsolete ]
 ( deprecated )
+// Added for form cleanup 11/3/2014 spf
+( # _ _ _ )
+( / _ _ _ )
+( _ _ _ / )
+( j _ _ _ )
+( _ _ _ 0 , 2 )
+( _ _ _ 0 - 30 )
+( _ _ _ bpm )
+( _ _ _ cm )
+( _ _ _ _ cm )
+( _ _ _ _ _ cm )
+( _ _ _ cm2 )
+( _ _ _ degrees )
+( _ _ _ m2 )
+( _ _ _ mets )
+( _ _ _ min )
+( _ _ _ minutes each time )
+( _ _ _ per hpf )
+( minus _ _ _ )
+( _ _ _ ml / hr )
+( _ _ _ ml per second )
+( plus _ _ _ )
+( _ _ _ sec )
+: plus _ _ _
+_ _ _ %
+_ _ _ cm
+add _ _ _
+

Modified: ctakes/sandbox/dictionarytool/data/default/UnwantedPrefixes.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/default/UnwantedPrefixes.txt?rev=1636628&r1=1636627&r2=1636628&view=diff
==============================================================================
--- ctakes/sandbox/dictionarytool/data/default/UnwantedPrefixes.txt (original)
+++ ctakes/sandbox/dictionarytool/data/default/UnwantedPrefixes.txt Tue Nov  4 16:08:25 2014
@@ -22,9 +22,6 @@ child :
 college of american pathologists cancer checklist ;
 engaged in 
 entire 
-h / o
-o / e
-on examination
 other activity involving
 other
 phenx -

Modified: ctakes/sandbox/dictionarytool/data/default/UnwantedSuffixes.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/default/UnwantedSuffixes.txt?rev=1636628&r1=1636627&r2=1636628&view=diff
==============================================================================
--- ctakes/sandbox/dictionarytool/data/default/UnwantedSuffixes.txt (original)
+++ ctakes/sandbox/dictionarytool/data/default/UnwantedSuffixes.txt Tue Nov  4 16:08:25 2014
@@ -33,16 +33,15 @@
 - summary
 - symptom
 - unspec
+- unspecified
 - weather
 - sport
-( _ _ _ 0 , 2 )
 ( except leather and fur )
 ( namespace concept )
 ( process of differentiation )
 ) ( adjudicators ( & top )
  as topic
  as topics
- finding
  nec
  nec in snomedct
  nos
@@ -1022,3 +1021,4 @@ concept ( inactive concept )
 epidemiology
 reason not stated ( inactive concept )
 status ( & consent )
+

Added: ctakes/sandbox/dictionarytool/data/small/ConversionSources.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/small/ConversionSources.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/small/ConversionSources.txt (added)
+++ ctakes/sandbox/dictionarytool/data/small/ConversionSources.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,4 @@
+// ICD10PCS
+// ICD9CM
+RXNORM
+SNOMEDCT

Added: ctakes/sandbox/dictionarytool/data/small/CtakesDrugTuis.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/small/CtakesDrugTuis.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/small/CtakesDrugTuis.txt (added)
+++ ctakes/sandbox/dictionarytool/data/small/CtakesDrugTuis.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,22 @@
+T109
+T110
+T114
+T115
+T116
+T118
+T119
+T121
+T122
+T123
+T124
+T125
+T126
+T127
+T129
+T130
+T131
+T195
+T196
+T197
+T200
+T203

Added: ctakes/sandbox/dictionarytool/data/small/CtakesSnomedTuis.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/small/CtakesSnomedTuis.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/small/CtakesSnomedTuis.txt (added)
+++ ctakes/sandbox/dictionarytool/data/small/CtakesSnomedTuis.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,36 @@
+// ANAT
+T021
+T022
+T023
+T024
+T025
+T026
+T029
+T030
+// DISO
+T019
+T020
+T037
+T047
+T048
+T049
+T050
+T190
+T191
+// FIND
+T033
+T034
+T040
+T041
+T042
+T043
+T044
+T045
+T046
+T056
+T057
+T184
+// PROC
+T059
+T060
+T061

Added: ctakes/sandbox/dictionarytool/data/small/CtakesSources.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/small/CtakesSources.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/small/CtakesSources.txt (added)
+++ ctakes/sandbox/dictionarytool/data/small/CtakesSources.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1 @@
+SNOMEDCT

Added: ctakes/sandbox/dictionarytool/data/small/ModifierSuffixes.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/small/ModifierSuffixes.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/small/ModifierSuffixes.txt (added)
+++ ctakes/sandbox/dictionarytool/data/small/ModifierSuffixes.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1 @@
+// Don't automatically modify for small

Added: ctakes/sandbox/dictionarytool/data/small/RemovalColonTriggers.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/small/RemovalColonTriggers.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/small/RemovalColonTriggers.txt (added)
+++ ctakes/sandbox/dictionarytool/data/small/RemovalColonTriggers.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,38 @@
+: - :
+: 24 hours :
+: 8 hours :
+: address :
+: arbitrary concentration :
+: duration of encounter :
+: duration of the study :
+: finding :
+: identifier :
+: impression / interpretation of study :
+: life of the unit :
+: mass :
+: number :
+: point in time :
+: prid :
+: pt :
+: substance concentration :
+: text :
+: type :
+: volume :
+: volume rate :
+// The following are in UnwantedSuffixes for default, but trigger removal for small
+:
+: - -
+: - : point in time : ^ patient : - : phenx
+: ( & glucose ( & plasma ) )
+: ( & nos )
+: [ & named variants ]
+: [ admin ] or [ exam ]
+: [ drivers exam ] or [ dtp20003 - hgv / publ serv claim ] or [ admin ]
+: [ early morning ] or [ mid-stream ] or [ for organism ( & culture ) ]
+: [ exam ( & admin ) ] ) or [ my22 form ]
+: [ fp78 ] or [ gp78 ]
+: [ implantation ] or [ implant ]
+: [ monitoring ] or [ clinic ]
+: [ nos ]
+: [ officer ] or [ executive ]
+: [ production ] or [ product ]

Added: ctakes/sandbox/dictionarytool/data/small/RemovalPrefixTriggers.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/small/RemovalPrefixTriggers.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/small/RemovalPrefixTriggers.txt (added)
+++ ctakes/sandbox/dictionarytool/data/small/RemovalPrefixTriggers.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,32 @@
+deprecated 
+retired procedure
+// The following are in UnwantedPrefixes for the default, but here for small
+[ d ]
+[ edta ]
+[ m ]
+[ q ]
+[ v ]
+[ x ]
+[ so ]
+activities involving
+and unspecified
+( non-specific )
+child :
+college of american pathologists cancer checklist ;
+other activity involving
+other
+phenx -
+specimen source code -
+specimen source codes -
+specimen source codes =
+ or
+#
+&
+& [
+'
+*
+-
+:
+?
+
+

Added: ctakes/sandbox/dictionarytool/data/small/RemovalSuffixTriggers.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/small/RemovalSuffixTriggers.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/small/RemovalSuffixTriggers.txt (added)
+++ ctakes/sandbox/dictionarytool/data/small/RemovalSuffixTriggers.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,240 @@
+- retired code
+- retired -
+- retired
+( reassigned )
+[ obsolete ]
+( deprecated )
+// The following are in UnwantedSuffixes for default, but trigger removal for small
+- ( other )
+- duplicate -
+- nos
+- retired
+- retired -
+- unspec
+- unspecified
+ as topic
+ as topics
+ nec
+ nec in snomedct
+ nos
+ not otherwise specified
+ or
+ title 18
+( or )
+( 10 )
+( _ _ _ 0 , 2 )
+( deprecated )
+( except leather and fur )
+( hl7v3 . 0 concept )
+( inactive concept )
+( mesh category )
+( namespace concept )
+( non-specific )
+( nos )
+( not elsewhere classified in lnc )
+( obsolete notation )
+( procedure nec )
+( process of differentiation )
+) ( adjudicators ( & top )
+& / or o / e
+& / or orderly ( & [ orderlies ] )
+& [ fistula ]
+& [ symptom ]
+& [ legal ]
+& [ symptom ]
+& [ vehicle ]
+( & [ reimbursement ] )
+( & [ abnormal ] )
+( & [ acquired ] )
+( & [ acute ] )
+( & [ acute nos ] )
+( & [ broad ligament ] )
+( & [ bupa examination done ] )
+( & [ c / o - ] )
+( & [ c / o ] )
+( & [ chart ] )
+( & [ duct ] or [ gland ] )
+( & [ eating observation ] )
+( & [ etc ] )
+( & [ expiratory ] )
+( & [ eye disorder ] )
+( & [ government ] )
+( & [ hall ] )
+( & [ infection ] )
+( & [ keen ] )
+( & [ labor complicated by ] )
+( & [ laborer nos ] )
+( & [ labour complicated by ] )
+( & [ labourer nos ] )
+( & [ machine ] and / or [ press ] )
+( & [ member of phct ] )
+( & [ miners ( & coal )
+( & [ named variants ] )
+( & [ nec ] )
+( & [ nos ] )
+( & [ obstructive ] )
+( & [ occupation ] )
+( & [ occupations ] )
+( & [ officer ] )
+( & [ product ] )
+( & [ refer ] )
+( & [ referral ] )
+( & [ referral to ] )
+( & [ reimbursement ] )
+( & [ retail ] )
+( & [ rubber ] )
+( & [ staff ] )
+( & [ staghorn ] )
+( & [ symptom ] )
+( & [ teachers ] )
+( & [ unmarried ] )
+( & [ unspecified ] )
+( & [ warfarin ] )
+( & [ weightlessness ] )
+( & [ well ] )
+( & acquired )
+( & acute )
+( & admin )
+( & administration )
+( & ankle joint )
+( & assess )
+( & attic )
+( & bladder )
+( & blind )
+( & c / o )
+( & c / s )
+( & certificate )
+( & chronic )
+( & claim )
+( & claim gp / ms )
+( & congenital )
+( & conjunctiva )
+( & consent )
+( & dialysis )
+( & direct )
+( & disorder )
+( & distal )
+( & drug )
+( & expiratory )
+( & f303 )
+( & floor / wall )
+( & food poison )
+( & form )
+( & form gp45 ) )
+( & form : [ fp70a ] or [ gp / pea ] )
+( & from patient )
+( & held )
+( & hepatic duct )
+( & in blood )
+( & infection )
+( & infestation )
+( & level ( & phenylalanine ) )
+( & level )
+( & localization )
+( & marital )
+( & maternity allowances )
+( & mobile unit x-ray )
+( & named variants )
+( & nec )
+( & non urg )
+( & nos )
+( & o / e )
+( & on medical advice )
+( & operation )
+( & other )
+( & pathology )
+( & patient data )
+( & peritonsillar region )
+( & pleural )
+( & provision )
+( & record )
+( & referred to )
+( & report )
+( & result )
+( & sample )
+( & sc1 - given to staff )
+( & scrotal skin )
+( & short hip rotator )
+( & sinus )
+( & skin )
+( & skin of external ear )
+( & skin of external nose )
+( & skin of eyebrow )
+( & skin of eyelid )
+( & skin of male perineum )
+( & skin of nipple )
+( & skin of penis )
+( & skin of vulva )
+( & stock order gp10a )
+( & swab )
+( & symptom )
+( & tanner )
+( & top )
+( & top managers )
+( & traumatic )
+( & umbilical skin )
+( & unspecified )
+( & upper )
+( & vaccination )
+( & various types )
+( & viral )
+( & with stabilisation )
+( & with stabilization )
+( * )
+( _ _ _ per hpf )
+( + / - religion )
+( = breath )
+( = breath )
+( = stone )
+( = stone )
+[ & aircraft ]
+[ & bacterial ]
+[ & glucose ]
+[ & named variants ]
+[ & nos ]
+[ & professional ]
+[ & symptom ]
+[ ambiguous ]
+[ dup ]
+[ duplicate ]
+[ obsolete ]
+[ other ]
+, nec in snomedct
+, nos
+, not elsewhere classified
+, unclassified
+, unspecified
+, unspecified part
+concept ( inactive concept )
+reason not stated ( inactive concept )
+// New for tiny
+&
+: phenx
+// Added for form cleanup 11/3/2014 spf
+( # _ _ _ )
+( / _ _ _ )
+( _ _ _ / )
+( j _ _ _ )
+( _ _ _ 0 , 2 )
+( _ _ _ 0 - 30 )
+( _ _ _ bpm )
+( _ _ _ cm )
+( _ _ _ _ cm )
+( _ _ _ _ _ cm )
+( _ _ _ cm2 )
+( _ _ _ degrees )
+( _ _ _ m2 )
+( _ _ _ mets )
+( _ _ _ min )
+( _ _ _ minutes each time )
+( _ _ _ per hpf )
+( minus _ _ _ )
+( _ _ _ ml / hr )
+( _ _ _ ml per second )
+( plus _ _ _ )
+( _ _ _ sec )
+: plus _ _ _
+_ _ _ %
+_ _ _ cm
+add _ _ _
+

Added: ctakes/sandbox/dictionarytool/data/small/RightAbbreviations.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/small/RightAbbreviations.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/small/RightAbbreviations.txt (added)
+++ ctakes/sandbox/dictionarytool/data/small/RightAbbreviations.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1 @@
+// No automatic abbreviation extraction for small
\ No newline at end of file

Added: ctakes/sandbox/dictionarytool/data/small/UnwantedPrefixes.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/small/UnwantedPrefixes.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/small/UnwantedPrefixes.txt (added)
+++ ctakes/sandbox/dictionarytool/data/small/UnwantedPrefixes.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,3 @@
+( congenital )
+engaged in
+entire 

Added: ctakes/sandbox/dictionarytool/data/small/UnwantedSuffixes.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/small/UnwantedSuffixes.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/small/UnwantedSuffixes.txt (added)
+++ ctakes/sandbox/dictionarytool/data/small/UnwantedSuffixes.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,806 @@
+/
+'
+-
+- ( religious affiliation )
+- a specialty area of medicine
+- action
+- activity of daily living
+- assess ( & non urg )
+- attribute
+- biologic function
+- category
+- confectionery
+- dietary
+- entertainment ( & [ musical ]
+- finding
+- function
+- general -
+- herbal supplement
+- mental defense mechanism
+- mental process
+- observation
+- organization
+- physical agent
+- related finding
+- rolecode
+- speciality
+- specialty
+- summary
+- symptom
+- weather
+- sport
+( a priority lower than stat )
+( abnormality )
+( abo blood group )
+( acholeplasma host )
+( acknowledgementcondition )
+( acquired )
+( action )
+( action )
+( activated )
+( activity )
+( activity )
+( actual )
+( adjudicators ( & top )
+( adl finding )
+( administative )
+( administration method )
+( administrative concept )
+( age group )
+( alcohol )
+( all components )
+( all fractions of )
+( allelic variant )
+( alone )
+( alphabetic group )
+( alternative )
+( amount )
+( amphibian )
+( anatomical classification )
+( anatomical coordinate )
+( anatomical descriptor )
+( anatomical feature )
+( anatomical location )
+( anatomy )
+( and accessories )
+( and related agents )
+( angiosperm hosts )
+( animal )
+( any part )
+( apparatus )
+( applies to non-numeric results )
+( area of increased mortality )
+( area of knowledge )
+( arrangement )
+( art )
+( artificial )
+( artistic )
+( ascomycota )
+( assessment )
+( assessment scale )
+( astronomy )
+( at present )
+( attitude )
+( attribute )
+( attributes )
+( audit )
+( australia )
+( available ( qualifier ) )
+( bacteria )
+( bacteria and cyanobacteria hosts )
+( battery - powered )
+( beach )
+( beach , court )
+( beet sugar )
+( behavior )
+( behavior finding )
+( beverage )
+( biologic function )
+( biological )
+( biological function )
+( biological funtion )
+( biopsy result )
+( bird )
+( blacus )
+( blood components )
+( blood pressure )
+( body part )
+( body structure )
+( body substance )
+( botanical taxonomic category below variety )
+( brain )
+( brain structure )
+( button )
+( calcified structure )
+( canal )
+( cancer stage )
+( candy )
+( cardiac conduction )
+( category )
+( catering and lodging services )
+( cell )
+( cell line )
+( cell structure )
+( cells or organisms )
+( central nervous system tumor )
+( central nervous system tumour )
+( cerebellum )
+( certification )
+( change )
+( characteristic )
+( chemical )
+( chemical / related processes )
+( chemical and related processes )
+( chemical processes , except petroleum )
+( citrus )
+( class )
+( classical )
+( classification )
+( cleansing action )
+( clinical )
+( clinical definition )
+( clinical disorder )
+( clinical specialty )
+( color )
+( combined site )
+( comfort level finding )
+( communication )
+( communicationfunctiontype )
+( community )
+( company )
+( compartment )
+( complement activation )
+( complete ) ( partial )
+( component )
+( composite )
+( compression algorithm )
+( computer )
+( computer files directory )
+( computer network )
+( concept )
+( construction )
+( container status )
+( containing tobacco )
+( context - dependent category )
+( contextual qualifier )
+( copy in notes )
+( copy within chart )
+( court )
+( crustacea )
+( crustacean )
+( data - processing )
+( data )
+( data type )
+( dataset )
+( dedicated instruments )
+( degree / certification )
+( degree / license / certification )
+( dental procedure )
+( detection and tagging )
+( deviation )
+( device )
+( device status )
+( diagnosis )
+( diagnostic , surgical , and therapeutic interventions )
+( dicotyledons hosts )
+( dietary )
+( digestive structure of animal )
+( diminutive )
+( direction )
+( discharge )
+( discipline )
+( disease )
+( disease attribute )
+( diskus intervertebralis )
+( disorder )
+( do as near as possible to requested time )
+( do immediately )
+( document )
+( document type )
+( documents )
+( dry - cleaner )
+( dysfunction )
+( economic )
+( economic descriptor )
+( economic status )
+( edible )
+( edible oils )
+( education )
+( electric power )
+( electrical )
+( embryology )
+( embryonic structure )
+( emergency )
+( emotion )
+( emotional response )
+( end postition )
+( england & wales )
+( england and wales )
+( entertainment )
+( entire collection )
+( entity )
+( environment )
+( environmental process )
+( epidemiology )
+( equipment )
+( ethnic group )
+( eukaryote )
+( event )
+( event history )
+( except advertising )
+( except aircraft and marine engines )
+( except aircraft and marine turbines )
+( except construction )
+( except electrical )
+( except lawyer or judge )
+( except motor vehicle )
+( except oil and gas wells )
+( except photo - gravers )
+( except rubber )
+( except ships ' engines )
+( except tire maker / tire vulcanizer )
+( except tire makers and tire vulcanisers )
+( except tire makers and tire vulcanizers )
+( except transport )
+( experimental )
+( experimental system )
+( exposure / injury )
+( extensiveness )
+( external anatomical feature )
+( eye )
+( family )
+( family issue )
+( female )
+( field )
+( fiinding )
+( film )
+( films )
+( finding )
+( findings )
+( fish )
+( flies )
+( flower )
+( follower of religion - person )
+( food )
+( for any age )
+( for cleaning purpose )
+( for gunter ' s chain )
+( for pain relief )
+( for staff )
+( for transmitting doses for pharmacokinetics )
+( formal meetings )
+( formation )
+( foundry )
+( freight train )
+( frequency )
+( fruit )
+( function )
+( function )
+( fungi hosts )
+( fungus )
+( garment )
+( gauge size )
+( general )
+( generic version )
+( genetics )
+( genu )
+( geographic )
+( geographic area )
+( geographic location )
+( glandula suprarenalis )
+( goal )
+( grade )
+( gramineae hosts )
+( grooming )
+( grooming )
+( group )
+( group of parts )
+( guideline )
+( gunter ' s chain )
+( hand - operated machine )
+( heading )
+( health )
+( health care )
+( healthcare )
+( helper cells )
+( hemorrhage )
+( high voltage )
+( histology )
+( history )
+( horizontal architectural support )
+( horse )
+( hosiery )
+( human only )
+( hypothalamus )
+( i . q . . . . . )
+( iadl finding )
+( idea )
+( identifier )
+( image )
+( in-store merchandise )
+( in animal tissues )
+( in vivo )
+( including digits )
+( indicator )
+( infection )
+( injury )
+( input )
+( inquiry )
+( insect )
+( insect hosts )
+( instep )
+( instruction )
+( instruction imperative )
+( intake )
+( interactions )
+( intercourse )
+( invertebrate )
+( island )
+( january 2007 release )
+( kind of quantity )
+( lab finding )
+( lab procedure )
+( lab result )
+( lab technique )
+( lab test )
+( laboratory finding )
+( laboratory procedure )
+( land - based )
+( laparotomy performed )
+( length )
+( lepidoptora hosts )
+( libraries )
+( libraries )
+( life event - finding )
+( life style )
+( line - film )
+( lingual , palatine , pharyngeal )
+( link assertion )
+( linkage concept )
+( liquid )
+( list bullets )
+( list separately in addition to code for primary procedure )
+( living organism )
+( location )
+( logging )
+( long - distance transport )
+( lower case a )
+( machine )
+( machine , except jacquard loom )
+( machines )
+( male )
+( malting )
+( mammal )
+( mapping )
+( marine pseudomonad hosts )
+( masonry and woodwork )
+( material )
+( mating )
+( measurement )
+( medical device )
+( medically )
+( medium of exchange )
+( medulla )
+( medullae )
+( medullary reticular formation )
+( member of religious order )
+( membrane )
+( mental function )
+( mental process )
+( metabolim test )
+( metal )
+( metal melting , except cupola )
+( metal reheating )
+( method )
+( military )
+( mine )
+( mine / quarry )
+( mine and quarry )
+( minor structure )
+( miscellaneous )
+( modified )
+( modifier )
+( molecular function )
+( molecular structure )
+( morphologic abnormality )
+( morphologic abnormality , calcified structure )
+( morphology )
+( motion )
+( motion picture )
+( motivation )
+( musical )
+( nanoparticle )
+( natural and artificial )
+( nature of procedure values )
+( navigational concept )
+( neck )
+( nmo )
+( non-ferrous metals )
+( non medication )
+( nonoperative )
+( not alarm reaction )
+( number )
+( numerical qualifier )
+( object )
+( observable )
+( observable entity )
+( observation )
+( observation or disorder )
+( occasion for eating )
+( occupation )
+( occupation history )
+( occupation or discipline )
+( occupational group )
+( ocular )
+( oil / gas wells )
+( oil and gas wells )
+( oilfield )
+( oils and fats )
+( old people ' s home )
+( operative )
+( or )
+( oral challenge )
+( ore smelting )
+( organ )
+( organ component )
+( organ part )
+( organisation )
+( organism )
+( organization )
+( origin )
+( origins )
+( orthodontic procedure )
+( other )
+( output )
+( package insert )
+( pairs )
+( pairs )
+( palatine )
+( parameter )
+( part organ )
+( patient )
+( patient instructions )
+( pdq )
+( per time )
+( per volume )
+( person )
+( personality )
+( persons )
+( pertaining to a node )
+( petroleum refining )
+( phenomenon )
+( philosophy )
+( physical activity )
+( physical activity )
+( physical device )
+( physical finding )
+( physical force )
+( physical force )
+( physical geography )
+( physical object )
+( physical object )
+( physician )
+( physician group )
+( physiologic )
+( physiologic function )
+( physiology )
+( planet )
+( plant )
+( plant hosts )
+( plant part )
+( plants hosts )
+( platelet )
+( polypectomy )
+( population )
+( population )
+( population group )
+( position )
+( postinfection )
+( postoperative )
+( potency unit )
+( power station )
+( ppc - c )
+( prescription )
+( present illness )
+( present time )
+( press / radio / t . v . )
+( primary )
+( primary procedure )
+( printing )
+( private service )
+( procedure )
+( procedure detail )
+( procedure or disorder )
+( process )
+( process of discrimination )
+( product )
+( profession )
+( property )
+( property of measurement )
+( prosthesis )
+( protons , ions )
+( protozoa )
+( provider role )
+( psychiatric )
+( psychiatry )
+( psychologic )
+( psychological function )
+( psychology )
+( psychophysiology )
+( psychotherapy )
+( publication )
+( publication format )
+( publications )
+( pure mathematics )
+( qualifier )
+( qualifier in lnc )
+( qualifier value )
+( qualifier value )
+( quality )
+( race )
+( racial group )
+( radio / t . v . )
+( radio / television )
+( radio / television / radar equipment )
+( radio broadcasts )
+( radioactive materials )
+( radiologist )
+( rate )
+( reassigned )
+( record )
+( record artifact )
+( recreational activity )
+( reference documentation )
+( regime / therapy )
+( region )
+( region of foot )
+( rehabilitation )
+( reimbursement )
+( relative )
+( religion )
+( religion / philosophy )
+( reptile )
+( respiratory measure )
+( response modality )
+( result )
+( revised )
+( rifle )
+( road )
+( role )
+( rolling mill )
+( route )
+( route of administration )
+( schedule frequency )
+( science )
+( scotland )
+( sea )
+( season )
+( second level )
+( section of report )
+( self - care status )
+( sensation )
+( separate procedure )
+( serotype )
+( serous membrane )
+( services standards )
+( set of actions )
+( severe )
+( severity modifier )
+( shape )
+( shin )
+( ship )
+( singles )
+( site )
+( site of care )
+( situation )
+( sleep disorder )
+( slowed )
+( small pieces )
+( snails )
+( social behavior )
+( social concept )
+( social status )
+( sound )
+( sound intensity )
+( source type )
+( spatial )
+( special concept )
+( specialised )
+( specialized )
+( specialty )
+( specific stage )
+( specify )
+( specify level )
+( specify type )
+( specimen )
+( specimen source code )
+( spice )
+( sport )
+( sports activity )
+( stage / studio )
+( stage and studio )
+( staging scale )
+( statement )
+( statistical )
+( statistical adjustment )
+( statistical procedure )
+( statistical technique )
+( status )
+( steam )
+( steel )
+( steel converting )
+( steel refining )
+( still camera )
+( strain )
+( structure )
+( subdivision )
+( substance )
+( substance / material )
+( substance status )
+( substantia innominata )
+( substitution condition )
+( suffix )
+( sugar refining )
+( suppressor cells )
+( surface )
+( surface region )
+( surface region of back )
+( surgical )
+( surgical procedure )
+( surgical speciality )
+( surgical view )
+( symbol )
+( symptom )
+( system )
+( taxon )
+( technician / technologist )
+( techniques )
+( temporal )
+( temporal concept )
+( temporal qualifier )
+( terminology entity )
+( test result )
+( testicles )
+( theater )
+( theatre )
+( theoretical )
+( therapeutic )
+( therapy )
+( therapy ) ( diagnostic id )
+( third level )
+( time frame )
+( time pattern )
+( time period )
+( time point or interval )
+( tissue )
+( tissue integrity )
+( to be done prior to surgery )
+( traffic )
+( transanal disk excision )
+( transmission )
+( transportation )
+( treatment )
+( trust territory )
+( tumor staging )
+( type )
+( type of material , substance )
+( u . s . )
+( uniformity )
+( unit )
+( unit of mass )
+( unit of measure )
+( unit of pressure )
+( unmodified )
+( upper )
+( us )
+( use calc )
+( use of drug )
+( use only with codes for immunization )
+( used to identify procedures performed on the left side of the body )
+( used to identify procedures performed on the right side of the body )
+( uterine )
+( value )
+( variant )
+( vegetable )
+( version 4 )
+( vertebral column )
+( vertebrale )
+( vocalization )
+( volume )
+( wait and see )
+( wasp )
+( waterworks )
+( weight )
+( wood distillation )
+( workshop )
+( yes / no )
+( zoology )
+,
+, cs
+, device
+, disorder
+, follower of religion
+, function
+, life event
+, services & procedure codes ( category iii , temporary )
+, united states
+[ any part ]
+[ a body part ]
+[ action ]
+[ activity ]
+[ adjective ]
+[ an instruction material ]
+[ any part ]
+[ artwork ]
+[ ball ]
+[ body ]
+[ body function ]
+[ body part ]
+[ body parts ]
+[ character ]
+[ chemical / ingredient ]
+[ clothing ]
+[ color ]
+[ device ]
+[ disease / finding ]
+[ document ]
+[ dose form ]
+[ economics ]
+[ environment ]
+[ executive duty ]
+[ extent ]
+[ finding ]
+[ fish ]
+[ follower of religion ]
+[ for organism ( & culture ) ]
+[ furniture ]
+[ game ]
+[ garment ]
+[ hiv ]
+[ in an environmental context ]
+[ in social context ]
+[ mammal ]
+[ manufactured object ]
+[ manufactured product ]
+[ mating posture ]
+[ medical device ]
+[ mental function ]
+[ mental process ]
+[ moa ]
+[ molecular function ]
+[ money ]
+[ motion ]
+[ ngu ]
+[ object ]
+[ occupation ]
+[ organism ] or [ infection - shingles ]
+[ organization ]
+[ paper report ]
+[ pe ]
+[ people ]
+[ physical phenomenon ]
+[ plan ]
+[ plant ]
+[ procedure ]
+[ product ]
+[ profession ]
+[ psychology ]
+[ publication type ]
+[ race ]
+[ reference materials ]
+[ room ]
+[ sample ]
+[ specialty ]
+[ sports ]
+[ statistical concept ]
+[ status ]
+[ substance ]
+[ surgical action ]
+[ uniformity ]
+[ unit of measurement ]
+[ unit of pressure ]
+] ( person )
+{ organization ]
+< subjective >
+epidemiology
+status ( & consent )

Added: ctakes/sandbox/dictionarytool/data/tiny/ConversionSources.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/tiny/ConversionSources.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/tiny/ConversionSources.txt (added)
+++ ctakes/sandbox/dictionarytool/data/tiny/ConversionSources.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,4 @@
+// ICD10PCS
+// ICD9CM
+RXNORM
+SNOMEDCT

Added: ctakes/sandbox/dictionarytool/data/tiny/CtakesAnatTuis.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/tiny/CtakesAnatTuis.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/tiny/CtakesAnatTuis.txt (added)
+++ ctakes/sandbox/dictionarytool/data/tiny/CtakesAnatTuis.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,8 @@
+T021
+T022
+T023
+T024
+T025
+T026
+T029
+T030

Copied: ctakes/sandbox/dictionarytool/data/tiny/CtakesSnomedTuis.txt (from r1625577, ctakes/sandbox/dictionarytool/data/default/CtakesSnomedTuis.txt)
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/tiny/CtakesSnomedTuis.txt?p2=ctakes/sandbox/dictionarytool/data/tiny/CtakesSnomedTuis.txt&p1=ctakes/sandbox/dictionarytool/data/default/CtakesSnomedTuis.txt&r1=1625577&r2=1636628&rev=1636628&view=diff
==============================================================================
--- ctakes/sandbox/dictionarytool/data/default/CtakesSnomedTuis.txt (original)
+++ ctakes/sandbox/dictionarytool/data/tiny/CtakesSnomedTuis.txt Tue Nov  4 16:08:25 2014
@@ -1,12 +1,12 @@
 // ANAT
-T021
-T022
-T023
-T024
-T025
-T026
-T029
-T030
+// T021
+// T022
+// T023
+// T024
+// T025
+// T026
+// T029
+// T030
 // DISO
 T019
 T020

Added: ctakes/sandbox/dictionarytool/data/tiny/CtakesSources.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/tiny/CtakesSources.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/tiny/CtakesSources.txt (added)
+++ ctakes/sandbox/dictionarytool/data/tiny/CtakesSources.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1 @@
+SNOMEDCT

Added: ctakes/sandbox/dictionarytool/data/tiny/ModifierSuffixes.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/tiny/ModifierSuffixes.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/tiny/ModifierSuffixes.txt (added)
+++ ctakes/sandbox/dictionarytool/data/tiny/ModifierSuffixes.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1 @@
+// Don't automatically modify for tiny

Added: ctakes/sandbox/dictionarytool/data/tiny/RemovalColonTriggers.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/tiny/RemovalColonTriggers.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/tiny/RemovalColonTriggers.txt (added)
+++ ctakes/sandbox/dictionarytool/data/tiny/RemovalColonTriggers.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,38 @@
+: - :
+: 24 hours :
+: 8 hours :
+: address :
+: arbitrary concentration :
+: duration of encounter :
+: duration of the study :
+: finding :
+: identifier :
+: impression / interpretation of study :
+: life of the unit :
+: mass :
+: number :
+: point in time :
+: prid :
+: pt :
+: substance concentration :
+: text :
+: type :
+: volume :
+: volume rate :
+// The following are in UnwantedSuffixes for default, but trigger removal for tiny
+:
+: - -
+: - : point in time : ^ patient : - : phenx
+: ( & glucose ( & plasma ) )
+: ( & nos )
+: [ & named variants ]
+: [ admin ] or [ exam ]
+: [ drivers exam ] or [ dtp20003 - hgv / publ serv claim ] or [ admin ]
+: [ early morning ] or [ mid-stream ] or [ for organism ( & culture ) ]
+: [ exam ( & admin ) ] ) or [ my22 form ]
+: [ fp78 ] or [ gp78 ]
+: [ implantation ] or [ implant ]
+: [ monitoring ] or [ clinic ]
+: [ nos ]
+: [ officer ] or [ executive ]
+: [ production ] or [ product ]

Added: ctakes/sandbox/dictionarytool/data/tiny/RemovalFunctionTriggers.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/tiny/RemovalFunctionTriggers.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/tiny/RemovalFunctionTriggers.txt (added)
+++ ctakes/sandbox/dictionarytool/data/tiny/RemovalFunctionTriggers.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,4 @@
+] or [
+] & / or [
+) or (
+) & / or (
\ No newline at end of file

Added: ctakes/sandbox/dictionarytool/data/tiny/RemovalPrefixTriggers.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/tiny/RemovalPrefixTriggers.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/tiny/RemovalPrefixTriggers.txt (added)
+++ ctakes/sandbox/dictionarytool/data/tiny/RemovalPrefixTriggers.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,37 @@
+deprecated 
+retired procedure
+// The following are in UnwantedPrefixes for the default, but removed for small
+[ d ]
+[ edta ]
+[ m ]
+[ q ]
+[ v ]
+[ x ]
+[ so ]
+activities involving
+and unspecified
+( non-specific )
+child :
+college of american pathologists cancer checklist ;
+other activity involving
+other
+phenx -
+specimen source code -
+specimen source codes -
+specimen source codes =
+ or
+#
+&
+& [
+'
+*
+-
+:
+?
+// The following are in UnwantedPrefixes for the default, but removed for tiny
+// The idea is that such prefixes would most likely not be found by the old dictionary
+// And there should be versions without these prefixes
+( congenital )
+engaged in
+entire
+h / o :

Added: ctakes/sandbox/dictionarytool/data/tiny/RemovalSuffixTriggers.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/tiny/RemovalSuffixTriggers.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/tiny/RemovalSuffixTriggers.txt (added)
+++ ctakes/sandbox/dictionarytool/data/tiny/RemovalSuffixTriggers.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,1017 @@
+- retired code
+- retired -
+- retired
+( reassigned )
+[ obsolete ]
+( deprecated )
+// The following are in UnwantedSuffixes for default, but trigger removal for small
+- ( other )
+- duplicate -
+- nos
+- retired
+- retired -
+- unspec
+- unspecified
+ as topic
+ as topics
+ nec
+ nec in snomedct
+ nos
+ not otherwise specified
+ or
+ title 18
+( or )
+( 10 )
+( _ _ _ 0 , 2 )
+( deprecated )
+( except leather and fur )
+( hl7v3 . 0 concept )
+( inactive concept )
+( mesh category )
+( namespace concept )
+( non-specific )
+( nos )
+( not elsewhere classified in lnc )
+( obsolete notation )
+( procedure nec )
+( process of differentiation )
+) ( adjudicators ( & top )
+& / or o / e
+& / or orderly ( & [ orderlies ] )
+& [ fistula ]
+& [ symptom ]
+& [ legal ]
+& [ symptom ]
+& [ vehicle ]
+( & [ reimbursement ] )
+( & [ abnormal ] )
+( & [ acquired ] )
+( & [ acute ] )
+( & [ acute nos ] )
+( & [ broad ligament ] )
+( & [ bupa examination done ] )
+( & [ c / o - ] )
+( & [ c / o ] )
+( & [ chart ] )
+( & [ duct ] or [ gland ] )
+( & [ eating observation ] )
+( & [ etc ] )
+( & [ expiratory ] )
+( & [ eye disorder ] )
+( & [ government ] )
+( & [ hall ] )
+( & [ infection ] )
+( & [ keen ] )
+( & [ labor complicated by ] )
+( & [ laborer nos ] )
+( & [ labour complicated by ] )
+( & [ labourer nos ] )
+( & [ machine ] and / or [ press ] )
+( & [ member of phct ] )
+( & [ miners ( & coal )
+( & [ named variants ] )
+( & [ nec ] )
+( & [ nos ] )
+( & [ obstructive ] )
+( & [ occupation ] )
+( & [ occupations ] )
+( & [ officer ] )
+( & [ product ] )
+( & [ refer ] )
+( & [ referral ] )
+( & [ referral to ] )
+( & [ reimbursement ] )
+( & [ retail ] )
+( & [ rubber ] )
+( & [ staff ] )
+( & [ staghorn ] )
+( & [ symptom ] )
+( & [ teachers ] )
+( & [ unmarried ] )
+( & [ unspecified ] )
+( & [ warfarin ] )
+( & [ weightlessness ] )
+( & [ well ] )
+( & acquired )
+( & acute )
+( & admin )
+( & administration )
+( & ankle joint )
+( & assess )
+( & attic )
+( & bladder )
+( & blind )
+( & c / o )
+( & c / s )
+( & certificate )
+( & chronic )
+( & claim )
+( & claim gp / ms )
+( & congenital )
+( & conjunctiva )
+( & consent )
+( & dialysis )
+( & direct )
+( & disorder )
+( & distal )
+( & drug )
+( & expiratory )
+( & f303 )
+( & floor / wall )
+( & food poison )
+( & form )
+( & form gp45 ) )
+( & form : [ fp70a ] or [ gp / pea ] )
+( & from patient )
+( & held )
+( & hepatic duct )
+( & in blood )
+( & infection )
+( & infestation )
+( & level ( & phenylalanine ) )
+( & level )
+( & localization )
+( & marital )
+( & maternity allowances )
+( & mobile unit x-ray )
+( & named variants )
+( & nec )
+( & non urg )
+( & nos )
+( & o / e )
+( & on medical advice )
+( & operation )
+( & other )
+( & pathology )
+( & patient data )
+( & peritonsillar region )
+( & pleural )
+( & provision )
+( & record )
+( & referred to )
+( & report )
+( & result )
+( & sample )
+( & sc1 - given to staff )
+( & scrotal skin )
+( & short hip rotator )
+( & sinus )
+( & skin )
+( & skin of external ear )
+( & skin of external nose )
+( & skin of eyebrow )
+( & skin of eyelid )
+( & skin of male perineum )
+( & skin of nipple )
+( & skin of penis )
+( & skin of vulva )
+( & stock order gp10a )
+( & swab )
+( & symptom )
+( & tanner )
+( & top )
+( & top managers )
+( & traumatic )
+( & umbilical skin )
+( & unspecified )
+( & upper )
+( & vaccination )
+( & various types )
+( & viral )
+( & with stabilisation )
+( & with stabilization )
+( * )
+( _ _ _ per hpf )
+( + / - religion )
+( = breath )
+( = breath )
+( = stone )
+( = stone )
+[ & aircraft ]
+[ & bacterial ]
+[ & glucose ]
+[ & named variants ]
+[ & nos ]
+[ & professional ]
+[ & symptom ]
+[ ambiguous ]
+[ dup ]
+[ duplicate ]
+[ obsolete ]
+[ other ]
+, nec in snomedct
+, nos
+, not elsewhere classified
+, unclassified
+, unspecified
+, unspecified part
+concept ( inactive concept )
+reason not stated ( inactive concept )
+// New for small
+&
+// The following are in UnwantedSuffixes for the default, but removed for tiny
+// The idea is that such suffixes would most likely not be found by the old dictionary
+// And there should be versions without these suffixes
+- ( religious affiliation )
+- a specialty area of medicine
+- action
+- activity of daily living
+- assess ( & non urg )
+- attribute
+- biologic function
+- category
+- confectionery
+- dietary
+- entertainment ( & [ musical ]
+- finding
+- function
+- general -
+- herbal supplement
+- mental defense mechanism
+- mental process
+- observation
+- organization
+- physical agent
+- related finding
+- rolecode
+- speciality
+- specialty
+- summary
+- symptom
+- weather
+- sport
+( a priority lower than stat )
+( abnormality )
+( abo blood group )
+( acholeplasma host )
+( acknowledgementcondition )
+( acquired )
+( action )
+( action )
+( activated )
+( activity )
+( activity )
+( actual )
+( adjudicators ( & top )
+( adl finding )
+( administative )
+( administration method )
+( administrative concept )
+( age group )
+( alcohol )
+( all components )
+( all fractions of )
+( allelic variant )
+( alone )
+( alphabetic group )
+( alternative )
+( amount )
+( amphibian )
+( anatomical classification )
+( anatomical coordinate )
+( anatomical descriptor )
+( anatomical feature )
+( anatomical location )
+( anatomy )
+( and accessories )
+( and related agents )
+( angiosperm hosts )
+( animal )
+( any part )
+( apparatus )
+( applies to non-numeric results )
+( area of increased mortality )
+( area of knowledge )
+( arrangement )
+( art )
+( artificial )
+( artistic )
+( ascomycota )
+( assessment )
+( assessment scale )
+( astronomy )
+( at present )
+( attitude )
+( attribute )
+( attributes )
+( audit )
+( australia )
+( available ( qualifier ) )
+( bacteria )
+( bacteria and cyanobacteria hosts )
+( battery - powered )
+( beach )
+( beach , court )
+( beet sugar )
+( behavior )
+( behavior finding )
+( beverage )
+( biologic function )
+( biological )
+( biological function )
+( biological funtion )
+( biopsy result )
+( bird )
+( blacus )
+( blood components )
+( blood pressure )
+( body part )
+( body structure )
+( body substance )
+( botanical taxonomic category below variety )
+( brain )
+( brain structure )
+( button )
+( calcified structure )
+( canal )
+( cancer stage )
+( candy )
+( cardiac conduction )
+( category )
+( catering and lodging services )
+( cell )
+( cell line )
+( cell structure )
+( cells or organisms )
+( central nervous system tumor )
+( central nervous system tumour )
+( cerebellum )
+( certification )
+( change )
+( characteristic )
+( chemical )
+( chemical / related processes )
+( chemical and related processes )
+( chemical processes , except petroleum )
+( citrus )
+( class )
+( classical )
+( classification )
+( cleansing action )
+( clinical )
+( clinical definition )
+( clinical disorder )
+( clinical specialty )
+( color )
+( combined site )
+( comfort level finding )
+( communication )
+( communicationfunctiontype )
+( community )
+( company )
+( compartment )
+( complement activation )
+( complete ) ( partial )
+( component )
+( composite )
+( compression algorithm )
+( computer )
+( computer files directory )
+( computer network )
+( concept )
+( construction )
+( container status )
+( containing tobacco )
+( context - dependent category )
+( contextual qualifier )
+( copy in notes )
+( copy within chart )
+( court )
+( crustacea )
+( crustacean )
+( data - processing )
+( data )
+( data type )
+( dataset )
+( dedicated instruments )
+( degree / certification )
+( degree / license / certification )
+( dental procedure )
+( detection and tagging )
+( deviation )
+( device )
+( device status )
+( diagnosis )
+( diagnostic , surgical , and therapeutic interventions )
+( dicotyledons hosts )
+( dietary )
+( digestive structure of animal )
+( diminutive )
+( direction )
+( discharge )
+( discipline )
+( disease )
+( disease attribute )
+( diskus intervertebralis )
+( disorder )
+( do as near as possible to requested time )
+( do immediately )
+( document )
+( document type )
+( documents )
+( dry - cleaner )
+( dysfunction )
+( economic )
+( economic descriptor )
+( economic status )
+( edible )
+( edible oils )
+( education )
+( electric power )
+( electrical )
+( embryology )
+( embryonic structure )
+( emergency )
+( emotion )
+( emotional response )
+( end postition )
+( england & wales )
+( england and wales )
+( entertainment )
+( entire collection )
+( entity )
+( environment )
+( environmental process )
+( epidemiology )
+( equipment )
+( ethnic group )
+( eukaryote )
+( event )
+( event history )
+( except advertising )
+( except aircraft and marine engines )
+( except aircraft and marine turbines )
+( except construction )
+( except electrical )
+( except lawyer or judge )
+( except motor vehicle )
+( except oil and gas wells )
+( except photo - gravers )
+( except rubber )
+( except ships ' engines )
+( except tire maker / tire vulcanizer )
+( except tire makers and tire vulcanisers )
+( except tire makers and tire vulcanizers )
+( except transport )
+( experimental )
+( experimental system )
+( exposure / injury )
+( extensiveness )
+( external anatomical feature )
+( eye )
+( family )
+( family issue )
+( female )
+( field )
+( fiinding )
+( film )
+( films )
+( finding )
+( findings )
+( fish )
+( flies )
+( flower )
+( follower of religion - person )
+( food )
+( for any age )
+( for cleaning purpose )
+( for gunter ' s chain )
+( for pain relief )
+( for staff )
+( for transmitting doses for pharmacokinetics )
+( formal meetings )
+( formation )
+( foundry )
+( freight train )
+( frequency )
+( fruit )
+( function )
+( function )
+( fungi hosts )
+( fungus )
+( garment )
+( gauge size )
+( general )
+( generic version )
+( genetics )
+( genu )
+( geographic )
+( geographic area )
+( geographic location )
+( glandula suprarenalis )
+( goal )
+( grade )
+( gramineae hosts )
+( grooming )
+( grooming )
+( group )
+( group of parts )
+( guideline )
+( gunter ' s chain )
+( hand - operated machine )
+( heading )
+( health )
+( health care )
+( healthcare )
+( helper cells )
+( hemorrhage )
+( high voltage )
+( histology )
+( history )
+( horizontal architectural support )
+( horse )
+( hosiery )
+( human only )
+( hypothalamus )
+( i . q . . . . . )
+( iadl finding )
+( idea )
+( identifier )
+( image )
+( in-store merchandise )
+( in animal tissues )
+( in vivo )
+( including digits )
+( indicator )
+( infection )
+( injury )
+( input )
+( inquiry )
+( insect )
+( insect hosts )
+( instep )
+( instruction )
+( instruction imperative )
+( intake )
+( interactions )
+( intercourse )
+( invertebrate )
+( island )
+( january 2007 release )
+( kind of quantity )
+( lab finding )
+( lab procedure )
+( lab result )
+( lab technique )
+( lab test )
+( laboratory finding )
+( laboratory procedure )
+( land - based )
+( laparotomy performed )
+( length )
+( lepidoptora hosts )
+( libraries )
+( libraries )
+( life event - finding )
+( life style )
+( line - film )
+( lingual , palatine , pharyngeal )
+( link assertion )
+( linkage concept )
+( liquid )
+( list bullets )
+( list separately in addition to code for primary procedure )
+( living organism )
+( location )
+( logging )
+( long - distance transport )
+( lower case a )
+( machine )
+( machine , except jacquard loom )
+( machines )
+( male )
+( malting )
+( mammal )
+( mapping )
+( marine pseudomonad hosts )
+( masonry and woodwork )
+( material )
+( mating )
+( measurement )
+( medical device )
+( medically )
+( medium of exchange )
+( medulla )
+( medullae )
+( medullary reticular formation )
+( member of religious order )
+( membrane )
+( mental function )
+( mental process )
+( metabolim test )
+( metal )
+( metal melting , except cupola )
+( metal reheating )
+( method )
+( military )
+( mine )
+( mine / quarry )
+( mine and quarry )
+( minor structure )
+( miscellaneous )
+( modified )
+( modifier )
+( molecular function )
+( molecular structure )
+( morphologic abnormality )
+( morphologic abnormality , calcified structure )
+( morphology )
+( motion )
+( motion picture )
+( motivation )
+( musical )
+( nanoparticle )
+( natural and artificial )
+( nature of procedure values )
+( navigational concept )
+( neck )
+( nmo )
+( non-ferrous metals )
+( non medication )
+( nonoperative )
+( not alarm reaction )
+( number )
+( numerical qualifier )
+( object )
+( observable )
+( observable entity )
+( observation )
+( observation or disorder )
+( occasion for eating )
+( occupation )
+( occupation history )
+( occupation or discipline )
+( occupational group )
+( ocular )
+( oil / gas wells )
+( oil and gas wells )
+( oilfield )
+( oils and fats )
+( old people ' s home )
+( operative )
+( or )
+( oral challenge )
+( ore smelting )
+( organ )
+( organ component )
+( organ part )
+( organisation )
+( organism )
+( organization )
+( origin )
+( origins )
+( orthodontic procedure )
+( other )
+( output )
+( package insert )
+( pairs )
+( pairs )
+( palatine )
+( parameter )
+( part organ )
+( patient )
+( patient instructions )
+( pdq )
+( per time )
+( per volume )
+( person )
+( personality )
+( persons )
+( pertaining to a node )
+( petroleum refining )
+( phenomenon )
+( philosophy )
+( physical activity )
+( physical activity )
+( physical device )
+( physical finding )
+( physical force )
+( physical force )
+( physical geography )
+( physical object )
+( physical object )
+( physician )
+( physician group )
+( physiologic )
+( physiologic function )
+( physiology )
+( planet )
+( plant )
+( plant hosts )
+( plant part )
+( plants hosts )
+( platelet )
+( polypectomy )
+( population )
+( population )
+( population group )
+( position )
+( postinfection )
+( postoperative )
+( potency unit )
+( power station )
+( ppc - c )
+( prescription )
+( present illness )
+( present time )
+( press / radio / t . v . )
+( primary )
+( primary procedure )
+( printing )
+( private service )
+( procedure )
+( procedure detail )
+( procedure or disorder )
+( process )
+( process of discrimination )
+( product )
+( profession )
+( property )
+( property of measurement )
+( prosthesis )
+( protons , ions )
+( protozoa )
+( provider role )
+( psychiatric )
+( psychiatry )
+( psychologic )
+( psychological function )
+( psychology )
+( psychophysiology )
+( psychotherapy )
+( publication )
+( publication format )
+( publications )
+( pure mathematics )
+( qualifier )
+( qualifier in lnc )
+( qualifier value )
+( qualifier value )
+( quality )
+( race )
+( racial group )
+( radio / t . v . )
+( radio / television )
+( radio / television / radar equipment )
+( radio broadcasts )
+( radioactive materials )
+( radiologist )
+( rate )
+( reassigned )
+( record )
+( record artifact )
+( recreational activity )
+( reference documentation )
+( regime / therapy )
+( region )
+( region of foot )
+( rehabilitation )
+( reimbursement )
+( relative )
+( religion )
+( religion / philosophy )
+( reptile )
+( respiratory measure )
+( response modality )
+( result )
+( revised )
+( rifle )
+( road )
+( role )
+( rolling mill )
+( route )
+( route of administration )
+( schedule frequency )
+( science )
+( scotland )
+( sea )
+( season )
+( second level )
+( section of report )
+( self - care status )
+( sensation )
+( separate procedure )
+( serotype )
+( serous membrane )
+( services standards )
+( set of actions )
+( severe )
+( severity modifier )
+( shape )
+( shin )
+( ship )
+( singles )
+( site )
+( site of care )
+( situation )
+( sleep disorder )
+( slowed )
+( small pieces )
+( snails )
+( social behavior )
+( social concept )
+( social status )
+( sound )
+( sound intensity )
+( source type )
+( spatial )
+( special concept )
+( specialised )
+( specialized )
+( specialty )
+( specific stage )
+( specify )
+( specify level )
+( specify type )
+( specimen )
+( specimen source code )
+( spice )
+( sport )
+( sports activity )
+( stage / studio )
+( stage and studio )
+( staging scale )
+( statement )
+( statistical )
+( statistical adjustment )
+( statistical procedure )
+( statistical technique )
+( status )
+( steam )
+( steel )
+( steel converting )
+( steel refining )
+( still camera )
+( strain )
+( structure )
+( subdivision )
+( substance )
+( substance / material )
+( substance status )
+( substantia innominata )
+( substitution condition )
+( suffix )
+( sugar refining )
+( suppressor cells )
+( surface )
+( surface region )
+( surface region of back )
+( surgical )
+( surgical procedure )
+( surgical speciality )
+( surgical view )
+( symbol )
+( symptom )
+( system )
+( taxon )
+( technician / technologist )
+( techniques )
+( temporal )
+( temporal concept )
+( temporal qualifier )
+( terminology entity )
+( test result )
+( testicles )
+( theater )
+( theatre )
+( theoretical )
+( therapeutic )
+( therapy )
+( therapy ) ( diagnostic id )
+( third level )
+( time frame )
+( time pattern )
+( time period )
+( time point or interval )
+( tissue )
+( tissue integrity )
+( to be done prior to surgery )
+( traffic )
+( transanal disk excision )
+( transmission )
+( transportation )
+( treatment )
+( trust territory )
+( tumor staging )
+( type )
+( type of material , substance )
+( u . s . )
+( uniformity )
+( unit )
+( unit of mass )
+( unit of measure )
+( unit of pressure )
+( unmodified )
+( upper )
+( us )
+( use calc )
+( use of drug )
+( use only with codes for immunization )
+( used to identify procedures performed on the left side of the body )
+( used to identify procedures performed on the right side of the body )
+( uterine )
+( value )
+( variant )
+( vegetable )
+( version 4 )
+( vertebral column )
+( vertebrale )
+( vocalization )
+( volume )
+( wait and see )
+( wasp )
+( waterworks )
+( weight )
+( wood distillation )
+( workshop )
+( yes / no )
+( zoology )
+, cs
+, device
+, disorder
+, follower of religion
+, function
+, life event
+, services & procedure codes ( category iii , temporary )
+, united states
+[ any part ]
+[ a body part ]
+[ action ]
+[ activity ]
+[ adjective ]
+[ an instruction material ]
+[ any part ]
+[ artwork ]
+[ ball ]
+[ body ]
+[ body function ]
+[ body part ]
+[ body parts ]
+[ character ]
+[ chemical / ingredient ]
+[ clothing ]
+[ color ]
+[ device ]
+[ disease / finding ]
+[ document ]
+[ dose form ]
+[ economics ]
+[ environment ]
+[ executive duty ]
+[ extent ]
+[ finding ]
+[ fish ]
+[ follower of religion ]
+[ for organism ( & culture ) ]
+[ furniture ]
+[ game ]
+[ garment ]
+[ hiv ]
+[ in an environmental context ]
+[ in social context ]
+[ mammal ]
+[ manufactured object ]
+[ manufactured product ]
+[ mating posture ]
+[ medical device ]
+[ mental function ]
+[ mental process ]
+[ moa ]
+[ molecular function ]
+[ money ]
+[ motion ]
+[ ngu ]
+[ object ]
+[ occupation ]
+[ organism ] or [ infection - shingles ]
+[ organization ]
+[ paper report ]
+[ pe ]
+[ people ]
+[ physical phenomenon ]
+[ plan ]
+[ plant ]
+[ procedure ]
+[ product ]
+[ profession ]
+[ psychology ]
+[ publication type ]
+[ race ]
+[ reference materials ]
+[ room ]
+[ sample ]
+[ specialty ]
+[ sports ]
+[ statistical concept ]
+[ status ]
+[ substance ]
+[ surgical action ]
+[ uniformity ]
+[ unit of measurement ]
+[ unit of pressure ]
+] ( person )
+{ organization ]
+< subjective >
+epidemiology
+status ( & consent )
+

Added: ctakes/sandbox/dictionarytool/data/tiny/RightAbbreviations.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/tiny/RightAbbreviations.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/tiny/RightAbbreviations.txt (added)
+++ ctakes/sandbox/dictionarytool/data/tiny/RightAbbreviations.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1 @@
+// No automatic abbreviation extraction for tiny
\ No newline at end of file

Added: ctakes/sandbox/dictionarytool/data/tiny/UnwantedPrefixes.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/tiny/UnwantedPrefixes.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/tiny/UnwantedPrefixes.txt (added)
+++ ctakes/sandbox/dictionarytool/data/tiny/UnwantedPrefixes.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1 @@
+// No unwanted prefixes for tiny, only removal
\ No newline at end of file

Added: ctakes/sandbox/dictionarytool/data/tiny/UnwantedSuffixes.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/dictionarytool/data/tiny/UnwantedSuffixes.txt?rev=1636628&view=auto
==============================================================================
--- ctakes/sandbox/dictionarytool/data/tiny/UnwantedSuffixes.txt (added)
+++ ctakes/sandbox/dictionarytool/data/tiny/UnwantedSuffixes.txt Tue Nov  4 16:08:25 2014
@@ -0,0 +1,4 @@
+/
+'
+-
+,