You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@ctakes.apache.org by pk...@apache.org on 2016/06/06 11:56:42 UTC

svn commit: r1746984 [1/3] - in /ctakes/sandbox/ctakes-clinical-deid/src: main/resources/wordlists/ main/ruta/org/apache/ctakes/deid/ test/java/org/apache/ctakes/deid/

Author: pkluegl
Date: Mon Jun  6 11:56:42 2016
New Revision: 1746984

URL: http://svn.apache.org/viewvc?rev=1746984&view=rev
Log:
CTAKES-384
- fixed some rules
- removed whitespaces in wordlists

Modified:
    ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/age_post_ind.txt
    ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/country.txt
    ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/deceased_ind.txt
    ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_postfix.txt
    ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_prefix1.txt
    ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/nationality.txt
    ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/profession.txt
    ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/spoken_language.txt
    ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/us_state.txt
    ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/us_state_acronym_abbreviation.txt
    ctakes/sandbox/ctakes-clinical-deid/src/main/ruta/org/apache/ctakes/deid/Date.ruta
    ctakes/sandbox/ctakes-clinical-deid/src/main/ruta/org/apache/ctakes/deid/Deid.ruta
    ctakes/sandbox/ctakes-clinical-deid/src/main/ruta/org/apache/ctakes/deid/Dictionaries.ruta
    ctakes/sandbox/ctakes-clinical-deid/src/main/ruta/org/apache/ctakes/deid/Doctor.ruta
    ctakes/sandbox/ctakes-clinical-deid/src/main/ruta/org/apache/ctakes/deid/Fax.ruta
    ctakes/sandbox/ctakes-clinical-deid/src/main/ruta/org/apache/ctakes/deid/Patient.ruta
    ctakes/sandbox/ctakes-clinical-deid/src/main/ruta/org/apache/ctakes/deid/State.ruta
    ctakes/sandbox/ctakes-clinical-deid/src/test/java/org/apache/ctakes/deid/I2B2Evaluation.java

Modified: ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/age_post_ind.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/age_post_ind.txt?rev=1746984&r1=1746983&r2=1746984&view=diff
==============================================================================
--- ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/age_post_ind.txt (original)
+++ ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/age_post_ind.txt Mon Jun  6 11:56:42 2016
@@ -1,4 +1,4 @@
-years old
+yearsold
 y.o
 /
 m

Modified: ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/country.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/country.txt?rev=1746984&r1=1746983&r2=1746984&view=diff
==============================================================================
--- ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/country.txt (original)
+++ ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/country.txt Mon Jun  6 11:56:42 2016
@@ -178,14 +178,14 @@ Israel
 Herzegovina
 England
 America
-Puerto Rico
-Sri Lanka
-Costa Rica
-United Kingdom
+PuertoRico
+SriLanka
+CostaRica
+UnitedKingdom
 UK
-United States
-Ivory Coast
-Saudi Arabia
-South Korea
-North Korea
-Trinidad and Tobago
+UnitedStates
+IvoryCoast
+SaudiArabia
+SouthKorea
+NorthKorea
+TrinidadandTobago

Modified: ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/deceased_ind.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/deceased_ind.txt?rev=1746984&r1=1746983&r2=1746984&view=diff
==============================================================================
--- ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/deceased_ind.txt (original)
+++ ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/deceased_ind.txt Mon Jun  6 11:56:42 2016
@@ -1,3 +1,3 @@
-passed away
+passedaway
 dies
 deceased
\ No newline at end of file

Modified: ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_postfix.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_postfix.txt?rev=1746984&r1=1746983&r2=1746984&view=diff
==============================================================================
--- ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_postfix.txt (original)
+++ ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_postfix.txt Mon Jun  6 11:56:42 2016
@@ -1,3 +1,4 @@
+M.D.
 MD
 NP
 PA-C

Modified: ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_prefix1.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_prefix1.txt?rev=1746984&r1=1746983&r2=1746984&view=diff
==============================================================================
--- ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_prefix1.txt (original)
+++ ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_prefix1.txt Mon Jun  6 11:56:42 2016
@@ -5,4 +5,4 @@ PCP
 Transcribed
 Dictated
 electronically
-signed recommended
\ No newline at end of file
+signedrecommended
\ No newline at end of file

Modified: ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/nationality.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/nationality.txt?rev=1746984&r1=1746983&r2=1746984&view=diff
==============================================================================
--- ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/nationality.txt (original)
+++ ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/nationality.txt Mon Jun  6 11:56:42 2016
@@ -53,7 +53,7 @@ Djibouti
 Djiboutian
 Dominican
 Dutch
-East Timorese
+EastTimorese
 Ecuadorean
 Ecuadorian
 Egyptian
@@ -137,12 +137,12 @@ Mozambican
 Namibian
 Nauruan
 Nepalese
-New Zealander
+NewZealander
 Nicaraguan
 Nigerian
 Nigerien
-Northern Irish
-North Korean
+NorthernIrish
+NorthKorean
 Norwegian
 Omani
 Pakistani
@@ -170,12 +170,12 @@ Slovak
 Slovakian
 Slovene
 Slovenian
-Solomon Islander
+SolomonIslander
 Somali
-South African
-South Korean
+SouthAfrican
+SouthKorean
 Spanish
-Sri Lankan
+SriLankan
 Sudanese
 Surinamer
 Surinamese
@@ -217,12 +217,12 @@ Zairean
 Zambian
 Zimbabwean
 English
-San Marinese
-Sao Tomean
-Papua New Guinean
-Western Samoan
-Saint Lucian
-Sierra Leonean
-Sierra Leonian
-Equatorial Guinean
+SanMarinese
+SaoTomean
+PapuaNewGuinean
+WesternSamoan
+SaintLucian
+SierraLeonean
+SierraLeonian
+EquatorialGuinean