You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@ctakes.apache.org by pk...@apache.org on 2016/06/06 11:56:42 UTC
svn commit: r1746984 [1/3] - in /ctakes/sandbox/ctakes-clinical-deid/src:
main/resources/wordlists/ main/ruta/org/apache/ctakes/deid/
test/java/org/apache/ctakes/deid/
Author: pkluegl
Date: Mon Jun 6 11:56:42 2016
New Revision: 1746984
URL: http://svn.apache.org/viewvc?rev=1746984&view=rev
Log:
CTAKES-384
- fixed some rules
- removed whitespaces in wordlists
Modified:
ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/age_post_ind.txt
ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/country.txt
ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/deceased_ind.txt
ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_postfix.txt
ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_prefix1.txt
ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/nationality.txt
ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/profession.txt
ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/spoken_language.txt
ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/us_state.txt
ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/us_state_acronym_abbreviation.txt
ctakes/sandbox/ctakes-clinical-deid/src/main/ruta/org/apache/ctakes/deid/Date.ruta
ctakes/sandbox/ctakes-clinical-deid/src/main/ruta/org/apache/ctakes/deid/Deid.ruta
ctakes/sandbox/ctakes-clinical-deid/src/main/ruta/org/apache/ctakes/deid/Dictionaries.ruta
ctakes/sandbox/ctakes-clinical-deid/src/main/ruta/org/apache/ctakes/deid/Doctor.ruta
ctakes/sandbox/ctakes-clinical-deid/src/main/ruta/org/apache/ctakes/deid/Fax.ruta
ctakes/sandbox/ctakes-clinical-deid/src/main/ruta/org/apache/ctakes/deid/Patient.ruta
ctakes/sandbox/ctakes-clinical-deid/src/main/ruta/org/apache/ctakes/deid/State.ruta
ctakes/sandbox/ctakes-clinical-deid/src/test/java/org/apache/ctakes/deid/I2B2Evaluation.java
Modified: ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/age_post_ind.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/age_post_ind.txt?rev=1746984&r1=1746983&r2=1746984&view=diff
==============================================================================
--- ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/age_post_ind.txt (original)
+++ ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/age_post_ind.txt Mon Jun 6 11:56:42 2016
@@ -1,4 +1,4 @@
-years old
+yearsold
y.o
/
m
Modified: ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/country.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/country.txt?rev=1746984&r1=1746983&r2=1746984&view=diff
==============================================================================
--- ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/country.txt (original)
+++ ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/country.txt Mon Jun 6 11:56:42 2016
@@ -178,14 +178,14 @@ Israel
Herzegovina
England
America
-Puerto Rico
-Sri Lanka
-Costa Rica
-United Kingdom
+PuertoRico
+SriLanka
+CostaRica
+UnitedKingdom
UK
-United States
-Ivory Coast
-Saudi Arabia
-South Korea
-North Korea
-Trinidad and Tobago
+UnitedStates
+IvoryCoast
+SaudiArabia
+SouthKorea
+NorthKorea
+TrinidadandTobago
Modified: ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/deceased_ind.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/deceased_ind.txt?rev=1746984&r1=1746983&r2=1746984&view=diff
==============================================================================
--- ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/deceased_ind.txt (original)
+++ ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/deceased_ind.txt Mon Jun 6 11:56:42 2016
@@ -1,3 +1,3 @@
-passed away
+passedaway
dies
deceased
\ No newline at end of file
Modified: ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_postfix.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_postfix.txt?rev=1746984&r1=1746983&r2=1746984&view=diff
==============================================================================
--- ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_postfix.txt (original)
+++ ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_postfix.txt Mon Jun 6 11:56:42 2016
@@ -1,3 +1,4 @@
+M.D.
MD
NP
PA-C
Modified: ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_prefix1.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_prefix1.txt?rev=1746984&r1=1746983&r2=1746984&view=diff
==============================================================================
--- ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_prefix1.txt (original)
+++ ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/dr_prefix1.txt Mon Jun 6 11:56:42 2016
@@ -5,4 +5,4 @@ PCP
Transcribed
Dictated
electronically
-signed recommended
\ No newline at end of file
+signedrecommended
\ No newline at end of file
Modified: ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/nationality.txt
URL: http://svn.apache.org/viewvc/ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/nationality.txt?rev=1746984&r1=1746983&r2=1746984&view=diff
==============================================================================
--- ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/nationality.txt (original)
+++ ctakes/sandbox/ctakes-clinical-deid/src/main/resources/wordlists/nationality.txt Mon Jun 6 11:56:42 2016
@@ -53,7 +53,7 @@ Djibouti
Djiboutian
Dominican
Dutch
-East Timorese
+EastTimorese
Ecuadorean
Ecuadorian
Egyptian
@@ -137,12 +137,12 @@ Mozambican
Namibian
Nauruan
Nepalese
-New Zealander
+NewZealander
Nicaraguan
Nigerian
Nigerien
-Northern Irish
-North Korean
+NorthernIrish
+NorthKorean
Norwegian
Omani
Pakistani
@@ -170,12 +170,12 @@ Slovak
Slovakian
Slovene
Slovenian
-Solomon Islander
+SolomonIslander
Somali
-South African
-South Korean
+SouthAfrican
+SouthKorean
Spanish
-Sri Lankan
+SriLankan
Sudanese
Surinamer
Surinamese
@@ -217,12 +217,12 @@ Zairean
Zambian
Zimbabwean
English
-San Marinese
-Sao Tomean
-Papua New Guinean
-Western Samoan
-Saint Lucian
-Sierra Leonean
-Sierra Leonian
-Equatorial Guinean
+SanMarinese
+SaoTomean
+PapuaNewGuinean
+WesternSamoan
+SaintLucian
+SierraLeonean
+SierraLeonian
+EquatorialGuinean