You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@ctakes.apache.org by "Tomasz Oliwa (JIRA)" <ji...@apache.org> on 2015/11/16 22:06:11 UTC
[jira] [Closed] (CTAKES-389) cTAKES dictionary lookup missed word
starting string bug
[ https://issues.apache.org/jira/browse/CTAKES-389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tomasz Oliwa closed CTAKES-389.
-------------------------------
Check out the 'ctakes/trunk' with this fix and run it on the examples from the Description. It no longer finds the incorrect annotations mentioned in the Description.
> cTAKES dictionary lookup missed word starting string bug
> --------------------------------------------------------
>
> Key: CTAKES-389
> URL: https://issues.apache.org/jira/browse/CTAKES-389
> Project: cTAKES
> Issue Type: Bug
> Components: ctakes-dictionary-lookup-fast
> Affects Versions: 3.2.2, 3.2.3
> Environment: All environments
> Reporter: Tomasz Oliwa
> Assignee: Sean Finan
> Fix For: 3.2.3
>
>
> cTAKES has a bug in its fast dictionary lookup.
> "baby to" , "baby too" gets looked up as C1305907 of "baby tooth", however "baby token" does not match it.
> "electrolyte le", "electrolyte lev" gets found as C0428284 "electrolyte level", but "electrolyte dev" does not match.
> It seems if the "missed" word contains the same characters that the word found in the fast dictionary starts with, a match is made.
> This is a bug.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)