You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@ctakes.apache.org by "Tomasz Oliwa (JIRA)" <ji...@apache.org> on 2015/11/16 22:06:11 UTC

[jira] [Closed] (CTAKES-389) cTAKES dictionary lookup missed word starting string bug

     [ https://issues.apache.org/jira/browse/CTAKES-389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tomasz Oliwa closed CTAKES-389.
-------------------------------

Check out the 'ctakes/trunk' with this fix and run it on the examples from the Description. It no longer finds the incorrect annotations mentioned in the Description.

> cTAKES dictionary lookup missed word starting string bug
> --------------------------------------------------------
>
>                 Key: CTAKES-389
>                 URL: https://issues.apache.org/jira/browse/CTAKES-389
>             Project: cTAKES
>          Issue Type: Bug
>          Components: ctakes-dictionary-lookup-fast
>    Affects Versions: 3.2.2, 3.2.3
>         Environment: All environments
>            Reporter: Tomasz Oliwa
>            Assignee: Sean Finan
>             Fix For: 3.2.3
>
>
> cTAKES has a bug in its fast dictionary lookup.
> "baby to" , "baby too" gets looked up as C1305907 of "baby tooth", however "baby token" does not match it.
> "electrolyte le", "electrolyte lev" gets found as C0428284 "electrolyte level", but "electrolyte dev" does not match.
> It seems if the "missed" word contains the same characters that the word found in the fast dictionary starts with, a match is made.
> This is a bug.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)