You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@ctakes.apache.org by "ASF subversion and git services (JIRA)" <ji...@apache.org> on 2015/11/16 18:22:11 UTC

[jira] [Commented] (CTAKES-389) cTAKES dictionary lookup missed word starting string bug

    [ https://issues.apache.org/jira/browse/CTAKES-389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15006948#comment-15006948 ] 

ASF subversion and git services commented on CTAKES-389:
--------------------------------------------------------

Commit 1714634 from [~seanfinan] in branch 'ctakes/trunk'
[ https://svn.apache.org/r1714634 ]

CTAKES-389 : fix for erroneous terms returned when last token is a partial match

> cTAKES dictionary lookup missed word starting string bug
> --------------------------------------------------------
>
>                 Key: CTAKES-389
>                 URL: https://issues.apache.org/jira/browse/CTAKES-389
>             Project: cTAKES
>          Issue Type: Bug
>          Components: ctakes-dictionary-lookup-fast
>    Affects Versions: 3.2.2, 3.2.3
>         Environment: All environments
>            Reporter: Tomasz Oliwa
>
> cTAKES has a bug in its fast dictionary lookup.
> "baby to" , "baby too" gets looked up as C1305907 of "baby tooth", however "baby token" does not match it.
> "electrolyte le", "electrolyte lev" gets found as C0428284 "electrolyte level", but "electrolyte dev" does not match.
> It seems if the "missed" word contains the same characters that the word found in the fast dictionary starts with, a match is made.
> This is a bug.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)