You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@ctakes.apache.org by "Kean Kaufmann (JIRA)" <ji...@apache.org> on 2017/07/04 19:46:00 UTC

[jira] [Created] (CTAKES-450) CDASegmentAnnotator misses all headings after empty segment

Kean Kaufmann created CTAKES-450:
------------------------------------

             Summary: CDASegmentAnnotator misses all headings after empty segment
                 Key: CTAKES-450
                 URL: https://issues.apache.org/jira/browse/CTAKES-450
             Project: cTAKES
          Issue Type: Bug
          Components: ctakes-core
            Reporter: Kean Kaufmann
         Attachments: CDASegmentAnnotator.diff

If the CDASegmentAnnotator encounters an empty segment, it throws away everything after that in the document.  You can see this in the test document provided for TestCDASegmentAnnotator. The heading "CURRENT HEALTH STATUS" is followed immediately by the heading "Medications"; the test case misses the "Medications" heading, and "FAMILY HISTORY" after that. The sorted_segments loop is only incrementing the index variable for non-empty segments.

Patch attached.

TestCDASegmentAnnotator output before fix (with getPreferredText()):

Segment:2.16.840.1.113883.10.20.22.1.1 Begin:92 End:159: Header
Segment:1.3.6.1.4.1.19376.1.5.3.1.1.13.2.1 Begin:176 End:1612: CHIEF COMPLAINT
Segment:2.16.840.1.113883.10.20.22.2.20 Begin:1634 End:1696: HISTORY OF PAST ILLNESS
Segment:2.16.840.1.113883.10.20.22.2.2.1 Begin:1711 End:2271: History of immunizations

After fix:

Segment:2.16.840.1.113883.10.20.22.1.1 Begin:92 End:159: Header
Segment:1.3.6.1.4.1.19376.1.5.3.1.1.13.2.1 Begin:176 End:1612: CHIEF COMPLAINT
Segment:2.16.840.1.113883.10.20.22.2.20 Begin:1634 End:1696: HISTORY OF PAST ILLNESS
Segment:2.16.840.1.113883.10.20.22.2.2.1 Begin:1711 End:2271: History of immunizations
Segment:2.16.840.1.113883.10.20.22.2.1.1 Begin:2307 End:3506: HISTORY OF MEDICATION USE
Segment:2.16.840.1.113883.10.20.22.2.15 Begin:3522 End:5608: Family History




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)