You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@uima.apache.org by "Thilo Goetz (JIRA)" <ui...@incubator.apache.org> on 2007/05/29 11:44:16 UTC

[jira] Assigned: (UIMA-194) Tools highlight incorrect annotation offsets due to XML serialization bug in Sun Java 1.4.2

     [ https://issues.apache.org/jira/browse/UIMA-194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Thilo Goetz reassigned UIMA-194:
--------------------------------

    Assignee: Thilo Goetz

> Tools highlight incorrect annotation offsets due to XML serialization bug in Sun Java 1.4.2
> -------------------------------------------------------------------------------------------
>
>                 Key: UIMA-194
>                 URL: https://issues.apache.org/jira/browse/UIMA-194
>             Project: UIMA
>          Issue Type: Bug
>          Components: Documentation, Tools
>         Environment: Sun Java 1.4.2_12
>            Reporter: Adam Lally
>            Assignee: Thilo Goetz
>            Priority: Minor
>             Fix For: 2.2
>
>
> The XML serialization support in Sun Java 1.4.2_12 doesn't serialize CR characters to XML.  As a result, if the document text contains CR characters, XCAS or XMI serialization will cause them to be lost, resulting in incorrect annotation offsets.  This is exposed in the DocumentAnalyzer, with the highlighting being incorrect if the input document contains CR characters.
> Unit test failure occurred in XCasToCasDataHandlerTest, but the test was modified so it passes.  In that test an assertEquals of two strings fails, yet the strings appear identical in the compare viewer.
> The problem does not occur in Sun Java 1.5 or later, or with IBM Java.
> Probably a documentation update is an appropriate way to address this.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.