You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@ctakes.apache.org by "Sean Mullane (JIRA)" <ji...@apache.org> on 2018/05/07 19:10:00 UTC

[jira] [Created] (CTAKES-508) HTMLTextWriter encodes some characters incorrectly in Firefox

Sean Mullane created CTAKES-508:
-----------------------------------

             Summary: HTMLTextWriter encodes some characters incorrectly in Firefox
                 Key: CTAKES-508
                 URL: https://issues.apache.org/jira/browse/CTAKES-508
             Project: cTAKES
          Issue Type: Bug
          Components: ctakes-core
    Affects Versions: 4.0.0
            Reporter: Sean Mullane


Some documents output by the HTML pretty text writer (org.apache.ctakes.core.cc.pretty.html.HtmlTextWriter) show up in Firefox 59.0.3 with the A-with-circumflex character showing up where there should be a &nbsp, as described in the StackOverflow post below (in an unrelated context):

[https://stackoverflow.com/questions/1461907]


This seems to be because the writer implicitly uses UTF-8 but does not specify an encoding in the HTML file. Firefox defaults to Western rather than UTF-8 and so displays characters incorrectly.

This could be solved by adding a META tag to the HTML documents that are written and specifying UTF-8 encoding.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)