You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@forrest.apache.org by "Sjur N. Moshagen (JIRA)" <ji...@apache.org> on 2005/10/06 18:27:48 UTC

[jira] Commented: (FOR-668) UTF-8 encoded .ihtml documents gives garbled output

    [ http://issues.apache.org/jira/browse/FOR-668?page=comments#action_12331513 ] 

Sjur N. Moshagen commented on FOR-668:
--------------------------------------

The last sentence doesn't make much sense, it should read:

The bug appears when running Forrest on MacOS X 10.4 (and 10.3) with default java (1.4.2). I have not tested it with other java versions.

The bug is not seen on (some) Linux configurations.

It appears that the HTML reader (as well as the jspwiki reader, see FOR-667) uses the Java (default) locale, irrespective of any attempts to specify otherwise. And there is no way to tell Forrest to read a (class of) file(s) using a certain encoding.

The HTML reader should obey the charset info in the header of the file.

> UTF-8 encoded .ihtml documents gives garbled output
> ---------------------------------------------------
>
>          Key: FOR-668
>          URL: http://issues.apache.org/jira/browse/FOR-668
>      Project: Forrest
>         Type: Bug
>     Versions: 0.7
>  Environment: PowerPC Linux/IBM j2se.1.4.2, x86 Linux/Sun j2se1.5
>     Reporter: Børre Gaup
>     Priority: Minor

>
> Non-ascii characters gets garbled, "á" becomes "?°", and ø becomes "??". and so on.
> It is the same phenomenon as described in FOR-667 (http://issues.apache.org/jira/browse/FOR-667), but in another setting.
> These kinds of documents work using Mac OS X with built-in java.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira