You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@cocoon.apache.org by "Alexander Klimetschek (JIRA)" <ji...@apache.org> on 2007/05/10 11:20:15 UTC

[jira] Commented: (COCOON-2063) NekoHTMLTransformer needs to set the default-encoding of the current system to work properly with UTF-8

    [ https://issues.apache.org/jira/browse/COCOON-2063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12494682 ] 

Alexander Klimetschek commented on COCOON-2063:
-----------------------------------------------

I forgot to mention that if someone wants to override this property via the configuration of the NekoHTMLTransformer, he can certainly do it. The manual config is applied after the dynamic setting of the encoding property, thus the manual one overrides the dynamic one.

> NekoHTMLTransformer needs to set the default-encoding of the current system to work properly with UTF-8
> -------------------------------------------------------------------------------------------------------
>
>                 Key: COCOON-2063
>                 URL: https://issues.apache.org/jira/browse/COCOON-2063
>             Project: Cocoon
>          Issue Type: Bug
>          Components: Blocks: HTML
>    Affects Versions: 2.2-dev (Current SVN)
>            Reporter: Alexander Klimetschek
>         Attachments: nekohtmltransformer-encoding.patch
>
>
> The NekoHTMLTransformer uses the cyberneko HTMLConfiguration for tidying html. Unfortunately it does not use the system's current encoding as default, instead you have to set a property to set your encoding. But this varies from one OS to another, so the best solution is to set this property automatically in the NekoHTMLTransformer depending on what Java uses as defaultCharset:
>             config.setProperty("http://cyberneko.org/html/properties/default-encoding", Charset.defaultCharset().name());

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.