You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@maven.apache.org by "Gisbert Amm (JIRA)" <ji...@codehaus.org> on 2005/11/16 10:42:06 UTC

[jira] Created: (MPXDOC-184) Encoding problem: Non-ASCII characters like German umlauts are not transcoded properly.

Encoding problem: Non-ASCII characters like German umlauts are not transcoded properly.
---------------------------------------------------------------------------------------

         Key: MPXDOC-184
         URL: http://jira.codehaus.org/browse/MPXDOC-184
     Project: maven-xdoc-plugin
        Type: Bug
 Environment: Maven 1.1 beta 1
SuSE Linux 9.1
    Reporter: Gisbert Amm


Posting to the maven-users list:

We face encoding problems with the Maven-Reports. The output is declared to be ISO-8859-1 (Latin 1), but it actually isn't (neither the XML nor the HTML). Non-ASCII characters like German umlauts are not transcoded properly.

I noticed the same problem in the docs of the official maven tasklist plugin at http://maven.apache.org/maven-1.x/reference/plugins/tasklist/checkstyle/, e.g.: La variable 'destDir' devrait être privée et avoir des accesseurs.

When I set maven.docs.outputencoding to UTF-8 it is even worse: The transformation then makes apparently two wrong bytes out of every single byte of the aready erroneous transcoded characters.

I've not found any hint in the docs or in Jira so far; can somebody please shed some light on it for me?

I'm using Maven 1.1 beta 1.

Regards,
Gisbert Amm 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@maven.apache.org
For additional commands, e-mail: dev-help@maven.apache.org


[jira] Commented: (MPXDOC-184) Encoding problem: Non-ASCII characters like German umlauts are not transcoded properly.

Posted by "Lukas Theussl (JIRA)" <ji...@codehaus.org>.
    [ http://jira.codehaus.org/browse/MPXDOC-184?page=comments#action_51148 ] 

Lukas Theussl commented on MPXDOC-184:
--------------------------------------

For javadoc, did you try to set the maven.javadoc.locale property? If this works then we should set the default to maven.xdoc.locale.default.

In the jxr plugin, the maven.compile.encoding property from the java plugin is used (I think this is undocumented :( ), can you try if this helps?

Any other reports you have troubles with?

> Encoding problem: Non-ASCII characters like German umlauts are not transcoded properly.
> ---------------------------------------------------------------------------------------
>
>          Key: MPXDOC-184
>          URL: http://jira.codehaus.org/browse/MPXDOC-184
>      Project: maven-xdoc-plugin
>         Type: Bug
>  Environment: Maven 1.1 beta 1
> SuSE Linux 9.1
>     Reporter: Gisbert Amm

>
>
> Posting to the maven-users list:
> We face encoding problems with the Maven-Reports. The output is declared to be ISO-8859-1 (Latin 1), but it actually isn't (neither the XML nor the HTML). Non-ASCII characters like German umlauts are not transcoded properly.
> I noticed the same problem in the docs of the official maven tasklist plugin at http://maven.apache.org/maven-1.x/reference/plugins/tasklist/checkstyle/, e.g.: La variable 'destDir' devrait être privée et avoir des accesseurs.
> When I set maven.docs.outputencoding to UTF-8 it is even worse: The transformation then makes apparently two wrong bytes out of every single byte of the aready erroneous transcoded characters.
> I've not found any hint in the docs or in Jira so far; can somebody please shed some light on it for me?
> I'm using Maven 1.1 beta 1.
> Regards,
> Gisbert Amm 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@maven.apache.org
For additional commands, e-mail: dev-help@maven.apache.org


[jira] Closed: (MPXDOC-184) Encoding problem: Non-ASCII characters like German umlauts are not transcoded properly.

Posted by "Lukas Theussl (JIRA)" <ji...@codehaus.org>.
     [ http://jira.codehaus.org/browse/MPXDOC-184?page=all ]
     
Lukas Theussl closed MPXDOC-184:
--------------------------------

    Resolution: Won't Fix

Closing as won't fix since it 's not an xdoc specific issue.Please open an issue in the respective plugins if you still have problems with certain reports.

> Encoding problem: Non-ASCII characters like German umlauts are not transcoded properly.
> ---------------------------------------------------------------------------------------
>
>          Key: MPXDOC-184
>          URL: http://jira.codehaus.org/browse/MPXDOC-184
>      Project: maven-xdoc-plugin
>         Type: Bug
>  Environment: Maven 1.1 beta 1
> SuSE Linux 9.1
>     Reporter: Gisbert Amm

>
>
> Posting to the maven-users list:
> We face encoding problems with the Maven-Reports. The output is declared to be ISO-8859-1 (Latin 1), but it actually isn't (neither the XML nor the HTML). Non-ASCII characters like German umlauts are not transcoded properly.
> I noticed the same problem in the docs of the official maven tasklist plugin at http://maven.apache.org/maven-1.x/reference/plugins/tasklist/checkstyle/, e.g.: La variable 'destDir' devrait être privée et avoir des accesseurs.
> When I set maven.docs.outputencoding to UTF-8 it is even worse: The transformation then makes apparently two wrong bytes out of every single byte of the aready erroneous transcoded characters.
> I've not found any hint in the docs or in Jira so far; can somebody please shed some light on it for me?
> I'm using Maven 1.1 beta 1.
> Regards,
> Gisbert Amm 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@maven.apache.org
For additional commands, e-mail: dev-help@maven.apache.org


[jira] Commented: (MPXDOC-184) Encoding problem: Non-ASCII characters like German umlauts are not transcoded properly.

Posted by "Gisbert Amm (JIRA)" <ji...@codehaus.org>.
    [ http://jira.codehaus.org/browse/MPXDOC-184?page=comments#action_51112 ] 

Gisbert Amm commented on MPXDOC-184:
------------------------------------

Follow-Up on the mailing list:

I finally found something I consider a workaround for the checkstyle report (I already use the 3.0-SNAPSHOT of the plugin) and the xdocs:

I'v changed the locale settings on my SuSE 9.1 system like that

export LANG=de_DE.utf8
export LC_CTYPE=de_DE.utf8

(setting of LC_CTYPE was necessary aswell)

and set the following Maven properties:

maven.docs.outputencoding=UTF-8
maven.xdoc.locale.default=de_DE.utf8

Now the German umlauts are displayed correctly within the checkstyle report and the documents generated out of the xdocs (like index.xml). However, other reports like Javadoc, Source Xref and Test Xref still suffer from wrong encoded characters.

Regards,
Gisbert Amm

Lukas Theussl wrote:
> On Linux you also have to set the system encoding explicitly,
> see http://maven.apache.org/maven-1.x/reference/plugins/xdoc/faq.html#problems-unicode
> Arnaud HERITIER wrote:
>
>> For checkstyle it fixes only the locale.
>> It doesn't help for the encoding :-(
>>
>>> -----Message d'origine-----
>>> De : Lukas Theussl [mailto:ltheussl@apache.org] Envoyé : mercredi 16 novembre 2005 00:03
>>>
>>> For the checkstyle plugin this should be fixed in the current 3.0-SNAPSHOT (see http://jira.codehaus.org/browse/MPCHECKSTYLE-39 ), but it might still be a problem for other plugins.
>>>
>>> Gisbert, which plugins/reports do you have problems with? 

> Encoding problem: Non-ASCII characters like German umlauts are not transcoded properly.
> ---------------------------------------------------------------------------------------
>
>          Key: MPXDOC-184
>          URL: http://jira.codehaus.org/browse/MPXDOC-184
>      Project: maven-xdoc-plugin
>         Type: Bug
>  Environment: Maven 1.1 beta 1
> SuSE Linux 9.1
>     Reporter: Gisbert Amm

>
>
> Posting to the maven-users list:
> We face encoding problems with the Maven-Reports. The output is declared to be ISO-8859-1 (Latin 1), but it actually isn't (neither the XML nor the HTML). Non-ASCII characters like German umlauts are not transcoded properly.
> I noticed the same problem in the docs of the official maven tasklist plugin at http://maven.apache.org/maven-1.x/reference/plugins/tasklist/checkstyle/, e.g.: La variable 'destDir' devrait être privée et avoir des accesseurs.
> When I set maven.docs.outputencoding to UTF-8 it is even worse: The transformation then makes apparently two wrong bytes out of every single byte of the aready erroneous transcoded characters.
> I've not found any hint in the docs or in Jira so far; can somebody please shed some light on it for me?
> I'm using Maven 1.1 beta 1.
> Regards,
> Gisbert Amm 

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://jira.codehaus.org/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@maven.apache.org
For additional commands, e-mail: dev-help@maven.apache.org