You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@cocoon.apache.org by "Martinson, Theresa" <Th...@personneldecisions.com> on 2004/12/07 19:43:21 UTC

I18n Encoding Problems

Hello.

I am attempting to use UTF-8 for the internationalization of Asian
characters coming in as form request parameters.  Unfortunately, they
are not being handled or displayed correctly in my Cocoon application
and show up as blank.

I have read the Cocoon Wiki on Request Parameter Encoding and have
attempted the following suggestions from that document:

1.  declaration of html serializer in sitemap includes charset in the
mime-type attribute
	<map:serializer logger="sitemap.serializer.html"
mime-type="text/html; charset=UTF-8" 
		name="html" pool-grow="4" pool-max="32" pool-min="4" 
		src="org.apache.cocoon.serialization.HTMLSerializer">
	   <encoding>UTF-8</encoding>
	</map:serializer>

2.  web.xml initialization parameters   (This suggestion fixed the
original problem I was having with Spanish characters.)
	<init-param>
	      <param-name>container-encoding</param-name>
	      <param-value>ISO-8859-1</param-value>
	</init-param>

	<init-param>
	      <param-name>form-encoding</param-name>
	      <param-value>UTF-8</param-value>
	</init-param>	

3. locally overriding the form-encoding with the
SetCharacterEncodingAction

4. Bug #26997 Patch for changing the responsibilities of handling the
Mime Type


My Environment:
- Tomcat 4.1.3.0
- Cocoon 2.1.5.1
- j2sdk 1.4.1_07


I have also tried this with Tomcat 5.5.4 / Cocoon 2.1.5.1 / j2sdk 1.5.0
and get similar results for Asian characters.  However, with this setup
I gain a problem with Spanish where a bunch of characters are appended
to the end of my form request parameter (&#0;&#0;&#0;&#0;&#0;&#0;)

Any ideas or suggestions?

Thanks.

Theresa



---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@cocoon.apache.org
For additional commands, e-mail: users-help@cocoon.apache.org