You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@rave.apache.org by "Carlucci, Tony" <ac...@mitre.org> on 2012/07/25 20:54:34 UTC

[DISCUSS] Changing Rave charset to UTF-8

Hi Rave Devs,

Is there any reason why we *shouldn't* change the Rave code base to serve out UTF-8 character sets instead of ISO-8859-1?

Thanks, Tony

Re: [DISCUSS] Changing Rave charset to UTF-8

Posted by Jasha Joachimsthal <ja...@apache.org>.
On 25 July 2012 21:45, Carlucci, Tony <ac...@mitre.org> wrote:

> >-----Original Message-----
> >From: jashaj@gmail.com [mailto:jashaj@gmail.com] On Behalf Of Jasha
> >Joachimsthal
> >Sent: Wednesday, July 25, 2012 3:09 PM
> >To: dev@rave.apache.org
> >Subject: Re: [DISCUSS] Changing Rave charset to UTF-8
> >
> >On 25 July 2012 20:54, Carlucci, Tony <ac...@mitre.org> wrote:
> >
> >> Hi Rave Devs,
> >>
> >> Is there any reason why we *shouldn't* change the Rave code base to
> serve
> >> out UTF-8 character sets instead of ISO-8859-1?
> >>
> >> Thanks, Tony
> >>
> >
> >Because you may run into issues when doing form submits and url's with
> >query strings. If you change the encoding, please do check that all the
> >form submits (both POST and GET) and AJAX calls are treated correctly with
> >diacritics. English doesn't use them a lot, but other (European) languages
> >do.
> >The default encoding for servlet requests and responses seems to be
> >ISO-8859-1: http://wiki.apache.org/tomcat/FAQ/CharacterEncoding
> >
> >Is something going wrong with encoding at the moment?
> >
> >Jasha
>
> No, nothing is wrong, just something I noticed while trying to debug our
> internal Apache/Tomcat issues.  Our Tomcat connector is configured for
> UTF-8 so I thought there might have been a conflict with the default
> ISO-8859-1 encoding we are using.  Changing it didn't solve our problem,
> but never the less I didn't know if we should be serving UTF-8 or not
> out-of-the-box to support non-Latin based charsets?
>
> Perhaps instead of forcibly changing it to UTF-8, we could make it a
> configurable property?
>

Finding out what goes wrong with encoding is always a PITA. It can be
configured in many levels of the request handling: the httpd server, Tomcat
connector, in a servlet or Spring's CharacterEncodingFilter, in the JSP (I
may have forgotten a layer or 2). Then the database may or may not handle
the data correctly.
UTF-8 may be a better choice for non-European character sets, but it should
be tested carefully.

RE: [DISCUSS] Changing Rave charset to UTF-8

Posted by "Carlucci, Tony" <ac...@mitre.org>.
>-----Original Message-----
>From: jashaj@gmail.com [mailto:jashaj@gmail.com] On Behalf Of Jasha
>Joachimsthal
>Sent: Wednesday, July 25, 2012 3:09 PM
>To: dev@rave.apache.org
>Subject: Re: [DISCUSS] Changing Rave charset to UTF-8
>
>On 25 July 2012 20:54, Carlucci, Tony <ac...@mitre.org> wrote:
>
>> Hi Rave Devs,
>>
>> Is there any reason why we *shouldn't* change the Rave code base to serve
>> out UTF-8 character sets instead of ISO-8859-1?
>>
>> Thanks, Tony
>>
>
>Because you may run into issues when doing form submits and url's with
>query strings. If you change the encoding, please do check that all the
>form submits (both POST and GET) and AJAX calls are treated correctly with
>diacritics. English doesn't use them a lot, but other (European) languages
>do.
>The default encoding for servlet requests and responses seems to be
>ISO-8859-1: http://wiki.apache.org/tomcat/FAQ/CharacterEncoding
>
>Is something going wrong with encoding at the moment?
>
>Jasha

No, nothing is wrong, just something I noticed while trying to debug our internal Apache/Tomcat issues.  Our Tomcat connector is configured for UTF-8 so I thought there might have been a conflict with the default ISO-8859-1 encoding we are using.  Changing it didn't solve our problem, but never the less I didn't know if we should be serving UTF-8 or not out-of-the-box to support non-Latin based charsets?

Perhaps instead of forcibly changing it to UTF-8, we could make it a configurable property?

Re: [DISCUSS] Changing Rave charset to UTF-8

Posted by Jasha Joachimsthal <ja...@apache.org>.
On 25 July 2012 20:54, Carlucci, Tony <ac...@mitre.org> wrote:

> Hi Rave Devs,
>
> Is there any reason why we *shouldn't* change the Rave code base to serve
> out UTF-8 character sets instead of ISO-8859-1?
>
> Thanks, Tony
>

Because you may run into issues when doing form submits and url's with
query strings. If you change the encoding, please do check that all the
form submits (both POST and GET) and AJAX calls are treated correctly with
diacritics. English doesn't use them a lot, but other (European) languages
do.
The default encoding for servlet requests and responses seems to be
ISO-8859-1: http://wiki.apache.org/tomcat/FAQ/CharacterEncoding

Is something going wrong with encoding at the moment?

Jasha