You are viewing a plain text version of this content. The canonical link for it is here.
Posted to c-dev@xerces.apache.org by Guillaume Bataille <gb...@irisa.fr> on 2001/10/17 16:50:16 UTC

French accents

I parse an xml file with french accents under win2000, vc6 and xerces
c++.
I use an IDOMParser.
When I extract words containing accents from the tree, accentuated
characters appear to have been transformed.
How can I fix that ?
It seems that it's a transcoder problem.
There is no setLocale() method for IDOMParser, maybe an alternative
solution.
How can I parameterize my system to choose the output encoding ?
--
Guillaume Bataille



---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-c-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-c-dev-help@xml.apache.org


Re: French accents

Posted by Curt Arnold <ca...@houston.rr.com>.
My guess is the source document is encoded in ISO-8859-1, but does not have
an XML declaration so it is interpreted as UTF-8.  Add a:

<?xml encoding="ISO-8859-1"?>

XML documents are not affected by the locale of the processor, they contain
their encoding information so that a document written in France doesn't
magically get converted to Japanese, Thai or whatever the code page used by
the reader.


---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-c-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-c-dev-help@xml.apache.org