You are viewing a plain text version of this content. The canonical link for it is here.
Posted to j-users@xerces.apache.org by Jo...@yestelevision.com on 2001/11/21 10:55:33 UTC

Re: problem with iso-8859-1 ...

It may be that the characters are not validated,  and that the encoding you
set determines how the parser interprets the characters.
Have you tried outputting the simple text contained in your document tag?
If the characters do not appear as you have entered them, then this may be
the case.


                                                                                                                                  
                    Christophe                                                                                                    
                    Prévost              To:     "Xerces-J" <xe...@xml.apache.org>                                        
                    <cprevost@grou       cc:                                                                                      
                    perf.com>            Subject:     problem with iso-8859-1 ...                                                 
                                                                                                                                  
                    21/11/2001                                                                                                    
                    10:45                                                                                                         
                    Please respond                                                                                                
                    to                                                                                                            
                    xerces-j-user                                                                                                 
                                                                                                                                  
                                                                                                                                  




Hello everybody,

I use Xerces to parse documents via sax. I've an ErrorHandler and my parser
features are like this:

XMLReader parser = XMLReaderFactory.createXMLReader(
"org.apache.xerces.parsers.SAXParser" );
parser.setErrorHandler( errHandler );
parser.setFeature( "http://xml.org/sax/features/validation", true );
parser.setFeature( "http://xml.org/sax/features/namespaces", false );

When i parse a document like this, xerces errorhandler raise no error
<?xml version='1.0' encoding='ISO-8859-1' standalone="yes"?>
<document>€ Š, š, Ž, ž, Œ, œ, and Ÿ.</document>
However all those caracters doesn't belong to the ISO-8859-1 charmap but
ISO-8859-15 !!

Maybe i've done something wrong or i forgot something... Can u help me, i'm
blind.

Thanks

PS1: using Xerces-J 1.4.3
PS2: there are no entity in the dtd



---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-j-user-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-j-user-help@xml.apache.org




**********************************************************************
This e-mail (including any attachments) is intended only for
the recipient(s) named above. It may contain confidential or
privileged information and should not be read, copied or
otherwise used or disseminated by any unauthorised person.
Any views or opinions expressed are solely those of the author
and do not necessarily represent those of Yes Television
(Europe) Ltd . If you are not the named recipient, please
contact the sender and delete the e-mail from your system.
**********************************************************************