You are viewing a plain text version of this content. The canonical link for it is here.
Posted to c-dev@xerces.apache.org by bu...@apache.org on 2001/04/19 15:50:42 UTC

[Bug 1393] New - Converting from Unicode to iso8859

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=1393

*** shadow/1393	Thu Apr 19 06:50:42 2001
--- shadow/1393.tmp.28419	Thu Apr 19 06:50:42 2001
***************
*** 0 ****
--- 1,19 ----
+ +============================================================================+
+ | Converting from Unicode to iso8859                                         |
+ +----------------------------------------------------------------------------+
+ |        Bug #: 1393                        Product: Xerces-C                |
+ |       Status: NEW                         Version: 1.4                     |
+ |   Resolution:                            Platform: PC                      |
+ |     Severity: Normal                   OS/Version: All                     |
+ |     Priority: Low                       Component: Utilities               |
+ +----------------------------------------------------------------------------+
+ |  Assigned To: xerces-c-dev@xml.apache.org                                  |
+ |  Reported By: dherr@mapquest.com                                           |
+ |      CC list: Cc:                                                          |
+ +----------------------------------------------------------------------------+
+ |          URL:                                                              |
+ +============================================================================+
+ |                              DESCRIPTION                                   |
+ This applies to XERCES-C-SRC1_4_0\SRC\util\xml88591transcoder.cpp, line 154.
+ I'm not very familiar with the Unicode char set, but I'm guessing that the 
+ 0x256 should probably be either 256 or 0x100.

---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-c-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-c-dev-help@xml.apache.org


Re: [Bug 1393] New - Converting from Unicode to iso8859

Posted by Aaron Kaplan <ka...@xsoft.at>.

Ah this reminds me of a bug/feature which I noticed recently: when I feed
the parser (DOM) an UTF-16 document which is actually lets say ASCII or
ISO-8859-1 (i.e. no 16 bit) it will core dump.
In other words: if the < ... ENCODING= > part _says_ its in UTF-16 but in
fact it is not, then we are in trouble.


This is a bit of a DOS attack problem for me since I use xerces from
within an apache module and then the module / apache process dies.

Is this an easy error to fix or is this sort of inherent with multi byte
characters?

aaron.



---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-c-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-c-dev-help@xml.apache.org