You are viewing a plain text version of this content. The canonical link for it is here.
Posted to c-dev@xerces.apache.org by bu...@apache.org on 2001/04/19 15:50:42 UTC
[Bug 1393] New - Converting from Unicode to iso8859
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=1393
*** shadow/1393 Thu Apr 19 06:50:42 2001
--- shadow/1393.tmp.28419 Thu Apr 19 06:50:42 2001
***************
*** 0 ****
--- 1,19 ----
+ +============================================================================+
+ | Converting from Unicode to iso8859 |
+ +----------------------------------------------------------------------------+
+ | Bug #: 1393 Product: Xerces-C |
+ | Status: NEW Version: 1.4 |
+ | Resolution: Platform: PC |
+ | Severity: Normal OS/Version: All |
+ | Priority: Low Component: Utilities |
+ +----------------------------------------------------------------------------+
+ | Assigned To: xerces-c-dev@xml.apache.org |
+ | Reported By: dherr@mapquest.com |
+ | CC list: Cc: |
+ +----------------------------------------------------------------------------+
+ | URL: |
+ +============================================================================+
+ | DESCRIPTION |
+ This applies to XERCES-C-SRC1_4_0\SRC\util\xml88591transcoder.cpp, line 154.
+ I'm not very familiar with the Unicode char set, but I'm guessing that the
+ 0x256 should probably be either 256 or 0x100.
---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-c-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-c-dev-help@xml.apache.org
Re: [Bug 1393] New - Converting from Unicode to iso8859
Posted by Aaron Kaplan <ka...@xsoft.at>.
Ah this reminds me of a bug/feature which I noticed recently: when I feed
the parser (DOM) an UTF-16 document which is actually lets say ASCII or
ISO-8859-1 (i.e. no 16 bit) it will core dump.
In other words: if the < ... ENCODING= > part _says_ its in UTF-16 but in
fact it is not, then we are in trouble.
This is a bit of a DOS attack problem for me since I use xerces from
within an apache module and then the module / apache process dies.
Is this an easy error to fix or is this sort of inherent with multi byte
characters?
aaron.
---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-c-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-c-dev-help@xml.apache.org