You are viewing a plain text version of this content. The canonical link for it is here.
Posted to soap-dev@xml.apache.org by Deane Sloan <De...@equinox.co.nz> on 2001/05/01 07:35:35 UTC

RE: Does nobody care about this stuff?

>>I have gotten no response to my message from yesterday, and little from my

>>earlier messages.  Does nobody care about this stuff?  I find it hard to 
>>believe that everybody is happy with a SOAP implementation that cannot 
>>even transport accented Latin characters.
A starting point ...

org.apache.soap.util.xml.Utils.cleanString(, as used from within
SOAPMappingRegistry.java (line 187 of rev. 1.14, line 239 or rev. 1.17), the
function's interpretation and recognition of "<![CDATA[" may be unwanted and
incorrect in this context.
At this point of the serialization, one should simply encode the 5
proscribed XML characters... maybe also chars >= 128? 

I've seen posts claiming that Apache SOAP doesn't correctly handle various
forms of line feed/new lines - maybe in this context the additional
behaviour of the getSafeString function should be examined...

Best Regards,

Deane Sloan


-----Original Message-----
From: Mike Spreitzer [mailto:mspreitz@us.ibm.com]
Sent: Friday, 27 April 2001 09:02
To: soap-dev@xml.apache.org
Subject: Does nobody care about this stuff?


I have gotten no response to my message from yesterday, and little from my 
earlier messages.  Does nobody care about this stuff?  I find it hard to 
believe that everybody is happy with a SOAP implementation that cannot 
even transport accented Latin characters.
*************************************************************************
Please respond to soap-dev@xml.apache.org 
To:     soap-dev@xml.apache.org
cc: 
Subject:        Current SOAP and Xerces 1.2.2 can NOT transport a String
containing 
arbitrary Unicode characters



I've tried the April 24 nightly build of Apache SOAP, with Xerces 1.2.2,
and both problems reported below are still present.
****************************************************************************
***********
Please respond to soap-dev@xml.apache.org
To:     soap-user@xml.apache.org
cc:     soap-dev@xml.apache.org
Subject:        Re: Can SOAP 2.1 and Xerces 1.2.2 transport a String 
containing an
arbitrary Unicode character?



I used tcpdump to capture traffic containing three interesting call
messages, containing (respectively) the Strings: "qqq\u00F6zzz",
"qqq<![CDATA[zzz", and "qqq\u2030zzz" (in Java source notation) among the
data being sent.

The tcpdump output shows: (1) where \u00F6 goes, a single 00 byte was
sent; (2) the CDATA start tag was sent without being quoted; and (3) the
\u2030 was sent correctly.

Unhappily,
Mike