You are viewing a plain text version of this content. The canonical link for it is here.

Posted to c-users@xerces.apache.org by "Wons, Jean-Baptiste" <Je...@kbcfp.com> on 2008/06/19 18:08:43 UTC

Sterling pound sign encoding sith XML string

Hello. 

I am not sure if this is a bug in xerces or me not using xerces well. 

This is my code: 

<code> 

#include <string> 
#include <iostream> 

#include <xercesc/dom/DOM.hpp> 
#include <xercesc/dom/DOMException.hpp> 
#include <xercesc/dom/DOMImplementationRegistry.hpp> 
#include <xercesc/framework/MemBufInputSource.hpp> 
#include <xercesc/parsers/XercesDOMParser.hpp> 
#include <xercesc/util/PlatformUtils.hpp> 
#include <xercesc/util/XMLString.hpp> 


using namespace std; 
using namespace XERCES_CPP_NAMESPACE; 

void replaceSpecialCharactersXML(std::string &s) 
{ 
    string cp; 
    unsigned int i; 
    cp.reserve(s.size()*2); 
    for (i = 0; i < s.size(); i++) 
    { 
        const unsigned char c = s[i]; 

        if ((c < 32 && c != '\012' && c != '\015') || c > 127) 
        { 
            char buffer[10000]; 
            sprintf(buffer, "&#x%02x;", c); 
            cp += buffer; 
        } 
        else 
        { 
            cp += c; 
        } 
    } 
    s = cp; 
} 


int main() 
{ 
    XMLPlatformUtils::Initialize(); 
    string aString0 ("This will crash ££££ ..."); 
    XMLCh* fUnicodeForm = XMLString::transcode(aString0.c_str()); 
    char *pMsg = XMLString::transcode(fUnicodeForm); 
    string res(pMsg); 
    replaceSpecialCharactersXML(res); 

    cout << aString0 << " -> " << pMsg << " -> " << res << endl; 

    return 0; 
} 

</code> 

When I compile and run, I have that output: 

<output> 
sh$ ./testxerces 
This will crash ££££ ... -> This will crash ... -> This will crash &#x1a;&#x1a;&#x1a;&#x1a; ... 
</output> 

When I transcode the £ sign to XMLCh, then transcode it back to a char*, it is transformed to 0x1a. 

Is it a real bug, or is it just me missing something ? 

Regards, 
Jean-Baptiste 


-- 
This message may contain confidential, proprietary, or legally privileged information. No confidentiality or privilege is waived by any transmission to an unintended recipient. If you are not an intended recipient, please notify the sender and delete this message immediately. Any views expressed in this message are those of the sender, not those of any entity within the KBC Financial Products group of companies (together referred to as "KBC FP"). 

This message does not create any obligation, contractual or otherwise, on the part of KBC FP. It is not an offer (or solicitation of an offer) of, or a recommendation to buy or sell, any financial product. Any prices or other values included in this message are indicative only, and do not necessarily represent current market prices, prices at which KBC FP would enter into a transaction, or prices at which similar transactions may be carried on KBC FP's own books. The information contained in this message is provided "as is", without representations or warranties, express or implied, of any kind. Past performance is not indicative of future returns.