You are viewing a plain text version of this content. The canonical link for it is here.
Posted to c-dev@xerces.apache.org by bu...@apache.org on 2003/03/14 05:20:14 UTC

DO NOT REPLY [Bug 17983] New: - Formatter does not escape control characters

DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=17983>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=17983

Formatter does not escape control characters

           Summary: Formatter does not escape control characters
           Product: Xerces-C++
           Version: 2.2.0
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: Normal
          Priority: Other
         Component: Miscellaneous
        AssignedTo: xerces-c-dev@xml.apache.org
        ReportedBy: egnor-apache@ofb.net


XML 1.0 forbids control characters (#00-#1F, #7F-#9F, excluding #08, #0A, #0D)
from XML character data.  XML 1.1 relaxes this to allow #01-#1F and #7F-#9F but
requires them to be escaped (except #08, #0A, #0D).  The Xerces-C++ formatter
(framework/XMLFormatter.cpp) does not escape these characters at all -- they are
passed verbatim into the output.

This means Xerces-C++'s formatter generates non-well-formed XML under some
circumstances!

These characters should be escaped, e.g. #02 should become "&#2;".

---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-c-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-c-dev-help@xml.apache.org