You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@xalan.apache.org by "Dmitry Hayes (JIRA)" <xa...@xml.apache.org> on 2005/07/05 22:03:17 UTC

[jira] Commented: (XALANC-532) FormatterToXMLUnicode does not serialize UTF-8 correctly

    [ http://issues.apache.org/jira/browse/XALANC-532?page=comments#action_12315093 ] 

Dmitry Hayes commented on XALANC-532:
-------------------------------------

I'm not sure if the problem is related to UTF8: we concentrate all the encoding related code inside of the XalanUTF8(UTF16/OtherEncoding)Writers. FormatterToXMLUnicode code should be encoding independent.
I support enhance of the s_specialChars1_1 special characters table, 
but we still have some problem.
The structure "    
        	 ...
             else if (XMLVersion == XML_VERSION_1_1)
              {
                 writeXML1_1CharacterReference(ch);
              }
	...
was intendend to sure that in case we are dealing with a XML1.1 specific character , it would be printed as a charRefs for XML1.1 output. However, for XML1.0 output those characters would be just ignored. The solution is far from being perfect, but it produced well-formed XML files. 
	Right now I'm thinking about other solution: in XalanXMLSerializerBase::s_specialChars1_0 table we can mark the forbidden characters as "eForbidden" and later just check for it. In case of the forbidden character we can throw an exception. 


> FormatterToXMLUnicode does not serialize UTF-8 correctly
> --------------------------------------------------------
>
>          Key: XALANC-532
>          URL: http://issues.apache.org/jira/browse/XALANC-532
>      Project: XalanC
>         Type: Bug
>   Components: XalanC
>     Versions: CurrentCVS
>     Reporter: David Bertoni
>     Assignee: David Bertoni
>     Priority: Critical
>  Attachments: patch.txt
>
> There are some serious bugs that result in incorrect serialization.  A patch is attached.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: xalan-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xalan-dev-help@xml.apache.org