You are viewing a plain text version of this content. The canonical link for it is here.
Posted to j-dev@xerces.apache.org by "Yves Lafon (JIRA)" <xe...@xml.apache.org> on 2005/08/09 16:54:37 UTC

[jira] Created: (XERCESJ-1094) Xerces in infinite loop validating wrongly encoded XML 1.1 documents

Xerces in infinite loop validating wrongly encoded XML 1.1 documents
--------------------------------------------------------------------

         Key: XERCESJ-1094
         URL: http://issues.apache.org/jira/browse/XERCESJ-1094
     Project: Xerces2-J
        Type: Bug
    Versions: 2.7.1    
 Environment: Linux, Solaris, jdk 1.2.2/1.4/1.5
    Reporter: Yves Lafon
    Priority: Blocker


When parsing a XML1.1 document from an InputSource, where the encoding is set to iso-8859-1, with an encoding set to UTF-8 in the XML declaration, and with a iso-8859-2 character in an attribute, then xerces enters an infinite loop.
If the same character is not in the attribute, then Xerces reports an invalid XML character instead of blocking.
If the encoding of the input source is not set to iso-8859-1, Xerces works fine also.

Sample doc and modified DocumentScanner that demonstrate the issue at http://jigsaw.w3.org/Yves/xercesBug.zip
Thanks,

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: j-dev-unsubscribe@xerces.apache.org
For additional commands, e-mail: j-dev-help@xerces.apache.org


[jira] Assigned: (XERCESJ-1094) Xerces in infinite loop validating wrongly encoded XML 1.1 documents

Posted by "Michael Glavassevich (JIRA)" <xe...@xml.apache.org>.
     [ http://issues.apache.org/jira/browse/XERCESJ-1094?page=all ]

Michael Glavassevich reassigned XERCESJ-1094:
---------------------------------------------

    Assign To: Michael Glavassevich

> Xerces in infinite loop validating wrongly encoded XML 1.1 documents
> --------------------------------------------------------------------
>
>          Key: XERCESJ-1094
>          URL: http://issues.apache.org/jira/browse/XERCESJ-1094
>      Project: Xerces2-J
>         Type: Bug
>     Versions: 2.7.1
>  Environment: Linux, Solaris, jdk 1.2.2/1.4/1.5
>     Reporter: Yves Lafon
>     Assignee: Michael Glavassevich
>     Priority: Blocker

>
> When parsing a XML1.1 document from an InputSource, where the encoding is set to iso-8859-1, with an encoding set to UTF-8 in the XML declaration, and with a iso-8859-2 character in an attribute, then xerces enters an infinite loop.
> If the same character is not in the attribute, then Xerces reports an invalid XML character instead of blocking.
> If the encoding of the input source is not set to iso-8859-1, Xerces works fine also.
> Sample doc and modified DocumentScanner that demonstrate the issue at http://jigsaw.w3.org/Yves/xercesBug.zip
> Thanks,

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: j-dev-unsubscribe@xerces.apache.org
For additional commands, e-mail: j-dev-help@xerces.apache.org


[jira] Resolved: (XERCESJ-1094) Xerces in infinite loop validating wrongly encoded XML 1.1 documents

Posted by "Michael Glavassevich (JIRA)" <xe...@xml.apache.org>.
     [ http://issues.apache.org/jira/browse/XERCESJ-1094?page=all ]
     
Michael Glavassevich resolved XERCESJ-1094:
-------------------------------------------

    Resolution: Fixed

Thanks for reporting. This actually had nothing to do with the encoding of the document, but rather how Xerces deals with unescaped control characters in attribute values. Instead of reporting an error, it kept looping back into the scanner. Should be fixed now in CVS.

> Xerces in infinite loop validating wrongly encoded XML 1.1 documents
> --------------------------------------------------------------------
>
>          Key: XERCESJ-1094
>          URL: http://issues.apache.org/jira/browse/XERCESJ-1094
>      Project: Xerces2-J
>         Type: Bug
>     Versions: 2.7.1
>  Environment: Linux, Solaris, jdk 1.2.2/1.4/1.5
>     Reporter: Yves Lafon
>     Assignee: Michael Glavassevich
>     Priority: Blocker

>
> When parsing a XML1.1 document from an InputSource, where the encoding is set to iso-8859-1, with an encoding set to UTF-8 in the XML declaration, and with a iso-8859-2 character in an attribute, then xerces enters an infinite loop.
> If the same character is not in the attribute, then Xerces reports an invalid XML character instead of blocking.
> If the encoding of the input source is not set to iso-8859-1, Xerces works fine also.
> Sample doc and modified DocumentScanner that demonstrate the issue at http://jigsaw.w3.org/Yves/xercesBug.zip
> Thanks,

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: j-dev-unsubscribe@xerces.apache.org
For additional commands, e-mail: j-dev-help@xerces.apache.org


[jira] Updated: (XERCESJ-1094) Xerces in infinite loop validating wrongly encoded XML 1.1 documents

Posted by "Michael Glavassevich (JIRA)" <xe...@xml.apache.org>.
     [ http://issues.apache.org/jira/browse/XERCESJ-1094?page=all ]

Michael Glavassevich updated XERCESJ-1094:
------------------------------------------

    Fix Version: 2.8.0

> Xerces in infinite loop validating wrongly encoded XML 1.1 documents
> --------------------------------------------------------------------
>
>          Key: XERCESJ-1094
>          URL: http://issues.apache.org/jira/browse/XERCESJ-1094
>      Project: Xerces2-J
>         Type: Bug
>     Versions: 2.7.1
>  Environment: Linux, Solaris, jdk 1.2.2/1.4/1.5
>     Reporter: Yves Lafon
>     Assignee: Michael Glavassevich
>     Priority: Blocker
>      Fix For: 2.8.0

>
> When parsing a XML1.1 document from an InputSource, where the encoding is set to iso-8859-1, with an encoding set to UTF-8 in the XML declaration, and with a iso-8859-2 character in an attribute, then xerces enters an infinite loop.
> If the same character is not in the attribute, then Xerces reports an invalid XML character instead of blocking.
> If the encoding of the input source is not set to iso-8859-1, Xerces works fine also.
> Sample doc and modified DocumentScanner that demonstrate the issue at http://jigsaw.w3.org/Yves/xercesBug.zip
> Thanks,

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: j-dev-unsubscribe@xerces.apache.org
For additional commands, e-mail: j-dev-help@xerces.apache.org