You are viewing a plain text version of this content. The canonical link for it is here.
Posted to j-dev@xerces.apache.org by "Jörg Henne (JIRA)" <xe...@xml.apache.org> on 2005/03/22 12:00:27 UTC

[jira] Created: (XERCESJ-1049) org.apache.xerces.util.EncodingMap should recognise Java-style encodings

org.apache.xerces.util.EncodingMap should recognise Java-style encodings
------------------------------------------------------------------------

         Key: XERCESJ-1049
         URL: http://issues.apache.org/jira/browse/XERCESJ-1049
     Project: Xerces2-J
        Type: Bug
    Versions: 2.6.2    
 Environment: all
    Reporter: Jörg Henne


Xerces 1 supported encodings of the kind "Cp1141" etc. Xerces 2 fails to parse InputSources with this (and other) encodings, because the 4-digit code page ("CpXXXX") encodings have IANA-side entries for "CP0XXXX" only. 
For backwards compatibility, Xerces 2 should support the encodings without the leading 5th digit, too.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
If you want more information on JIRA, or have a bug to report see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-j-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-j-dev-help@xml.apache.org


[jira] Commented: (XERCESJ-1049) org.apache.xerces.util.EncodingMap should recognise Java-style encodings

Posted by "Michael Glavassevich (JIRA)" <xe...@xml.apache.org>.
     [ http://issues.apache.org/jira/browse/XERCESJ-1049?page=comments#action_61682 ]
     
Michael Glavassevich commented on XERCESJ-1049:
-----------------------------------------------

The IANA side of the EncodingMap only contains encoding names and aliases registered with IANA [1]. There's a feature [2] when enabled that will cause the parser to recognize Java encoding names. It may be that the Java encoding side of the map is missing the reverse mapping.

[1] http://www.iana.org/assignments/character-sets
[2] http://xml.apache.org/xerces2-j/features.html#allow-java-encodings

> org.apache.xerces.util.EncodingMap should recognise Java-style encodings
> ------------------------------------------------------------------------
>
>          Key: XERCESJ-1049
>          URL: http://issues.apache.org/jira/browse/XERCESJ-1049
>      Project: Xerces2-J
>         Type: Bug
>     Versions: 2.6.2
>  Environment: all
>     Reporter: Jörg Henne

>
> Xerces 1 supported encodings of the kind "Cp1141" etc. Xerces 2 fails to parse InputSources with this (and other) encodings, because the 4-digit code page ("CpXXXX") encodings have IANA-side entries for "CP0XXXX" only. 
> For backwards compatibility, Xerces 2 should support the encodings without the leading 5th digit, too.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
If you want more information on JIRA, or have a bug to report see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-j-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-j-dev-help@xml.apache.org