You are viewing a plain text version of this content. The canonical link for it is here.
Posted to j-dev@xerces.apache.org by "Michael Glavassevich (JIRA)" <xe...@xml.apache.org> on 2004/11/05 22:15:49 UTC

[jira] Commented: (XERCESJ-1027) Support RFC 3066 language codes

     [ http://nagoya.apache.org/jira/browse/XERCESJ-1027?page=comments#action_55134 ]
     
Michael Glavassevich commented on XERCESJ-1027:
-----------------------------------------------

Could you please provide an example of what you feel Xerces is doing incorrectly. We've implemented that erratum for the language type for quite some time. The pattern facet it is checking against is ([a-zA-Z]{1,8})(-[a-zA-Z0-9]{1,8})*. This is equivalent to what is in the XML Schema second edition [1].

RFC 3066 [2] defines language as:

Language-Tag = Primary-subtag *( "-" Subtag )
Primary-subtag = 1*8ALPHA
Subtag = 1*8(ALPHA / DIGIT)

That seems to me like the same thing. If there are constraints on language above and beyond this syntax XML Schema currently doesn't impose that.

[1] http://www.w3.org/TR/2004/REC-xmlschema-2-20041028/#language
[2] http://www.ietf.org/rfc/rfc3066.txt

> Support RFC 3066 language codes
> -------------------------------
>
>          Key: XERCESJ-1027
>          URL: http://nagoya.apache.org/jira/browse/XERCESJ-1027
>      Project: Xerces2-J
>         Type: Bug
>   Components: XML Schema datatypes
>     Versions: 2.6.2
>     Reporter: Chris Hubick

>
> XML instance documents with 'Language' datatype values containing RFC 3066 compliant three letter language codes (tags) will not validate with Xerces.
> Background:  The old XML Schema datatypes document
> http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#language
> referenced RFC 1766, as did the core XML 1.0 Second Edition specification.  XML 1.0 Third Edition, and the new XML Schema specification both now reference the newer RFC 3066.  I understand the major change is that RFC 3066 supports the three letter language tags from ISO 639.
> http://www.w3.org/TR/REC-xml/#sec-lang-tag
> http://www.w3.org/TR/xmlschema-2/#language
> http://www.ietf.org/rfc/rfc3066.txt

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://nagoya.apache.org/jira/secure/Administrators.jspa
-
If you want more information on JIRA, or have a bug to report see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-j-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-j-dev-help@xml.apache.org