You are viewing a plain text version of this content. The canonical link for it is here.
Posted to c-dev@xerces.apache.org by "Alberto Massari (JIRA)" <xe...@xml.apache.org> on 2008/06/26 14:21:45 UTC
[jira] Resolved: (XERCESC-541) Regular Expressions : \w incorrectly
matching punctuation characters
[ https://issues.apache.org/jira/browse/XERCESC-541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Alberto Massari resolved XERCESC-541.
-------------------------------------
Resolution: Fixed
Fix Version/s: 3.0.0
Assignee: Alberto Massari (was: Xerces-C Developers Mailing List)
The \w category is now dinamically built using the three other categories
> Regular Expressions : \w incorrectly matching punctuation characters
> --------------------------------------------------------------------
>
> Key: XERCESC-541
> URL: https://issues.apache.org/jira/browse/XERCESC-541
> Project: Xerces-C++
> Issue Type: Bug
> Components: Validating Parser (XML Schema)
> Affects Versions: 1.7.0
> Environment: Operating System: Other
> Platform: PC
> Reporter: Richard Schofield
> Assignee: Alberto Massari
> Fix For: 3.0.0
>
>
> The XML Schema Spec Part 2 (Appendix F) defines the multi-charcater escapes
> which can be used in regular expression matching.
> \w should match all characters EXCEPT the set of "punctuation", "separator"
> and "other" characters as defined by the unicode specification.
> However, \w sets up a range which matches all characters between x0020 and
> xD7FF (gXMLChars). This range results in the punctuation, separator and other
> characters being matched incorrectly.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: c-dev-unsubscribe@xerces.apache.org
For additional commands, e-mail: c-dev-help@xerces.apache.org