You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oro-dev@jakarta.apache.org by df...@apache.org on 2001/11/14 02:30:08 UTC
cvs commit: jakarta-oro/src/java/org/apache/oro/text/regex Perl5Matcher.java
dfs 01/11/13 17:30:08
Modified: . CHANGES
src/java/org/apache/oro/text/regex Perl5Matcher.java
Log:
Fixed [[:punct:]] (which also affected [[:print:]] and [[:graph:]])
to conform to Single Unix Specification (some characters had been
omitted).
The Single Unix Specification (Austin Group Draft 7), the successor to POSIX
and Unix98 defines punctuation characters as the following for the POSIX
locale:
punct <exclamation-mark>;<quotation-mark>;<number-sign>;
<dollar-sign>;<percent-sign>;<ampersand>;<apostrophe>;
<left-parenthesis>;<right-parenthesis>;<asterisk>;
<plus-sign>;<comma>;<hyphen>;<period>;<slash>;
<colon>;<semicolon>;<less-than-sign>;<equals-sign>;
<greater-than-sign>;<question-mark>;<commercial-at>;
<left-square-bracket>;<backslash>;<right-square-bracket>;
<circumflex>;<underscore>;<grave-accent>;<left-curly-bracket>;
<vertical-line>;<right-curly-bracket>;<tilde>
Rather than handle some of these specifically, in order to maintain locale
independence, I have added Character.MATH_SYMBOL, CURRENCY_SYMBOL, and
MODIFIER_SYMBOL in addition to the various _PUNCTUATION types to represent
[[:punct:]].
PR: 4531
Revision Changes Path
1.19 +5 -1 jakarta-oro/CHANGES
Index: CHANGES
===================================================================
RCS file: /home/cvs/jakarta-oro/CHANGES,v
retrieving revision 1.18
retrieving revision 1.19
diff -u -r1.18 -r1.19
--- CHANGES 2001/10/23 06:42:20 1.18
+++ CHANGES 2001/11/14 01:30:08 1.19
@@ -1,6 +1,10 @@
-$Id: CHANGES,v 1.18 2001/10/23 06:42:20 dfs Exp $
+$Id: CHANGES,v 1.19 2001/11/14 01:30:08 dfs Exp $
Version 2.x.x
+
+o Fixed [[:punct:]] (which also affected [[:print:]] and [[:graph:]])
+ to conform to Single Unix Specification (some characters had been
+ omitted).
o Fixed bug whereby a - in a Perl expression would be ignored when
it followed a builtin character class like \w. In other words,
1.16 +4 -1 jakarta-oro/src/java/org/apache/oro/text/regex/Perl5Matcher.java
Index: Perl5Matcher.java
===================================================================
RCS file: /home/cvs/jakarta-oro/src/java/org/apache/oro/text/regex/Perl5Matcher.java,v
retrieving revision 1.15
retrieving revision 1.16
diff -u -r1.15 -r1.16
--- Perl5Matcher.java 2001/06/13 01:05:57 1.15
+++ Perl5Matcher.java 2001/11/14 01:30:08 1.16
@@ -59,7 +59,7 @@
/*
- * $Id: Perl5Matcher.java,v 1.15 2001/06/13 01:05:57 dfs Exp $
+ * $Id: Perl5Matcher.java,v 1.16 2001/11/14 01:30:08 dfs Exp $
*/
import java.io.IOException;
import java.util.*;
@@ -704,6 +704,9 @@
case Character.END_PUNCTUATION:
case Character.CONNECTOR_PUNCTUATION:
case Character.OTHER_PUNCTUATION:
+ case Character.MATH_SYMBOL:
+ case Character.CURRENCY_SYMBOL:
+ case Character.MODIFIER_SYMBOL:
return isANYOF;
default:
break;
--
To unsubscribe, e-mail: <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>