You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oro-dev@jakarta.apache.org by df...@apache.org on 2001/11/14 02:30:08 UTC

cvs commit: jakarta-oro/src/java/org/apache/oro/text/regex Perl5Matcher.java

dfs         01/11/13 17:30:08

  Modified:    .        CHANGES
               src/java/org/apache/oro/text/regex Perl5Matcher.java
  Log:
  Fixed [[:punct:]] (which also affected [[:print:]] and [[:graph:]])
  to conform to Single Unix Specification (some characters had been
  omitted).
  
  The Single Unix Specification (Austin Group Draft 7), the successor to POSIX
  and Unix98 defines punctuation characters as the following for the POSIX
  locale:
  
  punct <exclamation-mark>;<quotation-mark>;<number-sign>;
  <dollar-sign>;<percent-sign>;<ampersand>;<apostrophe>;
  <left-parenthesis>;<right-parenthesis>;<asterisk>;
  <plus-sign>;<comma>;<hyphen>;<period>;<slash>;
  <colon>;<semicolon>;<less-than-sign>;<equals-sign>;
  <greater-than-sign>;<question-mark>;<commercial-at>;
  <left-square-bracket>;<backslash>;<right-square-bracket>;
  <circumflex>;<underscore>;<grave-accent>;<left-curly-bracket>;
  <vertical-line>;<right-curly-bracket>;<tilde>
  
  Rather than handle some of these specifically, in order to maintain locale
  independence, I have added Character.MATH_SYMBOL, CURRENCY_SYMBOL, and
  MODIFIER_SYMBOL in addition to the various _PUNCTUATION types to represent
  [[:punct:]].
  
  PR: 4531
  
  Revision  Changes    Path
  1.19      +5 -1      jakarta-oro/CHANGES
  
  Index: CHANGES
  ===================================================================
  RCS file: /home/cvs/jakarta-oro/CHANGES,v
  retrieving revision 1.18
  retrieving revision 1.19
  diff -u -r1.18 -r1.19
  --- CHANGES	2001/10/23 06:42:20	1.18
  +++ CHANGES	2001/11/14 01:30:08	1.19
  @@ -1,6 +1,10 @@
  -$Id: CHANGES,v 1.18 2001/10/23 06:42:20 dfs Exp $
  +$Id: CHANGES,v 1.19 2001/11/14 01:30:08 dfs Exp $
   
   Version 2.x.x
  +
  +o Fixed [[:punct:]] (which also affected [[:print:]] and [[:graph:]])
  +  to conform to Single Unix Specification (some characters had been
  +  omitted).
   
   o Fixed bug whereby a - in a Perl expression would be ignored when
     it followed a builtin character class like \w.  In other words,
  
  
  
  1.16      +4 -1      jakarta-oro/src/java/org/apache/oro/text/regex/Perl5Matcher.java
  
  Index: Perl5Matcher.java
  ===================================================================
  RCS file: /home/cvs/jakarta-oro/src/java/org/apache/oro/text/regex/Perl5Matcher.java,v
  retrieving revision 1.15
  retrieving revision 1.16
  diff -u -r1.15 -r1.16
  --- Perl5Matcher.java	2001/06/13 01:05:57	1.15
  +++ Perl5Matcher.java	2001/11/14 01:30:08	1.16
  @@ -59,7 +59,7 @@
   
   
   /*
  - * $Id: Perl5Matcher.java,v 1.15 2001/06/13 01:05:57 dfs Exp $
  + * $Id: Perl5Matcher.java,v 1.16 2001/11/14 01:30:08 dfs Exp $
    */
   import java.io.IOException;
   import java.util.*;
  @@ -704,6 +704,9 @@
   	    case Character.END_PUNCTUATION:
   	    case Character.CONNECTOR_PUNCTUATION:
   	    case Character.OTHER_PUNCTUATION:
  +            case Character.MATH_SYMBOL:
  +            case Character.CURRENCY_SYMBOL:
  +            case Character.MODIFIER_SYMBOL:
   	      return isANYOF;
   	    default:
   	      break;
  
  
  

--
To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>