You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oro-dev@jakarta.apache.org by Takashi Okamoto <to...@rd.nttdata.co.jp> on 2001/01/28 01:17:13 UTC

[PATCH] fixed unicode problem and added POSIX expression

Hello, ORO (especially daniel).

I have posted a patch for unicode. But I found following problems at
my patch:

 - It doesn't work when I use special character in OpCode
   (ex. \18 is mapped to _ALNUM).

 - I add escape '\u' for unicode, but it doesn't exist in
   Perl5.6 expression.

I wrote new patch and post again. It has following features:

 - fix bug above mentioned.(remove escape '\u')

 - gave up to use __setCharacterClassBits().
   (don't use it but exist at Perl5Compiler.java)
   It's difficult that I keep bit shift approach with a few
   memory for unicode.

 - I found POSIX expressin in Perl5.6, and added following
   POSIX expression:

  [;alnum:]
  [:alpha:]
  [:blank:]
  [:cntrl:]
  [:digit:]
  [:graph:]
  [:lower:]
  [:print:]
  [:punct:]
  [:space:]
  [:upper:]
  [:xdigit:]

I refered Jakarta-Regexp code, and added '__parsePOSIX()' method and
Hashtable 'hashPOSIX'. I also modified OpCode.java. Now I must write
these expresiion within [] (ex. m/[[:alnum:]]/ shows one alphabet or
one digit). I can't use POSIX expression without []
(ex./[:alnum:]/ is invalid).

You can apply my patch to CVS version ORO.

Regards.
-----------------------------
Takashi Okamoto