You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oro-dev@jakarta.apache.org by Takashi Okamoto <to...@rd.nttdata.co.jp> on 2001/01/28 01:17:13 UTC
[PATCH] fixed unicode problem and added POSIX expression
Hello, ORO (especially daniel).
I have posted a patch for unicode. But I found following problems at
my patch:
- It doesn't work when I use special character in OpCode
(ex. \18 is mapped to _ALNUM).
- I add escape '\u' for unicode, but it doesn't exist in
Perl5.6 expression.
I wrote new patch and post again. It has following features:
- fix bug above mentioned.(remove escape '\u')
- gave up to use __setCharacterClassBits().
(don't use it but exist at Perl5Compiler.java)
It's difficult that I keep bit shift approach with a few
memory for unicode.
- I found POSIX expressin in Perl5.6, and added following
POSIX expression:
[;alnum:]
[:alpha:]
[:blank:]
[:cntrl:]
[:digit:]
[:graph:]
[:lower:]
[:print:]
[:punct:]
[:space:]
[:upper:]
[:xdigit:]
I refered Jakarta-Regexp code, and added '__parsePOSIX()' method and
Hashtable 'hashPOSIX'. I also modified OpCode.java. Now I must write
these expresiion within [] (ex. m/[[:alnum:]]/ shows one alphabet or
one digit). I can't use POSIX expression without []
(ex./[:alnum:]/ is invalid).
You can apply my patch to CVS version ORO.
Regards.
-----------------------------
Takashi Okamoto