You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@harmony.apache.org by Richard Liang <ri...@gmail.com> on 2006/06/29 05:14:38 UTC

Re: [jira] Created: (HARMONY-696) Harmony regex does not support Character.isXXX character classes

Hello Nik,
Just a kind reminder, as you may already know, a set of new isXXXX 
methods have been defined since Java 1.5 which have a int (codepoint) 
parameter. This may make things tricky. ;-)

Best regards,
Richard.

Nikolay Kuznetsov (JIRA) wrote:
> Harmony regex does not support Character.isXXX character classes
> ----------------------------------------------------------------
>
>          Key: HARMONY-696
>          URL: http://issues.apache.org/jira/browse/HARMONY-696
>      Project: Harmony
>         Type: Bug
>
>   Components: Classlib  
>  Environment: All
>     Reporter: Nikolay Kuznetsov
>
>
> According to JavaDoc specification java.util.regex.Patter should provide character classes corresponding to all Character.isXXX methods except deprecated:
> "Categories that behave like the java.lang.Character boolean ismethodname methods (except for the deprecated ones) are available through the same \p{prop} syntax where the specified property has the name javamethodname."
>
> Harmony implementation supports only for, which named directly in the spec:
> java.lang.Character classes (simple java character type) 
> \p{javaLowerCase} Equivalent to java.lang.Character.isLowerCase() 
> \p{javaUpperCase} Equivalent to java.lang.Character.isUpperCase() 
> \p{javaWhitespace} Equivalent to java.lang.Character.isWhitespace() 
> \p{javaMirrored} Equivalent to java.lang.Character.isMirrored() 
>
> The others should be also implemented.
>
>   

-- 
Richard Liang
China Software Development Lab, IBM 



---------------------------------------------------------------------
Terms of use : http://incubator.apache.org/harmony/mailing.html
To unsubscribe, e-mail: harmony-dev-unsubscribe@incubator.apache.org
For additional commands, e-mail: harmony-dev-help@incubator.apache.org


Re: [jira] Created: (HARMONY-696) Harmony regex does not support Character.isXXX character classes

Posted by Richard Liang <ri...@gmail.com>.

Nikolay Kuznetsov wrote:
>> Unfortunately, I can provide some test cases to demonstrate other
>> constructs, such as X{n}, DO support supplementary characters. But I'm
>> not sure if other constructs need special processing.
>
> OK, I see, I'll provide general support for supplementary characters,
> instead of patching some special functionality like character classes,
> thanks for the clue.
>
> And if you'll update regex unit tests with test cases that would be 
> great.
>
> BTW, what's DO?
>
Hi Nik,

I just want to emphasize other constructs also should support 
supplementary characters.  I'm sorry if the "DO" make you confused, you 
know, I still cannot use English smoothly ;-) 

Richard
> ---------------------------------------------------------------------
> Terms of use : http://incubator.apache.org/harmony/mailing.html
> To unsubscribe, e-mail: harmony-dev-unsubscribe@incubator.apache.org
> For additional commands, e-mail: harmony-dev-help@incubator.apache.org
>
>

-- 
Richard Liang
China Software Development Lab, IBM 



---------------------------------------------------------------------
Terms of use : http://incubator.apache.org/harmony/mailing.html
To unsubscribe, e-mail: harmony-dev-unsubscribe@incubator.apache.org
For additional commands, e-mail: harmony-dev-help@incubator.apache.org


Re: [jira] Created: (HARMONY-696) Harmony regex does not support Character.isXXX character classes

Posted by Nikolay Kuznetsov <ni...@gmail.com>.
> Unfortunately, I can provide some test cases to demonstrate other
> constructs, such as X{n}, DO support supplementary characters. But I'm
> not sure if other constructs need special processing.

OK, I see, I'll provide general support for supplementary characters,
instead of patching some special functionality like character classes,
thanks for the clue.

And if you'll update regex unit tests with test cases that would be great.

BTW, what's DO?

---------------------------------------------------------------------
Terms of use : http://incubator.apache.org/harmony/mailing.html
To unsubscribe, e-mail: harmony-dev-unsubscribe@incubator.apache.org
For additional commands, e-mail: harmony-dev-help@incubator.apache.org


Re: [jira] Created: (HARMONY-696) Harmony regex does not support Character.isXXX character classes

Posted by Richard Liang <ri...@gmail.com>.

Nikolay Kuznetsov wrote:
> Richard,
>
> We have two issues here:
> [688] java.util.regex.Matcher does not support Unicode supplementary 
> characters
> [696] Harmony regex does not support Character.isXXX character classes
>
> I would prefer, if you don't mind, to fix them separately. I know that
> supplementary character support is a tricky thing and will take some
> time while {javaXXXX} char. classes could be fixed relatively
> quick(w/o supplementary character support).
>
I agree. Nik.
> And I have a question, am I right that supplementary character affects
> mostly ranges and character classes all the other constructs will work
> fine w/o special processing, what do you think.
>
Unfortunately, I can provide some test cases to demonstrate other 
constructs, such as X{n}, DO support supplementary characters. But I'm 
not sure if other constructs need special processing.  Thanks a lot.
> Thank you.
>   Nik.
>
> On 6/29/06, Richard Liang <ri...@gmail.com> wrote:
>> Hello Nik,
>> Just a kind reminder, as you may already know, a set of new isXXXX
>> methods have been defined since Java 1.5 which have a int (codepoint)
>> parameter. This may make things tricky. ;-)
>>
>> Best regards,
>> Richard.
>
> ---------------------------------------------------------------------
> Terms of use : http://incubator.apache.org/harmony/mailing.html
> To unsubscribe, e-mail: harmony-dev-unsubscribe@incubator.apache.org
> For additional commands, e-mail: harmony-dev-help@incubator.apache.org
>
>

-- 
Richard Liang
China Software Development Lab, IBM 



---------------------------------------------------------------------
Terms of use : http://incubator.apache.org/harmony/mailing.html
To unsubscribe, e-mail: harmony-dev-unsubscribe@incubator.apache.org
For additional commands, e-mail: harmony-dev-help@incubator.apache.org


Re: [jira] Created: (HARMONY-696) Harmony regex does not support Character.isXXX character classes

Posted by Nikolay Kuznetsov <ni...@gmail.com>.
Richard,

We have two issues here:
[688] java.util.regex.Matcher does not support Unicode supplementary characters
[696] Harmony regex does not support Character.isXXX character classes

I would prefer, if you don't mind, to fix them separately. I know that
supplementary character support is a tricky thing and will take some
time while {javaXXXX} char. classes could be fixed relatively
quick(w/o supplementary character support).

And I have a question, am I right that supplementary character affects
mostly ranges and character classes all the other constructs will work
fine w/o special processing, what do you think.

Thank you.
   Nik.

On 6/29/06, Richard Liang <ri...@gmail.com> wrote:
> Hello Nik,
> Just a kind reminder, as you may already know, a set of new isXXXX
> methods have been defined since Java 1.5 which have a int (codepoint)
> parameter. This may make things tricky. ;-)
>
> Best regards,
> Richard.

---------------------------------------------------------------------
Terms of use : http://incubator.apache.org/harmony/mailing.html
To unsubscribe, e-mail: harmony-dev-unsubscribe@incubator.apache.org
For additional commands, e-mail: harmony-dev-help@incubator.apache.org