You are viewing a plain text version of this content. The canonical link for it is here.
Posted to oro-dev@jakarta.apache.org by bu...@apache.org on 2001/10/31 01:12:04 UTC

DO NOT REPLY [Bug 4531] New: - print/graph/punct character classes missing some chars

DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=4531>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=4531

print/graph/punct character classes missing some chars

           Summary: print/graph/punct character classes missing some chars
           Product: ORO
           Version: 2.0.4
          Platform: PC
        OS/Version: Windows NT/2K
            Status: NEW
          Severity: Minor
          Priority: Other
         Component: Main
        AssignedTo: oro-dev@jakarta.apache.org
        ReportedBy: tlane@macromedia.com


The print, graph & punct POSIX character classes (i.e. [[:print:]]) don't match 
the following chars:

$+<=>^`|~

While it is arguable that some of all of these are not punctuation characters, 
I suspect most or all existing POSIX implementations do include them in the 
punct class. Clearly print & graph should include these no matter what.

The trouble in the code seems to be that punct relies exclusively on the 
Unicode punctuation categories, which don't include these chars, and then print 
& graph rely on punct.

You might look into including the Unicode symbol character categories in the 
punct class (Character.MATH_SYMBOL, .CURRENCY_SYMBOL, etc.) I haven't checked 
myself to see if they include everything you want (and nothing you don't), but 
they seem like good candidates.

--
To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>