You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tajo.apache.org by "Hyunsik Choi (JIRA)" <ji...@apache.org> on 2015/10/15 06:46:05 UTC

[jira] [Resolved] (TAJO-230) Support unicode identifiers

     [ https://issues.apache.org/jira/browse/TAJO-230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hyunsik Choi resolved TAJO-230.
-------------------------------
    Resolution: Won't Fix

We already support unicode in quoted identifiers.

> Support unicode identifiers
> ---------------------------
>
>                 Key: TAJO-230
>                 URL: https://issues.apache.org/jira/browse/TAJO-230
>             Project: Tajo
>          Issue Type: New Feature
>          Components: SQL Parser
>            Reporter: Hyunsik Choi
>
> The current parse only recognizes a combination of alphabet characters and underscore as an identifier used for function names, column names, and table names. This is because of the following antlr lexer rules:
> {code}
> Identifier
>   : Nonreserved_keywords
>   | Regular_Identifier
>   ;
> Regular_Identifier
>   : ('a'..'z'|'A'..'Z'|'_') ('a'..'z'|'A'..'Z'|Digit|'_')*
>   ;
> {code}
> In some CJK country, their characters can be used as identifiers. We need to support unicode identifier.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)