You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Zheng Shao (JIRA)" <ji...@apache.org> on 2009/05/02 02:05:30 UTC
[jira] Issue Comment Edited: (HIVE-420) Support regular expressions
for column in select clause
[ https://issues.apache.org/jira/browse/HIVE-420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12704565#action_12704565 ]
Zheng Shao edited comment on HIVE-420 at 5/1/09 5:05 PM:
---------------------------------------------------------
With Java regex library we can easily support exclusion of several columns:
{code} "(ds|hr)?+.+" {code} will match all columns except ds and hr.
{code} "(dsa|ds)?+.+" {code} will match all columns except ds and dsa. Note that "dsa" has to come before "ds" otherwise it won't work as expected - basically if there exists s and t, and s is a prefix of t, then t has to come before s.
When the user is specifying column names, they should not need to specify "^" and "$" and we should assume the regex matches the whole string.
Use http://www.fileformat.info/tool/regex.htm for testing the above examples.
{code}
"(ds|hr)?+.+"
Try these:
ds
hr
d
h
s
r
ads
dsa
ahr
hra
dsds
dshr
hrds
hrhr
abc
{code}
was (Author: zshao):
With Java regex library we can easily support exclusion of several columns:
{code} "^(ds|hr)?+.+$" {code} will match all columns except ds and hr.
{code} "^(dsa|ds)?+.+$" {code} will match all columns except ds and dsa. Note that "dsa" has to come before "ds" otherwise it won't work as expected - basically if there exists s and t, and s is a prefix of t, then t has to come before s.
When the user is specifying column names, they should not need to specify "^" and "$" and we should assume the regex matches the whole string.
Use http://www.fileformat.info/tool/regex.htm for testing the above examples.
{code}
"^(ds|hr)?+.+$"
Try these:
ds
hr
d
h
s
r
ads
dsa
ahr
hra
dsds
dshr
hrds
hrhr
abc
{code}
> Support regular expressions for column in select clause
> -------------------------------------------------------
>
> Key: HIVE-420
> URL: https://issues.apache.org/jira/browse/HIVE-420
> Project: Hadoop Hive
> Issue Type: New Feature
> Components: Query Processor
> Affects Versions: 0.2.0, 0.3.0, 0.3.1, 0.4.0
> Reporter: Prasad Chakka
> Assignee: Zheng Shao
> Priority: Minor
> Fix For: 0.4.0
>
> Attachments: HIVE-420.1.patch, HIVE-420.2.patch, HIVE-420.3.patch, HIVE-420.4.patch
>
>
> support queries like `select a.[^ds] from a where a.ds='2009-09-09'` which are useful when there are humongous number of columns in a table and want to select all columns except the partitions columns.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.