You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Zheng Shao (JIRA)" <ji...@apache.org> on 2009/02/24 00:34:03 UTC

[jira] Created: (HIVE-300) DISTRIBUTE BY should support expressions

DISTRIBUTE BY should support expressions
----------------------------------------

                 Key: HIVE-300
                 URL: https://issues.apache.org/jira/browse/HIVE-300
             Project: Hadoop Hive
          Issue Type: Improvement
          Components: Query Processor
    Affects Versions: 0.2.0, 0.3.0
            Reporter: Zheng Shao


We should support this kind of query.  Distribute By only generates the hash-code, so it's easy to allow expressions (while it will be hard for Sort By and Cluster By).

{code}
SELECT a.key, a.value FROM a
DISTRIBUTE BY rand()
{code}


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-300) DISTRIBUTE BY should support expressions

Posted by "Zheng Shao (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Zheng Shao updated HIVE-300:
----------------------------

    Attachment: HIVE-300.1.patch

Modified the syntax. No code change is needed.

> DISTRIBUTE BY should support expressions
> ----------------------------------------
>
>                 Key: HIVE-300
>                 URL: https://issues.apache.org/jira/browse/HIVE-300
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Query Processor
>    Affects Versions: 0.2.0, 0.3.0
>            Reporter: Zheng Shao
>         Attachments: HIVE-300.1.patch
>
>
> We should support this kind of query.  Distribute By only generates the hash-code, so it's easy to allow expressions (while it will be hard for Sort By and Cluster By).
> {code}
> SELECT a.key, a.value FROM a
> DISTRIBUTE BY rand()
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (HIVE-300) DISTRIBUTE BY should support expressions

Posted by "Zheng Shao (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Zheng Shao resolved HIVE-300.
-----------------------------

       Resolution: Fixed
    Fix Version/s: 0.3.0
                   0.2.0
         Assignee: Zheng Shao
     Release Note: HIVE-300. DISTRIBUTE BY now supports expressions. (zshao)
     Hadoop Flags: [Reviewed]

branch 0.2: Committed revision 747640.
trunk: Committed revision 747600.

> DISTRIBUTE BY should support expressions
> ----------------------------------------
>
>                 Key: HIVE-300
>                 URL: https://issues.apache.org/jira/browse/HIVE-300
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Query Processor
>    Affects Versions: 0.2.0, 0.3.0
>            Reporter: Zheng Shao
>            Assignee: Zheng Shao
>             Fix For: 0.2.0, 0.3.0
>
>         Attachments: HIVE-300.1.patch
>
>
> We should support this kind of query.  Distribute By only generates the hash-code, so it's easy to allow expressions (while it will be hard for Sort By and Cluster By).
> {code}
> SELECT a.key, a.value FROM a
> DISTRIBUTE BY rand()
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HIVE-300) DISTRIBUTE BY should support expressions

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12676433#action_12676433 ] 

Namit Jain commented on HIVE-300:
---------------------------------

+1

looks good

> DISTRIBUTE BY should support expressions
> ----------------------------------------
>
>                 Key: HIVE-300
>                 URL: https://issues.apache.org/jira/browse/HIVE-300
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Query Processor
>    Affects Versions: 0.2.0, 0.3.0
>            Reporter: Zheng Shao
>         Attachments: HIVE-300.1.patch
>
>
> We should support this kind of query.  Distribute By only generates the hash-code, so it's easy to allow expressions (while it will be hard for Sort By and Cluster By).
> {code}
> SELECT a.key, a.value FROM a
> DISTRIBUTE BY rand()
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HIVE-300) DISTRIBUTE BY should support expressions

Posted by "Joydeep Sen Sarma (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12676435#action_12676435 ] 

Joydeep Sen Sarma commented on HIVE-300:
----------------------------------------

+1 - isn't the test non-deterministic?

> DISTRIBUTE BY should support expressions
> ----------------------------------------
>
>                 Key: HIVE-300
>                 URL: https://issues.apache.org/jira/browse/HIVE-300
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Query Processor
>    Affects Versions: 0.2.0, 0.3.0
>            Reporter: Zheng Shao
>         Attachments: HIVE-300.1.patch
>
>
> We should support this kind of query.  Distribute By only generates the hash-code, so it's easy to allow expressions (while it will be hard for Sort By and Cluster By).
> {code}
> SELECT a.key, a.value FROM a
> DISTRIBUTE BY rand()
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HIVE-300) DISTRIBUTE BY should support expressions

Posted by "Zheng Shao (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12676436#action_12676436 ] 

Zheng Shao commented on HIVE-300:
---------------------------------

rand with a seed is deterministic.


> DISTRIBUTE BY should support expressions
> ----------------------------------------
>
>                 Key: HIVE-300
>                 URL: https://issues.apache.org/jira/browse/HIVE-300
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Query Processor
>    Affects Versions: 0.2.0, 0.3.0
>            Reporter: Zheng Shao
>         Attachments: HIVE-300.1.patch
>
>
> We should support this kind of query.  Distribute By only generates the hash-code, so it's easy to allow expressions (while it will be hard for Sort By and Cluster By).
> {code}
> SELECT a.key, a.value FROM a
> DISTRIBUTE BY rand()
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-300) DISTRIBUTE BY should support expressions

Posted by "Carl Steinbach (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Carl Steinbach updated HIVE-300:
--------------------------------

    Affects Version/s:     (was: 0.6.0)
        Fix Version/s:     (was: 0.6.0)

> DISTRIBUTE BY should support expressions
> ----------------------------------------
>
>                 Key: HIVE-300
>                 URL: https://issues.apache.org/jira/browse/HIVE-300
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Query Processor
>    Affects Versions: 0.3.0
>            Reporter: Zheng Shao
>            Assignee: Zheng Shao
>             Fix For: 0.3.0
>
>         Attachments: HIVE-300.1.patch
>
>
> We should support this kind of query.  Distribute By only generates the hash-code, so it's easy to allow expressions (while it will be hard for Sort By and Cluster By).
> {code}
> SELECT a.key, a.value FROM a
> DISTRIBUTE BY rand()
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HIVE-300) DISTRIBUTE BY should support expressions

Posted by "Prasad Chakka (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12676432#action_12676432 ] 

Prasad Chakka commented on HIVE-300:
------------------------------------

+1

> DISTRIBUTE BY should support expressions
> ----------------------------------------
>
>                 Key: HIVE-300
>                 URL: https://issues.apache.org/jira/browse/HIVE-300
>             Project: Hadoop Hive
>          Issue Type: Improvement
>          Components: Query Processor
>    Affects Versions: 0.2.0, 0.3.0
>            Reporter: Zheng Shao
>         Attachments: HIVE-300.1.patch
>
>
> We should support this kind of query.  Distribute By only generates the hash-code, so it's easy to allow expressions (while it will be hard for Sort By and Cluster By).
> {code}
> SELECT a.key, a.value FROM a
> DISTRIBUTE BY rand()
> {code}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.