You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@hive.apache.org by "Namit Jain (JIRA)" <ji...@apache.org> on 2009/11/19 01:17:39 UTC

[jira] Created: (HIVE-942) use bucketing for group by

use bucketing for group by
--------------------------

                 Key: HIVE-942
                 URL: https://issues.apache.org/jira/browse/HIVE-942
             Project: Hadoop Hive
          Issue Type: New Feature
          Components: Query Processor
            Reporter: Namit Jain
            Assignee: He Yongqiang
             Fix For: 0.5.0


Group by on a bucketed column can be completely performed on the mapper if the split can be adjusted to span the key boundary.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HIVE-942) use bucketing for group by

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.

    [ https://issues.apache.org/jira/browse/HIVE-942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12779802#action_12779802 ] 

Namit Jain commented on HIVE-942:
---------------------------------

However, a single mapper may take a long time if there is skew

> use bucketing for group by
> --------------------------
>
>                 Key: HIVE-942
>                 URL: https://issues.apache.org/jira/browse/HIVE-942
>             Project: Hadoop Hive
>          Issue Type: New Feature
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: He Yongqiang
>             Fix For: 0.5.0
>
>
> Group by on a bucketed column can be completely performed on the mapper if the split can be adjusted to span the key boundary.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HIVE-942) use bucketing for group by

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/HIVE-942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-942:
----------------------------

    Fix Version/s:     (was: 0.5.0)

> use bucketing for group by
> --------------------------
>
>                 Key: HIVE-942
>                 URL: https://issues.apache.org/jira/browse/HIVE-942
>             Project: Hadoop Hive
>          Issue Type: New Feature
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: He Yongqiang
>
> Group by on a bucketed column can be completely performed on the mapper if the split can be adjusted to span the key boundary.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.