You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Namit Jain (JIRA)" <ji...@apache.org> on 2009/11/19 01:17:39 UTC
[jira] Created: (HIVE-942) use bucketing for group by
use bucketing for group by
--------------------------
Key: HIVE-942
URL: https://issues.apache.org/jira/browse/HIVE-942
Project: Hadoop Hive
Issue Type: New Feature
Components: Query Processor
Reporter: Namit Jain
Assignee: He Yongqiang
Fix For: 0.5.0
Group by on a bucketed column can be completely performed on the mapper if the split can be adjusted to span the key boundary.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (HIVE-942) use bucketing for group by
Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12779802#action_12779802 ]
Namit Jain commented on HIVE-942:
---------------------------------
However, a single mapper may take a long time if there is skew
> use bucketing for group by
> --------------------------
>
> Key: HIVE-942
> URL: https://issues.apache.org/jira/browse/HIVE-942
> Project: Hadoop Hive
> Issue Type: New Feature
> Components: Query Processor
> Reporter: Namit Jain
> Assignee: He Yongqiang
> Fix For: 0.5.0
>
>
> Group by on a bucketed column can be completely performed on the mapper if the split can be adjusted to span the key boundary.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (HIVE-942) use bucketing for group by
Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Namit Jain updated HIVE-942:
----------------------------
Fix Version/s: (was: 0.5.0)
> use bucketing for group by
> --------------------------
>
> Key: HIVE-942
> URL: https://issues.apache.org/jira/browse/HIVE-942
> Project: Hadoop Hive
> Issue Type: New Feature
> Components: Query Processor
> Reporter: Namit Jain
> Assignee: He Yongqiang
>
> Group by on a bucketed column can be completely performed on the mapper if the split can be adjusted to span the key boundary.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.