You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Prasad Chakka (JIRA)" <ji...@apache.org> on 2009/06/03 22:54:07 UTC

[jira] Created: (HIVE-539) Support range bucketing of hive tables/partitions

Support range bucketing of hive tables/partitions
-------------------------------------------------

                 Key: HIVE-539
                 URL: https://issues.apache.org/jira/browse/HIVE-539
             Project: Hadoop Hive
          Issue Type: New Feature
          Components: Metastore, Query Processor
    Affects Versions: 0.4.0
            Reporter: Prasad Chakka


Hive uses hash partitioner to distribute keys to reducers and thus creating hash bucketed tables/partitions. There are some cases where range partitioning will help in further query processing such as joins/filters.

Terasort (http://hadoop.apache.org/core/docs/current/api/org/apache/hadoop/examples/terasort/package-summary.html) seems to have implemented a sampling based range partitioner and Hive can reuse this or implement something similar.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.