You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@hive.apache.org by "Charles Pritchard (JIRA)" <ji...@apache.org> on 2016/02/05 02:34:39 UTC

[jira] [Commented] (HIVE-7148) Use murmur hash to create bucketed tables

    [ https://issues.apache.org/jira/browse/HIVE-7148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15133477#comment-15133477 ] 

Charles Pritchard commented on HIVE-7148:
-----------------------------------------

I could really use custom bucketing functions, as I want to use buckets instead of partitions based on a derived value.

> Use murmur hash to create bucketed tables
> -----------------------------------------
>
>                 Key: HIVE-7148
>                 URL: https://issues.apache.org/jira/browse/HIVE-7148
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Gunther Hagleitner
>
> HIVE-7121 introduced murmur hashing for queries that don't insert into bucketed tables. This was done to achieve better distribution of the data. The same should be done for bucketed tables as well, but this involves making sure we don't break backwards compat. This probably means that we have to store the partitioning function used in the metadata and use that to determine if SMB and bucketed map-join optimizations apply.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)