You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "binlijin (Created) (JIRA)" <ji...@apache.org> on 2012/02/11 14:23:59 UTC
[jira] [Created] (HIVE-2801) When join key is null, random
distribute this tuple
When join key is null, random distribute this tuple
---------------------------------------------------
Key: HIVE-2801
URL: https://issues.apache.org/jira/browse/HIVE-2801
Project: Hive
Issue Type: Improvement
Reporter: binlijin
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-2801) When join key is null, random
distribute this tuple
Posted by "Ashutosh Chauhan (Commented) (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13208953#comment-13208953 ]
Ashutosh Chauhan commented on HIVE-2801:
----------------------------------------
I didn't get the context. Can you expand a bit more? Better still, you can add a testcase which illustrate the "fix" for the problem.
> When join key is null, random distribute this tuple
> ---------------------------------------------------
>
> Key: HIVE-2801
> URL: https://issues.apache.org/jira/browse/HIVE-2801
> Project: Hive
> Issue Type: Improvement
> Reporter: binlijin
> Attachments: HIVE-2801.patch
>
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-2801) When join key is null, random
distribute this tuple
Posted by "binlijin (Updated) (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
binlijin updated HIVE-2801:
---------------------------
Status: Patch Available (was: Open)
> When join key is null, random distribute this tuple
> ---------------------------------------------------
>
> Key: HIVE-2801
> URL: https://issues.apache.org/jira/browse/HIVE-2801
> Project: Hive
> Issue Type: Improvement
> Reporter: binlijin
> Attachments: HIVE-2801.patch
>
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-2801) When join key is null, random
distribute this tuple
Posted by "binlijin (Updated) (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
binlijin updated HIVE-2801:
---------------------------
Attachment: HIVE-2801.patch
> When join key is null, random distribute this tuple
> ---------------------------------------------------
>
> Key: HIVE-2801
> URL: https://issues.apache.org/jira/browse/HIVE-2801
> Project: Hive
> Issue Type: Improvement
> Reporter: binlijin
> Attachments: HIVE-2801.patch
>
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-2801) When join key is null, random
distribute this tuple
Posted by "Namit Jain (Updated) (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Namit Jain updated HIVE-2801:
-----------------------------
Assignee: binlijin
Status: Open (was: Patch Available)
Please make it 'Patch Available' after addresssing Ashutosh's concerns
> When join key is null, random distribute this tuple
> ---------------------------------------------------
>
> Key: HIVE-2801
> URL: https://issues.apache.org/jira/browse/HIVE-2801
> Project: Hive
> Issue Type: Improvement
> Reporter: binlijin
> Assignee: binlijin
> Attachments: HIVE-2801.patch
>
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-2801) When join key is null, random
distribute this tuple
Posted by "binlijin (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13260169#comment-13260169 ]
binlijin commented on HIVE-2801:
--------------------------------
All records who's join key is null will go to the same reduce.
> When join key is null, random distribute this tuple
> ---------------------------------------------------
>
> Key: HIVE-2801
> URL: https://issues.apache.org/jira/browse/HIVE-2801
> Project: Hive
> Issue Type: Improvement
> Reporter: binlijin
> Assignee: binlijin
> Attachments: HIVE-2801.patch
>
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-2801) When join key is null, random
distribute this tuple
Posted by "binlijin (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13260166#comment-13260166 ]
binlijin commented on HIVE-2801:
--------------------------------
If one table's join key have many null key, in reduce the data is skew. so if we random distribute the null key will sovle the skew problem.
> When join key is null, random distribute this tuple
> ---------------------------------------------------
>
> Key: HIVE-2801
> URL: https://issues.apache.org/jira/browse/HIVE-2801
> Project: Hive
> Issue Type: Improvement
> Reporter: binlijin
> Assignee: binlijin
> Attachments: HIVE-2801.patch
>
>
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira