You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "binlijin (Created) (JIRA)" <ji...@apache.org> on 2012/02/11 14:23:59 UTC

[jira] [Created] (HIVE-2801) When join key is null, random distribute this tuple

When join key is null, random distribute this tuple
---------------------------------------------------

                 Key: HIVE-2801
                 URL: https://issues.apache.org/jira/browse/HIVE-2801
             Project: Hive
          Issue Type: Improvement
            Reporter: binlijin




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (HIVE-2801) When join key is null, random distribute this tuple

Posted by "Ashutosh Chauhan (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13208953#comment-13208953 ] 

Ashutosh Chauhan commented on HIVE-2801:
----------------------------------------

I didn't get the context. Can you expand a bit more? Better still, you can add a testcase which illustrate the "fix" for the problem.
                
> When join key is null, random distribute this tuple
> ---------------------------------------------------
>
>                 Key: HIVE-2801
>                 URL: https://issues.apache.org/jira/browse/HIVE-2801
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: binlijin
>         Attachments: HIVE-2801.patch
>
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HIVE-2801) When join key is null, random distribute this tuple

Posted by "binlijin (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

binlijin updated HIVE-2801:
---------------------------

    Status: Patch Available  (was: Open)
    
> When join key is null, random distribute this tuple
> ---------------------------------------------------
>
>                 Key: HIVE-2801
>                 URL: https://issues.apache.org/jira/browse/HIVE-2801
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: binlijin
>         Attachments: HIVE-2801.patch
>
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HIVE-2801) When join key is null, random distribute this tuple

Posted by "binlijin (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

binlijin updated HIVE-2801:
---------------------------

    Attachment: HIVE-2801.patch
    
> When join key is null, random distribute this tuple
> ---------------------------------------------------
>
>                 Key: HIVE-2801
>                 URL: https://issues.apache.org/jira/browse/HIVE-2801
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: binlijin
>         Attachments: HIVE-2801.patch
>
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HIVE-2801) When join key is null, random distribute this tuple

Posted by "Namit Jain (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-2801:
-----------------------------

    Assignee: binlijin
      Status: Open  (was: Patch Available)

Please make it 'Patch Available' after addresssing Ashutosh's concerns
                
> When join key is null, random distribute this tuple
> ---------------------------------------------------
>
>                 Key: HIVE-2801
>                 URL: https://issues.apache.org/jira/browse/HIVE-2801
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: binlijin
>            Assignee: binlijin
>         Attachments: HIVE-2801.patch
>
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (HIVE-2801) When join key is null, random distribute this tuple

Posted by "binlijin (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13260169#comment-13260169 ] 

binlijin commented on HIVE-2801:
--------------------------------

All records who's join key is null will go to the same reduce.
                
> When join key is null, random distribute this tuple
> ---------------------------------------------------
>
>                 Key: HIVE-2801
>                 URL: https://issues.apache.org/jira/browse/HIVE-2801
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: binlijin
>            Assignee: binlijin
>         Attachments: HIVE-2801.patch
>
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (HIVE-2801) When join key is null, random distribute this tuple

Posted by "binlijin (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13260166#comment-13260166 ] 

binlijin commented on HIVE-2801:
--------------------------------

If one table's join key have many null key, in reduce the data is skew. so if we random distribute the null key will sovle the skew problem.
                
> When join key is null, random distribute this tuple
> ---------------------------------------------------
>
>                 Key: HIVE-2801
>                 URL: https://issues.apache.org/jira/browse/HIVE-2801
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: binlijin
>            Assignee: binlijin
>         Attachments: HIVE-2801.patch
>
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira