You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Namit Jain (JIRA)" <ji...@apache.org> on 2010/04/02 09:12:27 UTC

[jira] Created: (HIVE-1290) sort merge join does not work with bucketizedhiveinputformat

sort merge join does not work with bucketizedhiveinputformat
------------------------------------------------------------

                 Key: HIVE-1290
                 URL: https://issues.apache.org/jira/browse/HIVE-1290
             Project: Hadoop Hive
          Issue Type: Bug
          Components: Query Processor
            Reporter: Namit Jain
            Assignee: Namit Jain
             Fix For: 0.6.0
         Attachments: hive.1290.1.patch

The mappers are assigned in the order of the sizes of the files which violates the output bucketing of the result of sort merge join

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-1290) sort merge join does not work with bucketizedhiveinputformat

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-1290:
-----------------------------

    Status: Patch Available  (was: Open)

> sort merge join does not work with bucketizedhiveinputformat
> ------------------------------------------------------------
>
>                 Key: HIVE-1290
>                 URL: https://issues.apache.org/jira/browse/HIVE-1290
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.6.0
>
>         Attachments: hive.1290.1.patch
>
>
> The mappers are assigned in the order of the sizes of the files which violates the output bucketing of the result of sort merge join

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-1290) sort merge join does not work with bucketizedhiveinputformat

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-1290:
-----------------------------

    Attachment: hive.1290.2.patch

> sort merge join does not work with bucketizedhiveinputformat
> ------------------------------------------------------------
>
>                 Key: HIVE-1290
>                 URL: https://issues.apache.org/jira/browse/HIVE-1290
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.6.0
>
>         Attachments: hive.1290.1.patch, hive.1290.2.patch
>
>
> The mappers are assigned in the order of the sizes of the files which violates the output bucketing of the result of sort merge join

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HIVE-1290) sort merge join does not work with bucketizedhiveinputformat

Posted by "Ning Zhang (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12853604#action_12853604 ] 

Ning Zhang commented on HIVE-1290:
----------------------------------

+1. will commit after tests.

> sort merge join does not work with bucketizedhiveinputformat
> ------------------------------------------------------------
>
>                 Key: HIVE-1290
>                 URL: https://issues.apache.org/jira/browse/HIVE-1290
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.6.0
>
>         Attachments: hive.1290.1.patch, hive.1290.2.patch, hive.1290.3.patch
>
>
> The mappers are assigned in the order of the sizes of the files which violates the output bucketing of the result of sort merge join

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-1290) sort merge join does not work with bucketizedhiveinputformat

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-1290:
-----------------------------

    Attachment: hive.1290.4.patch

> sort merge join does not work with bucketizedhiveinputformat
> ------------------------------------------------------------
>
>                 Key: HIVE-1290
>                 URL: https://issues.apache.org/jira/browse/HIVE-1290
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.6.0
>
>         Attachments: hive.1290.1.patch, hive.1290.2.patch, hive.1290.3.patch, hive.1290.4.patch
>
>
> The mappers are assigned in the order of the sizes of the files which violates the output bucketing of the result of sort merge join

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-1290) sort merge join does not work with bucketizedhiveinputformat

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-1290:
-----------------------------

    Status: Patch Available  (was: Open)

> sort merge join does not work with bucketizedhiveinputformat
> ------------------------------------------------------------
>
>                 Key: HIVE-1290
>                 URL: https://issues.apache.org/jira/browse/HIVE-1290
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.6.0
>
>         Attachments: hive.1290.1.patch, hive.1290.2.patch
>
>
> The mappers are assigned in the order of the sizes of the files which violates the output bucketing of the result of sort merge join

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-1290) sort merge join does not work with bucketizedhiveinputformat

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-1290:
-----------------------------

    Attachment: hive.1290.3.patch

> sort merge join does not work with bucketizedhiveinputformat
> ------------------------------------------------------------
>
>                 Key: HIVE-1290
>                 URL: https://issues.apache.org/jira/browse/HIVE-1290
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.6.0
>
>         Attachments: hive.1290.1.patch, hive.1290.2.patch, hive.1290.3.patch
>
>
> The mappers are assigned in the order of the sizes of the files which violates the output bucketing of the result of sort merge join

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HIVE-1290) sort merge join does not work with bucketizedhiveinputformat

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12853734#action_12853734 ] 

Namit Jain commented on HIVE-1290:
----------------------------------

new patch

> sort merge join does not work with bucketizedhiveinputformat
> ------------------------------------------------------------
>
>                 Key: HIVE-1290
>                 URL: https://issues.apache.org/jira/browse/HIVE-1290
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.6.0
>
>         Attachments: hive.1290.1.patch, hive.1290.2.patch, hive.1290.3.patch, hive.1290.4.patch
>
>
> The mappers are assigned in the order of the sizes of the files which violates the output bucketing of the result of sort merge join

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-1290) sort merge join does not work with bucketizedhiveinputformat

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-1290:
-----------------------------

    Status: Open  (was: Patch Available)

> sort merge join does not work with bucketizedhiveinputformat
> ------------------------------------------------------------
>
>                 Key: HIVE-1290
>                 URL: https://issues.apache.org/jira/browse/HIVE-1290
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.6.0
>
>         Attachments: hive.1290.1.patch
>
>
> The mappers are assigned in the order of the sizes of the files which violates the output bucketing of the result of sort merge join

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-1290) sort merge join does not work with bucketizedhiveinputformat

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-1290:
-----------------------------

    Attachment: hive.1290.1.patch

> sort merge join does not work with bucketizedhiveinputformat
> ------------------------------------------------------------
>
>                 Key: HIVE-1290
>                 URL: https://issues.apache.org/jira/browse/HIVE-1290
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.6.0
>
>         Attachments: hive.1290.1.patch
>
>
> The mappers are assigned in the order of the sizes of the files which violates the output bucketing of the result of sort merge join

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-1290) sort merge join does not work with bucketizedhiveinputformat

Posted by "Ning Zhang (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ning Zhang updated HIVE-1290:
-----------------------------

    Resolution: Fixed
        Status: Resolved  (was: Patch Available)

Committed. Thanks Namit!

> sort merge join does not work with bucketizedhiveinputformat
> ------------------------------------------------------------
>
>                 Key: HIVE-1290
>                 URL: https://issues.apache.org/jira/browse/HIVE-1290
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.6.0
>
>         Attachments: hive.1290.1.patch, hive.1290.2.patch, hive.1290.3.patch, hive.1290.4.patch
>
>
> The mappers are assigned in the order of the sizes of the files which violates the output bucketing of the result of sort merge join

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HIVE-1290) sort merge join does not work with bucketizedhiveinputformat

Posted by "Ning Zhang (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12853596#action_12853596 ] 

Ning Zhang commented on HIVE-1290:
----------------------------------

Looks good, one comment:

The taskID was changed in execContext for use in FileSinkOperator. It is not intuitive why we change the set the taskID in execContext. Should we change the name to something like "fileID"?
  

> sort merge join does not work with bucketizedhiveinputformat
> ------------------------------------------------------------
>
>                 Key: HIVE-1290
>                 URL: https://issues.apache.org/jira/browse/HIVE-1290
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.6.0
>
>         Attachments: hive.1290.1.patch, hive.1290.2.patch
>
>
> The mappers are assigned in the order of the sizes of the files which violates the output bucketing of the result of sort merge join

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.