You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Namit Jain (JIRA)" <ji...@apache.org> on 2009/09/12 03:20:57 UTC

[jira] Created: (HIVE-827) increase hive.mapjoin.cache.numrows default value

increase hive.mapjoin.cache.numrows default value
-------------------------------------------------

                 Key: HIVE-827
                 URL: https://issues.apache.org/jira/browse/HIVE-827
             Project: Hadoop Hive
          Issue Type: Bug
          Components: Query Processor
            Reporter: Namit Jain


Currently, it is 10000 and leads to bad performance

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Assigned: (HIVE-827) increase hive.mapjoin.cache.numrows default value

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain reassigned HIVE-827:
-------------------------------

    Assignee: Namit Jain

> increase hive.mapjoin.cache.numrows default value
> -------------------------------------------------
>
>                 Key: HIVE-827
>                 URL: https://issues.apache.org/jira/browse/HIVE-827
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: hive.827.1.patch
>
>
> Currently, it is 10000 and leads to bad performance

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HIVE-827) increase hive.mapjoin.cache.numrows default value

Posted by "Jeff Hammerbacher (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12754488#action_12754488 ] 

Jeff Hammerbacher commented on HIVE-827:
----------------------------------------

For posterity, why does the value of 10,000 lead to bad performance?

> increase hive.mapjoin.cache.numrows default value
> -------------------------------------------------
>
>                 Key: HIVE-827
>                 URL: https://issues.apache.org/jira/browse/HIVE-827
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>         Attachments: hive.827.1.patch
>
>
> Currently, it is 10000 and leads to bad performance

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HIVE-827) increase hive.mapjoin.cache.numrows default value

Posted by "Zheng Shao (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12757453#action_12757453 ] 

Zheng Shao commented on HIVE-827:
---------------------------------

+1. Will commit if tests pass.

> increase hive.mapjoin.cache.numrows default value
> -------------------------------------------------
>
>                 Key: HIVE-827
>                 URL: https://issues.apache.org/jira/browse/HIVE-827
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: hive.827.1.patch
>
>
> Currently, it is 10000 and leads to bad performance

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HIVE-827) increase hive.mapjoin.cache.numrows default value

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12754827#action_12754827 ] 

Namit Jain commented on HIVE-827:
---------------------------------

What I meant to say is that there is enough memory on the machines to fit more than 10,000 rows easily - of course, that depends on the row size.
But, based on the mails in the mailing lists, I felt that, a higher default may be better

> increase hive.mapjoin.cache.numrows default value
> -------------------------------------------------
>
>                 Key: HIVE-827
>                 URL: https://issues.apache.org/jira/browse/HIVE-827
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>         Attachments: hive.827.1.patch
>
>
> Currently, it is 10000 and leads to bad performance

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-827) increase hive.mapjoin.cache.numrows default value

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-827:
----------------------------

    Attachment: hive.827.1.patch

> increase hive.mapjoin.cache.numrows default value
> -------------------------------------------------
>
>                 Key: HIVE-827
>                 URL: https://issues.apache.org/jira/browse/HIVE-827
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>         Attachments: hive.827.1.patch
>
>
> Currently, it is 10000 and leads to bad performance

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-827) increase hive.mapjoin.cache.numrows default value

Posted by "Zheng Shao (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Zheng Shao updated HIVE-827:
----------------------------

       Resolution: Fixed
    Fix Version/s: 0.5.0
     Release Note: HIVE-827. Increase hive.mapjoin.cache.numrows default value. (Namit Jain via zshao)
     Hadoop Flags: [Reviewed]
           Status: Resolved  (was: Patch Available)

Committed. Thanks Namit!

> increase hive.mapjoin.cache.numrows default value
> -------------------------------------------------
>
>                 Key: HIVE-827
>                 URL: https://issues.apache.org/jira/browse/HIVE-827
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.5.0
>
>         Attachments: hive.827.1.patch
>
>
> Currently, it is 10000 and leads to bad performance

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HIVE-827) increase hive.mapjoin.cache.numrows default value

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HIVE-827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-827:
----------------------------

    Status: Patch Available  (was: Open)

> increase hive.mapjoin.cache.numrows default value
> -------------------------------------------------
>
>                 Key: HIVE-827
>                 URL: https://issues.apache.org/jira/browse/HIVE-827
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>         Attachments: hive.827.1.patch
>
>
> Currently, it is 10000 and leads to bad performance

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HIVE-827) increase hive.mapjoin.cache.numrows default value

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HIVE-827?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12754829#action_12754829 ] 

Namit Jain commented on HIVE-827:
---------------------------------

It is actually backed my persistent disk, so there is no correctness issue, but most of the applications may be able to use a higher default value

> increase hive.mapjoin.cache.numrows default value
> -------------------------------------------------
>
>                 Key: HIVE-827
>                 URL: https://issues.apache.org/jira/browse/HIVE-827
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>            Reporter: Namit Jain
>         Attachments: hive.827.1.patch
>
>
> Currently, it is 10000 and leads to bad performance

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.