You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@hive.apache.org by "Namit Jain (JIRA)" <ji...@apache.org> on 2009/03/24 18:56:51 UTC

[jira] Created: (HIVE-363) [hive] extra rows for count distinct

[hive] extra rows for count distinct
------------------------------------

                 Key: HIVE-363
                 URL: https://issues.apache.org/jira/browse/HIVE-363
             Project: Hadoop Hive
          Issue Type: Bug
          Components: Query Processor
    Affects Versions: 0.3.0
            Reporter: Namit Jain
            Assignee: Namit Jain


select count(distinct a) from T 

returns dummy rows from all reducers if number of reducers are more than 1

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HIVE-363) [hive] extra rows for count distinct

Posted by "Raghotham Murthy (JIRA)" <ji...@apache.org>.

    [ https://issues.apache.org/jira/browse/HIVE-363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12688797#action_12688797 ] 

Raghotham Murthy commented on HIVE-363:
---------------------------------------

looks like the dummyRow flag is not being used anywhere during execution time.

> [hive] extra rows for count distinct
> ------------------------------------
>
>                 Key: HIVE-363
>                 URL: https://issues.apache.org/jira/browse/HIVE-363
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.3.0
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: hive.363.1.patch
>
>
> select count(distinct a) from T 
> returns dummy rows from all reducers if number of reducers are more than 1

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HIVE-363) [hive] extra rows for count distinct

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/HIVE-363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-363:
----------------------------

    Status: Open  (was: Patch Available)

> [hive] extra rows for count distinct
> ------------------------------------
>
>                 Key: HIVE-363
>                 URL: https://issues.apache.org/jira/browse/HIVE-363
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.3.0
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: hive.363.1.patch, hive.363.2.patch
>
>
> select count(distinct a) from T 
> returns dummy rows from all reducers if number of reducers are more than 1

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HIVE-363) [hive] extra rows for count distinct

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/HIVE-363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-363:
----------------------------

    Attachment: hive.363.2.patch

> [hive] extra rows for count distinct
> ------------------------------------
>
>                 Key: HIVE-363
>                 URL: https://issues.apache.org/jira/browse/HIVE-363
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.3.0
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: hive.363.1.patch, hive.363.2.patch
>
>
> select count(distinct a) from T 
> returns dummy rows from all reducers if number of reducers are more than 1

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HIVE-363) [hive] extra rows for count distinct

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/HIVE-363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-363:
----------------------------

    Attachment: hive.363.3.patch

> [hive] extra rows for count distinct
> ------------------------------------
>
>                 Key: HIVE-363
>                 URL: https://issues.apache.org/jira/browse/HIVE-363
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.3.0
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: hive.363.1.patch, hive.363.2.patch, hive.363.3.patch
>
>
> select count(distinct a) from T 
> returns dummy rows from all reducers if number of reducers are more than 1

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HIVE-363) [hive] extra rows for count distinct

Posted by "Raghotham Murthy (JIRA)" <ji...@apache.org>.

    [ https://issues.apache.org/jira/browse/HIVE-363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12688921#action_12688921 ] 

Raghotham Murthy commented on HIVE-363:
---------------------------------------

Looks like the dummyRow flag is not needed at all. All we need to do is force using exactly one reducer for full-table aggregates, i.e., if there are no grouping keys.

> [hive] extra rows for count distinct
> ------------------------------------
>
>                 Key: HIVE-363
>                 URL: https://issues.apache.org/jira/browse/HIVE-363
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.3.0
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: hive.363.1.patch, hive.363.2.patch
>
>
> select count(distinct a) from T 
> returns dummy rows from all reducers if number of reducers are more than 1

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HIVE-363) [hive] extra rows for count distinct

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/HIVE-363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-363:
----------------------------

       Resolution: Fixed
    Fix Version/s: 0.3.0
     Hadoop Flags: [Reviewed]
           Status: Resolved  (was: Patch Available)

committed

> [hive] extra rows for count distinct
> ------------------------------------
>
>                 Key: HIVE-363
>                 URL: https://issues.apache.org/jira/browse/HIVE-363
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.3.0
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>             Fix For: 0.3.0
>
>         Attachments: hive.363.1.patch, hive.363.2.patch, hive.363.3.patch
>
>
> select count(distinct a) from T 
> returns dummy rows from all reducers if number of reducers are more than 1

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HIVE-363) [hive] extra rows for count distinct

Posted by "Raghotham Murthy (JIRA)" <ji...@apache.org>.

    [ https://issues.apache.org/jira/browse/HIVE-363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12688966#action_12688966 ] 

Raghotham Murthy commented on HIVE-363:
---------------------------------------

+1

looks good.

> [hive] extra rows for count distinct
> ------------------------------------
>
>                 Key: HIVE-363
>                 URL: https://issues.apache.org/jira/browse/HIVE-363
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.3.0
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: hive.363.1.patch, hive.363.2.patch, hive.363.3.patch
>
>
> select count(distinct a) from T 
> returns dummy rows from all reducers if number of reducers are more than 1

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HIVE-363) [hive] extra rows for count distinct

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/HIVE-363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-363:
----------------------------

    Status: Patch Available  (was: Open)

> [hive] extra rows for count distinct
> ------------------------------------
>
>                 Key: HIVE-363
>                 URL: https://issues.apache.org/jira/browse/HIVE-363
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.3.0
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: hive.363.1.patch
>
>
> select count(distinct a) from T 
> returns dummy rows from all reducers if number of reducers are more than 1

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HIVE-363) [hive] extra rows for count distinct

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/HIVE-363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-363:
----------------------------

    Status: Open  (was: Patch Available)

> [hive] extra rows for count distinct
> ------------------------------------
>
>                 Key: HIVE-363
>                 URL: https://issues.apache.org/jira/browse/HIVE-363
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.3.0
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: hive.363.1.patch, hive.363.2.patch
>
>
> select count(distinct a) from T 
> returns dummy rows from all reducers if number of reducers are more than 1

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HIVE-363) [hive] extra rows for count distinct

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/HIVE-363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-363:
----------------------------

    Status: Patch Available  (was: Open)

> [hive] extra rows for count distinct
> ------------------------------------
>
>                 Key: HIVE-363
>                 URL: https://issues.apache.org/jira/browse/HIVE-363
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.3.0
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: hive.363.1.patch, hive.363.2.patch
>
>
> select count(distinct a) from T 
> returns dummy rows from all reducers if number of reducers are more than 1

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HIVE-363) [hive] extra rows for count distinct

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/HIVE-363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-363:
----------------------------

    Status: Patch Available  (was: Open)

> [hive] extra rows for count distinct
> ------------------------------------
>
>                 Key: HIVE-363
>                 URL: https://issues.apache.org/jira/browse/HIVE-363
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.3.0
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: hive.363.1.patch, hive.363.2.patch, hive.363.3.patch
>
>
> select count(distinct a) from T 
> returns dummy rows from all reducers if number of reducers are more than 1

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (HIVE-363) [hive] extra rows for count distinct

Posted by "Namit Jain (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/HIVE-363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Namit Jain updated HIVE-363:
----------------------------

    Attachment: hive.363.1.patch

> [hive] extra rows for count distinct
> ------------------------------------
>
>                 Key: HIVE-363
>                 URL: https://issues.apache.org/jira/browse/HIVE-363
>             Project: Hadoop Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.3.0
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>         Attachments: hive.363.1.patch
>
>
> select count(distinct a) from T 
> returns dummy rows from all reducers if number of reducers are more than 1

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.