You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Navis (JIRA)" <ji...@apache.org> on 2013/11/04 05:50:17 UTC
[jira] [Commented] (HIVE-5657) TopN produces incorrect results with
count(distinct)
[ https://issues.apache.org/jira/browse/HIVE-5657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13812617#comment-13812617 ]
Navis commented on HIVE-5657:
-----------------------------
HIVE-5503 is committed and I cannot rebase on that. Was it necessary to "refactor" the codes which is not yet confirmed solid and has on-going issues on it? I appreciate progresses made by members of the company. But not like this.
> TopN produces incorrect results with count(distinct)
> ----------------------------------------------------
>
> Key: HIVE-5657
> URL: https://issues.apache.org/jira/browse/HIVE-5657
> Project: Hive
> Issue Type: Bug
> Reporter: Sergey Shelukhin
> Assignee: Navis
> Priority: Critical
> Attachments: D13797.1.patch, D13797.2.patch, HIVE-5657.1.patch.txt, example.patch
>
>
> Attached patch illustrates the problem.
> limit_pushdown test has various other cases of aggregations and distincts, incl. count-distinct, that work correctly (that said, src dataset is bad for testing these things because every count, for example, produces one record only), so something must be special about this.
> I am not very familiar with distinct- code and these nuances; if someone knows a quick fix feel free to take this, otherwise I will probably start looking next week.
--
This message was sent by Atlassian JIRA
(v6.1#6144)