You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Sergey Shelukhin (JIRA)" <ji...@apache.org> on 2013/10/26 03:26:30 UTC
[jira] [Created] (HIVE-5657) TopN produces incorrect results with
count(distinct)
Sergey Shelukhin created HIVE-5657:
--------------------------------------
Summary: TopN produces incorrect results with count(distinct)
Key: HIVE-5657
URL: https://issues.apache.org/jira/browse/HIVE-5657
Project: Hive
Issue Type: Bug
Reporter: Sergey Shelukhin
Priority: Critical
Attachments: example.patch
Attached patch illustrates the problem.
limit_pushdown test has various other cases of aggregations and distincts, incl. count-distinct, that work correctly (that said, src dataset is bad for testing these things because every count, for example, produces one record only), so something must be special about this.
I am not very familiar with distinct- code and these nuances; if someone knows a quick fix feel free to take this, otherwise I will probably start looking next week.
--
This message was sent by Atlassian JIRA
(v6.1#6144)