You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Jesus Camacho Rodriguez (JIRA)" <ji...@apache.org> on 2016/02/18 21:56:18 UTC
[jira] [Created] (HIVE-13089) Rounding in Stats for equality
expressions
Jesus Camacho Rodriguez created HIVE-13089:
----------------------------------------------
Summary: Rounding in Stats for equality expressions
Key: HIVE-13089
URL: https://issues.apache.org/jira/browse/HIVE-13089
Project: Hive
Issue Type: Bug
Components: Statistics
Affects Versions: 2.1.0
Reporter: Jesus Camacho Rodriguez
Assignee: Jesus Camacho Rodriguez
Currently we divide numRows(long) by countDistinct(long), thus ignoring the decimals. We should do proper rounding.
This is specially useful for equality expressions over columns whose values are unique. As NDV estimates allow for a certain error, if countDistinct > numRows, we end up with 0 rows in the estimate for the expression.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)