You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Prasanth J (JIRA)" <ji...@apache.org> on 2014/08/11 20:38:11 UTC

[jira] [Updated] (HIVE-7679) JOIN operator should update the column stats when number of rows changes

     [ https://issues.apache.org/jira/browse/HIVE-7679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Prasanth J updated HIVE-7679:
-----------------------------

    Description: JOIN operator does not update the column stats when the number of rows changes. All other operators scales up/down the column statistics when the number of rows changes. Same should be done for JOIN operator as well. Because of this dataSize might become negative as numNulls can get bigger than numRows (if scaling down of column stats is not done).  (was: JOIN operator does not update the column stats when the number of rows changes. All other operators scales up/down the column statistics when the number of rows changes. Same should be done for JOIN operator as well. )

> JOIN operator should update the column stats when number of rows changes
> ------------------------------------------------------------------------
>
>                 Key: HIVE-7679
>                 URL: https://issues.apache.org/jira/browse/HIVE-7679
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Query Processor, Statistics
>    Affects Versions: 0.14.0
>            Reporter: Prasanth J
>            Assignee: Prasanth J
>            Priority: Minor
>             Fix For: 0.13.0
>
>
> JOIN operator does not update the column stats when the number of rows changes. All other operators scales up/down the column statistics when the number of rows changes. Same should be done for JOIN operator as well. Because of this dataSize might become negative as numNulls can get bigger than numRows (if scaling down of column stats is not done).



--
This message was sent by Atlassian JIRA
(v6.2#6252)