You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Krisztian Kasa (Jira)" <ji...@apache.org> on 2021/09/28 16:40:00 UTC

[jira] [Updated] (HIVE-25568) Estimate TopNKey operator statistics.

     [ https://issues.apache.org/jira/browse/HIVE-25568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Krisztian Kasa updated HIVE-25568:
----------------------------------
    Description: 
Currently TopNKey operator has the same statistics as it's parent operator:
{code}
TableScan
  alias: src
  Statistics: Num rows: 500 Data size: 89000 Basic stats: COMPLETE Column stats: COMPLETE
  Top N Key Operator
    sort order: +
    keys: key (type: string)
    null sort order: z
    Statistics: Num rows: 500 Data size: 89000 Basic stats: COMPLETE Column stats: COMPLETE
    top n: 5
{code}
This operator filters out rows and this should be indicated in statistics.

  was:
Currently TopNKey operator has the same statistics as it's parent operator:
{code}
 TableScan
                  alias: src
                  Statistics: Num rows: 500 Data size: 89000 Basic stats: COMPLETE Column stats: COMPLETE
                  Top N Key Operator
                    sort order: +
                    keys: key (type: string)
                    null sort order: z
                    Statistics: Num rows: 500 Data size: 89000 Basic stats: COMPLETE Column stats: COMPLETE
                    top n: 5
{code}
This operator filters out rows and this should be indicated in statistics.


> Estimate TopNKey operator statistics.
> -------------------------------------
>
>                 Key: HIVE-25568
>                 URL: https://issues.apache.org/jira/browse/HIVE-25568
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Krisztian Kasa
>            Priority: Major
>
> Currently TopNKey operator has the same statistics as it's parent operator:
> {code}
> TableScan
>   alias: src
>   Statistics: Num rows: 500 Data size: 89000 Basic stats: COMPLETE Column stats: COMPLETE
>   Top N Key Operator
>     sort order: +
>     keys: key (type: string)
>     null sort order: z
>     Statistics: Num rows: 500 Data size: 89000 Basic stats: COMPLETE Column stats: COMPLETE
>     top n: 5
> {code}
> This operator filters out rows and this should be indicated in statistics.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)