You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Eugene Koifman (JIRA)" <ji...@apache.org> on 2018/06/26 14:44:00 UTC

[jira] [Commented] (HIVE-19995) Aggregate row traffic for acid tables

    [ https://issues.apache.org/jira/browse/HIVE-19995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16523822#comment-16523822 ] 

Eugene Koifman commented on HIVE-19995:
---------------------------------------

There is logic in the compactor to recompute column level stats but that doesn't run very often - currently only for major compaction.  Perhaps this is worth considering

 

> Aggregate row traffic for acid tables
> -------------------------------------
>
>                 Key: HIVE-19995
>                 URL: https://issues.apache.org/jira/browse/HIVE-19995
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Statistics, Transactions
>            Reporter: Zoltan Haindrich
>            Assignee: Zoltan Haindrich
>            Priority: Major
>
> for transactional tables we store basic stats in case of explicit analyze/rewrite; but doesn't do anything in other cases....which may even lead to plans which oom...
> It would be better to aggregate the total row traffic...because that is already available; so that operator tree estimations could work with a real upper bound of the row numbers.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)