You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Eugene Koifman (JIRA)" <ji...@apache.org> on 2018/06/26 14:44:00 UTC
[jira] [Commented] (HIVE-19995) Aggregate row traffic for acid
tables
[ https://issues.apache.org/jira/browse/HIVE-19995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16523822#comment-16523822 ]
Eugene Koifman commented on HIVE-19995:
---------------------------------------
There is logic in the compactor to recompute column level stats but that doesn't run very often - currently only for major compaction. Perhaps this is worth considering
> Aggregate row traffic for acid tables
> -------------------------------------
>
> Key: HIVE-19995
> URL: https://issues.apache.org/jira/browse/HIVE-19995
> Project: Hive
> Issue Type: Sub-task
> Components: Statistics, Transactions
> Reporter: Zoltan Haindrich
> Assignee: Zoltan Haindrich
> Priority: Major
>
> for transactional tables we store basic stats in case of explicit analyze/rewrite; but doesn't do anything in other cases....which may even lead to plans which oom...
> It would be better to aggregate the total row traffic...because that is already available; so that operator tree estimations could work with a real upper bound of the row numbers.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)