You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Vuk Ercegovac (JIRA)" <ji...@apache.org> on 2018/08/11 04:55:00 UTC

[jira] [Assigned] (IMPALA-7425) Add option to load incremental statistics from catalog

     [ https://issues.apache.org/jira/browse/IMPALA-7425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Vuk Ercegovac reassigned IMPALA-7425:
-------------------------------------

    Assignee: Vuk Ercegovac

> Add option to load incremental statistics from catalog
> ------------------------------------------------------
>
>                 Key: IMPALA-7425
>                 URL: https://issues.apache.org/jira/browse/IMPALA-7425
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Catalog
>    Affects Versions: Impala 3.1.0
>            Reporter: Vuk Ercegovac
>            Assignee: Vuk Ercegovac
>            Priority: Major
>
> Incremental statistics currently store all required data in catalogd and all impalad coordinators. However, this data is only required when computing incremental statistics. In cases where incremental statistics is used on many partition columns (due to tables with many columns, many partitions or both), this data can dominate the overall memory footprint. This can lead to OOM's, increased network usage, and instability.
> Add an option to avoid propagating incremental stats to all coordinators and instead, pull it on demand from the catalog only when needed by the compute incremental statistics statement.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org