You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Alexander Behm (JIRA)" <ji...@apache.org> on 2017/11/21 05:27:00 UTC

[jira] [Created] (IMPALA-6228) More flexible configuration of stats extrapolation

Alexander Behm created IMPALA-6228:
--------------------------------------

             Summary: More flexible configuration of stats extrapolation
                 Key: IMPALA-6228
                 URL: https://issues.apache.org/jira/browse/IMPALA-6228
             Project: IMPALA
          Issue Type: Improvement
          Components: Frontend
    Affects Versions: Impala 2.10.0, Impala 2.11.0
            Reporter: Alexander Behm


For stats extrapolation (IMPALA-2373) and COMPUTE STATS TABLESMAPLE (IMPALA-5310) we currently require an impalad startup option -enable_stats_extrapolation to be set.

It would be nice if changing that configuration would not require a service restart.
For example, we could consider a query option instead of adding a table property to tables where extrapolation should be enabled.

The reason for the current behavior is as follows:
It is technically not required to be a startup option, but it reduces the number of ways users can shoot themselves in the foot. For example, first running COMPUTE STATS TABLESAMPLE on a table T and then running a query against table T without stats extrapolation does not make sense and will not work well. This subtle behavior might not be clear to users. Yes, that can be addressed with warnings etc., but preventing non-sensical combinations seems better until we have strong evidence against that conservative approach.





--
This message was sent by Atlassian JIRA
(v6.4.14#64029)