You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Tim Armstrong (Jira)" <ji...@apache.org> on 2020/04/01 17:25:00 UTC

[jira] [Assigned] (IMPALA-9317) Improve number of instances estimate for scans in planner

     [ https://issues.apache.org/jira/browse/IMPALA-9317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tim Armstrong reassigned IMPALA-9317:
-------------------------------------

    Assignee: Tim Armstrong

> Improve number of instances estimate for scans in planner
> ---------------------------------------------------------
>
>                 Key: IMPALA-9317
>                 URL: https://issues.apache.org/jira/browse/IMPALA-9317
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Frontend
>            Reporter: Tim Armstrong
>            Assignee: Tim Armstrong
>            Priority: Major
>              Labels: multithreading
>
> Currently the number of instances (used for various estimates) is calculated as estimated # hosts * mt_dop. This is suboptimal, because in many cases there isn't enough input data to allow creating that many instances per host.
> The per-host estimate for scans actually already handles this correctly by capping the hosts estimate at the number of input splits - we should use the same approach for number of instance estimates.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org