You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Qifan Chen (Jira)" <ji...@apache.org> on 2022/03/07 13:54:00 UTC

[jira] [Created] (IMPALA-11163) To scan small dimensional tables, the number of nodes selected by FE can be less

Qifan Chen created IMPALA-11163:
-----------------------------------

             Summary: To scan small dimensional tables, the number of nodes selected by FE can be less
                 Key: IMPALA-11163
                 URL: https://issues.apache.org/jira/browse/IMPALA-11163
             Project: IMPALA
          Issue Type: Improvement
            Reporter: Qifan Chen


In Impala, FE determines the # of exec nodes to use for scan based on the # of local/remote nodes hosting data blocks. For example for a dimensional table,  assume its #local nodes = 3, and its #remote nodes = 17. Then # of exec nodes for scan is 20. The final value is min(20, #exec nodes in cluster).

In the case of a partitioned join(f, d) where f is the fact table and d is the dimensional table, the # of network opens from join to table d can be made less (say 2 instead of 20). Therefore, the system can handle more # of queries.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org