You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@impala.apache.org by "Mostafa Mokhtar (JIRA)" <ji...@apache.org> on 2017/10/20 06:07:00 UTC
[jira] [Created] (IMPALA-6088) Rack aware broadcast operator
Mostafa Mokhtar created IMPALA-6088:
---------------------------------------
Summary: Rack aware broadcast operator
Key: IMPALA-6088
URL: https://issues.apache.org/jira/browse/IMPALA-6088
Project: IMPALA
Issue Type: Sub-task
Components: Distributed Exec
Reporter: Mostafa Mokhtar
When conducting large scale experiments on a 6 rack cluster with aggregator core network topology overall cluster bandwidth utilization was limited.
With aggregator core networks nodes and racks are not equidistant, which means a broadcast operation can be inefficient as the broadcasting node needs to send the same data N times to each node on a remote rack.
Ideally Rowbatches should be sent once per remote rack then a node on each remote rack would broadcast within its rack.
Table below represent rack to rack latency for the 90% of operations, ration between best and worst case is 7.3x
| || va|| vc|| vd1|| vd3|| ve|
|va| 4,238| 4,290| 9,692| 8,897| 8,208|
|vc| 9,290| 4,396| 30,952| 13,529| 14,578|
|vd1| 9,131| 29,066| 4,346| 17,265| 16,849|
|vd3| 7,409| 15,517| 17,265| 4,370| 4,687|
|ve| 4,914| 16,894| 16,430| 4,713| 4,472|
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)