You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Aman Sinha (JIRA)" <ji...@apache.org> on 2015/04/29 07:20:05 UTC
[jira] [Created] (DRILL-2900) Joins are not getting parallelized in
TestTpchDistributed unit tests
Aman Sinha created DRILL-2900:
---------------------------------
Summary: Joins are not getting parallelized in TestTpchDistributed unit tests
Key: DRILL-2900
URL: https://issues.apache.org/jira/browse/DRILL-2900
Project: Apache Drill
Issue Type: Bug
Components: Query Planning & Optimization
Affects Versions: 0.8.0
Reporter: Aman Sinha
Assignee: Jinfeng Ni
The suite of unit tests in org.apache.drill.TestTpchDistributed are supposed to have parallel joins, parallel aggregations and sort. In order to force parallelism, the slice_target value is set to 10 (default is 100K). However, it looks like the plans of at least a few queries that I tried (query 03, 05, 14) did not have parallelism for the hash joins. The aggregations and sort do have Exchanges, so they are getting parallelized.
There is a likelihood that many plans have BroadcastExchange on the right input of the join and since the left input of the join is typically a single file we may be dropping the exchange on the right input. This should be fixed.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)