You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Aman Sinha (JIRA)" <ji...@apache.org> on 2015/04/29 07:20:05 UTC

[jira] [Created] (DRILL-2900) Joins are not getting parallelized in TestTpchDistributed unit tests

Aman Sinha created DRILL-2900:
---------------------------------

             Summary: Joins are not getting parallelized in TestTpchDistributed unit tests
                 Key: DRILL-2900
                 URL: https://issues.apache.org/jira/browse/DRILL-2900
             Project: Apache Drill
          Issue Type: Bug
          Components: Query Planning & Optimization
    Affects Versions: 0.8.0
            Reporter: Aman Sinha
            Assignee: Jinfeng Ni


The suite of unit tests in org.apache.drill.TestTpchDistributed are supposed to have parallel joins, parallel aggregations and sort.  In order to force parallelism, the slice_target value is set to 10 (default is 100K).  However, it looks like the plans of at least a few queries that I tried (query 03, 05, 14) did not have parallelism for the hash joins.  The aggregations and sort do have Exchanges, so they are getting parallelized.  

There is a likelihood that many  plans have BroadcastExchange on the right input of the join and since the left input of the join is typically a single file we may be dropping the exchange on the right input.  This should be fixed. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)