You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Vitalii Diravka (JIRA)" <ji...@apache.org> on 2018/05/02 12:43:00 UTC

[jira] [Created] (DRILL-6379) Pushing additional filters for disjunction of expressions past JOIN

Vitalii Diravka created DRILL-6379:
--------------------------------------

             Summary: Pushing additional filters for disjunction of expressions past JOIN
                 Key: DRILL-6379
                 URL: https://issues.apache.org/jira/browse/DRILL-6379
             Project: Apache Drill
          Issue Type: Improvement
          Components: Query Planning &amp; Optimization
    Affects Versions: 1.13.0
            Reporter: Vitalii Diravka
            Assignee: Vitalii Diravka
             Fix For: Future


For queries with JOIN operator and filter as disjunction of expressions
the additional filters can be derived and pushed down to prevent of unnecessary scanning.
Query example:
{code}
SELECT * FROM t1 JOIN t2 ON T1.COLUMN = T2.COLUMN WHERE
(PC = X AND <other filters>) OR (PC = Y AND <some other filters>)
{code}

Unit test for _TestParquetFilterPushdownWithTransitivePredicates.java_:
{code}
  @Test
  public void testForOrOperatorTestOr() throws Exception {
    String query = String.format("SELECT * FROM %s t1 " +
            "JOIN %s t2 ON t1.`month` = t2.`month` " +
            "WHERE ((t1.`period` = 4 AND t2.`year` = 1991) OR (t1.`period` = 3 AND t1.`year` = 1991)) ",
        FIRST_TABLE_NAME, SECOND_TABLE_NAME);

    final String[] expectedPlan = {"first.*numRowGroups=2", "second.*numRowGroups=1"};
    testPlanMatchingPatterns(query, expectedPlan);
  }
{code}

{code}
LogicalProject(**=[$0], **0=[$4])
  LogicalFilter(condition=[OR(AND(=($2, 4), =($6, 1991)), AND(=($2, 3), =($3, 1991)))])
    LogicalJoin(condition=[=($1, $5)], joinType=[inner])
      EnumerableTableScan(table=[[dfs, parquetFilterPush/transitiveClosure/first]])
      EnumerableTableScan(table=[[dfs, parquetFilterPush/transitiveClosure/second]])
{code}

This improvement can be solved by CALCITE-2296.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)