You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tajo.apache.org by "Jaehwa Jung (JIRA)" <ji...@apache.org> on 2016/03/23 08:06:25 UTC
[jira] [Resolved] (TAJO-2093) Partition Pruning doesn't handle
Constant folding occasionally for BETWEEN clause.
[ https://issues.apache.org/jira/browse/TAJO-2093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jaehwa Jung resolved TAJO-2093.
-------------------------------
Resolution: Fixed
> Partition Pruning doesn't handle Constant folding occasionally for BETWEEN clause.
> ----------------------------------------------------------------------------------
>
> Key: TAJO-2093
> URL: https://issues.apache.org/jira/browse/TAJO-2093
> Project: Tajo
> Issue Type: Bug
> Components: Planner/Optimizer
> Affects Versions: 0.12.0, 0.11.1
> Reporter: Jaehwa Jung
> Assignee: Jaehwa Jung
> Fix For: 0.11.2
>
>
> In some cases, Partition Pruning doesn't handle Constant folding for {{BETWEEN|| clause as shown below.
> {code:xml}
> default> CREATE EXTERNAL TABLE student (
> id INT,
> name TEXT,
> grade TEXT,
> phone TEXT
> ) USING TEXT
> PARTITION BY COLUMN (entrance_date TEXT)
> LOCATION '/data/student';
> default> \dfs -ls /data/student;
> Found 6 items
> drwx------ - tajo supergroup 0 2016-03-14 11:13 /data/student/entrance_date=20160308
> drwx------ - tajo supergroup 0 2016-03-14 11:13 /data/student/entrance_date=20160309
> drwx------ - tajo supergroup 0 2016-03-14 11:13 /data/student/entrance_date=20160310
> drwx------ - tajo supergroup 0 2016-03-14 11:13 /data/student/entrance_date=20160311
> drwx------ - tajo supergroup 0 2016-03-14 11:13 /data/student/entrance_date=20160312
> drwx------ - tajo supergroup 0 2016-03-14 11:13 /data/student/entrance_date=20160313
> default> \! date
> 2016. 03. 14. (MON) 15:19:27 KST
> default> SELECT * FROM student WHERE entrance_date BETWEEN TO_CHAR(ADD_DAYS(CURRENT_DATE(), -2) , 'YYYYMMDD') AND TO_CHAR( ADD_DAYS(CURRENT_DATE(), -1) , 'YYYYMMDD');
> {code}
> Above select query returns two rows as expected. But when I found logical plan on Tajo web UI, tajo retrieves all sub-directories of the example table as following:
> {code:xml}
> PARTITIONS_SCAN(4) on default.student
> => filter: default.student.entrance_date (TEXT) BETWEEN ASYMMETRIC to_char(add_days(current_date(),-2),YYYYMMDD) AND to_char(add_days(current_date(),-1),YYYYMMDD)
> => target list: default.student.id (INT4), default.student.name (TEXT), default.student.grade (TEXT), default.student.phone (TEXT), default.student.entrance_date (TEXT)
> => num of filtered paths: 6
> => out schema: {(5) default.student.id (INT4), default.student.name (TEXT), default.student.grade (TEXT), default.student.phone (TEXT), default.student.entrance_date (TEXT)}
> => in schema: {(4) default.student.id (INT4), default.student.name (TEXT), default.student.grade (TEXT), default.student.phone (TEXT)}
> => 0: hdfs://localhost:9010/data/student/entrance_date=20160308
> => 1: hdfs://localhost:9010/data/student/entrance_date=20160309
> => 2: hdfs://localhost:9010/data/student/entrance_date=20160310
> => 3: hdfs://localhost:9010/data/student/entrance_date=20160311
> => 4: hdfs://localhost:9010/data/student/entrance_date=20160312
> => 5: hdfs://localhost:9010/data/student/entrance_date=20160313
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)