You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tajo.apache.org by "Hyoungjun Kim (JIRA)" <ji...@apache.org> on 2014/07/01 16:41:24 UTC

[jira] [Created] (TAJO-897) PartitionedTableRewriter is repeated several times with same table.

Hyoungjun Kim created TAJO-897:
----------------------------------

             Summary: PartitionedTableRewriter is repeated several times with same table. 
                 Key: TAJO-897
                 URL: https://issues.apache.org/jira/browse/TAJO-897
             Project: Tajo
          Issue Type: Bug
            Reporter: Hyoungjun Kim
            Assignee: Hyoungjun Kim
            Priority: Minor


See the title. 
If there is some block which contains partitioned table, PartitionedTableRewriter runs several time. At first time after finding partition path, PartitionedTableRewriter removes partitioned filter condition. So next time all partition is selected for scanning.
I ran the next query. customer_parts table is partitioned by c_nationkey.
{code:sql}
select a.c_custkey, b.c_custkey from 
 (select c_custkey, c_nationkey from customer_parts where c_nationkey < 0 
 union all 
  select c_custkey, c_nationkey from customer_parts where c_nationkey < 0 
) a
left outer join customer_parts b
on a.c_custkey = b.c_custkey 
and a.c_nationkey > 0
{code}


{noformat}
=======================================================
Block Id: eb_1404224996147_0002_000001 [LEAF]
=======================================================

[Outgoing]
[q_1404224996147_0002] 1 => 3 (type=HASH_SHUFFLE, key=default.a.c_custkey (INT4), num=32)

TABLE_SUBQUERY(19) as default.a
  => Targets: default.a.c_custkey (INT4) as default.a.c_custkey
  => out schema: {(1) default.a.c_custkey (INT4)}
  => in  schema: {(2) default.a.c_custkey (INT4),default.a.c_nationkey (INT4)}
   PARTITIONS_SCAN(16) on default.customer_parts
     => target list: default.customer_parts.c_custkey (INT4), default.customer_parts.c_nationkey (INT4)
     => num of filtered paths: 5
     => out schema: {(2) default.customer_parts.c_custkey (INT4),default.customer_parts.c_nationkey (INT4)}
     => in schema: {(7) default.customer_parts.c_custkey (INT4),default.customer_parts.c_name (TEXT),default.customer_parts.c_address (TEXT),default.customer_parts.c_phone (TEXT),default.customer_parts.c_acctbal (FLOAT8),default.customer_parts.c_mktsegment (TEXT),default.customer_parts.c_comment (TEXT)}
     => 0: hdfs://localhost:49896/tajo/warehouse/default/customer_parts/c_nationkey=1
     => 1: hdfs://localhost:49896/tajo/warehouse/default/customer_parts/c_nationkey=13
     => 2: hdfs://localhost:49896/tajo/warehouse/default/customer_parts/c_nationkey=15
     => 3: hdfs://localhost:49896/tajo/warehouse/default/customer_parts/c_nationkey=3
     => 4: hdfs://localhost:49896/tajo/warehouse/default/customer_parts/c_nationkey=4

=======================================================
Block Id: eb_1404224996147_0002_000002 [LEAF]
=======================================================

[Outgoing]
[q_1404224996147_0002] 2 => 3 (type=HASH_SHUFFLE, key=default.a.c_custkey (INT4), num=32)

TABLE_SUBQUERY(20) as default.a
  => Targets: default.a.c_custkey (INT4)
  => out schema: {(1) default.a.c_custkey (INT4)}
  => in  schema: {(2) default.a.c_custkey (INT4),default.a.c_nationkey (INT4)}
   PARTITIONS_SCAN(17) on default.customer_parts
     => target list: default.customer_parts.c_custkey (INT4), default.customer_parts.c_nationkey (INT4)
     => num of filtered paths: 5
     => out schema: {(2) default.customer_parts.c_custkey (INT4),default.customer_parts.c_nationkey (INT4)}
     => in schema: {(7) default.customer_parts.c_custkey (INT4),default.customer_parts.c_name (TEXT),default.customer_parts.c_address (TEXT),default.customer_parts.c_phone (TEXT),default.customer_parts.c_acctbal (FLOAT8),default.customer_parts.c_mktsegment (TEXT),default.customer_parts.c_comment (TEXT)}
     => 0: hdfs://localhost:49896/tajo/warehouse/default/customer_parts/c_nationkey=1
     => 1: hdfs://localhost:49896/tajo/warehouse/default/customer_parts/c_nationkey=13
     => 2: hdfs://localhost:49896/tajo/warehouse/default/customer_parts/c_nationkey=15
     => 3: hdfs://localhost:49896/tajo/warehouse/default/customer_parts/c_nationkey=3
     => 4: hdfs://localhost:49896/tajo/warehouse/default/customer_parts/c_nationkey=4

=======================================================
Block Id: eb_1404224996147_0002_000004 [LEAF]
=======================================================

[Outgoing]
[q_1404224996147_0002] 4 => 3 (type=HASH_SHUFFLE, key=default.b.c_custkey (INT4), num=32)

PARTITIONS_SCAN(15) on default.customer_parts
  => target list: default.b.c_custkey (INT4)
  => num of filtered paths: 5
  => out schema: {(1) default.b.c_custkey (INT4)}
  => in schema: {(7) default.b.c_custkey (INT4),default.b.c_name (TEXT),default.b.c_address (TEXT),default.b.c_phone (TEXT),default.b.c_acctbal (FLOAT8),default.b.c_mktsegment (TEXT),default.b.c_comment (TEXT)}
  => 0: hdfs://localhost:49896/tajo/warehouse/default/customer_parts/c_nationkey=1
  => 1: hdfs://localhost:49896/tajo/warehouse/default/customer_parts/c_nationkey=13
  => 2: hdfs://localhost:49896/tajo/warehouse/default/customer_parts/c_nationkey=15
  => 3: hdfs://localhost:49896/tajo/warehouse/default/customer_parts/c_nationkey=3
  => 4: hdfs://localhost:49896/tajo/warehouse/default/customer_parts/c_nationkey=4

=======================================================
Block Id: eb_1404224996147_0002_000003 [ROOT]
=======================================================
{noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)