You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Ramana Inukonda Nagaraj (JIRA)" <ji...@apache.org> on 2014/06/26 00:28:24 UTC

[jira] [Commented] (DRILL-989) TPCH 20 returning 0 rows

    [ https://issues.apache.org/jira/browse/DRILL-989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14044127#comment-14044127 ] 

Ramana Inukonda Nagaraj commented on DRILL-989:
-----------------------------------------------

Query as of build 79c1502 executes successfully and returns results but results are wrong even for small datasets.
SF 0.01:
>From drill:
s_name	s_address
Supplier#000000006	tQxuVm7s7CnK
Supplier#000000006	tQxuVm7s7CnK
Supplier#000000006	tQxuVm7s7CnK
Supplier#000000006	tQxuVm7s7CnK
Supplier#000000047	3XM1x,Pcxqw,HK4XNlgbnZMbLhBHLA
Supplier#000000047	3XM1x,Pcxqw,HK4XNlgbnZMbLhBHLA
Supplier#000000048	jg0U FNPMQDuyuKvTnLXXaLf3Wl6OtONA6mQlWJ
Supplier#000000076	JBhSBa3cLYvNgHUYtUHmtECCD
Supplier#000000079	p0u3tztSXUD2J8vFfLNFNKsrRRv7qyUtTBTA
Supplier#000000083	WRJUkzCn050seVz57oAfrbCuw
Supplier#000000083	WRJUkzCn050seVz57oAfrbCuw

Baseline:
Supplier#000000006	tQxuVm7s7CnK
Supplier#000000047	3XM1x,Pcxqw,HK4XNlgbnZMbLhBHLA
Supplier#000000048	jg0U FNPMQDuyuKvTnLXXaLf3Wl6OtONA6mQlWJ
Supplier#000000076	JBhSBa3cLYvNgHUYtUHmtECCD
Supplier#000000079	p0u3tztSXUD2J8vFfLNFNKsrRRv7qyUtTBTA
Supplier#000000083	WRJUkzCn050seVz57oAfrbCuw





> TPCH 20 returning 0 rows
> ------------------------
>
>                 Key: DRILL-989
>                 URL: https://issues.apache.org/jira/browse/DRILL-989
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Execution - Flow
>            Reporter: Ramana Inukonda Nagaraj
>
> TPCH 20 returns 0 rows, successfully executes though.
> Text version of physical plan:
> {code}
> 00-00    Screen
> 00-01      Project(s_name=[$0], s_address=[$1])
> 00-02        SingleMergeExchange(sort0=[0 ASC])
> 01-01          SelectionVectorRemover
> 01-02            Sort(sort0=[$0], dir0=[ASC])
> 01-03              HashToRandomExchange(dist0=[[$0]])
> 02-01                Project(s_name=[$0], s_address=[$1])
> 02-02                  HashJoin(condition=[=($2, $3)], joinType=[inner])
> 02-04                    HashToRandomExchange(dist0=[[$2]])
> 03-01                      Project($f3=[$2], $f4=[$3], $f8=[$0])
> 03-02                        HashJoin(condition=[=($1, $5)], joinType=[inner])
> 03-04                          Project(s_suppkey=[$3], s_nationkey=[$2], s_name=[$1], s_address=[$0])
> 03-05                            Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=maprfs:/drill/testdata/tpch-multi/supplier]], selectionRoot=/drill/testdata/tpch-multi/supplier, columns=[SchemaPath [`s_suppkey`], SchemaPath [`s_nationkey`], SchemaPath [`s_name`], SchemaPath [`s_address`]]]])
> 03-03                          BroadcastExchange
> 06-01                            SelectionVectorRemover
> 06-02                              Filter(condition=[=(CAST($0):CHAR(5) CHARACTER SET "ISO-8859-1" COLLATE "ISO-8859-1$en_US$primary", 'KENYA')])
> 06-03                                Project(n_name=[$1], n_nationkey=[$0])
> 06-04                                  Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=maprfs:/drill/testdata/tpch-multi/nation]], selectionRoot=/drill/testdata/tpch-multi/nation, columns=[SchemaPath [`n_name`], SchemaPath [`n_nationkey`]]]])
> 02-03                    StreamAgg(group=[{0}])
> 02-05                      Project(ps_suppkey=[$2])
> 02-06                        SelectionVectorRemover
> 02-07                          Filter(condition=[AND(true, >($1, CAST(*(0.5, $5)):ANY))])
> 02-08                            HashJoin(condition=[AND(=($0, $3), =($2, $4))], joinType=[left])
> 02-10                              HashToRandomExchange(dist0=[[$0]], dist1=[[$2]])
> 04-01                                Project($f1=[$0], $f2=[$1], $f3=[$2])
> 04-02                                  HashJoin(condition=[=($3, $4)], joinType=[inner])
> 04-04                                    Project($f1=[$0], $f2=[$2], $f3=[$1], $f4=[$0])
> 04-05                                      Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=maprfs:/drill/testdata/tpch-multi/partsupp]], selectionRoot=/drill/testdata/tpch-multi/partsupp, columns=[SchemaPath [`ps_partkey`], SchemaPath [`ps_availqty`], SchemaPath [`ps_suppkey`]]]])
> 04-03                                    BroadcastExchange
> 07-01                                      HashAgg(group=[{0}])
> 07-02                                        HashToRandomExchange(dist0=[[$0]])
> 09-01                                          Project(p_partkey=[$1])
> 09-02                                            SelectionVectorRemover
> 09-03                                              Filter(condition=[LIKE($0, 'antique%')])
> 09-04                                                Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=maprfs:/drill/testdata/tpch-multi/part]], selectionRoot=/drill/testdata/tpch-multi/part, columns=[SchemaPath [`p_name`], SchemaPath [`p_partkey`]]]])
> 02-09                              Project($f0=[$0], $f10=[$1], $f20=[$2])
> 02-11                                HashAgg(group=[{0, 1}], agg#0=[SUM($2)])
> 02-12                                  HashToRandomExchange(dist0=[[$0]])
> 05-01                                    Project($f0=[$4], $f1=[$5], l_quantity=[$3])
> 05-02                                      HashJoin(condition=[AND(=($0, $4), =($1, $5))], joinType=[inner])
> 05-04                                        SelectionVectorRemover
> 05-05                                          Filter(condition=[AND(>=($2, 1993-01-01), <($2, +(1993-01-01, 12)))])
> 05-06                                            Project(l_partkey=[$2], l_suppkey=[$1], l_shipdate=[$3], l_quantity=[$0])
> 05-07                                              Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=maprfs:/drill/testdata/tpch-multi/lineitem]], selectionRoot=/drill/testdata/tpch-multi/lineitem, columns=[SchemaPath [`l_partkey`], SchemaPath [`l_suppkey`], SchemaPath [`l_shipdate`], SchemaPath [`l_quantity`]]]])
> 05-03                                        BroadcastExchange
> 08-01                                          HashAgg(group=[{0, 1}])
> 08-02                                            HashToRandomExchange(dist0=[[$0]])
> 10-01                                              Project($f0=[$0], $f1=[$1])
> 10-02                                                HashJoin(condition=[=($2, $3)], joinType=[inner])
> 10-04                                                  Project($f1=[$0], $f3=[$1], $f4=[$0])
> 10-05                                                    Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=maprfs:/drill/testdata/tpch-multi/partsupp]], selectionRoot=/drill/testdata/tpch-multi/partsupp, columns=[SchemaPath [`ps_partkey`], SchemaPath [`ps_suppkey`]]]])
> 10-03                                                  BroadcastExchange
> 11-01                                                    HashAgg(group=[{0}])
> 11-02                                                      HashToRandomExchange(dist0=[[$0]])
> 12-01                                                        Project(p_partkey=[$1])
> 12-02                                                          SelectionVectorRemover
> 12-03                                                            Filter(condition=[LIKE($0, 'antique%')])
> 12-04                                                              Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=maprfs:/drill/testdata/tpch-multi/part]], selectionRoot=/drill/testdata/tpch-multi/part, columns=[SchemaPath [`p_name`], SchemaPath [`p_partkey`]]]])
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)