You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Jacques Nadeau (JIRA)" <ji...@apache.org> on 2014/06/09 07:35:21 UTC

[jira] [Resolved] (DRILL-740) Re-enable TPCH05 in unit tests after fixing underlying reason for timeout

     [ https://issues.apache.org/jira/browse/DRILL-740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jacques Nadeau resolved DRILL-740.
----------------------------------

    Resolution: Fixed

> Re-enable TPCH05 in unit tests after fixing underlying reason for timeout 
> --------------------------------------------------------------------------
>
>                 Key: DRILL-740
>                 URL: https://issues.apache.org/jira/browse/DRILL-740
>             Project: Apache Drill
>          Issue Type: Bug
>            Reporter: Aman Sinha
>
> TPCH05 is currently marked Ignored since it started timing out after we enabled new plans with hash join and hash aggregate.  The logs show that the query progresses until a point where  some downstream fragment is waiting for data or end of stream and the query times out.  More diagnostic information needs to be added to track exactly what each fragment is doing.   We need to re-enable this test after fixing the underlying cause for timeout. 
> Here's the physical plan for query 05: 
> Drill Physical : 
> ScreenPrel: rowcount = 9.638871459960938E8, cumulative cost = {8.14099147957207E10 rows, 1.3306858338906152E12 cpu, 0.0 io, 1.521937908014208E15 network}, id = 17939
>   SingleMergeExchangePrel(sort0=[1 DESC]): rowcount = 9.638871459960938E8, cumulative cost = {8.13135260811211E10 rows, 1.3305894451760156E12 cpu, 0.0 io, 1.521937908014208E15 network}, id = 17938
>     SortPrel(sort0=[$1], dir0=[DESC]): rowcount = 9.638871459960938E8, cumulative cost = {8.0349638935125E10 rows, 1.3228783480080469E12 cpu, 0.0 io, 1.514041744514208E15 network}, id = 17937
>       HashToRandomExchangePrel(dist0=[[$1]]): rowcount = 9.638871459960938E8, cumulative cost = {7.93857517891289E10 rows, 1.2078122418080312E12 cpu, 0.0 io, 1.514041744514208E15 network}, id = 17936
>         HashAggPrel(group=[{0}], revenue=[SUM($1)]): rowcount = 9.638871459960938E8, cumulative cost = {7.842186464313281E10 rows, 1.1923900474720938E12 cpu, 0.0 io, 1.506145581014208E15 network}, id = 17935
>           HashToRandomExchangePrel(dist0=[[$0]]): rowcount = 9.638871459960938E9, cumulative cost = {6.878299318317188E10 rows, 9.99612618272875E11 cpu, 0.0 io, 1.506145581014208E15 network}, id = 17934
>             ProjectPrel(n_name=[$4], $f1=[*($17, -(1, $18))]): rowcount = 9.638871459960938E9, cumulative cost = {5.914412172321094E10 rows, 8.453906749135E11 cpu, 0.0 io, 1.427183946014208E15 network}, id = 17933
>               HashJoinPrel(condition=[=($20, $6)], joinType=[inner]): rowcount = 9.638871459960938E9, cumulative cost = {4.950525026325E10 rows, 8.453906749055E11 cpu, 0.0 io, 1.427183946014208E15 network}, id = 17932
>                 HashToRandomExchangePrel(dist0=[[$6]]): rowcount = 3.75, cumulative cost = {90.75 rows, 923.0 cpu, 0.0 io, 529408.0 network}, id = 17920
>                   HashJoinPrel(condition=[=($5, $2)], joinType=[inner]): rowcount = 3.75, cumulative cost = {87.0 rows, 863.0 cpu, 0.0 io, 421888.0 network}, id = 17919
>                     HashToRandomExchangePrel(dist0=[[$2]]): rowcount = 1.0, cumulative cost = {11.0 rows, 51.0 cpu, 0.0 io, 12288.0 network}, id = 17917
>                       FilterPrel(condition=[=(CAST($1):CHAR(6) CHARACTER SET "ISO-8859-1" COLLATE "ISO-8859-1$en_US$primary", 'EUROPE')]): rowcount = 1.0, cumulative cost = {10.0 rows, 35.0 cpu, 0.0 io, 0.0 network}, id = 17916
>                         ScanPrel(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=/tpch/region.parquet]], selectionRoot=/tpch/region.parquet, columns=[SchemaPath [`r_name`], SchemaPath [`r_regionkey`]]]]): rowcount = 5.0, cumulative cost = {5.0 rows, 15.0 cpu, 0.0 io, 0.0 network}, id = 16822
>                     HashToRandomExchangePrel(dist0=[[$2]]): rowcount = 25.0, cumulative cost = {50.0 rows, 500.0 cpu, 0.0 io, 409600.0 network}, id = 17918
>                       ScanPrel(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=/tpch/nation.parquet]], selectionRoot=/tpch/nation.parquet, columns=[SchemaPath [`n_name`], SchemaPath [`n_regionkey`], SchemaPath [`n_nationkey`]]]]): rowcount = 25.0, cumulative cost = {25.0 rows, 100.0 cpu, 0.0 io, 0.0 network}, id = 16833
>                 HashToRandomExchangePrel(dist0=[[$13]]): rowcount = 1.7135771484375E10, cumulative cost = {3.2369478684375E10 rows, 6.39761416125E11 cpu, 0.0 io, 1.4271839454848E15 network}, id = 17931
>                   HashJoinPrel(condition=[AND(=($2, $13), =($9, $14))], joinType=[inner]): rowcount = 1.7135771484375E10, cumulative cost = {1.52337072E10 rows, 3.65589072375E11 cpu, 0.0 io, 3.743621454848E14 network}, id = 17930
>                     HashToRandomExchangePrel(dist0=[[$2]], dist1=[[$9]]): rowcount = 7.6158984375E9, cumulative cost = {7.6178084625E9 rows, 1.82807504775E11 cpu, 0.0 io, 3.74362144256E14 network}, id = 17928
>                       HashJoinPrel(condition=[=($8, $6)], joinType=[inner]): rowcount = 7.6158984375E9, cumulative cost = {1910025.0 rows, 2.5942275E7 cpu, 0.0 io, 2.5504256E10 network}, id = 17927
>                         HashToRandomExchangePrel(dist0=[[$6]]): rowcount = 843750.0, cumulative cost = {885750.0 rows, 1.38315E7 cpu, 0.0 io, 2.4271872E10 network}, id = 17925
>                           HashJoinPrel(condition=[=($1, $5)], joinType=[inner]): rowcount = 843750.0, cumulative cost = {42000.0 rows, 331500.0 cpu, 0.0 io, 7.9872E7 network}, id = 17924
>                             HashToRandomExchangePrel(dist0=[[$1]]): rowcount = 1500.0, cumulative cost = {3000.0 rows, 28500.0 cpu, 0.0 io, 1.8432E7 network}, id = 17921
>                               ScanPrel(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=/tpch/customer.parquet]], selectionRoot=/tpch/customer.parquet, columns=[SchemaPath [`c_custkey`], SchemaPath [`c_nationkey`]]]]): rowcount = 1500.0, cumulative cost = {1500.0 rows, 4500.0 cpu, 0.0 io, 0.0 network}, id = 16829
>                             HashToRandomExchangePrel(dist0=[[$2]]): rowcount = 3750.0, cumulative cost = {33750.0 rows, 240000.0 cpu, 0.0 io, 6.144E7 network}, id = 17923
>                               FilterPrel(condition=[AND(>=($1, 1997-01-01), <($1, +(1997-01-01, 12)))]): rowcount = 3750.0, cumulative cost = {30000.0 rows, 180000.0 cpu, 0.0 io, 0.0 network}, id = 17922
>                                 ScanPrel(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=/tpch/orders.parquet]], selectionRoot=/tpch/orders.parquet, columns=[SchemaPath [`o_orderdate`], SchemaPath [`o_custkey`], SchemaPath [`o_orderkey`]]]]): rowcount = 15000.0, cumulative cost = {15000.0 rows, 60000.0 cpu, 0.0 io, 0.0 network}, id = 16879
>                         HashToRandomExchangePrel(dist0=[[$1]]): rowcount = 60175.0, cumulative cost = {120350.0 rows, 1263675.0 cpu, 0.0 io, 1.232384E9 network}, id = 17926
>                           ScanPrel(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=/tpch/lineitem.parquet]], selectionRoot=/tpch/lineitem.parquet, columns=[SchemaPath [`l_orderkey`], SchemaPath [`l_suppkey`], SchemaPath [`l_extendedprice`], SchemaPath [`l_discount`]]]]): rowcount = 60175.0, cumulative cost = {60175.0 rows, 300875.0 cpu, 0.0 io, 0.0 network}, id = 16839
>                     HashToRandomExchangePrel(dist0=[[$1]], dist1=[[$2]]): rowcount = 100.0, cumulative cost = {200.0 rows, 2700.0 cpu, 0.0 io, 1228800.0 network}, id = 17929
>                       ScanPrel(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=/tpch/supplier.parquet]], selectionRoot=/tpch/supplier.parquet, columns=[SchemaPath [`s_nationkey`], SchemaPath [`s_suppkey`]]]]): rowcount = 100.0, cumulative cost = {100.0 rows, 300.0 cpu, 0.0 io, 0.0 network}, id = 17110



--
This message was sent by Atlassian JIRA
(v6.2#6252)