You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@impala.apache.org by st...@apache.org on 2022/10/28 07:38:14 UTC

[impala] 01/02: IMPALA-11680: Fix test_krpc_datastream_sender_shuffle

This is an automated email from the ASF dual-hosted git repository.

stigahuang pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/impala.git

commit 8e96820b9e0d60b188cfa9e96aeeeceb158d366d
Author: Riza Suminto <ri...@cloudera.com>
AuthorDate: Sat Oct 22 02:44:45 2022 -0700

    IMPALA-11680: Fix test_krpc_datastream_sender_shuffle
    
    test_krpc_datastream_sender_shuffle failed in erasure-coding
    environment, possibly due to variability in data loading. This patch
    adjust the test to replace o_comment with o_clerk column that has
    consistent string length.
    
    Testing:
    - Run and pass test_krpc_datastream_sender_shuffle both in
      erasure-coding and regular hdfs environment, each on 100 iterations.
    
    Change-Id: I46fe6b37e089f853d54aa5d8f4ecf0d1f6d12191
    Reviewed-on: http://gerrit.cloudera.org:8080/19161
    Reviewed-by: Impala Public Jenkins <im...@cloudera.com>
    Tested-by: Impala Public Jenkins <im...@cloudera.com>
---
 testdata/workloads/tpch/queries/datastream-sender.test | 10 +++++-----
 1 file changed, 5 insertions(+), 5 deletions(-)

diff --git a/testdata/workloads/tpch/queries/datastream-sender.test b/testdata/workloads/tpch/queries/datastream-sender.test
index 01b03dca7..7656fe12a 100644
--- a/testdata/workloads/tpch/queries/datastream-sender.test
+++ b/testdata/workloads/tpch/queries/datastream-sender.test
@@ -6,16 +6,16 @@ WITH cte1
     AS (SELECT  c1,
                 c2,
                 orderkey
-         FROM   (SELECT Repeat(Group_concat(o_comment, o_comment), 20) AS c1,
-                        Repeat(Group_concat(o_comment, o_comment), 20) AS c2,
+         FROM   (SELECT Repeat(Group_concat(distinct o_clerk, '|'), 200) AS c1,
+                        Repeat(Group_concat(distinct o_clerk, '|'), 200) AS c2,
                         o_orderkey % 700                               AS
                         orderkey
                 FROM   orders
                 WHERE  o_orderkey < 800000000
                 GROUP  BY orderkey
                 UNION ALL
-                SELECT  Repeat(Group_concat(o_comment, o_comment), 20) AS c1,
-                        Repeat(Group_concat(o_comment, o_comment), 20) AS c2,
+                SELECT  Repeat(Group_concat(distinct o_clerk, '|'), 200) AS c1,
+                        Repeat(Group_concat(distinct o_clerk, '|'), 200) AS c2,
                         o_orderkey % 700                               AS
                         orderkey
                 FROM    orders
@@ -40,7 +40,7 @@ FROM   cte1 t1
        JOIN /* +shuffle */ cte2 t2
          ON t1.orderkey = t2.orderkey
 ---- RESULTS
-703,4251520,4251520,30,30
+703,2905400,2905400,30,30
 ---- TYPES
 BIGINT, INT, INT, INT, INT
 ---- RUNTIME_PROFILE