You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by maropu <gi...@git.apache.org> on 2018/01/21 15:55:49 UTC

[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

GitHub user maropu opened a pull request:

    https://github.com/apache/spark/pull/20343

    [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQuerySuite

    ## What changes were proposed in this pull request?
    This pr added TPCDS v2.7 (latest) queries in `TPCDSQuerySuite` because the current `TPCDSQuerySuite` tests older one (v1.4) and some queries are different from v1.4 and v2.7. Since the original v2.7 queries have the syntaxes that Spark cannot parse, I changed these queries in a following way:
    
     - [date] + 14 days -> date + `INTERVAL` 14 days
     - [column name] as "30 days" -> [column name] as \`30 days\`
     - Fix some syntax errors, e.g., missing brackets
    
    ## How was this patch tested?
    Added tests in `TPCDSQuerySuite`.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/maropu/spark TPCDSV2_7

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/20343.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #20343
    
----
commit 71e0e1ad2f6ee772103ef43d2270ed347bc497d0
Author: Takeshi Yamamuro <ya...@...>
Date:   2018-01-19T11:04:59Z

    Update TPCD queries to v2.7.0

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/20343


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86465 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86465/testReport)** for PR 20343 at commit [`5d6092c`](https://github.com/apache/spark/commit/5d6092c4bf029a021930a4ba66e6e1de3a4b15ed).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86812/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    retest this please


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86457 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86457/testReport)** for PR 20343 at commit [`12f687c`](https://github.com/apache/spark/commit/12f687c3c4338478f7f0cc40474c90f55aab8ecf).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r162835887
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/TPCDSQuerySuite.scala ---
    @@ -339,6 +340,30 @@ class TPCDSQuerySuite extends BenchmarkQueryTest {
         }
       }
     
    +  val tpcdsQueriesV2_7_0 = Seq(
    +    "q1", "q2", "q3", "q4", "q5", "q5a", "q6", "q7", "q8", "q9", "q10", "q10a", "q11",
    +    "q12", "q13", "q14_1", "q14_2", "q14a_1", "q14a_2",  "q15", "q16", "q17", "q18", "q18a", "q19",
    +    "q20", "q21", "q22", "q22a", "q23_1", "q23_2", "q24_1", "q24_2", "q25", "q26", "q27", "q27a",
    +    "q28", "q29", "q30", "q31", "q32", "q33", "q34", "q35", "q35a", "q36", "q36a", "q37", "q38",
    +    "q39_1", "q39_2", "q40", "q41", "q42", "q43", "q44", "q45", "q46", "q47", "q48", "q49",
    +    "q50", "q51", "q51a", "q52", "q53", "q54", "q55", "q56", "q57", "q58", "q59",
    +    "q60", "q61", "q62", "q63", "q64", "q65", "q66", "q67", "q67a", "q68", "q69",
    +    "q70", "q70a", "q71", "q72", "q73", "q74", "q75", "q76", "q77", "q77a", "q78", "q79",
    +    "q80", "q80a", "q81", "q82", "q83", "q84", "q85", "q86", "q86a", "q87", "q88", "q89",
    +    "q90", "q91", "q92", "q93", "q94", "q95", "q96", "q97", "q98", "q99")
    +
    +  tpcdsQueriesV2_7_0.foreach { name =>
    +    val queryString = resourceToString(s"tpcds-v2.7.0/$name.sql",
    --- End diff --
    
    ok, thanks.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r163016890
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/TPCDSQuerySuite.scala ---
    @@ -339,6 +340,24 @@ class TPCDSQuerySuite extends BenchmarkQueryTest {
         }
       }
     
    +  val tpcdsQueriesV2_7_0 = Seq(
    --- End diff --
    
    Please add a comment to explain what the list contains?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86457 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86457/testReport)** for PR 20343 at commit [`12f687c`](https://github.com/apache/spark/commit/12f687c3c4338478f7f0cc40474c90f55aab8ecf).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86812 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86812/testReport)** for PR 20343 at commit [`d04b087`](https://github.com/apache/spark/commit/d04b0872bcc02b5eadd309c560cda77ff1b8da0a).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    We need to update `TPCDSQueryBenchmark`, too? I think we could replace the update queries there.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    A support of the optional intervals will be planed in 3.x (#20433), so is it okay to restart this again? @gatorsmile 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #88569 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/88569/testReport)** for PR 20343 at commit [`d04b087`](https://github.com/apache/spark/commit/d04b0872bcc02b5eadd309c560cda77ff1b8da0a).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/1742/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    retest this please.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r162816645
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/TPCDSQuerySuite.scala ---
    @@ -339,6 +340,30 @@ class TPCDSQuerySuite extends BenchmarkQueryTest {
         }
       }
     
    +  val tpcdsQueriesV2_7_0 = Seq(
    +    "q1", "q2", "q3", "q4", "q5", "q5a", "q6", "q7", "q8", "q9", "q10", "q10a", "q11",
    +    "q12", "q13", "q14_1", "q14_2", "q14a_1", "q14a_2",  "q15", "q16", "q17", "q18", "q18a", "q19",
    +    "q20", "q21", "q22", "q22a", "q23_1", "q23_2", "q24_1", "q24_2", "q25", "q26", "q27", "q27a",
    +    "q28", "q29", "q30", "q31", "q32", "q33", "q34", "q35", "q35a", "q36", "q36a", "q37", "q38",
    +    "q39_1", "q39_2", "q40", "q41", "q42", "q43", "q44", "q45", "q46", "q47", "q48", "q49",
    +    "q50", "q51", "q51a", "q52", "q53", "q54", "q55", "q56", "q57", "q58", "q59",
    +    "q60", "q61", "q62", "q63", "q64", "q65", "q66", "q67", "q67a", "q68", "q69",
    +    "q70", "q70a", "q71", "q72", "q73", "q74", "q75", "q76", "q77", "q77a", "q78", "q79",
    +    "q80", "q80a", "q81", "q82", "q83", "q84", "q85", "q86", "q86a", "q87", "q88", "q89",
    +    "q90", "q91", "q92", "q93", "q94", "q95", "q96", "q97", "q98", "q99")
    +
    +  tpcdsQueriesV2_7_0.foreach { name =>
    +    val queryString = resourceToString(s"tpcds-v2.7.0/$name.sql",
    --- End diff --
    
    Could we just add the queries that are different from the v1.4 version?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    @maropu Yeah. As long as the queries are different, we should keep both versions. This is to help the others understand we fully support TPC-DS queries without the changes. Thanks!


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #88569 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/88569/testReport)** for PR 20343 at commit [`d04b087`](https://github.com/apache/spark/commit/d04b0872bcc02b5eadd309c560cda77ff1b8da0a).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    ok, I'll fix soon. Many Thanks!


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/85/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86462 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86462/testReport)** for PR 20343 at commit [`5d6092c`](https://github.com/apache/spark/commit/5d6092c4bf029a021930a4ba66e6e1de3a4b15ed).
     * This patch **fails due to an unknown error code, -9**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/115/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/75/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86444/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86452/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Thanks for submitting the PR https://github.com/apache/spark/pull/20433. It sounds like there are still some test failure. Will review it after 2.3 release. Thanks!


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r164279795
  
    --- Diff: sql/core/src/test/resources/tpcds-v2.7.0/q11.sql ---
    @@ -0,0 +1,78 @@
    +with year_total as (
    + select c_customer_id customer_id
    +       ,c_first_name customer_first_name
    +       ,c_last_name customer_last_name
    +       ,c_preferred_cust_flag customer_preferred_cust_flag
    +       ,c_birth_country customer_birth_country
    +       ,c_login customer_login
    +       ,c_email_address customer_email_address
    +       ,d_year dyear
    +       ,sum(ss_ext_list_price-ss_ext_discount_amt) year_total
    +       ,'s' sale_type
    + from customer
    +     ,store_sales
    +     ,date_dim
    + where c_customer_sk = ss_customer_sk
    +   and ss_sold_date_sk = d_date_sk
    + group by c_customer_id
    +         ,c_first_name
    +         ,c_last_name
    +         ,c_preferred_cust_flag 
    +         ,c_birth_country
    +         ,c_login
    +         ,c_email_address
    +         ,d_year 
    + union all
    + select c_customer_id customer_id
    +       ,c_first_name customer_first_name
    +       ,c_last_name customer_last_name
    +       ,c_preferred_cust_flag customer_preferred_cust_flag
    +       ,c_birth_country customer_birth_country
    +       ,c_login customer_login
    +       ,c_email_address customer_email_address
    +       ,d_year dyear
    +       ,sum(ws_ext_list_price-ws_ext_discount_amt) year_total
    +       ,'w' sale_type
    + from customer
    +     ,web_sales
    +     ,date_dim
    + where c_customer_sk = ws_bill_customer_sk
    +   and ws_sold_date_sk = d_date_sk
    + group by c_customer_id
    +         ,c_first_name
    +         ,c_last_name
    +         ,c_preferred_cust_flag 
    +         ,c_birth_country
    +         ,c_login
    +         ,c_email_address
    +         ,d_year
    +         )
    +  select  
    +                  t_s_secyear.customer_id
    +                 ,t_s_secyear.customer_first_name
    +                 ,t_s_secyear.customer_last_name
    +                 ,t_s_secyear.customer_email_address
    --- End diff --
    
    Regarding a keywords capitalization rule, this is just for readability. We do not enforce it, but it is preferred. 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    We need to review #20433 first


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86465 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86465/testReport)** for PR 20343 at commit [`5d6092c`](https://github.com/apache/spark/commit/5d6092c4bf029a021930a4ba66e6e1de3a4b15ed).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86444 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86444/testReport)** for PR 20343 at commit [`71e0e1a`](https://github.com/apache/spark/commit/71e0e1ad2f6ee772103ef43d2270ed347bc497d0).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86501 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86501/testReport)** for PR 20343 at commit [`fff88d2`](https://github.com/apache/spark/commit/fff88d2f20cc5cd3177abd865ea44d0d47146f35).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86501/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86800/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/78/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    I opened a new pr to support `[date] + 14 days`: https://github.com/apache/spark/pull/20433


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/89/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/88569/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/376/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86452 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86452/testReport)** for PR 20343 at commit [`9ac04ed`](https://github.com/apache/spark/commit/9ac04edc5aa770fb04b9ad4c12de75fa6d4ac2c8).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86465/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86452 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86452/testReport)** for PR 20343 at commit [`9ac04ed`](https://github.com/apache/spark/commit/9ac04edc5aa770fb04b9ad4c12de75fa6d4ac2c8).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/364/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/81/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    retest this please.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    retest this please.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #88321 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/88321/testReport)** for PR 20343 at commit [`d04b087`](https://github.com/apache/spark/commit/d04b0872bcc02b5eadd309c560cda77ff1b8da0a).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by kiszk <gi...@git.apache.org>.
Github user kiszk commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    ping @gatorsmile


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r163020873
  
    --- Diff: sql/core/src/test/resources/tpcds-v2.7.0/q11.sql ---
    @@ -0,0 +1,78 @@
    +with year_total as (
    + select c_customer_id customer_id
    +       ,c_first_name customer_first_name
    +       ,c_last_name customer_last_name
    +       ,c_preferred_cust_flag customer_preferred_cust_flag
    +       ,c_birth_country customer_birth_country
    +       ,c_login customer_login
    +       ,c_email_address customer_email_address
    +       ,d_year dyear
    +       ,sum(ss_ext_list_price-ss_ext_discount_amt) year_total
    +       ,'s' sale_type
    + from customer
    +     ,store_sales
    +     ,date_dim
    + where c_customer_sk = ss_customer_sk
    +   and ss_sold_date_sk = d_date_sk
    + group by c_customer_id
    +         ,c_first_name
    +         ,c_last_name
    +         ,c_preferred_cust_flag 
    +         ,c_birth_country
    +         ,c_login
    +         ,c_email_address
    +         ,d_year 
    + union all
    + select c_customer_id customer_id
    +       ,c_first_name customer_first_name
    +       ,c_last_name customer_last_name
    +       ,c_preferred_cust_flag customer_preferred_cust_flag
    +       ,c_birth_country customer_birth_country
    +       ,c_login customer_login
    +       ,c_email_address customer_email_address
    +       ,d_year dyear
    +       ,sum(ws_ext_list_price-ws_ext_discount_amt) year_total
    +       ,'w' sale_type
    + from customer
    +     ,web_sales
    +     ,date_dim
    + where c_customer_sk = ws_bill_customer_sk
    +   and ws_sold_date_sk = d_date_sk
    + group by c_customer_id
    +         ,c_first_name
    +         ,c_last_name
    +         ,c_preferred_cust_flag 
    +         ,c_birth_country
    +         ,c_login
    +         ,c_email_address
    +         ,d_year
    +         )
    +  select  
    +                  t_s_secyear.customer_id
    +                 ,t_s_secyear.customer_first_name
    +                 ,t_s_secyear.customer_last_name
    +                 ,t_s_secyear.customer_email_address
    --- End diff --
    
    Could we highlight the changes we made in version 2.7 compared with the original version by adding the comments like `-- `?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    ok, I'll try and check. Just s sec.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86462/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r163021098
  
    --- Diff: sql/core/src/test/resources/tpcds-v2.7.0/q11.sql ---
    @@ -0,0 +1,78 @@
    +with year_total as (
    + select c_customer_id customer_id
    +       ,c_first_name customer_first_name
    +       ,c_last_name customer_last_name
    +       ,c_preferred_cust_flag customer_preferred_cust_flag
    +       ,c_birth_country customer_birth_country
    +       ,c_login customer_login
    +       ,c_email_address customer_email_address
    +       ,d_year dyear
    +       ,sum(ss_ext_list_price-ss_ext_discount_amt) year_total
    +       ,'s' sale_type
    + from customer
    +     ,store_sales
    +     ,date_dim
    + where c_customer_sk = ss_customer_sk
    +   and ss_sold_date_sk = d_date_sk
    + group by c_customer_id
    +         ,c_first_name
    +         ,c_last_name
    +         ,c_preferred_cust_flag 
    +         ,c_birth_country
    +         ,c_login
    +         ,c_email_address
    +         ,d_year 
    + union all
    + select c_customer_id customer_id
    +       ,c_first_name customer_first_name
    +       ,c_last_name customer_last_name
    +       ,c_preferred_cust_flag customer_preferred_cust_flag
    +       ,c_birth_country customer_birth_country
    +       ,c_login customer_login
    +       ,c_email_address customer_email_address
    +       ,d_year dyear
    +       ,sum(ws_ext_list_price-ws_ext_discount_amt) year_total
    +       ,'w' sale_type
    + from customer
    +     ,web_sales
    +     ,date_dim
    + where c_customer_sk = ws_bill_customer_sk
    +   and ws_sold_date_sk = d_date_sk
    + group by c_customer_id
    +         ,c_first_name
    +         ,c_last_name
    +         ,c_preferred_cust_flag 
    +         ,c_birth_country
    +         ,c_login
    +         ,c_email_address
    +         ,d_year
    +         )
    +  select  
    +                  t_s_secyear.customer_id
    +                 ,t_s_secyear.customer_first_name
    +                 ,t_s_secyear.customer_last_name
    +                 ,t_s_secyear.customer_email_address
    --- End diff --
    
    If possible, could you also use the original SQL query files with the same styles/indents?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/92/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86448 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86448/testReport)** for PR 20343 at commit [`9ac04ed`](https://github.com/apache/spark/commit/9ac04edc5aa770fb04b9ad4c12de75fa6d4ac2c8).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86444 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86444/testReport)** for PR 20343 at commit [`71e0e1a`](https://github.com/apache/spark/commit/71e0e1ad2f6ee772103ef43d2270ed347bc497d0).
     * This patch **fails Scala style tests**.
     * This patch merges cleanly.
     * This patch adds the following public classes _(experimental)_:
      * `i_class in ('personal','portable','reference','self-help') and`
      * `i_class in ('accessories','classical','fragrances','pants') and`
      * `              and i_class in ('personal','portable','reference','self-help')`
      * `              and i_class in ('accessories','classical','fragrances','pants')`
      * `(     select i_category ,i_class ,i_brand ,i_product_name ,d_year ,d_qoy ,d_moy ,s_store_id`
      * `( select sum(ws_net_paid) as total_sum, i_category, i_class, 0 as g_category, 0 as g_class `
      * `          i_class in ('wallpaper','parenting','musical')`
      * `          i_class in ('womens','birdal','pants') `


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86800 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86800/testReport)** for PR 20343 at commit [`d04b087`](https://github.com/apache/spark/commit/d04b0872bcc02b5eadd309c560cda77ff1b8da0a).
     * This patch **fails due to an unknown error code, -9**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Regarding the updates of `TPCDSQueryBenchmark`, we can do it in a separate PR.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86501 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86501/testReport)** for PR 20343 at commit [`fff88d2`](https://github.com/apache/spark/commit/fff88d2f20cc5cd3177abd865ea44d0d47146f35).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    I checked all the queries again and I found that some queries (q6, q11, q20, q22, q24, q34, q35, q47, q49, q57, q64, q72, q74, q75, q78, q98) only have minor changes (See the comments to point out the  changes). So, how about directly applying these changes in `sql/core/src/test/resources/tpcds`?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r163019362
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/TPCDSQuerySuite.scala ---
    @@ -244,7 +244,7 @@ class TPCDSQuerySuite extends BenchmarkQueryTest {
             |`web_company_id` INT, `web_company_name` STRING, `web_street_number` STRING,
             |`web_street_name` STRING, `web_street_type` STRING, `web_suite_number` STRING,
             |`web_city` STRING, `web_county` STRING, `web_state` STRING, `web_zip` STRING,
    -        |`web_country` STRING, `web_gmt_offset` STRING, `web_tax_percentage` DECIMAL(5,2))
    +        |`web_country` STRING, `web_gmt_offset` DECIMAL(5,2), `web_tax_percentage` DECIMAL(5,2))
    --- End diff --
    
    I double checked the schema changes made in this PR is consistent with the TPC-DS doc http://www.tpc.org/tpc_documents_current_versions/pdf/tpc-ds_v2.1.0.pdf



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #88321 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/88321/testReport)** for PR 20343 at commit [`d04b087`](https://github.com/apache/spark/commit/d04b0872bcc02b5eadd309c560cda77ff1b8da0a).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86812 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86812/testReport)** for PR 20343 at commit [`d04b087`](https://github.com/apache/spark/commit/d04b0872bcc02b5eadd309c560cda77ff1b8da0a).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r162850692
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/TPCDSQuerySuite.scala ---
    @@ -339,6 +340,30 @@ class TPCDSQuerySuite extends BenchmarkQueryTest {
         }
       }
     
    +  val tpcdsQueriesV2_7_0 = Seq(
    +    "q1", "q2", "q3", "q4", "q5", "q5a", "q6", "q7", "q8", "q9", "q10", "q10a", "q11",
    +    "q12", "q13", "q14_1", "q14_2", "q14a_1", "q14a_2",  "q15", "q16", "q17", "q18", "q18a", "q19",
    +    "q20", "q21", "q22", "q22a", "q23_1", "q23_2", "q24_1", "q24_2", "q25", "q26", "q27", "q27a",
    +    "q28", "q29", "q30", "q31", "q32", "q33", "q34", "q35", "q35a", "q36", "q36a", "q37", "q38",
    +    "q39_1", "q39_2", "q40", "q41", "q42", "q43", "q44", "q45", "q46", "q47", "q48", "q49",
    +    "q50", "q51", "q51a", "q52", "q53", "q54", "q55", "q56", "q57", "q58", "q59",
    +    "q60", "q61", "q62", "q63", "q64", "q65", "q66", "q67", "q67a", "q68", "q69",
    +    "q70", "q70a", "q71", "q72", "q73", "q74", "q75", "q76", "q77", "q77a", "q78", "q79",
    +    "q80", "q80a", "q81", "q82", "q83", "q84", "q85", "q86", "q86a", "q87", "q88", "q89",
    +    "q90", "q91", "q92", "q93", "q94", "q95", "q96", "q97", "q98", "q99")
    +
    +  tpcdsQueriesV2_7_0.foreach { name =>
    +    val queryString = resourceToString(s"tpcds-v2.7.0/$name.sql",
    --- End diff --
    
    This is only related to test cases. Thus, it is fine Spark 2.3 release does not have it. You can do it in this PR.
    
    Actually, this PR can be merged as long as we can fix all the issues.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by dongjoon-hyun <gi...@git.apache.org>.
Github user dongjoon-hyun commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r163030125
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/TPCDSQuerySuite.scala ---
    @@ -244,7 +244,7 @@ class TPCDSQuerySuite extends BenchmarkQueryTest {
             |`web_company_id` INT, `web_company_name` STRING, `web_street_number` STRING,
             |`web_street_name` STRING, `web_street_type` STRING, `web_suite_number` STRING,
             |`web_city` STRING, `web_county` STRING, `web_state` STRING, `web_zip` STRING,
    -        |`web_country` STRING, `web_gmt_offset` STRING, `web_tax_percentage` DECIMAL(5,2))
    +        |`web_country` STRING, `web_gmt_offset` DECIMAL(5,2), `web_tax_percentage` DECIMAL(5,2))
    --- End diff --
    
    Thank you a lot, @maropu and @gatorsmile !


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r163021292
  
    --- Diff: sql/core/src/test/resources/tpcds-v2.7.0/q11.sql ---
    @@ -0,0 +1,78 @@
    +with year_total as (
    + select c_customer_id customer_id
    +       ,c_first_name customer_first_name
    +       ,c_last_name customer_last_name
    +       ,c_preferred_cust_flag customer_preferred_cust_flag
    +       ,c_birth_country customer_birth_country
    +       ,c_login customer_login
    +       ,c_email_address customer_email_address
    +       ,d_year dyear
    +       ,sum(ss_ext_list_price-ss_ext_discount_amt) year_total
    +       ,'s' sale_type
    + from customer
    +     ,store_sales
    +     ,date_dim
    + where c_customer_sk = ss_customer_sk
    +   and ss_sold_date_sk = d_date_sk
    + group by c_customer_id
    +         ,c_first_name
    +         ,c_last_name
    +         ,c_preferred_cust_flag 
    +         ,c_birth_country
    +         ,c_login
    +         ,c_email_address
    +         ,d_year 
    + union all
    + select c_customer_id customer_id
    +       ,c_first_name customer_first_name
    +       ,c_last_name customer_last_name
    +       ,c_preferred_cust_flag customer_preferred_cust_flag
    +       ,c_birth_country customer_birth_country
    +       ,c_login customer_login
    +       ,c_email_address customer_email_address
    +       ,d_year dyear
    +       ,sum(ws_ext_list_price-ws_ext_discount_amt) year_total
    +       ,'w' sale_type
    + from customer
    +     ,web_sales
    +     ,date_dim
    + where c_customer_sk = ws_bill_customer_sk
    +   and ws_sold_date_sk = d_date_sk
    + group by c_customer_id
    +         ,c_first_name
    +         ,c_last_name
    +         ,c_preferred_cust_flag 
    +         ,c_birth_country
    +         ,c_login
    +         ,c_email_address
    +         ,d_year
    +         )
    +  select  
    +                  t_s_secyear.customer_id
    +                 ,t_s_secyear.customer_first_name
    +                 ,t_s_secyear.customer_last_name
    +                 ,t_s_secyear.customer_email_address
    --- End diff --
    
    ![screen shot 2018-01-22 at 10 12 44 am](https://user-images.githubusercontent.com/11567269/35236633-d6b31c5a-ff5c-11e7-9ee8-40191a5a136e.png)



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86800 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86800/testReport)** for PR 20343 at commit [`d04b087`](https://github.com/apache/spark/commit/d04b0872bcc02b5eadd309c560cda77ff1b8da0a).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/88321/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86462 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86462/testReport)** for PR 20343 at commit [`5d6092c`](https://github.com/apache/spark/commit/5d6092c4bf029a021930a4ba66e6e1de3a4b15ed).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Regarding the change you made in `[date] + 14 days -> date + INTERVAL 14 days`, could we first support it before we merge this PR? It sounds like this is trivial to support, right?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r162818134
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/TPCDSQuerySuite.scala ---
    @@ -339,6 +340,30 @@ class TPCDSQuerySuite extends BenchmarkQueryTest {
         }
       }
     
    +  val tpcdsQueriesV2_7_0 = Seq(
    +    "q1", "q2", "q3", "q4", "q5", "q5a", "q6", "q7", "q8", "q9", "q10", "q10a", "q11",
    +    "q12", "q13", "q14_1", "q14_2", "q14a_1", "q14a_2",  "q15", "q16", "q17", "q18", "q18a", "q19",
    +    "q20", "q21", "q22", "q22a", "q23_1", "q23_2", "q24_1", "q24_2", "q25", "q26", "q27", "q27a",
    +    "q28", "q29", "q30", "q31", "q32", "q33", "q34", "q35", "q35a", "q36", "q36a", "q37", "q38",
    +    "q39_1", "q39_2", "q40", "q41", "q42", "q43", "q44", "q45", "q46", "q47", "q48", "q49",
    +    "q50", "q51", "q51a", "q52", "q53", "q54", "q55", "q56", "q57", "q58", "q59",
    +    "q60", "q61", "q62", "q63", "q64", "q65", "q66", "q67", "q67a", "q68", "q69",
    +    "q70", "q70a", "q71", "q72", "q73", "q74", "q75", "q76", "q77", "q77a", "q78", "q79",
    +    "q80", "q80a", "q81", "q82", "q83", "q84", "q85", "q86", "q86a", "q87", "q88", "q89",
    +    "q90", "q91", "q92", "q93", "q94", "q95", "q96", "q97", "q98", "q99")
    +
    +  tpcdsQueriesV2_7_0.foreach { name =>
    +    val queryString = resourceToString(s"tpcds-v2.7.0/$name.sql",
    --- End diff --
    
    ok


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86448/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r163112223
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/TPCDSQuerySuite.scala ---
    @@ -339,6 +340,24 @@ class TPCDSQuerySuite extends BenchmarkQueryTest {
         }
       }
     
    +  val tpcdsQueriesV2_7_0 = Seq(
    --- End diff --
    
    ok


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    **[Test build #86448 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/86448/testReport)** for PR 20343 at commit [`9ac04ed`](https://github.com/apache/spark/commit/9ac04edc5aa770fb04b9ad4c12de75fa6d4ac2c8).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r162860188
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/TPCDSQuerySuite.scala ---
    @@ -339,6 +340,30 @@ class TPCDSQuerySuite extends BenchmarkQueryTest {
         }
       }
     
    +  val tpcdsQueriesV2_7_0 = Seq(
    +    "q1", "q2", "q3", "q4", "q5", "q5a", "q6", "q7", "q8", "q9", "q10", "q10a", "q11",
    +    "q12", "q13", "q14_1", "q14_2", "q14a_1", "q14a_2",  "q15", "q16", "q17", "q18", "q18a", "q19",
    +    "q20", "q21", "q22", "q22a", "q23_1", "q23_2", "q24_1", "q24_2", "q25", "q26", "q27", "q27a",
    +    "q28", "q29", "q30", "q31", "q32", "q33", "q34", "q35", "q35a", "q36", "q36a", "q37", "q38",
    +    "q39_1", "q39_2", "q40", "q41", "q42", "q43", "q44", "q45", "q46", "q47", "q48", "q49",
    +    "q50", "q51", "q51a", "q52", "q53", "q54", "q55", "q56", "q57", "q58", "q59",
    +    "q60", "q61", "q62", "q63", "q64", "q65", "q66", "q67", "q67a", "q68", "q69",
    +    "q70", "q70a", "q71", "q72", "q73", "q74", "q75", "q76", "q77", "q77a", "q78", "q79",
    +    "q80", "q80a", "q81", "q82", "q83", "q84", "q85", "q86", "q86a", "q87", "q88", "q89",
    +    "q90", "q91", "q92", "q93", "q94", "q95", "q96", "q97", "q98", "q99")
    +
    +  tpcdsQueriesV2_7_0.foreach { name =>
    +    val queryString = resourceToString(s"tpcds-v2.7.0/$name.sql",
    --- End diff --
    
    ok


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/86457/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by dongjoon-hyun <gi...@git.apache.org>.
Github user dongjoon-hyun commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r162835844
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/TPCDSQuerySuite.scala ---
    @@ -339,6 +340,30 @@ class TPCDSQuerySuite extends BenchmarkQueryTest {
         }
       }
     
    +  val tpcdsQueriesV2_7_0 = Seq(
    +    "q1", "q2", "q3", "q4", "q5", "q5a", "q6", "q7", "q8", "q9", "q10", "q10a", "q11",
    +    "q12", "q13", "q14_1", "q14_2", "q14a_1", "q14a_2",  "q15", "q16", "q17", "q18", "q18a", "q19",
    +    "q20", "q21", "q22", "q22a", "q23_1", "q23_2", "q24_1", "q24_2", "q25", "q26", "q27", "q27a",
    +    "q28", "q29", "q30", "q31", "q32", "q33", "q34", "q35", "q35a", "q36", "q36a", "q37", "q38",
    +    "q39_1", "q39_2", "q40", "q41", "q42", "q43", "q44", "q45", "q46", "q47", "q48", "q49",
    +    "q50", "q51", "q51a", "q52", "q53", "q54", "q55", "q56", "q57", "q58", "q59",
    +    "q60", "q61", "q62", "q63", "q64", "q65", "q66", "q67", "q67a", "q68", "q69",
    +    "q70", "q70a", "q71", "q72", "q73", "q74", "q75", "q76", "q77", "q77a", "q78", "q79",
    +    "q80", "q80a", "q81", "q82", "q83", "q84", "q85", "q86", "q86a", "q87", "q88", "q89",
    +    "q90", "q91", "q92", "q93", "q94", "q95", "q96", "q97", "q98", "q99")
    +
    +  tpcdsQueriesV2_7_0.foreach { name =>
    +    val queryString = resourceToString(s"tpcds-v2.7.0/$name.sql",
    --- End diff --
    
    Thanks. @maropu . 
    While reviewing this, I found that we missed that bug at the original PR of @gatorsmile .
    If the fix is able to be included in Apache Spark 2.3, I think the followup PR also sounds good to me.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r163119029
  
    --- Diff: sql/core/src/test/resources/tpcds-v2.7.0/q11.sql ---
    @@ -0,0 +1,78 @@
    +with year_total as (
    + select c_customer_id customer_id
    +       ,c_first_name customer_first_name
    +       ,c_last_name customer_last_name
    +       ,c_preferred_cust_flag customer_preferred_cust_flag
    +       ,c_birth_country customer_birth_country
    +       ,c_login customer_login
    +       ,c_email_address customer_email_address
    +       ,d_year dyear
    +       ,sum(ss_ext_list_price-ss_ext_discount_amt) year_total
    +       ,'s' sale_type
    + from customer
    +     ,store_sales
    +     ,date_dim
    + where c_customer_sk = ss_customer_sk
    +   and ss_sold_date_sk = d_date_sk
    + group by c_customer_id
    +         ,c_first_name
    +         ,c_last_name
    +         ,c_preferred_cust_flag 
    +         ,c_birth_country
    +         ,c_login
    +         ,c_email_address
    +         ,d_year 
    + union all
    + select c_customer_id customer_id
    +       ,c_first_name customer_first_name
    +       ,c_last_name customer_last_name
    +       ,c_preferred_cust_flag customer_preferred_cust_flag
    +       ,c_birth_country customer_birth_country
    +       ,c_login customer_login
    +       ,c_email_address customer_email_address
    +       ,d_year dyear
    +       ,sum(ws_ext_list_price-ws_ext_discount_amt) year_total
    +       ,'w' sale_type
    + from customer
    +     ,web_sales
    +     ,date_dim
    + where c_customer_sk = ws_bill_customer_sk
    +   and ws_sold_date_sk = d_date_sk
    + group by c_customer_id
    +         ,c_first_name
    +         ,c_last_name
    +         ,c_preferred_cust_flag 
    +         ,c_birth_country
    +         ,c_login
    +         ,c_email_address
    +         ,d_year
    +         )
    +  select  
    +                  t_s_secyear.customer_id
    +                 ,t_s_secyear.customer_first_name
    +                 ,t_s_secyear.customer_last_name
    +                 ,t_s_secyear.customer_email_address
    --- End diff --
    
    ok, I'll do as much as possible. one question; I think the sql format is not consistent in all the files, e.g., `SQLQueryTestSuite`. Probably, I think uppercase letters for SQL reserved words and 2 indents seem to be de-facto, but we don't have any format rule for that anywhere, right? we'd better to write the rule somewhere? We don't need to re-format existing code though, IMHO we'd better to make them consistent in future prs.  


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCD...

Posted by dongjoon-hyun <gi...@git.apache.org>.
Github user dongjoon-hyun commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20343#discussion_r162824441
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/TPCDSQuerySuite.scala ---
    @@ -339,6 +340,30 @@ class TPCDSQuerySuite extends BenchmarkQueryTest {
         }
       }
     
    +  val tpcdsQueriesV2_7_0 = Seq(
    +    "q1", "q2", "q3", "q4", "q5", "q5a", "q6", "q7", "q8", "q9", "q10", "q10a", "q11",
    +    "q12", "q13", "q14_1", "q14_2", "q14a_1", "q14a_2",  "q15", "q16", "q17", "q18", "q18a", "q19",
    +    "q20", "q21", "q22", "q22a", "q23_1", "q23_2", "q24_1", "q24_2", "q25", "q26", "q27", "q27a",
    +    "q28", "q29", "q30", "q31", "q32", "q33", "q34", "q35", "q35a", "q36", "q36a", "q37", "q38",
    +    "q39_1", "q39_2", "q40", "q41", "q42", "q43", "q44", "q45", "q46", "q47", "q48", "q49",
    +    "q50", "q51", "q51a", "q52", "q53", "q54", "q55", "q56", "q57", "q58", "q59",
    +    "q60", "q61", "q62", "q63", "q64", "q65", "q66", "q67", "q67a", "q68", "q69",
    +    "q70", "q70a", "q71", "q72", "q73", "q74", "q75", "q76", "q77", "q77a", "q78", "q79",
    +    "q80", "q80a", "q81", "q82", "q83", "q84", "q85", "q86", "q86a", "q87", "q88", "q89",
    +    "q90", "q91", "q92", "q93", "q94", "q95", "q96", "q97", "q98", "q99")
    +
    +  tpcdsQueriesV2_7_0.foreach { name =>
    +    val queryString = resourceToString(s"tpcds-v2.7.0/$name.sql",
    --- End diff --
    
    @maropu . It's great to have v2.7.
    Could you check the schema too?
    For example, we had better update [the following in the schema](https://github.com/apache/spark/pull/20343/files#diff-38fa80d1dc9860f07e135dd02d259269R247)?
    ```
    -        |`web_country` STRING, `web_gmt_offset` STRING, `web_tax_percentage` DECIMAL(5,2))
    +        |`web_country` STRING, `web_gmt_offset` DECIMAL(5,2), `web_tax_percentage` DECIMAL(5,2))
    ```


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20343: [SPARK-23167][SQL] Add TPCDS queries v2.7 in TPCDSQueryS...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the issue:

    https://github.com/apache/spark/pull/20343
  
    Thanks! Merged to master


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org