You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Apache Spark (JIRA)" <ji...@apache.org> on 2018/01/21 15:57:00 UTC

[jira] [Assigned] (SPARK-23167) Update TPCDS queries from v1.4 to v2.7 (latest)

     [ https://issues.apache.org/jira/browse/SPARK-23167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Apache Spark reassigned SPARK-23167:
------------------------------------

    Assignee: Apache Spark

> Update TPCDS queries from v1.4 to v2.7 (latest)
> -----------------------------------------------
>
>                 Key: SPARK-23167
>                 URL: https://issues.apache.org/jira/browse/SPARK-23167
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 2.2.1
>            Reporter: Takeshi Yamamuro
>            Assignee: Apache Spark
>            Priority: Minor
>
> We currently use TPCDS v1.4 ([https://github.com/apache/spark/commits/master/sql/core/src/test/resources/tpcds)] though, the latest one is v2.7 ([http://www.tpc.org/tpc_documents_current_versions/current_specifications.asp]). I found that some queries are different from v1.4 and v2.7 (e.g., q4, q5, q6, ...) and some queries newly might appear (e.g., q10a, ..). I think it might make some sense to update the queries for more correct evaluation.
> Raw generated queries from TPCDS v2.7 query templates:
>  [https://github.com/maropu/spark_tpcds_v2.7.0/tree/master/generated]
> Modified TPCDS v2.7 queries to pass TPCDSQuerySuite (e.g., replacing unsupported syntaxes, + 14 days -> interval 14 days):
>  [https://github.com/apache/spark/compare/master...maropu:TPCDSV2_7]
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org