You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2020/10/16 12:27:18 UTC

[GitHub] [spark] MaxGekk opened a new pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

MaxGekk opened a new pull request #30067:
URL: https://github.com/apache/spark/pull/30067


   ### What changes were proposed in this pull request?
   1. Add the common trait `CommonFileDataSourceSuite` with tests that can be executed for all built-in file-based datasources.
   2. Add a test `CommonFileDataSourceSuite` to check that datasource options are propagated to underlying file systems as Hadoop configs.
   3. Mix `CommonFileDataSourceSuite` to `AvroSuite`, `OrcSourceSuite`, `TextSuite`, `JsonSuite`, CSVSuite` and to `ParquetFileFormatSuite`.
   4. Remove duplicated tests from `AvroSuite` and from `OrcSourceSuite`.
   
   ### Why are the changes needed?
   To improve test coverage and test all built-in file-based datasources.
   
   ### Does this PR introduce _any_ user-facing change?
   No
   
   ### How was this patch tested?
   By running the affected test suites.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711788169


   **[Test build #129996 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129996/testReport)** for PR 30067 at commit [`a397ead`](https://github.com/apache/spark/commit/a397ead711695946c32e81f29e29cbdb422d25bc).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711714669






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711146558


   **[Test build #129958 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129958/testReport)** for PR 30067 at commit [`1a22f1d`](https://github.com/apache/spark/commit/1a22f1d63b1ede49e42402eac1adb1b04de577c2).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] MaxGekk commented on a change in pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
MaxGekk commented on a change in pull request #30067:
URL: https://github.com/apache/spark/pull/30067#discussion_r507181580



##########
File path: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/CommonFileDataSourceSuite.scala
##########
@@ -0,0 +1,55 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *    http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.execution.datasources
+
+import org.scalatest.funsuite.AnyFunSuite
+
+import org.apache.spark.sql.{Encoders, FakeFileSystemRequiringDSOption, QueryTest, Row}
+import org.apache.spark.sql.catalyst.plans.SQLHelper
+import org.apache.spark.sql.test.SQLTestData
+
+// The trait contains tests for all file-based data sources.

Review comment:
       I included libsvm as well.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-710201973






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] HyukjinKwon commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
HyukjinKwon commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711867360


   Merged to master.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711268491


   **[Test build #129960 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129960/testReport)** for PR 30067 at commit [`e579d48`](https://github.com/apache/spark/commit/e579d483403780e774e6d8148aac4cf71df6cf60).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711371099


   **[Test build #129960 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129960/testReport)** for PR 30067 at commit [`e579d48`](https://github.com/apache/spark/commit/e579d483403780e774e6d8148aac4cf71df6cf60).
    * This patch **fails Spark unit tests**.
    * This patch merges cleanly.
    * This patch adds the following public classes _(experimental)_:
     * `trait CommonFileDataSourceSuite extends SQLHelper `


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] HyukjinKwon commented on a change in pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
HyukjinKwon commented on a change in pull request #30067:
URL: https://github.com/apache/spark/pull/30067#discussion_r507064326



##########
File path: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/CommonFileDataSourceSuite.scala
##########
@@ -0,0 +1,55 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *    http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.execution.datasources
+
+import org.scalatest.funsuite.AnyFunSuite
+
+import org.apache.spark.sql.{Encoders, FakeFileSystemRequiringDSOption, QueryTest, Row}
+import org.apache.spark.sql.catalyst.plans.SQLHelper
+import org.apache.spark.sql.test.SQLTestData
+
+// The trait contains tests for all file-based data sources.

Review comment:
       Can you add some comments here about `FileBasedDataSourceSuite` vs `CommonFileDataSourceSuite` by linking properly?
   
   For example, we can say like: `CommonFileDataSourceSuite` requires to run the same codes in all file based data sources. `FileBasedDataSourceSuite` is a more loose place where we put the tests with some variants/differences.
   
   We should also better mention that libsvm is not included.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711153363


   Kubernetes integration test status success
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34564/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711714669


   Merged build finished. Test FAILed.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711317270


   Kubernetes integration test status failure
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34566/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711183220






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711317314






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711371362


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/129960/
   Test FAILed.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-710017515


   **[Test build #129896 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129896/testReport)** for PR 30067 at commit [`dc7142a`](https://github.com/apache/spark/commit/dc7142aa62e105bb306f225112057c4be4197ecb).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711371341


   Merged build finished. Test FAILed.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711371341






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711181386


   **[Test build #129958 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129958/testReport)** for PR 30067 at commit [`1a22f1d`](https://github.com/apache/spark/commit/1a22f1d63b1ede49e42402eac1adb1b04de577c2).
    * This patch passes all tests.
    * This patch merges cleanly.
    * This patch adds no public classes.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711873041


   Kubernetes integration test starting
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34602/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-710048233


   Kubernetes integration test status success
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34501/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-710048301






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711153366






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711788169


   **[Test build #129996 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129996/testReport)** for PR 30067 at commit [`a397ead`](https://github.com/apache/spark/commit/a397ead711695946c32e81f29e29cbdb422d25bc).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-712121535






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711737999






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711150024


   Kubernetes integration test starting
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34564/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] MaxGekk commented on a change in pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
MaxGekk commented on a change in pull request #30067:
URL: https://github.com/apache/spark/pull/30067#discussion_r507181634



##########
File path: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/CommonFileDataSourceSuite.scala
##########
@@ -0,0 +1,55 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *    http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.execution.datasources
+
+import org.scalatest.funsuite.AnyFunSuite
+
+import org.apache.spark.sql.{Encoders, FakeFileSystemRequiringDSOption, QueryTest, Row}
+import org.apache.spark.sql.catalyst.plans.SQLHelper
+import org.apache.spark.sql.test.SQLTestData
+
+// The trait contains tests for all file-based data sources.
+trait CommonFileDataSourceSuite {
+  self: QueryTest with AnyFunSuite with SQLTestData with SQLHelper =>

Review comment:
       Requiring `QueryTest` is too much. I made more simple. 

##########
File path: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/CommonFileDataSourceSuite.scala
##########
@@ -0,0 +1,55 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *    http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.execution.datasources
+
+import org.scalatest.funsuite.AnyFunSuite
+
+import org.apache.spark.sql.{Encoders, FakeFileSystemRequiringDSOption, QueryTest, Row}
+import org.apache.spark.sql.catalyst.plans.SQLHelper
+import org.apache.spark.sql.test.SQLTestData
+
+// The trait contains tests for all file-based data sources.
+trait CommonFileDataSourceSuite {
+  self: QueryTest with AnyFunSuite with SQLTestData with SQLHelper =>

Review comment:
       Requiring `QueryTest` is too much. I made it more simple. 




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711896829






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-710201973






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] MaxGekk commented on a change in pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
MaxGekk commented on a change in pull request #30067:
URL: https://github.com/apache/spark/pull/30067#discussion_r507053187



##########
File path: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/CommonFileDataSourceSuite.scala
##########
@@ -0,0 +1,55 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *    http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.execution.datasources
+
+import org.scalatest.funsuite.AnyFunSuite
+
+import org.apache.spark.sql.{Encoders, FakeFileSystemRequiringDSOption, QueryTest, Row}
+import org.apache.spark.sql.catalyst.plans.SQLHelper
+import org.apache.spark.sql.test.SQLTestData
+
+// The trait contains tests for all file-based data sources.
+trait CommonFileDataSourceSuite {
+  self: QueryTest with AnyFunSuite with SQLTestData with SQLHelper =>
+
+  protected def dataSourceFormat: String
+
+  test(s"Propagate Hadoop configs from $dataSourceFormat options to underlying file system") {

Review comment:
       Do you mean add it to `FileBasedDataSourceSuite` and remove it from `JsonSuite`, `CSVSuite` and so on? In that case, we will not test both DS v1 and v2.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-712120461


   **[Test build #129996 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129996/testReport)** for PR 30067 at commit [`a397ead`](https://github.com/apache/spark/commit/a397ead711695946c32e81f29e29cbdb422d25bc).
    * This patch passes all tests.
    * This patch merges cleanly.
    * This patch adds no public classes.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711627124


   **[Test build #129987 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129987/testReport)** for PR 30067 at commit [`a397ead`](https://github.com/apache/spark/commit/a397ead711695946c32e81f29e29cbdb422d25bc).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711713830


   **[Test build #129987 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129987/testReport)** for PR 30067 at commit [`a397ead`](https://github.com/apache/spark/commit/a397ead711695946c32e81f29e29cbdb422d25bc).
    * This patch **fails due to an unknown error code, -9**.
    * This patch merges cleanly.
    * This patch adds no public classes.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-712121535






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] HyukjinKwon commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
HyukjinKwon commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711784560


   retest this please


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711317340


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/34566/
   Test FAILed.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711627124


   **[Test build #129987 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129987/testReport)** for PR 30067 at commit [`a397ead`](https://github.com/apache/spark/commit/a397ead711695946c32e81f29e29cbdb422d25bc).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] HyukjinKwon commented on a change in pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
HyukjinKwon commented on a change in pull request #30067:
URL: https://github.com/apache/spark/pull/30067#discussion_r507045951



##########
File path: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/CommonFileDataSourceSuite.scala
##########
@@ -0,0 +1,55 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *    http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.execution.datasources
+
+import org.scalatest.funsuite.AnyFunSuite
+
+import org.apache.spark.sql.{Encoders, FakeFileSystemRequiringDSOption, QueryTest, Row}
+import org.apache.spark.sql.catalyst.plans.SQLHelper
+import org.apache.spark.sql.test.SQLTestData
+
+// The trait contains tests for all file-based data sources.
+trait CommonFileDataSourceSuite {
+  self: QueryTest with AnyFunSuite with SQLTestData with SQLHelper =>
+
+  protected def dataSourceFormat: String
+
+  test(s"Propagate Hadoop configs from $dataSourceFormat options to underlying file system") {

Review comment:
       Shall we add this test into `FileBasedDataSourceSuite`?




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] MaxGekk commented on a change in pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
MaxGekk commented on a change in pull request #30067:
URL: https://github.com/apache/spark/pull/30067#discussion_r507054876



##########
File path: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/CommonFileDataSourceSuite.scala
##########
@@ -0,0 +1,55 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *    http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.execution.datasources
+
+import org.scalatest.funsuite.AnyFunSuite
+
+import org.apache.spark.sql.{Encoders, FakeFileSystemRequiringDSOption, QueryTest, Row}
+import org.apache.spark.sql.catalyst.plans.SQLHelper
+import org.apache.spark.sql.test.SQLTestData
+
+// The trait contains tests for all file-based data sources.
+trait CommonFileDataSourceSuite {
+  self: QueryTest with AnyFunSuite with SQLTestData with SQLHelper =>
+
+  protected def dataSourceFormat: String
+
+  test(s"Propagate Hadoop configs from $dataSourceFormat options to underlying file system") {

Review comment:
       If I place the test to `FileBasedDataSourceSuite`, will I be able to test Avro? It will look like slightly strange that we test `external/avro` in `sql/core`, won't it?




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] MaxGekk commented on a change in pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
MaxGekk commented on a change in pull request #30067:
URL: https://github.com/apache/spark/pull/30067#discussion_r507070826



##########
File path: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/CommonFileDataSourceSuite.scala
##########
@@ -0,0 +1,55 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *    http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.execution.datasources
+
+import org.scalatest.funsuite.AnyFunSuite
+
+import org.apache.spark.sql.{Encoders, FakeFileSystemRequiringDSOption, QueryTest, Row}
+import org.apache.spark.sql.catalyst.plans.SQLHelper
+import org.apache.spark.sql.test.SQLTestData
+
+// The trait contains tests for all file-based data sources.
+trait CommonFileDataSourceSuite {
+  self: QueryTest with AnyFunSuite with SQLTestData with SQLHelper =>

Review comment:
       Let me check that.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711693322


   Kubernetes integration test starting
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34594/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711896799


   Kubernetes integration test status success
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34602/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711183220






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-710037274


   Kubernetes integration test starting
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34501/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] HyukjinKwon commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
HyukjinKwon commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711866313


   I am going to merge this. The tests all passed at https://github.com/apache/spark/runs/1271560150, and the last change is only comments.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] MaxGekk commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
MaxGekk commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-710015456


   @cloud-fan @HyukjinKwon Please, take a look at it when you have time.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711295944


   Kubernetes integration test starting
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/34566/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-710048301






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-710198536


   **[Test build #129896 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129896/testReport)** for PR 30067 at commit [`dc7142a`](https://github.com/apache/spark/commit/dc7142aa62e105bb306f225112057c4be4197ecb).
    * This patch passes all tests.
    * This patch merges cleanly.
    * This patch adds no public classes.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] HyukjinKwon commented on a change in pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
HyukjinKwon commented on a change in pull request #30067:
URL: https://github.com/apache/spark/pull/30067#discussion_r507060706



##########
File path: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/CommonFileDataSourceSuite.scala
##########
@@ -0,0 +1,55 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *    http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.execution.datasources
+
+import org.scalatest.funsuite.AnyFunSuite
+
+import org.apache.spark.sql.{Encoders, FakeFileSystemRequiringDSOption, QueryTest, Row}
+import org.apache.spark.sql.catalyst.plans.SQLHelper
+import org.apache.spark.sql.test.SQLTestData
+
+// The trait contains tests for all file-based data sources.
+trait CommonFileDataSourceSuite {
+  self: QueryTest with AnyFunSuite with SQLTestData with SQLHelper =>
+
+  protected def dataSourceFormat: String
+
+  test(s"Propagate Hadoop configs from $dataSourceFormat options to underlying file system") {

Review comment:
       Hmmm.. okay




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] HyukjinKwon commented on a change in pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
HyukjinKwon commented on a change in pull request #30067:
URL: https://github.com/apache/spark/pull/30067#discussion_r507065249



##########
File path: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/CommonFileDataSourceSuite.scala
##########
@@ -0,0 +1,55 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *    http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.execution.datasources
+
+import org.scalatest.funsuite.AnyFunSuite
+
+import org.apache.spark.sql.{Encoders, FakeFileSystemRequiringDSOption, QueryTest, Row}
+import org.apache.spark.sql.catalyst.plans.SQLHelper
+import org.apache.spark.sql.test.SQLTestData
+
+// The trait contains tests for all file-based data sources.
+trait CommonFileDataSourceSuite {
+  self: QueryTest with AnyFunSuite with SQLTestData with SQLHelper =>

Review comment:
       Can't we just simply self-type `QueryTest` alone without other traits?




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711146558


   **[Test build #129958 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129958/testReport)** for PR 30067 at commit [`1a22f1d`](https://github.com/apache/spark/commit/1a22f1d63b1ede49e42402eac1adb1b04de577c2).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-710017515


   **[Test build #129896 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129896/testReport)** for PR 30067 at commit [`dc7142a`](https://github.com/apache/spark/commit/dc7142aa62e105bb306f225112057c4be4197ecb).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] HyukjinKwon commented on a change in pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
HyukjinKwon commented on a change in pull request #30067:
URL: https://github.com/apache/spark/pull/30067#discussion_r507298870



##########
File path: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/CommonFileDataSourceSuite.scala
##########
@@ -0,0 +1,59 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *    http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.spark.sql.execution.datasources
+
+import org.scalatest.funsuite.AnyFunSuite
+
+import org.apache.spark.sql.{Dataset, Encoders, FakeFileSystemRequiringDSOption, SparkSession}
+import org.apache.spark.sql.catalyst.plans.SQLHelper
+
+// The trait contains tests for all file-based data sources. The tests that are not applicable to
+// all file-based data sources should be placed to `FileBasedDataSourceSuite`.

Review comment:
       I would make it as a proper class doc. Since `FileBasedDataSourceSuite` is a proper class, it should be able to refer that class as `[[org.apache.spark.sql.FileBasedDataSourceSuite]]` in the class doc.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] HyukjinKwon closed pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
HyukjinKwon closed pull request #30067:
URL: https://github.com/apache/spark/pull/30067


   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711153366






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711317314


   Merged build finished. Test FAILed.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711896829






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711268491


   **[Test build #129960 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/129960/testReport)** for PR 30067 at commit [`e579d48`](https://github.com/apache/spark/commit/e579d483403780e774e6d8148aac4cf71df6cf60).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #30067: [SPARK-33169][SQL][TESTS] Check propagation of datasource options to underlying file system for built-in file-based datasources

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #30067:
URL: https://github.com/apache/spark/pull/30067#issuecomment-711714746


   Test FAILed.
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/129987/
   Test FAILed.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org