You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by cloud-fan <gi...@git.apache.org> on 2018/02/07 15:00:50 UTC

[GitHub] spark pull request #20535: [SPARK-23341][SQL] define some standard options f...

GitHub user cloud-fan opened a pull request:

    https://github.com/apache/spark/pull/20535

    [SPARK-23341][SQL] define some standard options for data source v2

    ## What changes were proposed in this pull request?
    
    Each data source implementation can define its own options and teach its users how to set them. Spark doesn't have any restrictions about what options a data source should or should not have. It's possible that some options are very common and many data sources use them. However different data sources may define the common options(key and meaning) differently, which is quite confusing to end users.
    
    This PR defines some standard options that data sources can optionally adopt: path, table and database.
    
    ## How was this patch tested?
    
    a new test case.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/cloud-fan/spark options

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/20535.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #20535
    
----
commit c9009d85d3f147eb652f141bbcb0424767e90477
Author: Wenchen Fan <we...@...>
Date:   2018-02-07T14:56:53Z

    define some standard options for data source v2

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/672/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/89382/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #89098 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/89098/testReport)** for PR 20535 at commit [`c5e403c`](https://github.com/apache/spark/commit/c5e403c960cdfb68755df754abf7aa96ac6d40bc).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #89116 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/89116/testReport)** for PR 20535 at commit [`c5e403c`](https://github.com/apache/spark/commit/c5e403c960cdfb68755df754abf7aa96ac6d40bc).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #89098 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/89098/testReport)** for PR 20535 at commit [`c5e403c`](https://github.com/apache/spark/commit/c5e403c960cdfb68755df754abf7aa96ac6d40bc).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #87187 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87187/testReport)** for PR 20535 at commit [`3e8f71b`](https://github.com/apache/spark/commit/3e8f71b749f41d9ff1acb550a3ede6b8a6651ebd).
     * This patch **fails to generate documentation**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20535: [SPARK-23341][SQL] define some standard options f...

Posted by rxin <gi...@git.apache.org>.
Github user rxin commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20535#discussion_r166701501
  
    --- Diff: sql/core/src/main/java/org/apache/spark/sql/sources/v2/DataSourceOptions.java ---
    @@ -27,6 +27,39 @@
     /**
      * An immutable string-to-string map in which keys are case-insensitive. This is used to represent
      * data source options.
    + *
    + * Each data source implementation can define its own options and teach its users how to set them.
    + * Spark doesn't have any restrictions about what options a data source should or should not have.
    + * Instead Spark defines some standard options that data sources can optionally adopt. It's possible
    + * that some options are very common and many data sources use them. However different data
    + * sources may define the common options(key and meaning) differently, which is quite confusing to
    + * end users.
    + *
    + * The standard options defined by Spark:
    + * <table>
    + *   <tr>
    + *     <th><b>Option key</b></th>
    + *     <th><b>Option value</b></th>
    + *   </tr>
    + *   <tr>
    + *     <td>path</td>
    + *     <td>A comma separated paths string of the data files/directories, like
    + *     <code>path1,/absolute/file2,path3/*</code>. Each path can either be relative or absolute,
    + *     points to either file or directory, and can contain wildcards. This option is commonly used
    + *     by file-based data sources.</td>
    + *   </tr>
    + *   <tr>
    + *     <td>table</td>
    + *     <td>A table name string representing the table name directly without any interpretation.
    --- End diff --
    
    what do you mean by "without any interpretation"?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #87194 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87194/testReport)** for PR 20535 at commit [`e92b6b2`](https://github.com/apache/spark/commit/e92b6b2083c4dbf31c27c961096a45cd8d84f16e).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/87194/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20535: [SPARK-23341][SQL] define some standard options f...

Posted by gengliangwang <gi...@git.apache.org>.
Github user gengliangwang commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20535#discussion_r180714763
  
    --- Diff: sql/core/src/main/java/org/apache/spark/sql/sources/v2/DataSourceOptions.java ---
    @@ -17,16 +17,61 @@
     
     package org.apache.spark.sql.sources.v2;
     
    +import java.io.IOException;
     import java.util.HashMap;
     import java.util.Locale;
     import java.util.Map;
     import java.util.Optional;
    +import java.util.stream.Stream;
    +
    +import com.fasterxml.jackson.databind.ObjectMapper;
     
     import org.apache.spark.annotation.InterfaceStability;
     
     /**
      * An immutable string-to-string map in which keys are case-insensitive. This is used to represent
      * data source options.
    + *
    + * Each data source implementation can define its own options and teach its users how to set them.
    + * Spark doesn't have any restrictions about what options a data source should or should not have.
    + * Instead Spark defines some standard options that data sources can optionally adopt. It's possible
    + * that some options are very common and many data sources use them. However different data
    + * sources may define the common options(key and meaning) differently, which is quite confusing to
    + * end users.
    + *
    + * The standard options defined by Spark:
    + * <table summary="standard data source options">
    + *   <tr>
    + *     <th><b>Option key</b></th>
    + *     <th><b>Option value</b></th>
    + *   </tr>
    + *   <tr>
    + *     <td>path</td>
    + *     <td>A path string of the data files/directories, like
    + *     <code>path1</code>, <code>/absolute/file2</code>, <code>path3/*</code>. The path can
    + *     either be relative or absolute, points to either file or directory, and can contain
    + *     wildcards. This option is commonly used by file-based data sources.</td>
    + *   </tr>
    + *   <tr>
    + *     <td>paths</td>
    + *     <td>A JSON array style paths string of the data files/directories, like
    + *     <code>["path1", "/absolute/file2"]</code>. The format of each path is same as the
    + *     <code>path</code> option, plus it should follow JSON string literal format, e.g. quotes
    + *     should be escaped, <code>pa\"th</code> means pa"th.
    --- End diff --
    
    pa\"th?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/694/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #89069 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/89069/testReport)** for PR 20535 at commit [`c811d72`](https://github.com/apache/spark/commit/c811d72f88552a30a985bdbb2c0005eddc56b5ff).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20535: [SPARK-23341][SQL] define some standard options f...

Posted by rdblue <gi...@git.apache.org>.
Github user rdblue commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20535#discussion_r169707938
  
    --- Diff: sql/core/src/main/java/org/apache/spark/sql/sources/v2/DataSourceOptions.java ---
    @@ -97,4 +130,20 @@ public double getDouble(String key, double defaultValue) {
         return keyLowerCasedMap.containsKey(lcaseKey) ?
           Double.parseDouble(keyLowerCasedMap.get(lcaseKey)) : defaultValue;
       }
    +
    +  public static final String KEY_PATH = "path";
    +  public static final String KEY_TABLE = "table";
    +  public static final String KEY_DATABASE = "database";
    +
    +  public Optional<String> getPath() {
    --- End diff --
    
    I think it is more friendly when using this in scala to drop the `get` and use just `path` or `database`.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    thanks, merging to master!


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20535: [SPARK-23341][SQL] define some standard options f...

Posted by rdblue <gi...@git.apache.org>.
Github user rdblue commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20535#discussion_r169707644
  
    --- Diff: sql/core/src/main/java/org/apache/spark/sql/sources/v2/DataSourceOptions.java ---
    @@ -27,6 +27,39 @@
     /**
      * An immutable string-to-string map in which keys are case-insensitive. This is used to represent
      * data source options.
    + *
    + * Each data source implementation can define its own options and teach its users how to set them.
    + * Spark doesn't have any restrictions about what options a data source should or should not have.
    + * Instead Spark defines some standard options that data sources can optionally adopt. It's possible
    + * that some options are very common and many data sources use them. However different data
    + * sources may define the common options(key and meaning) differently, which is quite confusing to
    + * end users.
    + *
    + * The standard options defined by Spark:
    + * <table>
    + *   <tr>
    + *     <th><b>Option key</b></th>
    + *     <th><b>Option value</b></th>
    + *   </tr>
    + *   <tr>
    + *     <td>path</td>
    + *     <td>A comma separated paths string of the data files/directories, like
    + *     <code>path1,/absolute/file2,path3/*</code>. Each path can either be relative or absolute,
    + *     points to either file or directory, and can contain wildcards. This option is commonly used
    + *     by file-based data sources.</td>
    + *   </tr>
    + *   <tr>
    + *     <td>table</td>
    + *     <td>A table name string representing the table name directly without any interpretation.
    --- End diff --
    
    I think this is clear with the examples.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20535: [SPARK-23341][SQL] define some standard options f...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20535#discussion_r166826138
  
    --- Diff: sql/core/src/main/java/org/apache/spark/sql/sources/v2/DataSourceOptions.java ---
    @@ -27,6 +27,39 @@
     /**
      * An immutable string-to-string map in which keys are case-insensitive. This is used to represent
      * data source options.
    + *
    + * Each data source implementation can define its own options and teach its users how to set them.
    + * Spark doesn't have any restrictions about what options a data source should or should not have.
    + * Instead Spark defines some standard options that data sources can optionally adopt. It's possible
    + * that some options are very common and many data sources use them. However different data
    + * sources may define the common options(key and meaning) differently, which is quite confusing to
    + * end users.
    + *
    + * The standard options defined by Spark:
    + * <table>
    + *   <tr>
    + *     <th><b>Option key</b></th>
    + *     <th><b>Option value</b></th>
    + *   </tr>
    + *   <tr>
    + *     <td>path</td>
    + *     <td>A comma separated paths string of the data files/directories, like
    + *     <code>path1,/absolute/file2,path3/*</code>. Each path can either be relative or absolute,
    + *     points to either file or directory, and can contain wildcards. This option is commonly used
    + *     by file-based data sources.</td>
    + *   </tr>
    + *   <tr>
    + *     <td>table</td>
    + *     <td>A table name string representing the table name directly without any interpretation.
    --- End diff --
    
    It means it's a pure string, there is not parsing rule for it like SQL identifier. I put some examples below and hopefully they can explain it well.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    retest this please


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20535: [SPARK-23341][SQL] define some standard options f...

Posted by gengliangwang <gi...@git.apache.org>.
Github user gengliangwang commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20535#discussion_r180171421
  
    --- Diff: sql/core/src/main/java/org/apache/spark/sql/sources/v2/DataSourceOptions.java ---
    @@ -97,4 +142,66 @@ public double getDouble(String key, double defaultValue) {
         return keyLowerCasedMap.containsKey(lcaseKey) ?
           Double.parseDouble(keyLowerCasedMap.get(lcaseKey)) : defaultValue;
       }
    +
    +  /**
    +   * The option key for singular path.
    +   */
    +  public static final String PATH_KEY = "path";
    +
    +  /**
    +   * The option key for multiple paths.
    +   */
    +  public static final String PATHS_KEY = "paths";
    +
    +  /**
    +   * The option key for table name.
    +   */
    +  public static final String TABLE_KEY = "table";
    +
    +  /**
    +   * The option key for database name.
    +   */
    +  public static final String DATABASE_KEY = "database";
    +
    +  /**
    +   * Returns the value of the singular path option.
    +   */
    +  public Optional<String> path() {
    +    return get(PATH_KEY);
    +  }
    +
    +  /**
    +   * Returns all the paths specified by both the singular path option and the multiple
    +   * paths option.
    +   */
    +  public String[] paths() {
    +    String[] singularPath = path().map(s -> new String[]{s}).orElseGet(() -> new String[0]);
    +    Optional<String> pathsStr = get(PATHS_KEY);
    +    System.out.println(pathsStr);
    --- End diff --
    
    remove println :)


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #87186 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87186/testReport)** for PR 20535 at commit [`86bcda9`](https://github.com/apache/spark/commit/86bcda949f24e475c8284412874fe1c871512a27).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    cc @rxin @rdblue @gatorsmile 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/707/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by rdblue <gi...@git.apache.org>.
Github user rdblue commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    This should move the standard options to `DataSourceV2Relation` to avoid needing to instantiate `DataSourceOptions` wherever the relation is created.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #89116 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/89116/testReport)** for PR 20535 at commit [`c5e403c`](https://github.com/apache/spark/commit/c5e403c960cdfb68755df754abf7aa96ac6d40bc).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20535: [SPARK-23341][SQL] define some standard options f...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20535#discussion_r183580907
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameReader.scala ---
    @@ -193,10 +196,13 @@ class DataFrameReader private[sql](sparkSession: SparkSession) extends Logging {
           if (ds.isInstanceOf[ReadSupport] || ds.isInstanceOf[ReadSupportWithSchema]) {
             val sessionOptions = DataSourceV2Utils.extractSessionConfigs(
               ds = ds, conf = sparkSession.sessionState.conf)
    +        val pathsOption = {
    +          val objectMapper = new ObjectMapper()
    +          DataSourceOptions.PATHS_KEY -> objectMapper.writeValueAsString(paths.toArray)
    +        }
             Dataset.ofRows(sparkSession, DataSourceV2Relation.create(
    -          ds, extraOptions.toMap ++ sessionOptions,
    +          ds, extraOptions.toMap ++ sessionOptions + pathsOption,
    --- End diff --
    
    Basically we may have duplicated entries in session configs and `DataFrameReader/Writer` options, not only path. The rule is, `DataFrameReader/Writer` options should overwrite session configs.
    
    cc @jiangxb1987 can you submit a PR to explicitly document it in `SessionConfigSupport`?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/89098/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/87168/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/685/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #87207 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87207/testReport)** for PR 20535 at commit [`e92b6b2`](https://github.com/apache/spark/commit/e92b6b2083c4dbf31c27c961096a45cd8d84f16e).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #87187 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87187/testReport)** for PR 20535 at commit [`3e8f71b`](https://github.com/apache/spark/commit/3e8f71b749f41d9ff1acb550a3ede6b8a6651ebd).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/89116/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #87207 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87207/testReport)** for PR 20535 at commit [`e92b6b2`](https://github.com/apache/spark/commit/e92b6b2083c4dbf31c27c961096a45cd8d84f16e).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20535: [SPARK-23341][SQL] define some standard options f...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/20535


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #89069 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/89069/testReport)** for PR 20535 at commit [`c811d72`](https://github.com/apache/spark/commit/c811d72f88552a30a985bdbb2c0005eddc56b5ff).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #87168 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87168/testReport)** for PR 20535 at commit [`c9009d8`](https://github.com/apache/spark/commit/c9009d85d3f147eb652f141bbcb0424767e90477).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/691/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20535: [SPARK-23341][SQL] define some standard options f...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20535#discussion_r166826237
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameReader.scala ---
    @@ -171,7 +171,8 @@ class DataFrameReader private[sql](sparkSession: SparkSession) extends Logging {
        * @since 1.4.0
        */
       def load(path: String): DataFrame = {
    -    option("path", path).load(Seq.empty: _*) // force invocation of `load(...varargs...)`
    +    // force invocation of `load(...varargs...)`
    +    option(DataSourceOptions.KEY_PATH, path).load(Seq.empty: _*)
    --- End diff --
    
    makes sense, let me change it back.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20535: [SPARK-23341][SQL] define some standard options f...

Posted by rdblue <gi...@git.apache.org>.
Github user rdblue commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20535#discussion_r166709990
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameReader.scala ---
    @@ -171,7 +171,8 @@ class DataFrameReader private[sql](sparkSession: SparkSession) extends Logging {
        * @since 1.4.0
        */
       def load(path: String): DataFrame = {
    -    option("path", path).load(Seq.empty: _*) // force invocation of `load(...varargs...)`
    +    // force invocation of `load(...varargs...)`
    +    option(DataSourceOptions.KEY_PATH, path).load(Seq.empty: _*)
    --- End diff --
    
    It seems odd to me to change this string. While there's no behavior change, the constant is for a key in v2's DataSourceOptions, not for the DataFrameReader API. We could change it to "PATH" and it would be perfectly fine for v2, but would change the behavior here. Such a change is incredibly unlikely, which is why I say it is just "odd".


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #87191 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87191/testReport)** for PR 20535 at commit [`6644e49`](https://github.com/apache/spark/commit/6644e49ce41e971103298fe3966e921765a82804).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/89069/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #89382 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/89382/testReport)** for PR 20535 at commit [`c5e403c`](https://github.com/apache/spark/commit/c5e403c960cdfb68755df754abf7aa96ac6d40bc).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #89382 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/89382/testReport)** for PR 20535 at commit [`c5e403c`](https://github.com/apache/spark/commit/c5e403c960cdfb68755df754abf7aa96ac6d40bc).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20535: [SPARK-23341][SQL] define some standard options f...

Posted by rdblue <gi...@git.apache.org>.
Github user rdblue commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20535#discussion_r169708283
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameReader.scala ---
    @@ -190,10 +190,15 @@ class DataFrameReader private[sql](sparkSession: SparkSession) extends Logging {
         val cls = DataSource.lookupDataSource(source, sparkSession.sessionState.conf)
         if (classOf[DataSourceV2].isAssignableFrom(cls)) {
           val ds = cls.newInstance()
    -      val options = new DataSourceOptions((extraOptions ++
    -        DataSourceV2Utils.extractSessionConfigs(
    -          ds = ds.asInstanceOf[DataSourceV2],
    -          conf = sparkSession.sessionState.conf)).asJava)
    +      val sessionOptions = DataSourceV2Utils.extractSessionConfigs(
    +        ds = ds.asInstanceOf[DataSourceV2],
    +        conf = sparkSession.sessionState.conf)
    +      val pathOption = if (paths.isEmpty) {
    +        Map.empty
    +      } else {
    +        Map(DataSourceOptions.KEY_PATH -> paths.mkString(","))
    --- End diff --
    
    `KEY_PATH` sounds like the path for a key. It would be more clear if the name was `PATH_KEY`.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/687/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/87191/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #87191 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87191/testReport)** for PR 20535 at commit [`6644e49`](https://github.com/apache/spark/commit/6644e49ce41e971103298fe3966e921765a82804).
     * This patch **fails to generate documentation**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20535: [SPARK-23341][SQL] define some standard options f...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20535#discussion_r183470867
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameReader.scala ---
    @@ -193,10 +196,13 @@ class DataFrameReader private[sql](sparkSession: SparkSession) extends Logging {
           if (ds.isInstanceOf[ReadSupport] || ds.isInstanceOf[ReadSupportWithSchema]) {
             val sessionOptions = DataSourceV2Utils.extractSessionConfigs(
               ds = ds, conf = sparkSession.sessionState.conf)
    +        val pathsOption = {
    +          val objectMapper = new ObjectMapper()
    +          DataSourceOptions.PATHS_KEY -> objectMapper.writeValueAsString(paths.toArray)
    +        }
             Dataset.ofRows(sparkSession, DataSourceV2Relation.create(
    -          ds, extraOptions.toMap ++ sessionOptions,
    +          ds, extraOptions.toMap ++ sessionOptions + pathsOption,
    --- End diff --
    
    issue an exception when extraOptions("path") is not empty?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    > This should move the standard options to DataSourceV2Relation to avoid needing to instantiate DataSourceOptions wherever the relation is created.
    
    @rdblue We don't have this problem now, so I'd like to not touch `DataSourceV2Relation` here and rethink about it when the problem really comes out.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #87186 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87186/testReport)** for PR 20535 at commit [`86bcda9`](https://github.com/apache/spark/commit/86bcda949f24e475c8284412874fe1c871512a27).
     * This patch **fails to generate documentation**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/87186/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/2106/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/2333/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/87207/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/87187/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #87168 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87168/testReport)** for PR 20535 at commit [`c9009d8`](https://github.com/apache/spark/commit/c9009d85d3f147eb652f141bbcb0424767e90477).
     * This patch **fails to generate documentation**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    retest this please


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/2147/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    retest this please


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/2132/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20535: [SPARK-23341][SQL] define some standard options for data...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20535
  
    **[Test build #87194 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87194/testReport)** for PR 20535 at commit [`e92b6b2`](https://github.com/apache/spark/commit/e92b6b2083c4dbf31c27c961096a45cd8d84f16e).
     * This patch **fails due to an unknown error code, -9**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org