You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by MaxGekk <gi...@git.apache.org> on 2018/11/06 13:20:20 UTC

[GitHub] spark pull request #22956: [SPARK-25950][SQL] from_csv should respect to spa...

GitHub user MaxGekk opened a pull request:

    https://github.com/apache/spark/pull/22956

    [SPARK-25950][SQL] from_csv should respect to spark.sql.columnNameOfCorruptRecord

    ## What changes were proposed in this pull request?
    
    Fix for `CsvToStructs` to take into account SQL config `spark.sql.columnNameOfCorruptRecord` similar to `from_json`.
    
    ## How was this patch tested?
    
    Added new test where `spark.sql.columnNameOfCorruptRecord` is set to corrupt column name different from default.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/MaxGekk/spark-1 csv-tests

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/22956.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #22956
    
----
commit 797dfc68da7a1038cd9c2e725d44ca4561a16edd
Author: Maxim Gekk <ma...@...>
Date:   2018-11-06T13:15:19Z

    Added a test

commit 0767c50dc9419060ce9ef446fa58db4c2c95a9ab
Author: Maxim Gekk <ma...@...>
Date:   2018-11-06T13:15:40Z

    Taking into account SQL config

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #22956: [SPARK-25950][SQL] from_csv should respect to spark.sql....

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/22956
  
    **[Test build #98518 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/98518/testReport)** for PR 22956 at commit [`0767c50`](https://github.com/apache/spark/commit/0767c50dc9419060ce9ef446fa58db4c2c95a9ab).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #22956: [SPARK-25950][SQL] from_csv should respect to spark.sql....

Posted by HyukjinKwon <gi...@git.apache.org>.
Github user HyukjinKwon commented on the issue:

    https://github.com/apache/spark/pull/22956
  
    Looks good. I or someone else should take a closer look before getting this in.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #22956: [SPARK-25950][SQL] from_csv should respect to spark.sql....

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/22956
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #22956: [SPARK-25950][SQL] from_csv should respect to spark.sql....

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/22956
  
    Can one of the admins verify this patch?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #22956: [SPARK-25950][SQL] from_csv should respect to spark.sql....

Posted by HyukjinKwon <gi...@git.apache.org>.
Github user HyukjinKwon commented on the issue:

    https://github.com/apache/spark/pull/22956
  
    Merged to master.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #22956: [SPARK-25950][SQL] from_csv should respect to spa...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/22956


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #22956: [SPARK-25950][SQL] from_csv should respect to spark.sql....

Posted by MaxGekk <gi...@git.apache.org>.
Github user MaxGekk commented on the issue:

    https://github.com/apache/spark/pull/22956
  
    @HyukjinKwon @cloud-fan May I ask you to take a look at the PR.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #22956: [SPARK-25950][SQL] from_csv should respect to spark.sql....

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/22956
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/98518/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #22956: [SPARK-25950][SQL] from_csv should respect to spark.sql....

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/22956
  
    **[Test build #98518 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/98518/testReport)** for PR 22956 at commit [`0767c50`](https://github.com/apache/spark/commit/0767c50dc9419060ce9ef446fa58db4c2c95a9ab).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #22956: [SPARK-25950][SQL] from_csv should respect to spa...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/22956#discussion_r231359024
  
    --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/csvExpressions.scala ---
    @@ -92,8 +93,14 @@ case class CsvToStructs(
         }
       }
     
    +  val nameOfCorruptRecord = SQLConf.get.getConf(SQLConf.COLUMN_NAME_OF_CORRUPT_RECORD)
    --- End diff --
    
    should this be private?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #22956: [SPARK-25950][SQL] from_csv should respect to spark.sql....

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/22956
  
    Can one of the admins verify this patch?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #22956: [SPARK-25950][SQL] from_csv should respect to spa...

Posted by HyukjinKwon <gi...@git.apache.org>.
Github user HyukjinKwon commented on a diff in the pull request:

    https://github.com/apache/spark/pull/22956#discussion_r231370599
  
    --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/csvExpressions.scala ---
    @@ -92,8 +93,14 @@ case class CsvToStructs(
         }
       }
     
    +  val nameOfCorruptRecord = SQLConf.get.getConf(SQLConf.COLUMN_NAME_OF_CORRUPT_RECORD)
    --- End diff --
    
    Yea, I think so.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #22956: [SPARK-25950][SQL] from_csv should respect to spark.sql....

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/22956
  
    LGTM


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org