You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by gatorsmile <gi...@git.apache.org> on 2018/02/14 23:29:25 UTC

[GitHub] spark pull request #20614: Revert [SPARK-23094] Fix invalid character handli...

GitHub user gatorsmile opened a pull request:

    https://github.com/apache/spark/pull/20614

    Revert [SPARK-23094] Fix invalid character handling in JsonDataSource

    ## What changes were proposed in this pull request?
    This PR is to revert the PR https://github.com/apache/spark/pull/20302, because it causes a regression.
    
    ## How was this patch tested?
    Added a test case. Without the revert, it return an empty result set. 

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/gatorsmile/spark revertJsonFix

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/20614.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #20614
    
----
commit 3cd91b1d07d3cb2451045913d0c1e27226a67816
Author: gatorsmile <ga...@...>
Date:   2018-02-14T23:23:34Z

    added test case

commit d4015d0c3c9b5cae0309bd6b9486b4990c7f4479
Author: gatorsmile <ga...@...>
Date:   2018-02-14T23:24:38Z

    Revert "[SPARK-23094] Fix invalid character handling in JsonDataSource"
    
    This reverts commit e01919e834d301e13adc8919932796ebae900576.

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by HyukjinKwon <gi...@git.apache.org>.
Github user HyukjinKwon commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Wait .. do we formally support UTF-16 encoded files in line by line parsing via JSON? I think this accidentally works because the newline character in UTF-8 is the part of UTF-16.
    
    I think I am fine to revert it but how about we avoid adding a test?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Since this is just to remove the file and the previous test already passed, I would merge it to master/2.3. Thanks!


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    **[Test build #87473 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87473/testReport)** for PR 20614 at commit [`0bb86c6`](https://github.com/apache/spark/commit/0bb86c6e45967de2fe92980422b5a3ccd83fb15f).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/87456/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    **[Test build #87456 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87456/testReport)** for PR 20614 at commit [`d4015d0`](https://github.com/apache/spark/commit/d4015d0c3c9b5cae0309bd6b9486b4990c7f4479).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by dongjoon-hyun <gi...@git.apache.org>.
Github user dongjoon-hyun commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Retest this please


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/911/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    **[Test build #87456 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87456/testReport)** for PR 20614 at commit [`d4015d0`](https://github.com/apache/spark/commit/d4015d0c3c9b5cae0309bd6b9486b4990c7f4479).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/87473/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    **[Test build #87470 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87470/testReport)** for PR 20614 at commit [`86c88ae`](https://github.com/apache/spark/commit/86c88ae9ce02ff25b6b4c8cdac4fe73cb1b65b8b).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    cc @brkyvz @MaxGekk @dongjoon-hyun @HyukjinKwon @hvanhovell 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    **[Test build #87470 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87470/testReport)** for PR 20614 at commit [`86c88ae`](https://github.com/apache/spark/commit/86c88ae9ce02ff25b6b4c8cdac4fe73cb1b65b8b).
     * This patch **fails due to an unknown error code, -9**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by brkyvz <gi...@git.apache.org>.
Github user brkyvz commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    LGTM, my initial assumption that files had to be UTF-8 encoded was a wrong one :(


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/913/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/901/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    **[Test build #87467 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87467/testReport)** for PR 20614 at commit [`d4015d0`](https://github.com/apache/spark/commit/d4015d0c3c9b5cae0309bd6b9486b4990c7f4479).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Saw the history. Our UTF-16 support is pretty weak. Let me revert the test case.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution/908/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    **[Test build #87473 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87473/testReport)** for PR 20614 at commit [`0bb86c6`](https://github.com/apache/spark/commit/0bb86c6e45967de2fe92980422b5a3ccd83fb15f).
     * This patch **fails due to an unknown error code, -9**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/87470/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    BTW, the ongoing test will be killed by -9 at midnight 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by dongjoon-hyun <gi...@git.apache.org>.
Github user dongjoon-hyun commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    The failure seems to be irrelevant to this.
    ```
    org.apache.spark.sql.execution.streaming.RateSourceV2Suite.basic microbatch execution
    ```


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    **[Test build #87467 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/87467/testReport)** for PR 20614 at commit [`d4015d0`](https://github.com/apache/spark/commit/d4015d0c3c9b5cae0309bd6b9486b4990c7f4479).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #20614: Revert [SPARK-23094] Fix invalid character handling in J...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/20614
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/87467/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #20614: Revert [SPARK-23094] Fix invalid character handli...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/20614


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org