You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by wangyum <gi...@git.apache.org> on 2018/08/30 18:15:19 UTC
[GitHub] spark pull request #22287: [SPARK-25135][SQL] FileFormatWriter should respec...
GitHub user wangyum opened a pull request:
https://github.com/apache/spark/pull/22287
[SPARK-25135][SQL] FileFormatWriter should respect the schema of Hive
## What changes were proposed in this pull request?
This pr fix `FileFormatWriter's dataSchema` should respect the schema of Hive. Otherwise there will be two issues.
1. Throwing an exception(This can be reproduce by added test case):
```scala
java.util.NoSuchElementException: None.get
at scala.None$.get(Option.scala:347)
at scala.None$.get(Option.scala:345)
at org.apache.spark.sql.execution.datasources.FileFormatWriter$$anonfun$3$$anonfun$4.apply(FileFormatWriter.scala:87)
at org.apache.spark.sql.execution.datasources.FileFormatWriter$$anonfun$3$$anonfun$4.apply(FileFormatWriter.scala:87)
```
2. The schema of the Hive table is not the same as the schema of the parquet file.
## How was this patch tested?
- Unit tests for FileFormatWriter should respect the schema of Hive.
- Manual tests for didn't break UI issues fixed by [SPARK-22834](https://issues.apache.org/jira/browse/SPARK-22834):
![image](https://user-images.githubusercontent.com/5399861/44870021-94ce1700-acc1-11e8-8ef7-d7a8ba3c435d.png)
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/wangyum/spark SPARK-25135-view
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/22287.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #22287
----
commit b54953a8224aa0a7759289a83e876e3bfc166cb6
Author: Yuming Wang <yu...@...>
Date: 2018-08-30T17:46:02Z
FileFormatWriter should respect the input query schema in HIVE
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org
[GitHub] spark issue #22287: [SPARK-25135][SQL] FileFormatWriter should respect the s...
Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/22287
Merged build finished. Test FAILed.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org
[GitHub] spark issue #22287: [SPARK-25135][SQL] FileFormatWriter should respect the s...
Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/22287
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/testing-k8s-prb-make-spark-distribution-unified/2704/
Test PASSed.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org
[GitHub] spark issue #22287: [SPARK-25135][SQL] FileFormatWriter should respect the s...
Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/22287
**[Test build #95483 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/95483/testReport)** for PR 22287 at commit [`b54953a`](https://github.com/apache/spark/commit/b54953a8224aa0a7759289a83e876e3bfc166cb6).
---
---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org
[GitHub] spark issue #22287: [SPARK-25135][SQL] FileFormatWriter should respect the s...
Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/22287
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/95483/
Test FAILed.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org
[GitHub] spark issue #22287: [SPARK-25135][SQL] FileFormatWriter should respect the s...
Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/22287
Merged build finished. Test PASSed.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org
[GitHub] spark pull request #22287: [SPARK-25135][SQL] FileFormatWriter should respec...
Posted by wangyum <gi...@git.apache.org>.
Github user wangyum closed the pull request at:
https://github.com/apache/spark/pull/22287
---
---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org
[GitHub] spark issue #22287: [SPARK-25135][SQL] FileFormatWriter should respect the s...
Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/22287
**[Test build #95483 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/95483/testReport)** for PR 22287 at commit [`b54953a`](https://github.com/apache/spark/commit/b54953a8224aa0a7759289a83e876e3bfc166cb6).
* This patch **fails Spark unit tests**.
* This patch merges cleanly.
* This patch adds no public classes.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org