You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by vinodkc <gi...@git.apache.org> on 2017/11/22 13:00:47 UTC

[GitHub] spark pull request #19795: [SPARK-17920][SPARK-19580][SPARK-19878][SQL] Back...

GitHub user vinodkc opened a pull request:

    https://github.com/apache/spark/pull/19795

    [SPARK-17920][SPARK-19580][SPARK-19878][SQL] Backport PR 19779 to branch-2.2 - Support writing to Hive table which uses Avro schema url 'avro.schema.url'

    ## What changes were proposed in this pull request?
    
    > Backport https://github.com/apache/spark/pull/19779 to branch-2.2
    
    SPARK-19580 Support for avro.schema.url while writing to hive table
    SPARK-19878 Add hive configuration when initialize hive serde in InsertIntoHiveTable.scala
    SPARK-17920 HiveWriterContainer passes null configuration to serde.initialize, causing NullPointerException in AvroSerde when using avro.schema.url
    
    Support writing to Hive table which uses Avro schema url 'avro.schema.url'
    For ex:
    create external table avro_in (a string) stored as avro location '/avro-in/' tblproperties ('avro.schema.url'='/avro-schema/avro.avsc');
    
    create external table avro_out (a string) stored as avro location '/avro-out/' tblproperties ('avro.schema.url'='/avro-schema/avro.avsc');
    
    insert overwrite table avro_out select * from avro_in; // fails with java.lang.NullPointerException
    
    WARN AvroSerDe: Encountered exception determining schema. Returning signal schema to indicate problem
    java.lang.NullPointerException
    at org.apache.hadoop.fs.FileSystem.getDefaultUri(FileSystem.java:182)
    at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:174)
    ## Changes proposed in this fix
    Currently 'null' value is passed to serializer, which causes NPE during insert operation, instead pass Hadoop configuration object
    ## How was this patch tested?
    Added new test case in VersionsSuite

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/vinodkc/spark br_Fix_SPARK-17920_branch-2.2

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/19795.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #19795
    
----
commit 63e40e866e8ad3307b91ea430c29938a0050e6f7
Author: vinodkc <vi...@gmail.com>
Date:   2017-11-22T12:47:47Z

    pass hadoop Configuration to serializer

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #19795: [SPARK-17920][SPARK-19580][SPARK-19878][SQL] Backport PR...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/19795
  
    **[Test build #84107 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/84107/testReport)** for PR 19795 at commit [`63e40e8`](https://github.com/apache/spark/commit/63e40e866e8ad3307b91ea430c29938a0050e6f7).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #19795: [SPARK-17920][SPARK-19580][SPARK-19878][SQL] Backport PR...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/19795
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #19795: [SPARK-17920][SPARK-19580][SPARK-19878][SQL] Backport PR...

Posted by vinodkc <gi...@git.apache.org>.
Github user vinodkc commented on the issue:

    https://github.com/apache/spark/pull/19795
  
    Thank you


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #19795: [SPARK-17920][SPARK-19580][SPARK-19878][SQL] Backport PR...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/19795
  
    **[Test build #84107 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/84107/testReport)** for PR 19795 at commit [`63e40e8`](https://github.com/apache/spark/commit/63e40e866e8ad3307b91ea430c29938a0050e6f7).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #19795: [SPARK-17920][SPARK-19580][SPARK-19878][SQL] Backport PR...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/19795
  
    LGTM


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #19795: [SPARK-17920][SPARK-19580][SPARK-19878][SQL] Back...

Posted by vinodkc <gi...@git.apache.org>.
Github user vinodkc closed the pull request at:

    https://github.com/apache/spark/pull/19795


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #19795: [SPARK-17920][SPARK-19580][SPARK-19878][SQL] Backport PR...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/19795
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/84107/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #19795: [SPARK-17920][SPARK-19580][SPARK-19878][SQL] Backport PR...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the issue:

    https://github.com/apache/spark/pull/19795
  
    Thanks! Merged to 2.2
    
    Could you please close this?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org