You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2021/02/02 22:01:23 UTC

[GitHub] [spark] baohe-zhang opened a new pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

baohe-zhang opened a new pull request #31446:
URL: https://github.com/apache/spark/pull/31446


   This optimization was originally put up by @msamirkhan in this PR https://github.com/apache/spark/pull/29354.
   
   ### What changes were proposed in this pull request?
   Set "org.apache.avro.generic.GenericData" as Avro serialization data model in Avro output writer.
   
   ### Why are the changes needed?
   We found that using "org.apache.avro.generic.GenericData" as Avro serialization data model in Avro writer can significantly improve Avro write benchmark performance and slightly improve Avro read benchmark performance. Full Benchmark results can be found on https://issues.apache.org/jira/browse/SPARK-34336.
   
   Column chart comparison on avg time
   ![image](https://user-images.githubusercontent.com/26694233/106668248-a8a39480-656f-11eb-86a9-aa18d99f0942.png)
   ![image](https://user-images.githubusercontent.com/26694233/106668280-b48f5680-656f-11eb-882a-02028611c8f9.png)
   
   ### Does this PR introduce _any_ user-facing change?
   No.
   
   ### How was this patch tested?
   Manually run benchmarks.
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] baohe-zhang commented on pull request #31446: [SPARK-34336][SQL] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
baohe-zhang commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-774188043


   We will use GenericData in our spark cluster and see if it cause loss of functionality.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31446: [SPARK-34336][SQL] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772884746


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134847/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31446: [SPARK-34336][SQL] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772877881


   **[Test build #134847 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134847/testReport)** for PR 31446 at commit [`8ccbf5c`](https://github.com/apache/spark/commit/8ccbf5cbfa3d30dffe8c5fa80a1a17580ac63615).
    * This patch passes all tests.
    * This patch merges cleanly.
    * This patch adds no public classes.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772044343


   **[Test build #134802 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134802/testReport)** for PR 31446 at commit [`e67395f`](https://github.com/apache/spark/commit/e67395f7024b8f6b8874ca3fa9f9d04d2d55d1f4).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772070948


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134802/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772069275


   Kubernetes integration test starting
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39390/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] gengliangwang commented on pull request #31446: [SPARK-34336][SQL] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
gengliangwang commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772625841


   @baohe-zhang there are Avro test failures for this PR: https://github.com/apache/spark/runs/1819669303?check_suite_focus=true


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772082823


   Kubernetes integration test status success
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39390/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772055846


   **[Test build #134802 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134802/testReport)** for PR 31446 at commit [`e67395f`](https://github.com/apache/spark/commit/e67395f7024b8f6b8874ca3fa9f9d04d2d55d1f4).
    * This patch **fails Spark unit tests**.
    * This patch merges cleanly.
    * This patch adds no public classes.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31446: [SPARK-34336][SQL] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772943020


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39434/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] xkrogen commented on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
xkrogen commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772064539


   @baohe-zhang can you comment on whether or not there is any loss of functionality as a result of this? It's been a while since I looked at `GenericData` vs. `ReflectData` but my recollection is that `ReflectData` has more ability to deduce types  at runtime. I'm not sure if there is any need for this ability within the Spark codebase.
   
   Basically -- do we need to make this configurable, or is it strictly an improvement with no regressions?


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31446: [SPARK-34336][SQL] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772866415


   **[Test build #134847 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134847/testReport)** for PR 31446 at commit [`8ccbf5c`](https://github.com/apache/spark/commit/8ccbf5cbfa3d30dffe8c5fa80a1a17580ac63615).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772097272


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39390/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772044343






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772044343


   **[Test build #134802 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134802/testReport)** for PR 31446 at commit [`e67395f`](https://github.com/apache/spark/commit/e67395f7024b8f6b8874ca3fa9f9d04d2d55d1f4).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772044343


   **[Test build #134802 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134802/testReport)** for PR 31446 at commit [`e67395f`](https://github.com/apache/spark/commit/e67395f7024b8f6b8874ca3fa9f9d04d2d55d1f4).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] baohe-zhang commented on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
baohe-zhang commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772102785


   @xkrogen We don't know that yet. We only run through benchmark tests so far. We will consider enabling it in our spark clusters and see if it cause regression.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] xkrogen commented on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
xkrogen commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772064539


   @baohe-zhang can you comment on whether or not there is any loss of functionality as a result of this? It's been a while since I looked at `GenericData` vs. `ReflectData` but my recollection is that `ReflectData` has more ability to deduce types  at runtime. I'm not sure if there is any need for this ability within the Spark codebase.
   
   Basically -- do we need to make this configurable, or is it strictly an improvement with no regressions?


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] github-actions[bot] closed pull request #31446: [SPARK-34336][SQL] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
github-actions[bot] closed pull request #31446:
URL: https://github.com/apache/spark/pull/31446


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772070948






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31446: [SPARK-34336][SQL] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772895765


   Kubernetes integration test starting
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39434/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] github-actions[bot] commented on pull request #31446: [SPARK-34336][SQL] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
github-actions[bot] commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-841901140


   We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable.
   If you'd like to revive this PR, please reopen it and ask a committer to remove the Stale tag!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] github-actions[bot] commented on pull request #31446: [SPARK-34336][SQL] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
github-actions[bot] commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-841901140


   We're closing this PR because it hasn't been updated in a while. This isn't a judgement on the merit of the PR in any way. It's just a way of keeping the PR queue manageable.
   If you'd like to revive this PR, please reopen it and ask a committer to remove the Stale tag!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772070948






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #31446: [SPARK-34336][SQL] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772940883


   Kubernetes integration test status failure
   URL: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39434/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #31446: [SPARK-34336][SQL] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772866415


   **[Test build #134847 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/134847/testReport)** for PR 31446 at commit [`8ccbf5c`](https://github.com/apache/spark/commit/8ccbf5cbfa3d30dffe8c5fa80a1a17580ac63615).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772097272


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39390/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #31446: [SPARK-34336][SQL] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772943020


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder-K8s/39434/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772070948


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134802/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #31446: [SPARK-34336][SQL] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772884746


   
   Refer to this link for build results (access rights to CI server needed): 
   https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/134847/
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] baohe-zhang commented on pull request #31446: [SPARK-34336] Use GenericData as Avro serialization data model

Posted by GitBox <gi...@apache.org>.
baohe-zhang commented on pull request #31446:
URL: https://github.com/apache/spark/pull/31446#issuecomment-772102785


   @xkrogen We don't know that yet. We only run through benchmark tests so far. We will consider enabling it in our spark clusters and see if it cause regression.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org