You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spark.apache.org by 郑瑞峰 <ru...@foxmail.com> on 2021/05/19 02:23:54 UTC

回复: Apache Spark 3.1.2 Release?

late +1. thanks Dongjoon!



------------------&nbsp;原始邮件&nbsp;------------------
发件人:                                                                                                                        "Dongjoon Hyun"                                                                                    <dongjoon.hyun@gmail.com&gt;;
发送时间:&nbsp;2021年5月19日(星期三) 凌晨1:29
收件人:&nbsp;"Wenchen Fan"<cloud0fan@gmail.com&gt;;
抄送:&nbsp;"Xiao Li"<lixiao@databricks.com&gt;;"Kent Yao"<yaooqinn@gmail.com&gt;;"John Zhuge"<jzhuge@apache.org&gt;;"Hyukjin Kwon"<gurwls223@gmail.com&gt;;"Holden Karau"<holden@pigscanfly.ca&gt;;"Takeshi Yamamuro"<linguin.m.s@gmail.com&gt;;"dev"<dev@spark.apache.org&gt;;"Yuming Wang"<wgyumg@gmail.com&gt;;
主题:&nbsp;Re: Apache Spark 3.1.2 Release?



Thank you all! I'll start to prepare.

Bests,
Dongjoon.



On Tue, May 18, 2021 at 12:53 AM Wenchen Fan <cloud0fan@gmail.com&gt; wrote:

+1, thanks!


On Tue, May 18, 2021 at 1:37 PM Xiao Li <lixiao@databricks.com&gt; wrote:

+1 Thanks, Dongjoon!

Xiao






On Mon, May 17, 2021 at 8:45 PM Kent Yao <yaooqinn@gmail.com&gt; wrote:

            
              +1. thanks Dongjoon
                           
              
                                                                                              Kent Yao&nbsp;
@ Data Science Center, Hangzhou Research Institute, NetEase Corp.a spark&nbsp;enthusiast
kyuubiis a unified&nbsp;multi-tenant&nbsp;JDBC interface for large-scale data processing and analytics,&nbsp;built on top of&nbsp;Apache Spark.

spark-authorizerA Spark SQL extension which provides SQL Standard Authorization for&nbsp;Apache Spark.
spark-postgres&nbsp;A library for reading data from and transferring data to Postgres / Greenplum with Spark SQL and DataFrames, 10~100x faster.
itatchiA library&nbsp;that brings useful functions from various modern database management systems to&nbsp;Apache Spark.








 
 
 
 
 
 
 
 
         
     
      
     
 
     On 05/18/2021 10:57,John Zhuge<jzhuge@apache.org&gt; wrote: 
 
  +1, thanks Dongjoon!


On Mon, May 17, 2021 at 7:50 PM Yuming Wang <wgyumg@gmail.com&gt; wrote:

+1.


On Tue, May 18, 2021 at 9:06 AM Hyukjin Kwon <gurwls223@gmail.com&gt; wrote:

+1 thanks for driving me

On Tue, 18 May 2021, 09:33 Holden Karau, <holden@pigscanfly.ca&gt; wrote:

+1 and thanks for volunteering to be the RM :)

On Mon, May 17, 2021 at 4:09 PM Takeshi Yamamuro <linguin.m.s@gmail.com&gt; wrote:

Thank you, Dongjoon~ sgtm, too.

On Tue, May 18, 2021 at 7:34 AM Cheng Su <chengsu@fb.com.invalid&gt; wrote:

+1 for a new release, thanks Dongjoon!
 
 Cheng Su
 
 On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <viirya@gmail.com&gt; wrote:
 
 &nbsp; &nbsp; +1 sounds good. Thanks Dongjoon for volunteering on this!
 
 
 &nbsp; &nbsp; Liang-Chi
 
 
 &nbsp; &nbsp; Dongjoon Hyun-2 wrote
 &nbsp; &nbsp; &gt; Hi, All.
 &nbsp; &nbsp; &gt; 
 &nbsp; &nbsp; &gt; Since Apache Spark 3.1.1 tag creation (Feb 21),
 &nbsp; &nbsp; &gt; new 172 patches including 9 correctness patches and 4 K8s patches arrived
 &nbsp; &nbsp; &gt; at branch-3.1.
 &nbsp; &nbsp; &gt; 
 &nbsp; &nbsp; &gt; Shall we make a new release, Apache Spark 3.1.2, as the second release at
 &nbsp; &nbsp; &gt; 3.1 line?
 &nbsp; &nbsp; &gt; I'd like to volunteer for the release manager for Apache Spark 3.1.2.
 &nbsp; &nbsp; &gt; I'm thinking about starting the first RC next week.
 &nbsp; &nbsp; &gt; 
 &nbsp; &nbsp; &gt; $ git log --oneline v3.1.1..HEAD | wc -l
 &nbsp; &nbsp; &gt;&nbsp; &nbsp; &nbsp; 172
 &nbsp; &nbsp; &gt; 
 &nbsp; &nbsp; &gt; # Known correctness issues
 &nbsp; &nbsp; &gt; SPARK-34534&nbsp; &nbsp; &nbsp;New protocol FetchShuffleBlocks in OneForOneBlockFetcher
 &nbsp; &nbsp; &gt; lead to data loss or correctness
 &nbsp; &nbsp; &gt; SPARK-34545&nbsp; &nbsp; &nbsp;PySpark Python UDF return inconsistent results when
 &nbsp; &nbsp; &gt; applying 2 UDFs with different return type to 2 columns together
 &nbsp; &nbsp; &gt; SPARK-34681&nbsp; &nbsp; &nbsp;Full outer shuffled hash join when building left side
 &nbsp; &nbsp; &gt; produces wrong result
 &nbsp; &nbsp; &gt; SPARK-34719&nbsp; &nbsp; &nbsp;fail if the view query has duplicated column names
 &nbsp; &nbsp; &gt; SPARK-34794&nbsp; &nbsp; &nbsp;Nested higher-order functions broken in DSL
 &nbsp; &nbsp; &gt; SPARK-34829&nbsp; &nbsp; &nbsp;transform_values return identical values when it's used
 &nbsp; &nbsp; &gt; with udf that returns reference type
 &nbsp; &nbsp; &gt; SPARK-34833&nbsp; &nbsp; &nbsp;Apply right-padding correctly for correlated subqueries
 &nbsp; &nbsp; &gt; SPARK-35381&nbsp; &nbsp; &nbsp;Fix lambda variable name issues in nested DataFrame
 &nbsp; &nbsp; &gt; functions in R APIs
 &nbsp; &nbsp; &gt; SPARK-35382&nbsp; &nbsp; &nbsp;Fix lambda variable name issues in nested DataFrame
 &nbsp; &nbsp; &gt; functions in Python APIs
 &nbsp; &nbsp; &gt; 
 &nbsp; &nbsp; &gt; # Notable K8s patches since K8s GA
 &nbsp; &nbsp; &gt; SPARK-34674&nbsp; &nbsp; Close SparkContext after the Main method has finished
 &nbsp; &nbsp; &gt; SPARK-34948&nbsp; &nbsp; Add ownerReference to executor configmap to fix leakages
 &nbsp; &nbsp; &gt; SPARK-34820&nbsp; &nbsp; add apt-update before gnupg install
 &nbsp; &nbsp; &gt; SPARK-34361&nbsp; &nbsp; In case of downscaling avoid killing of executors already
 &nbsp; &nbsp; &gt; known by the scheduler backend in the pod allocator
 &nbsp; &nbsp; &gt; 
 &nbsp; &nbsp; &gt; Bests,
 &nbsp; &nbsp; &gt; Dongjoon.
 
 
 
 
 
 &nbsp; &nbsp; --
 &nbsp; &nbsp; Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/ 
 
 &nbsp; &nbsp; ---------------------------------------------------------------------
 &nbsp; &nbsp; To unsubscribe e-mail: dev-unsubscribe@spark.apache.org
 
 
 



-- 
---
Takeshi Yamamuro



 

-- 
Twitter:&nbsp;https://twitter.com/holdenkarau

Books (Learning Spark, High Performance Spark, etc.):&nbsp;https://amzn.to/2MaRAG9&nbsp;
YouTube Live Streams:&nbsp;https://www.youtube.com/user/holdenkarau









 
 
 



-- 
John Zhuge

  
 
 



--

Re: Apache Spark 3.1.2 Release?

Posted by Gengliang Wang <lt...@gmail.com>.
Late +1, thank you, Dongjoon!

> On May 19, 2021, at 10:47 AM, Jungtaek Lim <ka...@gmail.com> wrote:
> 
> Late +1 here as well, thanks for volunteering!
> 
> 2021년 5월 19일 (수) 오전 11:24, 郑瑞峰 <ruifengz@foxmail.com <ma...@foxmail.com>>님이 작성:
> late +1. thanks Dongjoon!
> 
> 
> ------------------ 原始邮件 ------------------
> 发件人: "Dongjoon Hyun" <dongjoon.hyun@gmail.com <ma...@gmail.com>>;
> 发送时间: 2021年5月19日(星期三) 凌晨1:29
> 收件人: "Wenchen Fan"<cloud0fan@gmail.com <ma...@gmail.com>>;
> 抄送: "Xiao Li"<lixiao@databricks.com <ma...@databricks.com>>;"Kent Yao"<yaooqinn@gmail.com <ma...@gmail.com>>;"John Zhuge"<jzhuge@apache.org <ma...@apache.org>>;"Hyukjin Kwon"<gurwls223@gmail.com <ma...@gmail.com>>;"Holden Karau"<holden@pigscanfly.ca <ma...@pigscanfly.ca>>;"Takeshi Yamamuro"<linguin.m.s@gmail.com <ma...@gmail.com>>;"dev"<dev@spark.apache.org <ma...@spark.apache.org>>;"Yuming Wang"<wgyumg@gmail.com <ma...@gmail.com>>;
> 主题: Re: Apache Spark 3.1.2 Release?
> 
> Thank you all! I'll start to prepare.
> 
> Bests,
> Dongjoon.
> 
> On Tue, May 18, 2021 at 12:53 AM Wenchen Fan <cloud0fan@gmail.com <ma...@gmail.com>> wrote:
> +1, thanks!
> 
> On Tue, May 18, 2021 at 1:37 PM Xiao Li <lixiao@databricks.com <ma...@databricks.com>> wrote:
> +1 Thanks, Dongjoon!
> 
> Xiao
> 
> 
> 
> On Mon, May 17, 2021 at 8:45 PM Kent Yao <yaooqinn@gmail.com <ma...@gmail.com>> wrote:
> +1. thanks Dongjoon
> 
> Kent Yao 
> @ Data Science Center, Hangzhou Research Institute, NetEase Corp.
> a spark enthusiast
> kyuubi <https://github.com/yaooqinn/kyuubi>is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark <http://spark.apache.org/>.
> spark-authorizer <https://github.com/yaooqinn/spark-authorizer>A Spark SQL extension which provides SQL Standard Authorization for Apache Spark <http://spark.apache.org/>.
> spark-postgres <https://github.com/yaooqinn/spark-postgres> A library for reading data from and transferring data to Postgres / Greenplum with Spark SQL and DataFrames, 10~100x faster.
> itatchi <https://github.com/yaooqinn/spark-func-extras>A library that brings useful functions from various modern database management systems to Apache Spark <http://spark.apache.org/>.
> 
> 
>      
> 
> On 05/18/2021 10:57,John Zhuge<jz...@apache.org> <ma...@apache.org> wrote:
> +1, thanks Dongjoon!
> 
> On Mon, May 17, 2021 at 7:50 PM Yuming Wang <wgyumg@gmail.com <ma...@gmail.com>> wrote:
> +1.
> 
> On Tue, May 18, 2021 at 9:06 AM Hyukjin Kwon <gurwls223@gmail.com <ma...@gmail.com>> wrote:
> +1 thanks for driving me
> 
> On Tue, 18 May 2021, 09:33 Holden Karau, <holden@pigscanfly.ca <ma...@pigscanfly.ca>> wrote:
> +1 and thanks for volunteering to be the RM :)
> 
> On Mon, May 17, 2021 at 4:09 PM Takeshi Yamamuro <linguin.m.s@gmail.com <ma...@gmail.com>> wrote:
> Thank you, Dongjoon~ sgtm, too.
> 
> On Tue, May 18, 2021 at 7:34 AM Cheng Su <ch...@fb.com.invalid> wrote:
> +1 for a new release, thanks Dongjoon!
> 
> Cheng Su
> 
> On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <viirya@gmail.com <ma...@gmail.com>> wrote:
> 
>     +1 sounds good. Thanks Dongjoon for volunteering on this!
> 
> 
>     Liang-Chi
> 
> 
>     Dongjoon Hyun-2 wrote
>     > Hi, All.
>     > 
>     > Since Apache Spark 3.1.1 tag creation (Feb 21),
>     > new 172 patches including 9 correctness patches and 4 K8s patches arrived
>     > at branch-3.1.
>     > 
>     > Shall we make a new release, Apache Spark 3.1.2, as the second release at
>     > 3.1 line?
>     > I'd like to volunteer for the release manager for Apache Spark 3.1.2.
>     > I'm thinking about starting the first RC next week.
>     > 
>     > $ git log --oneline v3.1.1..HEAD | wc -l
>     >      172
>     > 
>     > # Known correctness issues
>     > SPARK-34534     New protocol FetchShuffleBlocks in OneForOneBlockFetcher
>     > lead to data loss or correctness
>     > SPARK-34545     PySpark Python UDF return inconsistent results when
>     > applying 2 UDFs with different return type to 2 columns together
>     > SPARK-34681     Full outer shuffled hash join when building left side
>     > produces wrong result
>     > SPARK-34719     fail if the view query has duplicated column names
>     > SPARK-34794     Nested higher-order functions broken in DSL
>     > SPARK-34829     transform_values return identical values when it's used
>     > with udf that returns reference type
>     > SPARK-34833     Apply right-padding correctly for correlated subqueries
>     > SPARK-35381     Fix lambda variable name issues in nested DataFrame
>     > functions in R APIs
>     > SPARK-35382     Fix lambda variable name issues in nested DataFrame
>     > functions in Python APIs
>     > 
>     > # Notable K8s patches since K8s GA
>     > SPARK-34674    Close SparkContext after the Main method has finished
>     > SPARK-34948    Add ownerReference to executor configmap to fix leakages
>     > SPARK-34820    add apt-update before gnupg install
>     > SPARK-34361    In case of downscaling avoid killing of executors already
>     > known by the scheduler backend in the pod allocator
>     > 
>     > Bests,
>     > Dongjoon.
> 
> 
> 
> 
> 
>     --
>     Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/ <http://apache-spark-developers-list.1001551.n3.nabble.com/> 
> 
>     ---------------------------------------------------------------------
>     To unsubscribe e-mail: dev-unsubscribe@spark.apache.org <ma...@spark.apache.org>
> 
> 
> 
> 
> -- 
> ---
> Takeshi Yamamuro
> -- 
> Twitter: https://twitter.com/holdenkarau <https://twitter.com/holdenkarau>
> Books (Learning Spark, High Performance Spark, etc.): https://amzn.to/2MaRAG9  <https://amzn.to/2MaRAG9>
> YouTube Live Streams: https://www.youtube.com/user/holdenkarau <https://www.youtube.com/user/holdenkarau>
> 
> -- 
> John Zhuge
> 
> 
> -- 
> 


Re: Apache Spark 3.1.2 Release?

Posted by Jungtaek Lim <ka...@gmail.com>.
Late +1 here as well, thanks for volunteering!

2021년 5월 19일 (수) 오전 11:24, 郑瑞峰 <ru...@foxmail.com>님이 작성:

> late +1. thanks Dongjoon!
>
>
> ------------------ 原始邮件 ------------------
> *发件人:* "Dongjoon Hyun" <do...@gmail.com>;
> *发送时间:* 2021年5月19日(星期三) 凌晨1:29
> *收件人:* "Wenchen Fan"<cl...@gmail.com>;
> *抄送:* "Xiao Li"<li...@databricks.com>;"Kent Yao"<ya...@gmail.com>;"John
> Zhuge"<jz...@apache.org>;"Hyukjin Kwon"<gu...@gmail.com>;"Holden
> Karau"<ho...@pigscanfly.ca>;"Takeshi Yamamuro"<linguin.m.s@gmail.com
> >;"dev"<de...@spark.apache.org>;"Yuming Wang"<wg...@gmail.com>;
> *主题:* Re: Apache Spark 3.1.2 Release?
>
> Thank you all! I'll start to prepare.
>
> Bests,
> Dongjoon.
>
> On Tue, May 18, 2021 at 12:53 AM Wenchen Fan <cl...@gmail.com> wrote:
>
>> +1, thanks!
>>
>> On Tue, May 18, 2021 at 1:37 PM Xiao Li <li...@databricks.com> wrote:
>>
>>> +1 Thanks, Dongjoon!
>>>
>>> Xiao
>>>
>>>
>>>
>>> On Mon, May 17, 2021 at 8:45 PM Kent Yao <ya...@gmail.com> wrote:
>>>
>>>> +1. thanks Dongjoon
>>>>
>>>> *Kent Yao *
>>>> @ Data Science Center, Hangzhou Research Institute, NetEase Corp.
>>>> *a spark enthusiast*
>>>> *kyuubi <https://github.com/yaooqinn/kyuubi>is a
>>>> unified multi-tenant JDBC interface for large-scale data processing and
>>>> analytics, built on top of Apache Spark <http://spark.apache.org/>.*
>>>> *spark-authorizer <https://github.com/yaooqinn/spark-authorizer>A Spark
>>>> SQL extension which provides SQL Standard Authorization for **Apache
>>>> Spark <http://spark.apache.org/>.*
>>>> *spark-postgres <https://github.com/yaooqinn/spark-postgres> A library
>>>> for reading data from and transferring data to Postgres / Greenplum with
>>>> Spark SQL and DataFrames, 10~100x faster.*
>>>> *itatchi <https://github.com/yaooqinn/spark-func-extras>A** library t**hat
>>>> brings useful functions from various modern database management systems to **Apache
>>>> Spark <http://spark.apache.org/>.*
>>>>
>>>>
>>>>
>>>> On 05/18/2021 10:57,John Zhuge<jz...@apache.org> <jz...@apache.org>
>>>> wrote:
>>>>
>>>> +1, thanks Dongjoon!
>>>>
>>>> On Mon, May 17, 2021 at 7:50 PM Yuming Wang <wg...@gmail.com> wrote:
>>>>
>>>>> +1.
>>>>>
>>>>> On Tue, May 18, 2021 at 9:06 AM Hyukjin Kwon <gu...@gmail.com>
>>>>> wrote:
>>>>>
>>>>>> +1 thanks for driving me
>>>>>>
>>>>>> On Tue, 18 May 2021, 09:33 Holden Karau, <ho...@pigscanfly.ca>
>>>>>> wrote:
>>>>>>
>>>>>>> +1 and thanks for volunteering to be the RM :)
>>>>>>>
>>>>>>> On Mon, May 17, 2021 at 4:09 PM Takeshi Yamamuro <
>>>>>>> linguin.m.s@gmail.com> wrote:
>>>>>>>
>>>>>>>> Thank you, Dongjoon~ sgtm, too.
>>>>>>>>
>>>>>>>> On Tue, May 18, 2021 at 7:34 AM Cheng Su <ch...@fb.com.invalid>
>>>>>>>> wrote:
>>>>>>>>
>>>>>>>>> +1 for a new release, thanks Dongjoon!
>>>>>>>>>
>>>>>>>>> Cheng Su
>>>>>>>>>
>>>>>>>>> On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <vi...@gmail.com> wrote:
>>>>>>>>>
>>>>>>>>>     +1 sounds good. Thanks Dongjoon for volunteering on this!
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>     Liang-Chi
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>     Dongjoon Hyun-2 wrote
>>>>>>>>>     > Hi, All.
>>>>>>>>>     >
>>>>>>>>>     > Since Apache Spark 3.1.1 tag creation (Feb 21),
>>>>>>>>>     > new 172 patches including 9 correctness patches and 4 K8s
>>>>>>>>> patches arrived
>>>>>>>>>     > at branch-3.1.
>>>>>>>>>     >
>>>>>>>>>     > Shall we make a new release, Apache Spark 3.1.2, as the
>>>>>>>>> second release at
>>>>>>>>>     > 3.1 line?
>>>>>>>>>     > I'd like to volunteer for the release manager for Apache
>>>>>>>>> Spark 3.1.2.
>>>>>>>>>     > I'm thinking about starting the first RC next week.
>>>>>>>>>     >
>>>>>>>>>     > $ git log --oneline v3.1.1..HEAD | wc -l
>>>>>>>>>     >      172
>>>>>>>>>     >
>>>>>>>>>     > # Known correctness issues
>>>>>>>>>     > SPARK-34534     New protocol FetchShuffleBlocks in
>>>>>>>>> OneForOneBlockFetcher
>>>>>>>>>     > lead to data loss or correctness
>>>>>>>>>     > SPARK-34545     PySpark Python UDF return inconsistent
>>>>>>>>> results when
>>>>>>>>>     > applying 2 UDFs with different return type to 2 columns
>>>>>>>>> together
>>>>>>>>>     > SPARK-34681     Full outer shuffled hash join when building
>>>>>>>>> left side
>>>>>>>>>     > produces wrong result
>>>>>>>>>     > SPARK-34719     fail if the view query has duplicated column
>>>>>>>>> names
>>>>>>>>>     > SPARK-34794     Nested higher-order functions broken in DSL
>>>>>>>>>     > SPARK-34829     transform_values return identical values
>>>>>>>>> when it's used
>>>>>>>>>     > with udf that returns reference type
>>>>>>>>>     > SPARK-34833     Apply right-padding correctly for correlated
>>>>>>>>> subqueries
>>>>>>>>>     > SPARK-35381     Fix lambda variable name issues in nested
>>>>>>>>> DataFrame
>>>>>>>>>     > functions in R APIs
>>>>>>>>>     > SPARK-35382     Fix lambda variable name issues in nested
>>>>>>>>> DataFrame
>>>>>>>>>     > functions in Python APIs
>>>>>>>>>     >
>>>>>>>>>     > # Notable K8s patches since K8s GA
>>>>>>>>>     > SPARK-34674    Close SparkContext after the Main method has
>>>>>>>>> finished
>>>>>>>>>     > SPARK-34948    Add ownerReference to executor configmap to
>>>>>>>>> fix leakages
>>>>>>>>>     > SPARK-34820    add apt-update before gnupg install
>>>>>>>>>     > SPARK-34361    In case of downscaling avoid killing of
>>>>>>>>> executors already
>>>>>>>>>     > known by the scheduler backend in the pod allocator
>>>>>>>>>     >
>>>>>>>>>     > Bests,
>>>>>>>>>     > Dongjoon.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>     --
>>>>>>>>>     Sent from:
>>>>>>>>> http://apache-spark-developers-list.1001551.n3.nabble.com/
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> ---------------------------------------------------------------------
>>>>>>>>>     To unsubscribe e-mail: dev-unsubscribe@spark.apache.org
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>> --
>>>>>>>> ---
>>>>>>>> Takeshi Yamamuro
>>>>>>>>
>>>>>>> --
>>>>>>> Twitter: https://twitter.com/holdenkarau
>>>>>>> Books (Learning Spark, High Performance Spark, etc.):
>>>>>>> https://amzn.to/2MaRAG9  <https://amzn.to/2MaRAG9>
>>>>>>> YouTube Live Streams: https://www.youtube.com/user/holdenkarau
>>>>>>>
>>>>>>
>>>>
>>>> --
>>>> John Zhuge
>>>>
>>>>
>>>
>>> --
>>>
>>>