You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@spark.apache.org by Dongjoon Hyun <do...@gmail.com> on 2021/05/17 21:08:46 UTC

Apache Spark 3.1.2 Release?

Hi, All.

Since Apache Spark 3.1.1 tag creation (Feb 21),
new 172 patches including 9 correctness patches and 4 K8s patches arrived
at branch-3.1.

Shall we make a new release, Apache Spark 3.1.2, as the second release at
3.1 line?
I'd like to volunteer for the release manager for Apache Spark 3.1.2.
I'm thinking about starting the first RC next week.

$ git log --oneline v3.1.1..HEAD | wc -l
     172

# Known correctness issues
SPARK-34534     New protocol FetchShuffleBlocks in OneForOneBlockFetcher
lead to data loss or correctness
SPARK-34545     PySpark Python UDF return inconsistent results when
applying 2 UDFs with different return type to 2 columns together
SPARK-34681     Full outer shuffled hash join when building left side
produces wrong result
SPARK-34719     fail if the view query has duplicated column names
SPARK-34794     Nested higher-order functions broken in DSL
SPARK-34829     transform_values return identical values when it's used
with udf that returns reference type
SPARK-34833     Apply right-padding correctly for correlated subqueries
SPARK-35381     Fix lambda variable name issues in nested DataFrame
functions in R APIs
SPARK-35382     Fix lambda variable name issues in nested DataFrame
functions in Python APIs

# Notable K8s patches since K8s GA
SPARK-34674    Close SparkContext after the Main method has finished
SPARK-34948    Add ownerReference to executor configmap to fix leakages
SPARK-34820    add apt-update before gnupg install
SPARK-34361    In case of downscaling avoid killing of executors already
known by the scheduler backend in the pod allocator

Bests,
Dongjoon.

Re: Apache Spark 3.1.2 Release?

Posted by Gengliang Wang <lt...@gmail.com>.

Late +1, thank you, Dongjoon!

> On May 19, 2021, at 10:47 AM, Jungtaek Lim <ka...@gmail.com> wrote:
> 
> Late +1 here as well, thanks for volunteering!
> 
> 2021년 5월 19일 (수) 오전 11:24, 郑瑞峰 <ruifengz@foxmail.com <ma...@foxmail.com>>님이 작성:
> late +1. thanks Dongjoon!
> 
> 
> ------------------ 原始邮件 ------------------
> 发件人: "Dongjoon Hyun" <dongjoon.hyun@gmail.com <ma...@gmail.com>>;
> 发送时间: 2021年5月19日(星期三) 凌晨1:29
> 收件人: "Wenchen Fan"<cloud0fan@gmail.com <ma...@gmail.com>>;
> 抄送: "Xiao Li"<lixiao@databricks.com <ma...@databricks.com>>;"Kent Yao"<yaooqinn@gmail.com <ma...@gmail.com>>;"John Zhuge"<jzhuge@apache.org <ma...@apache.org>>;"Hyukjin Kwon"<gurwls223@gmail.com <ma...@gmail.com>>;"Holden Karau"<holden@pigscanfly.ca <ma...@pigscanfly.ca>>;"Takeshi Yamamuro"<linguin.m.s@gmail.com <ma...@gmail.com>>;"dev"<dev@spark.apache.org <ma...@spark.apache.org>>;"Yuming Wang"<wgyumg@gmail.com <ma...@gmail.com>>;
> 主题: Re: Apache Spark 3.1.2 Release?
> 
> Thank you all! I'll start to prepare.
> 
> Bests,
> Dongjoon.
> 
> On Tue, May 18, 2021 at 12:53 AM Wenchen Fan <cloud0fan@gmail.com <ma...@gmail.com>> wrote:
> +1, thanks!
> 
> On Tue, May 18, 2021 at 1:37 PM Xiao Li <lixiao@databricks.com <ma...@databricks.com>> wrote:
> +1 Thanks, Dongjoon!
> 
> Xiao
> 
> 
> 
> On Mon, May 17, 2021 at 8:45 PM Kent Yao <yaooqinn@gmail.com <ma...@gmail.com>> wrote:
> +1. thanks Dongjoon
> 
> Kent Yao 
> @ Data Science Center, Hangzhou Research Institute, NetEase Corp.
> a spark enthusiast
> kyuubi <https://github.com/yaooqinn/kyuubi>is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark <http://spark.apache.org/>.
> spark-authorizer <https://github.com/yaooqinn/spark-authorizer>A Spark SQL extension which provides SQL Standard Authorization for Apache Spark <http://spark.apache.org/>.
> spark-postgres <https://github.com/yaooqinn/spark-postgres> A library for reading data from and transferring data to Postgres / Greenplum with Spark SQL and DataFrames, 10~100x faster.
> itatchi <https://github.com/yaooqinn/spark-func-extras>A library that brings useful functions from various modern database management systems to Apache Spark <http://spark.apache.org/>.
> 
> 
>      
> 
> On 05/18/2021 10:57，John Zhuge<jz...@apache.org> <ma...@apache.org> wrote：
> +1, thanks Dongjoon!
> 
> On Mon, May 17, 2021 at 7:50 PM Yuming Wang <wgyumg@gmail.com <ma...@gmail.com>> wrote:
> +1.
> 
> On Tue, May 18, 2021 at 9:06 AM Hyukjin Kwon <gurwls223@gmail.com <ma...@gmail.com>> wrote:
> +1 thanks for driving me
> 
> On Tue, 18 May 2021, 09:33 Holden Karau, <holden@pigscanfly.ca <ma...@pigscanfly.ca>> wrote:
> +1 and thanks for volunteering to be the RM :)
> 
> On Mon, May 17, 2021 at 4:09 PM Takeshi Yamamuro <linguin.m.s@gmail.com <ma...@gmail.com>> wrote:
> Thank you, Dongjoon~ sgtm, too.
> 
> On Tue, May 18, 2021 at 7:34 AM Cheng Su <ch...@fb.com.invalid> wrote:
> +1 for a new release, thanks Dongjoon!
> 
> Cheng Su
> 
> On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <viirya@gmail.com <ma...@gmail.com>> wrote:
> 
>     +1 sounds good. Thanks Dongjoon for volunteering on this!
> 
> 
>     Liang-Chi
> 
> 
>     Dongjoon Hyun-2 wrote
>     > Hi, All.
>     > 
>     > Since Apache Spark 3.1.1 tag creation (Feb 21),
>     > new 172 patches including 9 correctness patches and 4 K8s patches arrived
>     > at branch-3.1.
>     > 
>     > Shall we make a new release, Apache Spark 3.1.2, as the second release at
>     > 3.1 line?
>     > I'd like to volunteer for the release manager for Apache Spark 3.1.2.
>     > I'm thinking about starting the first RC next week.
>     > 
>     > $ git log --oneline v3.1.1..HEAD | wc -l
>     >      172
>     > 
>     > # Known correctness issues
>     > SPARK-34534     New protocol FetchShuffleBlocks in OneForOneBlockFetcher
>     > lead to data loss or correctness
>     > SPARK-34545     PySpark Python UDF return inconsistent results when
>     > applying 2 UDFs with different return type to 2 columns together
>     > SPARK-34681     Full outer shuffled hash join when building left side
>     > produces wrong result
>     > SPARK-34719     fail if the view query has duplicated column names
>     > SPARK-34794     Nested higher-order functions broken in DSL
>     > SPARK-34829     transform_values return identical values when it's used
>     > with udf that returns reference type
>     > SPARK-34833     Apply right-padding correctly for correlated subqueries
>     > SPARK-35381     Fix lambda variable name issues in nested DataFrame
>     > functions in R APIs
>     > SPARK-35382     Fix lambda variable name issues in nested DataFrame
>     > functions in Python APIs
>     > 
>     > # Notable K8s patches since K8s GA
>     > SPARK-34674    Close SparkContext after the Main method has finished
>     > SPARK-34948    Add ownerReference to executor configmap to fix leakages
>     > SPARK-34820    add apt-update before gnupg install
>     > SPARK-34361    In case of downscaling avoid killing of executors already
>     > known by the scheduler backend in the pod allocator
>     > 
>     > Bests,
>     > Dongjoon.
> 
> 
> 
> 
> 
>     --
>     Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/ <http://apache-spark-developers-list.1001551.n3.nabble.com/> 
> 
>     ---------------------------------------------------------------------
>     To unsubscribe e-mail: dev-unsubscribe@spark.apache.org <ma...@spark.apache.org>
> 
> 
> 
> 
> -- 
> ---
> Takeshi Yamamuro
> -- 
> Twitter: https://twitter.com/holdenkarau <https://twitter.com/holdenkarau>
> Books (Learning Spark, High Performance Spark, etc.): https://amzn.to/2MaRAG9  <https://amzn.to/2MaRAG9>
> YouTube Live Streams: https://www.youtube.com/user/holdenkarau <https://www.youtube.com/user/holdenkarau>
> 
> -- 
> John Zhuge
> 
> 
> -- 
>

Re: Apache Spark 3.1.2 Release?

Posted by Jungtaek Lim <ka...@gmail.com>.

Late +1 here as well, thanks for volunteering!

2021년 5월 19일 (수) 오전 11:24, 郑瑞峰 <ru...@foxmail.com>님이 작성:

> late +1. thanks Dongjoon!
>
>
> ------------------ 原始邮件 ------------------
> *发件人:* "Dongjoon Hyun" <do...@gmail.com>;
> *发送时间:* 2021年5月19日(星期三) 凌晨1:29
> *收件人:* "Wenchen Fan"<cl...@gmail.com>;
> *抄送:* "Xiao Li"<li...@databricks.com>;"Kent Yao"<ya...@gmail.com>;"John
> Zhuge"<jz...@apache.org>;"Hyukjin Kwon"<gu...@gmail.com>;"Holden
> Karau"<ho...@pigscanfly.ca>;"Takeshi Yamamuro"<linguin.m.s@gmail.com
> >;"dev"<de...@spark.apache.org>;"Yuming Wang"<wg...@gmail.com>;
> *主题:* Re: Apache Spark 3.1.2 Release?
>
> Thank you all! I'll start to prepare.
>
> Bests,
> Dongjoon.
>
> On Tue, May 18, 2021 at 12:53 AM Wenchen Fan <cl...@gmail.com> wrote:
>
>> +1, thanks!
>>
>> On Tue, May 18, 2021 at 1:37 PM Xiao Li <li...@databricks.com> wrote:
>>
>>> +1 Thanks, Dongjoon!
>>>
>>> Xiao
>>>
>>>
>>>
>>> On Mon, May 17, 2021 at 8:45 PM Kent Yao <ya...@gmail.com> wrote:
>>>
>>>> +1. thanks Dongjoon
>>>>
>>>> *Kent Yao *
>>>> @ Data Science Center, Hangzhou Research Institute, NetEase Corp.
>>>> *a spark enthusiast*
>>>> *kyuubi <https://github.com/yaooqinn/kyuubi>is a
>>>> unified multi-tenant JDBC interface for large-scale data processing and
>>>> analytics, built on top of Apache Spark <http://spark.apache.org/>.*
>>>> *spark-authorizer <https://github.com/yaooqinn/spark-authorizer>A Spark
>>>> SQL extension which provides SQL Standard Authorization for **Apache
>>>> Spark <http://spark.apache.org/>.*
>>>> *spark-postgres <https://github.com/yaooqinn/spark-postgres> A library
>>>> for reading data from and transferring data to Postgres / Greenplum with
>>>> Spark SQL and DataFrames, 10~100x faster.*
>>>> *itatchi <https://github.com/yaooqinn/spark-func-extras>A** library t**hat
>>>> brings useful functions from various modern database management systems to **Apache
>>>> Spark <http://spark.apache.org/>.*
>>>>
>>>>
>>>>
>>>> On 05/18/2021 10:57，John Zhuge<jz...@apache.org> <jz...@apache.org>
>>>> wrote：
>>>>
>>>> +1, thanks Dongjoon!
>>>>
>>>> On Mon, May 17, 2021 at 7:50 PM Yuming Wang <wg...@gmail.com> wrote:
>>>>
>>>>> +1.
>>>>>
>>>>> On Tue, May 18, 2021 at 9:06 AM Hyukjin Kwon <gu...@gmail.com>
>>>>> wrote:
>>>>>
>>>>>> +1 thanks for driving me
>>>>>>
>>>>>> On Tue, 18 May 2021, 09:33 Holden Karau, <ho...@pigscanfly.ca>
>>>>>> wrote:
>>>>>>
>>>>>>> +1 and thanks for volunteering to be the RM :)
>>>>>>>
>>>>>>> On Mon, May 17, 2021 at 4:09 PM Takeshi Yamamuro <
>>>>>>> linguin.m.s@gmail.com> wrote:
>>>>>>>
>>>>>>>> Thank you, Dongjoon~ sgtm, too.
>>>>>>>>
>>>>>>>> On Tue, May 18, 2021 at 7:34 AM Cheng Su <ch...@fb.com.invalid>
>>>>>>>> wrote:
>>>>>>>>
>>>>>>>>> +1 for a new release, thanks Dongjoon!
>>>>>>>>>
>>>>>>>>> Cheng Su
>>>>>>>>>
>>>>>>>>> On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <vi...@gmail.com> wrote:
>>>>>>>>>
>>>>>>>>>     +1 sounds good. Thanks Dongjoon for volunteering on this!
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>     Liang-Chi
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>     Dongjoon Hyun-2 wrote
>>>>>>>>>     > Hi, All.
>>>>>>>>>     >
>>>>>>>>>     > Since Apache Spark 3.1.1 tag creation (Feb 21),
>>>>>>>>>     > new 172 patches including 9 correctness patches and 4 K8s
>>>>>>>>> patches arrived
>>>>>>>>>     > at branch-3.1.
>>>>>>>>>     >
>>>>>>>>>     > Shall we make a new release, Apache Spark 3.1.2, as the
>>>>>>>>> second release at
>>>>>>>>>     > 3.1 line?
>>>>>>>>>     > I'd like to volunteer for the release manager for Apache
>>>>>>>>> Spark 3.1.2.
>>>>>>>>>     > I'm thinking about starting the first RC next week.
>>>>>>>>>     >
>>>>>>>>>     > $ git log --oneline v3.1.1..HEAD | wc -l
>>>>>>>>>     >      172
>>>>>>>>>     >
>>>>>>>>>     > # Known correctness issues
>>>>>>>>>     > SPARK-34534     New protocol FetchShuffleBlocks in
>>>>>>>>> OneForOneBlockFetcher
>>>>>>>>>     > lead to data loss or correctness
>>>>>>>>>     > SPARK-34545     PySpark Python UDF return inconsistent
>>>>>>>>> results when
>>>>>>>>>     > applying 2 UDFs with different return type to 2 columns
>>>>>>>>> together
>>>>>>>>>     > SPARK-34681     Full outer shuffled hash join when building
>>>>>>>>> left side
>>>>>>>>>     > produces wrong result
>>>>>>>>>     > SPARK-34719     fail if the view query has duplicated column
>>>>>>>>> names
>>>>>>>>>     > SPARK-34794     Nested higher-order functions broken in DSL
>>>>>>>>>     > SPARK-34829     transform_values return identical values
>>>>>>>>> when it's used
>>>>>>>>>     > with udf that returns reference type
>>>>>>>>>     > SPARK-34833     Apply right-padding correctly for correlated
>>>>>>>>> subqueries
>>>>>>>>>     > SPARK-35381     Fix lambda variable name issues in nested
>>>>>>>>> DataFrame
>>>>>>>>>     > functions in R APIs
>>>>>>>>>     > SPARK-35382     Fix lambda variable name issues in nested
>>>>>>>>> DataFrame
>>>>>>>>>     > functions in Python APIs
>>>>>>>>>     >
>>>>>>>>>     > # Notable K8s patches since K8s GA
>>>>>>>>>     > SPARK-34674    Close SparkContext after the Main method has
>>>>>>>>> finished
>>>>>>>>>     > SPARK-34948    Add ownerReference to executor configmap to
>>>>>>>>> fix leakages
>>>>>>>>>     > SPARK-34820    add apt-update before gnupg install
>>>>>>>>>     > SPARK-34361    In case of downscaling avoid killing of
>>>>>>>>> executors already
>>>>>>>>>     > known by the scheduler backend in the pod allocator
>>>>>>>>>     >
>>>>>>>>>     > Bests,
>>>>>>>>>     > Dongjoon.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>     --
>>>>>>>>>     Sent from:
>>>>>>>>> http://apache-spark-developers-list.1001551.n3.nabble.com/
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> ---------------------------------------------------------------------
>>>>>>>>>     To unsubscribe e-mail: dev-unsubscribe@spark.apache.org
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>> --
>>>>>>>> ---
>>>>>>>> Takeshi Yamamuro
>>>>>>>>
>>>>>>> --
>>>>>>> Twitter: https://twitter.com/holdenkarau
>>>>>>> Books (Learning Spark, High Performance Spark, etc.):
>>>>>>> https://amzn.to/2MaRAG9  <https://amzn.to/2MaRAG9>
>>>>>>> YouTube Live Streams: https://www.youtube.com/user/holdenkarau
>>>>>>>
>>>>>>
>>>>
>>>> --
>>>> John Zhuge
>>>>
>>>>
>>>
>>> --
>>>
>>>

回复： Apache Spark 3.1.2 Release?

Posted by 郑瑞峰 <ru...@foxmail.com>.

late +1. thanks Dongjoon!



------------------&nbsp;原始邮件&nbsp;------------------
发件人:                                                                                                                        "Dongjoon Hyun"                                                                                    <dongjoon.hyun@gmail.com&gt;;
发送时间:&nbsp;2021年5月19日(星期三) 凌晨1:29
收件人:&nbsp;"Wenchen Fan"<cloud0fan@gmail.com&gt;;
抄送:&nbsp;"Xiao Li"<lixiao@databricks.com&gt;;"Kent Yao"<yaooqinn@gmail.com&gt;;"John Zhuge"<jzhuge@apache.org&gt;;"Hyukjin Kwon"<gurwls223@gmail.com&gt;;"Holden Karau"<holden@pigscanfly.ca&gt;;"Takeshi Yamamuro"<linguin.m.s@gmail.com&gt;;"dev"<dev@spark.apache.org&gt;;"Yuming Wang"<wgyumg@gmail.com&gt;;
主题:&nbsp;Re: Apache Spark 3.1.2 Release?



Thank you all! I'll start to prepare.

Bests,
Dongjoon.



On Tue, May 18, 2021 at 12:53 AM Wenchen Fan <cloud0fan@gmail.com&gt; wrote:

+1, thanks!


On Tue, May 18, 2021 at 1:37 PM Xiao Li <lixiao@databricks.com&gt; wrote:

+1 Thanks, Dongjoon!

Xiao






On Mon, May 17, 2021 at 8:45 PM Kent Yao <yaooqinn@gmail.com&gt; wrote:

            
              +1. thanks Dongjoon
                           
              
                                                                                              Kent Yao&nbsp;
@ Data Science Center, Hangzhou Research Institute, NetEase Corp.a spark&nbsp;enthusiast
kyuubiis a unified&nbsp;multi-tenant&nbsp;JDBC interface for large-scale data processing and analytics,&nbsp;built on top of&nbsp;Apache Spark.

spark-authorizerA Spark SQL extension which provides SQL Standard Authorization for&nbsp;Apache Spark.
spark-postgres&nbsp;A library for reading data from and transferring data to Postgres / Greenplum with Spark SQL and DataFrames, 10~100x faster.
itatchiA library&nbsp;that brings useful functions from various modern database management systems to&nbsp;Apache Spark.








 
 
 
 
 
 
 
 
         
     
      
     
 
     On 05/18/2021 10:57，John Zhuge<jzhuge@apache.org&gt; wrote： 
 
  +1, thanks Dongjoon!


On Mon, May 17, 2021 at 7:50 PM Yuming Wang <wgyumg@gmail.com&gt; wrote:

+1.


On Tue, May 18, 2021 at 9:06 AM Hyukjin Kwon <gurwls223@gmail.com&gt; wrote:

+1 thanks for driving me

On Tue, 18 May 2021, 09:33 Holden Karau, <holden@pigscanfly.ca&gt; wrote:

+1 and thanks for volunteering to be the RM :)

On Mon, May 17, 2021 at 4:09 PM Takeshi Yamamuro <linguin.m.s@gmail.com&gt; wrote:

Thank you, Dongjoon~ sgtm, too.

On Tue, May 18, 2021 at 7:34 AM Cheng Su <chengsu@fb.com.invalid&gt; wrote:

+1 for a new release, thanks Dongjoon!
 
 Cheng Su
 
 On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <viirya@gmail.com&gt; wrote:
 
 &nbsp; &nbsp; +1 sounds good. Thanks Dongjoon for volunteering on this!
 
 
 &nbsp; &nbsp; Liang-Chi
 
 
 &nbsp; &nbsp; Dongjoon Hyun-2 wrote
 &nbsp; &nbsp; &gt; Hi, All.
 &nbsp; &nbsp; &gt; 
 &nbsp; &nbsp; &gt; Since Apache Spark 3.1.1 tag creation (Feb 21),
 &nbsp; &nbsp; &gt; new 172 patches including 9 correctness patches and 4 K8s patches arrived
 &nbsp; &nbsp; &gt; at branch-3.1.
 &nbsp; &nbsp; &gt; 
 &nbsp; &nbsp; &gt; Shall we make a new release, Apache Spark 3.1.2, as the second release at
 &nbsp; &nbsp; &gt; 3.1 line?
 &nbsp; &nbsp; &gt; I'd like to volunteer for the release manager for Apache Spark 3.1.2.
 &nbsp; &nbsp; &gt; I'm thinking about starting the first RC next week.
 &nbsp; &nbsp; &gt; 
 &nbsp; &nbsp; &gt; $ git log --oneline v3.1.1..HEAD | wc -l
 &nbsp; &nbsp; &gt;&nbsp; &nbsp; &nbsp; 172
 &nbsp; &nbsp; &gt; 
 &nbsp; &nbsp; &gt; # Known correctness issues
 &nbsp; &nbsp; &gt; SPARK-34534&nbsp; &nbsp; &nbsp;New protocol FetchShuffleBlocks in OneForOneBlockFetcher
 &nbsp; &nbsp; &gt; lead to data loss or correctness
 &nbsp; &nbsp; &gt; SPARK-34545&nbsp; &nbsp; &nbsp;PySpark Python UDF return inconsistent results when
 &nbsp; &nbsp; &gt; applying 2 UDFs with different return type to 2 columns together
 &nbsp; &nbsp; &gt; SPARK-34681&nbsp; &nbsp; &nbsp;Full outer shuffled hash join when building left side
 &nbsp; &nbsp; &gt; produces wrong result
 &nbsp; &nbsp; &gt; SPARK-34719&nbsp; &nbsp; &nbsp;fail if the view query has duplicated column names
 &nbsp; &nbsp; &gt; SPARK-34794&nbsp; &nbsp; &nbsp;Nested higher-order functions broken in DSL
 &nbsp; &nbsp; &gt; SPARK-34829&nbsp; &nbsp; &nbsp;transform_values return identical values when it's used
 &nbsp; &nbsp; &gt; with udf that returns reference type
 &nbsp; &nbsp; &gt; SPARK-34833&nbsp; &nbsp; &nbsp;Apply right-padding correctly for correlated subqueries
 &nbsp; &nbsp; &gt; SPARK-35381&nbsp; &nbsp; &nbsp;Fix lambda variable name issues in nested DataFrame
 &nbsp; &nbsp; &gt; functions in R APIs
 &nbsp; &nbsp; &gt; SPARK-35382&nbsp; &nbsp; &nbsp;Fix lambda variable name issues in nested DataFrame
 &nbsp; &nbsp; &gt; functions in Python APIs
 &nbsp; &nbsp; &gt; 
 &nbsp; &nbsp; &gt; # Notable K8s patches since K8s GA
 &nbsp; &nbsp; &gt; SPARK-34674&nbsp; &nbsp; Close SparkContext after the Main method has finished
 &nbsp; &nbsp; &gt; SPARK-34948&nbsp; &nbsp; Add ownerReference to executor configmap to fix leakages
 &nbsp; &nbsp; &gt; SPARK-34820&nbsp; &nbsp; add apt-update before gnupg install
 &nbsp; &nbsp; &gt; SPARK-34361&nbsp; &nbsp; In case of downscaling avoid killing of executors already
 &nbsp; &nbsp; &gt; known by the scheduler backend in the pod allocator
 &nbsp; &nbsp; &gt; 
 &nbsp; &nbsp; &gt; Bests,
 &nbsp; &nbsp; &gt; Dongjoon.
 
 
 
 
 
 &nbsp; &nbsp; --
 &nbsp; &nbsp; Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/ 
 
 &nbsp; &nbsp; ---------------------------------------------------------------------
 &nbsp; &nbsp; To unsubscribe e-mail: dev-unsubscribe@spark.apache.org
 
 
 



-- 
---
Takeshi Yamamuro



 

-- 
Twitter:&nbsp;https://twitter.com/holdenkarau

Books (Learning Spark, High Performance Spark, etc.):&nbsp;https://amzn.to/2MaRAG9&nbsp;
YouTube Live Streams:&nbsp;https://www.youtube.com/user/holdenkarau









 
 
 



-- 
John Zhuge

  
 
 



--

Re: Apache Spark 3.1.2 Release?

Posted by Dongjoon Hyun <do...@gmail.com>.

Thank you all! I'll start to prepare.

Bests,
Dongjoon.

On Tue, May 18, 2021 at 12:53 AM Wenchen Fan <cl...@gmail.com> wrote:

> +1, thanks!
>
> On Tue, May 18, 2021 at 1:37 PM Xiao Li <li...@databricks.com> wrote:
>
>> +1 Thanks, Dongjoon!
>>
>> Xiao
>>
>>
>>
>> On Mon, May 17, 2021 at 8:45 PM Kent Yao <ya...@gmail.com> wrote:
>>
>>> +1. thanks Dongjoon
>>>
>>> *Kent Yao *
>>> @ Data Science Center, Hangzhou Research Institute, NetEase Corp.
>>> *a spark enthusiast*
>>> *kyuubi <https://github.com/yaooqinn/kyuubi>is a
>>> unified multi-tenant JDBC interface for large-scale data processing and
>>> analytics, built on top of Apache Spark <http://spark.apache.org/>.*
>>> *spark-authorizer <https://github.com/yaooqinn/spark-authorizer>A Spark
>>> SQL extension which provides SQL Standard Authorization for **Apache
>>> Spark <http://spark.apache.org/>.*
>>> *spark-postgres <https://github.com/yaooqinn/spark-postgres> A library
>>> for reading data from and transferring data to Postgres / Greenplum with
>>> Spark SQL and DataFrames, 10~100x faster.*
>>> *itatchi <https://github.com/yaooqinn/spark-func-extras>A** library t**hat
>>> brings useful functions from various modern database management systems to **Apache
>>> Spark <http://spark.apache.org/>.*
>>>
>>>
>>>
>>> On 05/18/2021 10:57，John Zhuge<jz...@apache.org> <jz...@apache.org>
>>> wrote：
>>>
>>> +1, thanks Dongjoon!
>>>
>>> On Mon, May 17, 2021 at 7:50 PM Yuming Wang <wg...@gmail.com> wrote:
>>>
>>>> +1.
>>>>
>>>> On Tue, May 18, 2021 at 9:06 AM Hyukjin Kwon <gu...@gmail.com>
>>>> wrote:
>>>>
>>>>> +1 thanks for driving me
>>>>>
>>>>> On Tue, 18 May 2021, 09:33 Holden Karau, <ho...@pigscanfly.ca> wrote:
>>>>>
>>>>>> +1 and thanks for volunteering to be the RM :)
>>>>>>
>>>>>> On Mon, May 17, 2021 at 4:09 PM Takeshi Yamamuro <
>>>>>> linguin.m.s@gmail.com> wrote:
>>>>>>
>>>>>>> Thank you, Dongjoon~ sgtm, too.
>>>>>>>
>>>>>>> On Tue, May 18, 2021 at 7:34 AM Cheng Su <ch...@fb.com.invalid>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> +1 for a new release, thanks Dongjoon!
>>>>>>>>
>>>>>>>> Cheng Su
>>>>>>>>
>>>>>>>> On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <vi...@gmail.com> wrote:
>>>>>>>>
>>>>>>>>     +1 sounds good. Thanks Dongjoon for volunteering on this!
>>>>>>>>
>>>>>>>>
>>>>>>>>     Liang-Chi
>>>>>>>>
>>>>>>>>
>>>>>>>>     Dongjoon Hyun-2 wrote
>>>>>>>>     > Hi, All.
>>>>>>>>     >
>>>>>>>>     > Since Apache Spark 3.1.1 tag creation (Feb 21),
>>>>>>>>     > new 172 patches including 9 correctness patches and 4 K8s
>>>>>>>> patches arrived
>>>>>>>>     > at branch-3.1.
>>>>>>>>     >
>>>>>>>>     > Shall we make a new release, Apache Spark 3.1.2, as the
>>>>>>>> second release at
>>>>>>>>     > 3.1 line?
>>>>>>>>     > I'd like to volunteer for the release manager for Apache
>>>>>>>> Spark 3.1.2.
>>>>>>>>     > I'm thinking about starting the first RC next week.
>>>>>>>>     >
>>>>>>>>     > $ git log --oneline v3.1.1..HEAD | wc -l
>>>>>>>>     >      172
>>>>>>>>     >
>>>>>>>>     > # Known correctness issues
>>>>>>>>     > SPARK-34534     New protocol FetchShuffleBlocks in
>>>>>>>> OneForOneBlockFetcher
>>>>>>>>     > lead to data loss or correctness
>>>>>>>>     > SPARK-34545     PySpark Python UDF return inconsistent
>>>>>>>> results when
>>>>>>>>     > applying 2 UDFs with different return type to 2 columns
>>>>>>>> together
>>>>>>>>     > SPARK-34681     Full outer shuffled hash join when building
>>>>>>>> left side
>>>>>>>>     > produces wrong result
>>>>>>>>     > SPARK-34719     fail if the view query has duplicated column
>>>>>>>> names
>>>>>>>>     > SPARK-34794     Nested higher-order functions broken in DSL
>>>>>>>>     > SPARK-34829     transform_values return identical values when
>>>>>>>> it's used
>>>>>>>>     > with udf that returns reference type
>>>>>>>>     > SPARK-34833     Apply right-padding correctly for correlated
>>>>>>>> subqueries
>>>>>>>>     > SPARK-35381     Fix lambda variable name issues in nested
>>>>>>>> DataFrame
>>>>>>>>     > functions in R APIs
>>>>>>>>     > SPARK-35382     Fix lambda variable name issues in nested
>>>>>>>> DataFrame
>>>>>>>>     > functions in Python APIs
>>>>>>>>     >
>>>>>>>>     > # Notable K8s patches since K8s GA
>>>>>>>>     > SPARK-34674    Close SparkContext after the Main method has
>>>>>>>> finished
>>>>>>>>     > SPARK-34948    Add ownerReference to executor configmap to
>>>>>>>> fix leakages
>>>>>>>>     > SPARK-34820    add apt-update before gnupg install
>>>>>>>>     > SPARK-34361    In case of downscaling avoid killing of
>>>>>>>> executors already
>>>>>>>>     > known by the scheduler backend in the pod allocator
>>>>>>>>     >
>>>>>>>>     > Bests,
>>>>>>>>     > Dongjoon.
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>>     --
>>>>>>>>     Sent from:
>>>>>>>> http://apache-spark-developers-list.1001551.n3.nabble.com/
>>>>>>>>
>>>>>>>>
>>>>>>>> ---------------------------------------------------------------------
>>>>>>>>     To unsubscribe e-mail: dev-unsubscribe@spark.apache.org
>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>> --
>>>>>>> ---
>>>>>>> Takeshi Yamamuro
>>>>>>>
>>>>>> --
>>>>>> Twitter: https://twitter.com/holdenkarau
>>>>>> Books (Learning Spark, High Performance Spark, etc.):
>>>>>> https://amzn.to/2MaRAG9  <https://amzn.to/2MaRAG9>
>>>>>> YouTube Live Streams: https://www.youtube.com/user/holdenkarau
>>>>>>
>>>>>
>>>
>>> --
>>> John Zhuge
>>>
>>>
>>
>> --
>>
>>

Re: Apache Spark 3.1.2 Release?

Posted by Wenchen Fan <cl...@gmail.com>.

+1, thanks!

On Tue, May 18, 2021 at 1:37 PM Xiao Li <li...@databricks.com> wrote:

> +1 Thanks, Dongjoon!
>
> Xiao
>
>
>
> On Mon, May 17, 2021 at 8:45 PM Kent Yao <ya...@gmail.com> wrote:
>
>> +1. thanks Dongjoon
>>
>> *Kent Yao *
>> @ Data Science Center, Hangzhou Research Institute, NetEase Corp.
>> *a spark enthusiast*
>> *kyuubi <https://github.com/yaooqinn/kyuubi>is a
>> unified multi-tenant JDBC interface for large-scale data processing and
>> analytics, built on top of Apache Spark <http://spark.apache.org/>.*
>> *spark-authorizer <https://github.com/yaooqinn/spark-authorizer>A Spark
>> SQL extension which provides SQL Standard Authorization for **Apache
>> Spark <http://spark.apache.org/>.*
>> *spark-postgres <https://github.com/yaooqinn/spark-postgres> A library
>> for reading data from and transferring data to Postgres / Greenplum with
>> Spark SQL and DataFrames, 10~100x faster.*
>> *itatchi <https://github.com/yaooqinn/spark-func-extras>A** library t**hat
>> brings useful functions from various modern database management systems to **Apache
>> Spark <http://spark.apache.org/>.*
>>
>>
>>
>> On 05/18/2021 10:57，John Zhuge<jz...@apache.org> <jz...@apache.org>
>> wrote：
>>
>> +1, thanks Dongjoon!
>>
>> On Mon, May 17, 2021 at 7:50 PM Yuming Wang <wg...@gmail.com> wrote:
>>
>>> +1.
>>>
>>> On Tue, May 18, 2021 at 9:06 AM Hyukjin Kwon <gu...@gmail.com>
>>> wrote:
>>>
>>>> +1 thanks for driving me
>>>>
>>>> On Tue, 18 May 2021, 09:33 Holden Karau, <ho...@pigscanfly.ca> wrote:
>>>>
>>>>> +1 and thanks for volunteering to be the RM :)
>>>>>
>>>>> On Mon, May 17, 2021 at 4:09 PM Takeshi Yamamuro <
>>>>> linguin.m.s@gmail.com> wrote:
>>>>>
>>>>>> Thank you, Dongjoon~ sgtm, too.
>>>>>>
>>>>>> On Tue, May 18, 2021 at 7:34 AM Cheng Su <ch...@fb.com.invalid>
>>>>>> wrote:
>>>>>>
>>>>>>> +1 for a new release, thanks Dongjoon!
>>>>>>>
>>>>>>> Cheng Su
>>>>>>>
>>>>>>> On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <vi...@gmail.com> wrote:
>>>>>>>
>>>>>>>     +1 sounds good. Thanks Dongjoon for volunteering on this!
>>>>>>>
>>>>>>>
>>>>>>>     Liang-Chi
>>>>>>>
>>>>>>>
>>>>>>>     Dongjoon Hyun-2 wrote
>>>>>>>     > Hi, All.
>>>>>>>     >
>>>>>>>     > Since Apache Spark 3.1.1 tag creation (Feb 21),
>>>>>>>     > new 172 patches including 9 correctness patches and 4 K8s
>>>>>>> patches arrived
>>>>>>>     > at branch-3.1.
>>>>>>>     >
>>>>>>>     > Shall we make a new release, Apache Spark 3.1.2, as the second
>>>>>>> release at
>>>>>>>     > 3.1 line?
>>>>>>>     > I'd like to volunteer for the release manager for Apache Spark
>>>>>>> 3.1.2.
>>>>>>>     > I'm thinking about starting the first RC next week.
>>>>>>>     >
>>>>>>>     > $ git log --oneline v3.1.1..HEAD | wc -l
>>>>>>>     >      172
>>>>>>>     >
>>>>>>>     > # Known correctness issues
>>>>>>>     > SPARK-34534     New protocol FetchShuffleBlocks in
>>>>>>> OneForOneBlockFetcher
>>>>>>>     > lead to data loss or correctness
>>>>>>>     > SPARK-34545     PySpark Python UDF return inconsistent results
>>>>>>> when
>>>>>>>     > applying 2 UDFs with different return type to 2 columns
>>>>>>> together
>>>>>>>     > SPARK-34681     Full outer shuffled hash join when building
>>>>>>> left side
>>>>>>>     > produces wrong result
>>>>>>>     > SPARK-34719     fail if the view query has duplicated column
>>>>>>> names
>>>>>>>     > SPARK-34794     Nested higher-order functions broken in DSL
>>>>>>>     > SPARK-34829     transform_values return identical values when
>>>>>>> it's used
>>>>>>>     > with udf that returns reference type
>>>>>>>     > SPARK-34833     Apply right-padding correctly for correlated
>>>>>>> subqueries
>>>>>>>     > SPARK-35381     Fix lambda variable name issues in nested
>>>>>>> DataFrame
>>>>>>>     > functions in R APIs
>>>>>>>     > SPARK-35382     Fix lambda variable name issues in nested
>>>>>>> DataFrame
>>>>>>>     > functions in Python APIs
>>>>>>>     >
>>>>>>>     > # Notable K8s patches since K8s GA
>>>>>>>     > SPARK-34674    Close SparkContext after the Main method has
>>>>>>> finished
>>>>>>>     > SPARK-34948    Add ownerReference to executor configmap to fix
>>>>>>> leakages
>>>>>>>     > SPARK-34820    add apt-update before gnupg install
>>>>>>>     > SPARK-34361    In case of downscaling avoid killing of
>>>>>>> executors already
>>>>>>>     > known by the scheduler backend in the pod allocator
>>>>>>>     >
>>>>>>>     > Bests,
>>>>>>>     > Dongjoon.
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>     --
>>>>>>>     Sent from:
>>>>>>> http://apache-spark-developers-list.1001551.n3.nabble.com/
>>>>>>>
>>>>>>>
>>>>>>> ---------------------------------------------------------------------
>>>>>>>     To unsubscribe e-mail: dev-unsubscribe@spark.apache.org
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>
>>>>>> --
>>>>>> ---
>>>>>> Takeshi Yamamuro
>>>>>>
>>>>> --
>>>>> Twitter: https://twitter.com/holdenkarau
>>>>> Books (Learning Spark, High Performance Spark, etc.):
>>>>> https://amzn.to/2MaRAG9  <https://amzn.to/2MaRAG9>
>>>>> YouTube Live Streams: https://www.youtube.com/user/holdenkarau
>>>>>
>>>>
>>
>> --
>> John Zhuge
>>
>>
>
> --
>
>

Re: Apache Spark 3.1.2 Release?

Posted by Xiao Li <li...@databricks.com>.

+1 Thanks, Dongjoon!

Xiao



On Mon, May 17, 2021 at 8:45 PM Kent Yao <ya...@gmail.com> wrote:

> +1. thanks Dongjoon
>
> *Kent Yao *
> @ Data Science Center, Hangzhou Research Institute, NetEase Corp.
> *a spark enthusiast*
> *kyuubi <https://github.com/yaooqinn/kyuubi>is a unified multi-tenant JDBC
> interface for large-scale data processing and analytics, built on top
> of Apache Spark <http://spark.apache.org/>.*
> *spark-authorizer <https://github.com/yaooqinn/spark-authorizer>A Spark
> SQL extension which provides SQL Standard Authorization for **Apache
> Spark <http://spark.apache.org/>.*
> *spark-postgres <https://github.com/yaooqinn/spark-postgres> A library for
> reading data from and transferring data to Postgres / Greenplum with Spark
> SQL and DataFrames, 10~100x faster.*
> *itatchi <https://github.com/yaooqinn/spark-func-extras>A** library t**hat
> brings useful functions from various modern database management systems to **Apache
> Spark <http://spark.apache.org/>.*
>
>
>
> On 05/18/2021 10:57，John Zhuge<jz...@apache.org> <jz...@apache.org>
> wrote：
>
> +1, thanks Dongjoon!
>
> On Mon, May 17, 2021 at 7:50 PM Yuming Wang <wg...@gmail.com> wrote:
>
>> +1.
>>
>> On Tue, May 18, 2021 at 9:06 AM Hyukjin Kwon <gu...@gmail.com> wrote:
>>
>>> +1 thanks for driving me
>>>
>>> On Tue, 18 May 2021, 09:33 Holden Karau, <ho...@pigscanfly.ca> wrote:
>>>
>>>> +1 and thanks for volunteering to be the RM :)
>>>>
>>>> On Mon, May 17, 2021 at 4:09 PM Takeshi Yamamuro <li...@gmail.com>
>>>> wrote:
>>>>
>>>>> Thank you, Dongjoon~ sgtm, too.
>>>>>
>>>>> On Tue, May 18, 2021 at 7:34 AM Cheng Su <ch...@fb.com.invalid>
>>>>> wrote:
>>>>>
>>>>>> +1 for a new release, thanks Dongjoon!
>>>>>>
>>>>>> Cheng Su
>>>>>>
>>>>>> On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <vi...@gmail.com> wrote:
>>>>>>
>>>>>>     +1 sounds good. Thanks Dongjoon for volunteering on this!
>>>>>>
>>>>>>
>>>>>>     Liang-Chi
>>>>>>
>>>>>>
>>>>>>     Dongjoon Hyun-2 wrote
>>>>>>     > Hi, All.
>>>>>>     >
>>>>>>     > Since Apache Spark 3.1.1 tag creation (Feb 21),
>>>>>>     > new 172 patches including 9 correctness patches and 4 K8s
>>>>>> patches arrived
>>>>>>     > at branch-3.1.
>>>>>>     >
>>>>>>     > Shall we make a new release, Apache Spark 3.1.2, as the second
>>>>>> release at
>>>>>>     > 3.1 line?
>>>>>>     > I'd like to volunteer for the release manager for Apache Spark
>>>>>> 3.1.2.
>>>>>>     > I'm thinking about starting the first RC next week.
>>>>>>     >
>>>>>>     > $ git log --oneline v3.1.1..HEAD | wc -l
>>>>>>     >      172
>>>>>>     >
>>>>>>     > # Known correctness issues
>>>>>>     > SPARK-34534     New protocol FetchShuffleBlocks in
>>>>>> OneForOneBlockFetcher
>>>>>>     > lead to data loss or correctness
>>>>>>     > SPARK-34545     PySpark Python UDF return inconsistent results
>>>>>> when
>>>>>>     > applying 2 UDFs with different return type to 2 columns together
>>>>>>     > SPARK-34681     Full outer shuffled hash join when building
>>>>>> left side
>>>>>>     > produces wrong result
>>>>>>     > SPARK-34719     fail if the view query has duplicated column
>>>>>> names
>>>>>>     > SPARK-34794     Nested higher-order functions broken in DSL
>>>>>>     > SPARK-34829     transform_values return identical values when
>>>>>> it's used
>>>>>>     > with udf that returns reference type
>>>>>>     > SPARK-34833     Apply right-padding correctly for correlated
>>>>>> subqueries
>>>>>>     > SPARK-35381     Fix lambda variable name issues in nested
>>>>>> DataFrame
>>>>>>     > functions in R APIs
>>>>>>     > SPARK-35382     Fix lambda variable name issues in nested
>>>>>> DataFrame
>>>>>>     > functions in Python APIs
>>>>>>     >
>>>>>>     > # Notable K8s patches since K8s GA
>>>>>>     > SPARK-34674    Close SparkContext after the Main method has
>>>>>> finished
>>>>>>     > SPARK-34948    Add ownerReference to executor configmap to fix
>>>>>> leakages
>>>>>>     > SPARK-34820    add apt-update before gnupg install
>>>>>>     > SPARK-34361    In case of downscaling avoid killing of
>>>>>> executors already
>>>>>>     > known by the scheduler backend in the pod allocator
>>>>>>     >
>>>>>>     > Bests,
>>>>>>     > Dongjoon.
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>     --
>>>>>>     Sent from:
>>>>>> http://apache-spark-developers-list.1001551.n3.nabble.com/
>>>>>>
>>>>>>
>>>>>> ---------------------------------------------------------------------
>>>>>>     To unsubscribe e-mail: dev-unsubscribe@spark.apache.org
>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>> --
>>>>> ---
>>>>> Takeshi Yamamuro
>>>>>
>>>> --
>>>> Twitter: https://twitter.com/holdenkarau
>>>> Books (Learning Spark, High Performance Spark, etc.):
>>>> https://amzn.to/2MaRAG9  <https://amzn.to/2MaRAG9>
>>>> YouTube Live Streams: https://www.youtube.com/user/holdenkarau
>>>>
>>>
>
> --
> John Zhuge
>
>

--

Re: Apache Spark 3.1.2 Release?

Posted by Kent Yao <ya...@gmail.com>.

+1. thanks Dongjoon

  

**Kent Yao  **

@ Data Science Center, Hangzhou Research Institute, NetEase Corp.

 ** _a s_ _park  enthusiast_**

 _[kyuubi](https://github.com/yaooqinn/kyuubi) is a unified multi-tenant JDBC
interface for large-scale data processing and analytics, built on top of
[Apache Spark](http://spark.apache.org/)._

#

 _[spark-authorizer](https://github.com/yaooqinn/spark-authorizer) A Spark SQL
extension which provides SQL Standard Authorization for _ _[Apache
Spark](http://spark.apache.org/)._

 _ _[spark-postgres](https://github.com/yaooqinn/spark-postgres)  A library
for reading data from and transferring data to Postgres / Greenplum with Spark
SQL and DataFrames, 10~100x faster.__

 _ _[itatchi](https://github.com/yaooqinn/spark-func-extras) A__ _library  t_
_hat brings useful functions from various modern database management systems
to  _ _ _[Apache Spark](http://spark.apache.org/).__

#

  

  

  

On 05/18/2021 10:57，[John Zhuge<jz...@apache.org>](mailto:jzhuge@apache.org)
wrote：

> +1, thanks Dongjoon!  
>

>

>  
>

>

> On Mon, May 17, 2021 at 7:50 PM Yuming Wang
<[wgyumg@gmail.com](mailto:wgyumg@gmail.com)> wrote:  
>

>

>> +1.  
>

>>

>>  
>

>>

>> On Tue, May 18, 2021 at 9:06 AM Hyukjin Kwon
<[gurwls223@gmail.com](mailto:gurwls223@gmail.com)> wrote:  
>

>>

>>> +1 thanks for driving me

>>>

>>>  
>

>>>

>>> On Tue, 18 May 2021, 09:33 Holden Karau,
<[holden@pigscanfly.ca](mailto:holden@pigscanfly.ca)> wrote:  
>

>>>

>>>> +1 and thanks for volunteering to be the RM :)

>>>>

>>>>  
>

>>>>

>>>> On Mon, May 17, 2021 at 4:09 PM Takeshi Yamamuro
<[linguin.m.s@gmail.com](mailto:linguin.m.s@gmail.com)> wrote:  
>

>>>>

>>>>> Thank you, Dongjoon~ sgtm, too.

>>>>>

>>>>>  
>

>>>>>

>>>>> On Tue, May 18, 2021 at 7:34 AM Cheng Su <ch...@fb.com.invalid> wrote:  
>

>>>>>

>>>>>> +1 for a new release, thanks Dongjoon!  
>  
>  Cheng Su  
>  
>  On 5/17/21, 2:44 PM, "Liang-Chi Hsieh"
<[viirya@gmail.com](mailto:viirya@gmail.com)> wrote:  
>  
>      +1 sounds good. Thanks Dongjoon for volunteering on this!  
>  
>  
>      Liang-Chi  
>  
>  
>      Dongjoon Hyun-2 wrote  
>      > Hi, All.  
>      >  
>      > Since Apache Spark 3.1.1 tag creation (Feb 21),  
>      > new 172 patches including 9 correctness patches and 4 K8s patches
arrived  
>      > at branch-3.1.  
>      >  
>      > Shall we make a new release, Apache Spark 3.1.2, as the second
release at  
>      > 3.1 line?  
>      > I'd like to volunteer for the release manager for Apache Spark 3.1.2.  
>      > I'm thinking about starting the first RC next week.  
>      >  
>      > $ git log --oneline v3.1.1..HEAD | wc -l  
>      >      172  
>      >  
>      > # Known correctness issues  
>      > SPARK-34534     New protocol FetchShuffleBlocks in
OneForOneBlockFetcher  
>      > lead to data loss or correctness  
>      > SPARK-34545     PySpark Python UDF return inconsistent results when  
>      > applying 2 UDFs with different return type to 2 columns together  
>      > SPARK-34681     Full outer shuffled hash join when building left side  
>      > produces wrong result  
>      > SPARK-34719     fail if the view query has duplicated column names  
>      > SPARK-34794     Nested higher-order functions broken in DSL  
>      > SPARK-34829     transform_values return identical values when it's
used  
>      > with udf that returns reference type  
>      > SPARK-34833     Apply right-padding correctly for correlated
subqueries  
>      > SPARK-35381     Fix lambda variable name issues in nested DataFrame  
>      > functions in R APIs  
>      > SPARK-35382     Fix lambda variable name issues in nested DataFrame  
>      > functions in Python APIs  
>      >  
>      > # Notable K8s patches since K8s GA  
>      > SPARK-34674    Close SparkContext after the Main method has finished  
>      > SPARK-34948    Add ownerReference to executor configmap to fix
leakages  
>      > SPARK-34820    add apt-update before gnupg install  
>      > SPARK-34361    In case of downscaling avoid killing of executors
already  
>      > known by the scheduler backend in the pod allocator  
>      >  
>      > Bests,  
>      > Dongjoon.  
>  
>  
>  
>  
>  
>      \--  
>      Sent from: <http://apache-spark-developers-list.1001551.n3.nabble.com/>  
>  
>      \---------------------------------------------------------------------  
>      To unsubscribe e-mail: [dev-unsubscribe@spark.apache.org](mailto:dev-
unsubscribe@spark.apache.org)  
>  
>  
>

>>>>>

>>>>>  
>

>>>>>

>>>>>  
>

>>>>>

>>>>> \--  
>

>>>>>

>>>>> \---  
> Takeshi Yamamuro  
>

>>>>

>>>> \--  
>

>>>>

>>>> Twitter: <https://twitter.com/holdenkarau>  
>

>>>>

>>>> Books (Learning Spark, High Performance Spark, etc.):
<https://amzn.to/2MaRAG9>[ ](https://amzn.to/2MaRAG9)

>>>>

>>>> YouTube Live Streams: <https://www.youtube.com/user/holdenkarau>

>

>  
>

>

>  
>

>

> \--  
>

>

> John Zhuge

Re: Apache Spark 3.1.2 Release?

Posted by Chao Sun <su...@apache.org>.

+1. Thanks Dongjoon for doing this!

On Mon, May 17, 2021 at 7:58 PM John Zhuge <jz...@apache.org> wrote:

> +1, thanks Dongjoon!
>
> On Mon, May 17, 2021 at 7:50 PM Yuming Wang <wg...@gmail.com> wrote:
>
>> +1.
>>
>> On Tue, May 18, 2021 at 9:06 AM Hyukjin Kwon <gu...@gmail.com> wrote:
>>
>>> +1 thanks for driving me
>>>
>>> On Tue, 18 May 2021, 09:33 Holden Karau, <ho...@pigscanfly.ca> wrote:
>>>
>>>> +1 and thanks for volunteering to be the RM :)
>>>>
>>>> On Mon, May 17, 2021 at 4:09 PM Takeshi Yamamuro <li...@gmail.com>
>>>> wrote:
>>>>
>>>>> Thank you, Dongjoon~ sgtm, too.
>>>>>
>>>>> On Tue, May 18, 2021 at 7:34 AM Cheng Su <ch...@fb.com.invalid>
>>>>> wrote:
>>>>>
>>>>>> +1 for a new release, thanks Dongjoon!
>>>>>>
>>>>>> Cheng Su
>>>>>>
>>>>>> On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <vi...@gmail.com> wrote:
>>>>>>
>>>>>>     +1 sounds good. Thanks Dongjoon for volunteering on this!
>>>>>>
>>>>>>
>>>>>>     Liang-Chi
>>>>>>
>>>>>>
>>>>>>     Dongjoon Hyun-2 wrote
>>>>>>     > Hi, All.
>>>>>>     >
>>>>>>     > Since Apache Spark 3.1.1 tag creation (Feb 21),
>>>>>>     > new 172 patches including 9 correctness patches and 4 K8s
>>>>>> patches arrived
>>>>>>     > at branch-3.1.
>>>>>>     >
>>>>>>     > Shall we make a new release, Apache Spark 3.1.2, as the second
>>>>>> release at
>>>>>>     > 3.1 line?
>>>>>>     > I'd like to volunteer for the release manager for Apache Spark
>>>>>> 3.1.2.
>>>>>>     > I'm thinking about starting the first RC next week.
>>>>>>     >
>>>>>>     > $ git log --oneline v3.1.1..HEAD | wc -l
>>>>>>     >      172
>>>>>>     >
>>>>>>     > # Known correctness issues
>>>>>>     > SPARK-34534     New protocol FetchShuffleBlocks in
>>>>>> OneForOneBlockFetcher
>>>>>>     > lead to data loss or correctness
>>>>>>     > SPARK-34545     PySpark Python UDF return inconsistent results
>>>>>> when
>>>>>>     > applying 2 UDFs with different return type to 2 columns together
>>>>>>     > SPARK-34681     Full outer shuffled hash join when building
>>>>>> left side
>>>>>>     > produces wrong result
>>>>>>     > SPARK-34719     fail if the view query has duplicated column
>>>>>> names
>>>>>>     > SPARK-34794     Nested higher-order functions broken in DSL
>>>>>>     > SPARK-34829     transform_values return identical values when
>>>>>> it's used
>>>>>>     > with udf that returns reference type
>>>>>>     > SPARK-34833     Apply right-padding correctly for correlated
>>>>>> subqueries
>>>>>>     > SPARK-35381     Fix lambda variable name issues in nested
>>>>>> DataFrame
>>>>>>     > functions in R APIs
>>>>>>     > SPARK-35382     Fix lambda variable name issues in nested
>>>>>> DataFrame
>>>>>>     > functions in Python APIs
>>>>>>     >
>>>>>>     > # Notable K8s patches since K8s GA
>>>>>>     > SPARK-34674    Close SparkContext after the Main method has
>>>>>> finished
>>>>>>     > SPARK-34948    Add ownerReference to executor configmap to fix
>>>>>> leakages
>>>>>>     > SPARK-34820    add apt-update before gnupg install
>>>>>>     > SPARK-34361    In case of downscaling avoid killing of
>>>>>> executors already
>>>>>>     > known by the scheduler backend in the pod allocator
>>>>>>     >
>>>>>>     > Bests,
>>>>>>     > Dongjoon.
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>     --
>>>>>>     Sent from:
>>>>>> http://apache-spark-developers-list.1001551.n3.nabble.com/
>>>>>>
>>>>>>
>>>>>> ---------------------------------------------------------------------
>>>>>>     To unsubscribe e-mail: dev-unsubscribe@spark.apache.org
>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>> --
>>>>> ---
>>>>> Takeshi Yamamuro
>>>>>
>>>> --
>>>> Twitter: https://twitter.com/holdenkarau
>>>> Books (Learning Spark, High Performance Spark, etc.):
>>>> https://amzn.to/2MaRAG9  <https://amzn.to/2MaRAG9>
>>>> YouTube Live Streams: https://www.youtube.com/user/holdenkarau
>>>>
>>>
>
> --
> John Zhuge
>

Re: Apache Spark 3.1.2 Release?

Posted by John Zhuge <jz...@apache.org>.

+1, thanks Dongjoon!

On Mon, May 17, 2021 at 7:50 PM Yuming Wang <wg...@gmail.com> wrote:

> +1.
>
> On Tue, May 18, 2021 at 9:06 AM Hyukjin Kwon <gu...@gmail.com> wrote:
>
>> +1 thanks for driving me
>>
>> On Tue, 18 May 2021, 09:33 Holden Karau, <ho...@pigscanfly.ca> wrote:
>>
>>> +1 and thanks for volunteering to be the RM :)
>>>
>>> On Mon, May 17, 2021 at 4:09 PM Takeshi Yamamuro <li...@gmail.com>
>>> wrote:
>>>
>>>> Thank you, Dongjoon~ sgtm, too.
>>>>
>>>> On Tue, May 18, 2021 at 7:34 AM Cheng Su <ch...@fb.com.invalid>
>>>> wrote:
>>>>
>>>>> +1 for a new release, thanks Dongjoon!
>>>>>
>>>>> Cheng Su
>>>>>
>>>>> On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <vi...@gmail.com> wrote:
>>>>>
>>>>>     +1 sounds good. Thanks Dongjoon for volunteering on this!
>>>>>
>>>>>
>>>>>     Liang-Chi
>>>>>
>>>>>
>>>>>     Dongjoon Hyun-2 wrote
>>>>>     > Hi, All.
>>>>>     >
>>>>>     > Since Apache Spark 3.1.1 tag creation (Feb 21),
>>>>>     > new 172 patches including 9 correctness patches and 4 K8s
>>>>> patches arrived
>>>>>     > at branch-3.1.
>>>>>     >
>>>>>     > Shall we make a new release, Apache Spark 3.1.2, as the second
>>>>> release at
>>>>>     > 3.1 line?
>>>>>     > I'd like to volunteer for the release manager for Apache Spark
>>>>> 3.1.2.
>>>>>     > I'm thinking about starting the first RC next week.
>>>>>     >
>>>>>     > $ git log --oneline v3.1.1..HEAD | wc -l
>>>>>     >      172
>>>>>     >
>>>>>     > # Known correctness issues
>>>>>     > SPARK-34534     New protocol FetchShuffleBlocks in
>>>>> OneForOneBlockFetcher
>>>>>     > lead to data loss or correctness
>>>>>     > SPARK-34545     PySpark Python UDF return inconsistent results
>>>>> when
>>>>>     > applying 2 UDFs with different return type to 2 columns together
>>>>>     > SPARK-34681     Full outer shuffled hash join when building left
>>>>> side
>>>>>     > produces wrong result
>>>>>     > SPARK-34719     fail if the view query has duplicated column
>>>>> names
>>>>>     > SPARK-34794     Nested higher-order functions broken in DSL
>>>>>     > SPARK-34829     transform_values return identical values when
>>>>> it's used
>>>>>     > with udf that returns reference type
>>>>>     > SPARK-34833     Apply right-padding correctly for correlated
>>>>> subqueries
>>>>>     > SPARK-35381     Fix lambda variable name issues in nested
>>>>> DataFrame
>>>>>     > functions in R APIs
>>>>>     > SPARK-35382     Fix lambda variable name issues in nested
>>>>> DataFrame
>>>>>     > functions in Python APIs
>>>>>     >
>>>>>     > # Notable K8s patches since K8s GA
>>>>>     > SPARK-34674    Close SparkContext after the Main method has
>>>>> finished
>>>>>     > SPARK-34948    Add ownerReference to executor configmap to fix
>>>>> leakages
>>>>>     > SPARK-34820    add apt-update before gnupg install
>>>>>     > SPARK-34361    In case of downscaling avoid killing of executors
>>>>> already
>>>>>     > known by the scheduler backend in the pod allocator
>>>>>     >
>>>>>     > Bests,
>>>>>     > Dongjoon.
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>     --
>>>>>     Sent from:
>>>>> http://apache-spark-developers-list.1001551.n3.nabble.com/
>>>>>
>>>>>
>>>>> ---------------------------------------------------------------------
>>>>>     To unsubscribe e-mail: dev-unsubscribe@spark.apache.org
>>>>>
>>>>>
>>>>>
>>>>
>>>> --
>>>> ---
>>>> Takeshi Yamamuro
>>>>
>>> --
>>> Twitter: https://twitter.com/holdenkarau
>>> Books (Learning Spark, High Performance Spark, etc.):
>>> https://amzn.to/2MaRAG9  <https://amzn.to/2MaRAG9>
>>> YouTube Live Streams: https://www.youtube.com/user/holdenkarau
>>>
>>

-- 
John Zhuge

Re: Apache Spark 3.1.2 Release?

Posted by Yuming Wang <wg...@gmail.com>.

+1.

On Tue, May 18, 2021 at 9:06 AM Hyukjin Kwon <gu...@gmail.com> wrote:

> +1 thanks for driving me
>
> On Tue, 18 May 2021, 09:33 Holden Karau, <ho...@pigscanfly.ca> wrote:
>
>> +1 and thanks for volunteering to be the RM :)
>>
>> On Mon, May 17, 2021 at 4:09 PM Takeshi Yamamuro <li...@gmail.com>
>> wrote:
>>
>>> Thank you, Dongjoon~ sgtm, too.
>>>
>>> On Tue, May 18, 2021 at 7:34 AM Cheng Su <ch...@fb.com.invalid> wrote:
>>>
>>>> +1 for a new release, thanks Dongjoon!
>>>>
>>>> Cheng Su
>>>>
>>>> On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <vi...@gmail.com> wrote:
>>>>
>>>>     +1 sounds good. Thanks Dongjoon for volunteering on this!
>>>>
>>>>
>>>>     Liang-Chi
>>>>
>>>>
>>>>     Dongjoon Hyun-2 wrote
>>>>     > Hi, All.
>>>>     >
>>>>     > Since Apache Spark 3.1.1 tag creation (Feb 21),
>>>>     > new 172 patches including 9 correctness patches and 4 K8s patches
>>>> arrived
>>>>     > at branch-3.1.
>>>>     >
>>>>     > Shall we make a new release, Apache Spark 3.1.2, as the second
>>>> release at
>>>>     > 3.1 line?
>>>>     > I'd like to volunteer for the release manager for Apache Spark
>>>> 3.1.2.
>>>>     > I'm thinking about starting the first RC next week.
>>>>     >
>>>>     > $ git log --oneline v3.1.1..HEAD | wc -l
>>>>     >      172
>>>>     >
>>>>     > # Known correctness issues
>>>>     > SPARK-34534     New protocol FetchShuffleBlocks in
>>>> OneForOneBlockFetcher
>>>>     > lead to data loss or correctness
>>>>     > SPARK-34545     PySpark Python UDF return inconsistent results
>>>> when
>>>>     > applying 2 UDFs with different return type to 2 columns together
>>>>     > SPARK-34681     Full outer shuffled hash join when building left
>>>> side
>>>>     > produces wrong result
>>>>     > SPARK-34719     fail if the view query has duplicated column names
>>>>     > SPARK-34794     Nested higher-order functions broken in DSL
>>>>     > SPARK-34829     transform_values return identical values when
>>>> it's used
>>>>     > with udf that returns reference type
>>>>     > SPARK-34833     Apply right-padding correctly for correlated
>>>> subqueries
>>>>     > SPARK-35381     Fix lambda variable name issues in nested
>>>> DataFrame
>>>>     > functions in R APIs
>>>>     > SPARK-35382     Fix lambda variable name issues in nested
>>>> DataFrame
>>>>     > functions in Python APIs
>>>>     >
>>>>     > # Notable K8s patches since K8s GA
>>>>     > SPARK-34674    Close SparkContext after the Main method has
>>>> finished
>>>>     > SPARK-34948    Add ownerReference to executor configmap to fix
>>>> leakages
>>>>     > SPARK-34820    add apt-update before gnupg install
>>>>     > SPARK-34361    In case of downscaling avoid killing of executors
>>>> already
>>>>     > known by the scheduler backend in the pod allocator
>>>>     >
>>>>     > Bests,
>>>>     > Dongjoon.
>>>>
>>>>
>>>>
>>>>
>>>>
>>>>     --
>>>>     Sent from:
>>>> http://apache-spark-developers-list.1001551.n3.nabble.com/
>>>>
>>>>
>>>> ---------------------------------------------------------------------
>>>>     To unsubscribe e-mail: dev-unsubscribe@spark.apache.org
>>>>
>>>>
>>>>
>>>
>>> --
>>> ---
>>> Takeshi Yamamuro
>>>
>> --
>> Twitter: https://twitter.com/holdenkarau
>> Books (Learning Spark, High Performance Spark, etc.):
>> https://amzn.to/2MaRAG9  <https://amzn.to/2MaRAG9>
>> YouTube Live Streams: https://www.youtube.com/user/holdenkarau
>>
>

Re: Apache Spark 3.1.2 Release?

Posted by Hyukjin Kwon <gu...@gmail.com>.

+1 thanks for driving me

On Tue, 18 May 2021, 09:33 Holden Karau, <ho...@pigscanfly.ca> wrote:

> +1 and thanks for volunteering to be the RM :)
>
> On Mon, May 17, 2021 at 4:09 PM Takeshi Yamamuro <li...@gmail.com>
> wrote:
>
>> Thank you, Dongjoon~ sgtm, too.
>>
>> On Tue, May 18, 2021 at 7:34 AM Cheng Su <ch...@fb.com.invalid> wrote:
>>
>>> +1 for a new release, thanks Dongjoon!
>>>
>>> Cheng Su
>>>
>>> On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <vi...@gmail.com> wrote:
>>>
>>>     +1 sounds good. Thanks Dongjoon for volunteering on this!
>>>
>>>
>>>     Liang-Chi
>>>
>>>
>>>     Dongjoon Hyun-2 wrote
>>>     > Hi, All.
>>>     >
>>>     > Since Apache Spark 3.1.1 tag creation (Feb 21),
>>>     > new 172 patches including 9 correctness patches and 4 K8s patches
>>> arrived
>>>     > at branch-3.1.
>>>     >
>>>     > Shall we make a new release, Apache Spark 3.1.2, as the second
>>> release at
>>>     > 3.1 line?
>>>     > I'd like to volunteer for the release manager for Apache Spark
>>> 3.1.2.
>>>     > I'm thinking about starting the first RC next week.
>>>     >
>>>     > $ git log --oneline v3.1.1..HEAD | wc -l
>>>     >      172
>>>     >
>>>     > # Known correctness issues
>>>     > SPARK-34534     New protocol FetchShuffleBlocks in
>>> OneForOneBlockFetcher
>>>     > lead to data loss or correctness
>>>     > SPARK-34545     PySpark Python UDF return inconsistent results when
>>>     > applying 2 UDFs with different return type to 2 columns together
>>>     > SPARK-34681     Full outer shuffled hash join when building left
>>> side
>>>     > produces wrong result
>>>     > SPARK-34719     fail if the view query has duplicated column names
>>>     > SPARK-34794     Nested higher-order functions broken in DSL
>>>     > SPARK-34829     transform_values return identical values when it's
>>> used
>>>     > with udf that returns reference type
>>>     > SPARK-34833     Apply right-padding correctly for correlated
>>> subqueries
>>>     > SPARK-35381     Fix lambda variable name issues in nested DataFrame
>>>     > functions in R APIs
>>>     > SPARK-35382     Fix lambda variable name issues in nested DataFrame
>>>     > functions in Python APIs
>>>     >
>>>     > # Notable K8s patches since K8s GA
>>>     > SPARK-34674    Close SparkContext after the Main method has
>>> finished
>>>     > SPARK-34948    Add ownerReference to executor configmap to fix
>>> leakages
>>>     > SPARK-34820    add apt-update before gnupg install
>>>     > SPARK-34361    In case of downscaling avoid killing of executors
>>> already
>>>     > known by the scheduler backend in the pod allocator
>>>     >
>>>     > Bests,
>>>     > Dongjoon.
>>>
>>>
>>>
>>>
>>>
>>>     --
>>>     Sent from:
>>> http://apache-spark-developers-list.1001551.n3.nabble.com/
>>>
>>>     ---------------------------------------------------------------------
>>>     To unsubscribe e-mail: dev-unsubscribe@spark.apache.org
>>>
>>>
>>>
>>
>> --
>> ---
>> Takeshi Yamamuro
>>
> --
> Twitter: https://twitter.com/holdenkarau
> Books (Learning Spark, High Performance Spark, etc.):
> https://amzn.to/2MaRAG9  <https://amzn.to/2MaRAG9>
> YouTube Live Streams: https://www.youtube.com/user/holdenkarau
>

Re: Apache Spark 3.1.2 Release?

Posted by Holden Karau <ho...@pigscanfly.ca>.

+1 and thanks for volunteering to be the RM :)

On Mon, May 17, 2021 at 4:09 PM Takeshi Yamamuro <li...@gmail.com>
wrote:

> Thank you, Dongjoon~ sgtm, too.
>
> On Tue, May 18, 2021 at 7:34 AM Cheng Su <ch...@fb.com.invalid> wrote:
>
>> +1 for a new release, thanks Dongjoon!
>>
>> Cheng Su
>>
>> On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <vi...@gmail.com> wrote:
>>
>>     +1 sounds good. Thanks Dongjoon for volunteering on this!
>>
>>
>>     Liang-Chi
>>
>>
>>     Dongjoon Hyun-2 wrote
>>     > Hi, All.
>>     >
>>     > Since Apache Spark 3.1.1 tag creation (Feb 21),
>>     > new 172 patches including 9 correctness patches and 4 K8s patches
>> arrived
>>     > at branch-3.1.
>>     >
>>     > Shall we make a new release, Apache Spark 3.1.2, as the second
>> release at
>>     > 3.1 line?
>>     > I'd like to volunteer for the release manager for Apache Spark
>> 3.1.2.
>>     > I'm thinking about starting the first RC next week.
>>     >
>>     > $ git log --oneline v3.1.1..HEAD | wc -l
>>     >      172
>>     >
>>     > # Known correctness issues
>>     > SPARK-34534     New protocol FetchShuffleBlocks in
>> OneForOneBlockFetcher
>>     > lead to data loss or correctness
>>     > SPARK-34545     PySpark Python UDF return inconsistent results when
>>     > applying 2 UDFs with different return type to 2 columns together
>>     > SPARK-34681     Full outer shuffled hash join when building left
>> side
>>     > produces wrong result
>>     > SPARK-34719     fail if the view query has duplicated column names
>>     > SPARK-34794     Nested higher-order functions broken in DSL
>>     > SPARK-34829     transform_values return identical values when it's
>> used
>>     > with udf that returns reference type
>>     > SPARK-34833     Apply right-padding correctly for correlated
>> subqueries
>>     > SPARK-35381     Fix lambda variable name issues in nested DataFrame
>>     > functions in R APIs
>>     > SPARK-35382     Fix lambda variable name issues in nested DataFrame
>>     > functions in Python APIs
>>     >
>>     > # Notable K8s patches since K8s GA
>>     > SPARK-34674    Close SparkContext after the Main method has finished
>>     > SPARK-34948    Add ownerReference to executor configmap to fix
>> leakages
>>     > SPARK-34820    add apt-update before gnupg install
>>     > SPARK-34361    In case of downscaling avoid killing of executors
>> already
>>     > known by the scheduler backend in the pod allocator
>>     >
>>     > Bests,
>>     > Dongjoon.
>>
>>
>>
>>
>>
>>     --
>>     Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/
>>
>>     ---------------------------------------------------------------------
>>     To unsubscribe e-mail: dev-unsubscribe@spark.apache.org
>>
>>
>>
>
> --
> ---
> Takeshi Yamamuro
>
-- 
Twitter: https://twitter.com/holdenkarau
Books (Learning Spark, High Performance Spark, etc.):
https://amzn.to/2MaRAG9  <https://amzn.to/2MaRAG9>
YouTube Live Streams: https://www.youtube.com/user/holdenkarau

Re: Apache Spark 3.1.2 Release?

Posted by Takeshi Yamamuro <li...@gmail.com>.

Thank you, Dongjoon~ sgtm, too.

On Tue, May 18, 2021 at 7:34 AM Cheng Su <ch...@fb.com.invalid> wrote:

> +1 for a new release, thanks Dongjoon!
>
> Cheng Su
>
> On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <vi...@gmail.com> wrote:
>
>     +1 sounds good. Thanks Dongjoon for volunteering on this!
>
>
>     Liang-Chi
>
>
>     Dongjoon Hyun-2 wrote
>     > Hi, All.
>     >
>     > Since Apache Spark 3.1.1 tag creation (Feb 21),
>     > new 172 patches including 9 correctness patches and 4 K8s patches
> arrived
>     > at branch-3.1.
>     >
>     > Shall we make a new release, Apache Spark 3.1.2, as the second
> release at
>     > 3.1 line?
>     > I'd like to volunteer for the release manager for Apache Spark 3.1.2.
>     > I'm thinking about starting the first RC next week.
>     >
>     > $ git log --oneline v3.1.1..HEAD | wc -l
>     >      172
>     >
>     > # Known correctness issues
>     > SPARK-34534     New protocol FetchShuffleBlocks in
> OneForOneBlockFetcher
>     > lead to data loss or correctness
>     > SPARK-34545     PySpark Python UDF return inconsistent results when
>     > applying 2 UDFs with different return type to 2 columns together
>     > SPARK-34681     Full outer shuffled hash join when building left side
>     > produces wrong result
>     > SPARK-34719     fail if the view query has duplicated column names
>     > SPARK-34794     Nested higher-order functions broken in DSL
>     > SPARK-34829     transform_values return identical values when it's
> used
>     > with udf that returns reference type
>     > SPARK-34833     Apply right-padding correctly for correlated
> subqueries
>     > SPARK-35381     Fix lambda variable name issues in nested DataFrame
>     > functions in R APIs
>     > SPARK-35382     Fix lambda variable name issues in nested DataFrame
>     > functions in Python APIs
>     >
>     > # Notable K8s patches since K8s GA
>     > SPARK-34674    Close SparkContext after the Main method has finished
>     > SPARK-34948    Add ownerReference to executor configmap to fix
> leakages
>     > SPARK-34820    add apt-update before gnupg install
>     > SPARK-34361    In case of downscaling avoid killing of executors
> already
>     > known by the scheduler backend in the pod allocator
>     >
>     > Bests,
>     > Dongjoon.
>
>
>
>
>
>     --
>     Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/
>
>     ---------------------------------------------------------------------
>     To unsubscribe e-mail: dev-unsubscribe@spark.apache.org
>
>
>

-- 
---
Takeshi Yamamuro

Re: Apache Spark 3.1.2 Release?

Posted by Cheng Su <ch...@fb.com.INVALID>.

+1 for a new release, thanks Dongjoon!

Cheng Su

On 5/17/21, 2:44 PM, "Liang-Chi Hsieh" <vi...@gmail.com> wrote:

    +1 sounds good. Thanks Dongjoon for volunteering on this!


    Liang-Chi


    Dongjoon Hyun-2 wrote
    > Hi, All.
    > 
    > Since Apache Spark 3.1.1 tag creation (Feb 21),
    > new 172 patches including 9 correctness patches and 4 K8s patches arrived
    > at branch-3.1.
    > 
    > Shall we make a new release, Apache Spark 3.1.2, as the second release at
    > 3.1 line?
    > I'd like to volunteer for the release manager for Apache Spark 3.1.2.
    > I'm thinking about starting the first RC next week.
    > 
    > $ git log --oneline v3.1.1..HEAD | wc -l
    >      172
    > 
    > # Known correctness issues
    > SPARK-34534     New protocol FetchShuffleBlocks in OneForOneBlockFetcher
    > lead to data loss or correctness
    > SPARK-34545     PySpark Python UDF return inconsistent results when
    > applying 2 UDFs with different return type to 2 columns together
    > SPARK-34681     Full outer shuffled hash join when building left side
    > produces wrong result
    > SPARK-34719     fail if the view query has duplicated column names
    > SPARK-34794     Nested higher-order functions broken in DSL
    > SPARK-34829     transform_values return identical values when it's used
    > with udf that returns reference type
    > SPARK-34833     Apply right-padding correctly for correlated subqueries
    > SPARK-35381     Fix lambda variable name issues in nested DataFrame
    > functions in R APIs
    > SPARK-35382     Fix lambda variable name issues in nested DataFrame
    > functions in Python APIs
    > 
    > # Notable K8s patches since K8s GA
    > SPARK-34674    Close SparkContext after the Main method has finished
    > SPARK-34948    Add ownerReference to executor configmap to fix leakages
    > SPARK-34820    add apt-update before gnupg install
    > SPARK-34361    In case of downscaling avoid killing of executors already
    > known by the scheduler backend in the pod allocator
    > 
    > Bests,
    > Dongjoon.





    --
    Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/ 

    ---------------------------------------------------------------------
    To unsubscribe e-mail: dev-unsubscribe@spark.apache.org

Re: Apache Spark 3.1.2 Release?

Posted by Liang-Chi Hsieh <vi...@gmail.com>.

+1 sounds good. Thanks Dongjoon for volunteering on this!


Liang-Chi


Dongjoon Hyun-2 wrote
> Hi, All.
> 
> Since Apache Spark 3.1.1 tag creation (Feb 21),
> new 172 patches including 9 correctness patches and 4 K8s patches arrived
> at branch-3.1.
> 
> Shall we make a new release, Apache Spark 3.1.2, as the second release at
> 3.1 line?
> I'd like to volunteer for the release manager for Apache Spark 3.1.2.
> I'm thinking about starting the first RC next week.
> 
> $ git log --oneline v3.1.1..HEAD | wc -l
>      172
> 
> # Known correctness issues
> SPARK-34534     New protocol FetchShuffleBlocks in OneForOneBlockFetcher
> lead to data loss or correctness
> SPARK-34545     PySpark Python UDF return inconsistent results when
> applying 2 UDFs with different return type to 2 columns together
> SPARK-34681     Full outer shuffled hash join when building left side
> produces wrong result
> SPARK-34719     fail if the view query has duplicated column names
> SPARK-34794     Nested higher-order functions broken in DSL
> SPARK-34829     transform_values return identical values when it's used
> with udf that returns reference type
> SPARK-34833     Apply right-padding correctly for correlated subqueries
> SPARK-35381     Fix lambda variable name issues in nested DataFrame
> functions in R APIs
> SPARK-35382     Fix lambda variable name issues in nested DataFrame
> functions in Python APIs
> 
> # Notable K8s patches since K8s GA
> SPARK-34674    Close SparkContext after the Main method has finished
> SPARK-34948    Add ownerReference to executor configmap to fix leakages
> SPARK-34820    add apt-update before gnupg install
> SPARK-34361    In case of downscaling avoid killing of executors already
> known by the scheduler backend in the pod allocator
> 
> Bests,
> Dongjoon.





--
Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/

---------------------------------------------------------------------
To unsubscribe e-mail: dev-unsubscribe@spark.apache.org