You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@spark.apache.org by Dongjoon Hyun <do...@gmail.com> on 2020/02/03 05:30:57 UTC

[VOTE] Release Apache Spark 2.4.5 (RC2)

Please vote on releasing the following candidate as Apache Spark version
2.4.5.

The vote is open until February 5th 11PM PST and passes if a majority +1
PMC votes are cast, with a minimum of 3 +1 votes.

[ ] +1 Release this package as Apache Spark 2.4.5
[ ] -1 Do not release this package because ...

To learn more about Apache Spark, please see http://spark.apache.org/

The tag to be voted on is v2.4.5-rc2 (commit
cee4ecbb16917fa85f02c635925e2687400aa56b):
https://github.com/apache/spark/tree/v2.4.5-rc2

The release files, including signatures, digests, etc. can be found at:
https://dist.apache.org/repos/dist/dev/spark/v2.4.5-rc2-bin/

Signatures used for Spark RCs can be found in this file:
https://dist.apache.org/repos/dist/dev/spark/KEYS

The staging repository for this release can be found at:
https://repository.apache.org/content/repositories/orgapachespark-1340/

The documentation corresponding to this release can be found at:
https://dist.apache.org/repos/dist/dev/spark/v2.4.5-rc2-docs/

The list of bug fixes going into 2.4.5 can be found at the following URL:
https://issues.apache.org/jira/projects/SPARK/versions/12346042

This release is using the release script of the tag v2.4.5-rc2.

FAQ

=========================
How can I help test this release?
=========================

If you are a Spark user, you can help us test this release by taking
an existing Spark workload and running on this release candidate, then
reporting any regressions.

If you're working in PySpark you can set up a virtual env and install
the current RC and see if anything important breaks, in the Java/Scala
you can add the staging repository to your projects resolvers and test
with the RC (make sure to clean up the artifact cache before/after so
you don't end up building with a out of date RC going forward).

===========================================
What should happen to JIRA tickets still targeting 2.4.5?
===========================================

The current list of open tickets targeted at 2.4.5 can be found at:
https://issues.apache.org/jira/projects/SPARK and search for "Target
Version/s" = 2.4.5

Committers should look at those and triage. Extremely important bug
fixes, documentation, and API tweaks that impact compatibility should
be worked on immediately. Everything else please retarget to an
appropriate release.

==================
But my bug isn't fixed?
==================

In order to make timely releases, we will typically not hold the
release unless the bug in question is a regression from the previous
release. That being said, if there is something which is a regression
that has not been correctly targeted please ping me or a committer to
help target the issue.

Re: [VOTE] Release Apache Spark 2.4.5 (RC2)

Posted by Maxim Gekk <ma...@databricks.com>.
+1
I re-ran some of existing benchmarks in branch-2.4 on Linux/MacOS, and
haven't found any regressions compared to 2.4.4.

Maxim Gekk


On Tue, Feb 4, 2020 at 11:07 AM Takeshi Yamamuro <li...@gmail.com>
wrote:

> +1;
>  I run the tests with
> `-Pyarn -Phadoop-2.7 -Phive -Phive-thriftserver -Pmesos -Pkubernetes
> -Psparkr`
> on macOS (Java 8).
> All the things look fine in my env.
>
> Bests,
> Takeshi
>
> On Tue, Feb 4, 2020 at 12:35 PM Hyukjin Kwon <gu...@gmail.com> wrote:
>
>> +1 from me too.
>>
>> 2020년 2월 4일 (화) 오후 12:26, Wenchen Fan <cl...@gmail.com>님이 작성:
>>
>>> AFAIK there is no ongoing critical bug fixes, +1
>>>
>>> On Mon, Feb 3, 2020 at 11:46 PM Dongjoon Hyun <do...@gmail.com>
>>> wrote:
>>>
>>>> Yes, it does officially since 2.4.0.
>>>>
>>>> 2.4.5 is a maintenance release of 2.4.x line and the community didn't
>>>> support Hadoop 3.x on 'branch-2.4'. We didn't run test at all.
>>>>
>>>> Bests,
>>>> Dongjoon.
>>>>
>>>> On Sun, Feb 2, 2020 at 22:58 Ajith shetty <aj...@huawei.com>
>>>> wrote:
>>>>
>>>>> Is hadoop-3.1 profile supported for this release.? i see lot of UTs
>>>>> failing under this profile.
>>>>> https://github.com/apache/spark/blob/v2.4.5-rc2/pom.xml
>>>>>
>>>>> *Example:*
>>>>>  [INFO] Running org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite
>>>>> [ERROR] Tests run: 3, Failures: 0, Errors: 3, Skipped: 0, Time
>>>>> elapsed: 1.717 s <<< FAILURE! - in
>>>>> org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite
>>>>> [ERROR]
>>>>> saveExternalTableAndQueryIt(org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite)
>>>>> Time elapsed: 1.675 s  <<< ERROR!
>>>>> java.lang.ExceptionInInitializerError
>>>>> at
>>>>> org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite.setUp(JavaMetastoreDataSourcesSuite.java:66)
>>>>> Caused by: java.lang.IllegalArgumentException: *Unrecognized Hadoop
>>>>> major version number: 3.1.0*
>>>>> at
>>>>> org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite.setUp(JavaMetastoreDataSourcesSuite.java:66)
>>>>>
>>>>
>
> --
> ---
> Takeshi Yamamuro
>

Re: [VOTE] Release Apache Spark 2.4.5 (RC2)

Posted by Takeshi Yamamuro <li...@gmail.com>.
+1;
 I run the tests with
`-Pyarn -Phadoop-2.7 -Phive -Phive-thriftserver -Pmesos -Pkubernetes
-Psparkr`
on macOS (Java 8).
All the things look fine in my env.

Bests,
Takeshi

On Tue, Feb 4, 2020 at 12:35 PM Hyukjin Kwon <gu...@gmail.com> wrote:

> +1 from me too.
>
> 2020년 2월 4일 (화) 오후 12:26, Wenchen Fan <cl...@gmail.com>님이 작성:
>
>> AFAIK there is no ongoing critical bug fixes, +1
>>
>> On Mon, Feb 3, 2020 at 11:46 PM Dongjoon Hyun <do...@gmail.com>
>> wrote:
>>
>>> Yes, it does officially since 2.4.0.
>>>
>>> 2.4.5 is a maintenance release of 2.4.x line and the community didn't
>>> support Hadoop 3.x on 'branch-2.4'. We didn't run test at all.
>>>
>>> Bests,
>>> Dongjoon.
>>>
>>> On Sun, Feb 2, 2020 at 22:58 Ajith shetty <aj...@huawei.com>
>>> wrote:
>>>
>>>> Is hadoop-3.1 profile supported for this release.? i see lot of UTs
>>>> failing under this profile.
>>>> https://github.com/apache/spark/blob/v2.4.5-rc2/pom.xml
>>>>
>>>> *Example:*
>>>>  [INFO] Running org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite
>>>> [ERROR] Tests run: 3, Failures: 0, Errors: 3, Skipped: 0, Time elapsed:
>>>> 1.717 s <<< FAILURE! - in
>>>> org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite
>>>> [ERROR]
>>>> saveExternalTableAndQueryIt(org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite)
>>>> Time elapsed: 1.675 s  <<< ERROR!
>>>> java.lang.ExceptionInInitializerError
>>>> at
>>>> org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite.setUp(JavaMetastoreDataSourcesSuite.java:66)
>>>> Caused by: java.lang.IllegalArgumentException: *Unrecognized Hadoop
>>>> major version number: 3.1.0*
>>>> at
>>>> org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite.setUp(JavaMetastoreDataSourcesSuite.java:66)
>>>>
>>>

-- 
---
Takeshi Yamamuro

Re: [VOTE] Release Apache Spark 2.4.5 (RC2)

Posted by Hyukjin Kwon <gu...@gmail.com>.
+1 from me too.

2020년 2월 4일 (화) 오후 12:26, Wenchen Fan <cl...@gmail.com>님이 작성:

> AFAIK there is no ongoing critical bug fixes, +1
>
> On Mon, Feb 3, 2020 at 11:46 PM Dongjoon Hyun <do...@gmail.com>
> wrote:
>
>> Yes, it does officially since 2.4.0.
>>
>> 2.4.5 is a maintenance release of 2.4.x line and the community didn't
>> support Hadoop 3.x on 'branch-2.4'. We didn't run test at all.
>>
>> Bests,
>> Dongjoon.
>>
>> On Sun, Feb 2, 2020 at 22:58 Ajith shetty <aj...@huawei.com>
>> wrote:
>>
>>> Is hadoop-3.1 profile supported for this release.? i see lot of UTs
>>> failing under this profile.
>>> https://github.com/apache/spark/blob/v2.4.5-rc2/pom.xml
>>>
>>> *Example:*
>>>  [INFO] Running org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite
>>> [ERROR] Tests run: 3, Failures: 0, Errors: 3, Skipped: 0, Time elapsed:
>>> 1.717 s <<< FAILURE! - in
>>> org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite
>>> [ERROR]
>>> saveExternalTableAndQueryIt(org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite)
>>> Time elapsed: 1.675 s  <<< ERROR!
>>> java.lang.ExceptionInInitializerError
>>> at
>>> org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite.setUp(JavaMetastoreDataSourcesSuite.java:66)
>>> Caused by: java.lang.IllegalArgumentException: *Unrecognized Hadoop
>>> major version number: 3.1.0*
>>> at
>>> org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite.setUp(JavaMetastoreDataSourcesSuite.java:66)
>>>
>>

Re: [VOTE] Release Apache Spark 2.4.5 (RC2)

Posted by Wenchen Fan <cl...@gmail.com>.
AFAIK there is no ongoing critical bug fixes, +1

On Mon, Feb 3, 2020 at 11:46 PM Dongjoon Hyun <do...@gmail.com>
wrote:

> Yes, it does officially since 2.4.0.
>
> 2.4.5 is a maintenance release of 2.4.x line and the community didn't
> support Hadoop 3.x on 'branch-2.4'. We didn't run test at all.
>
> Bests,
> Dongjoon.
>
> On Sun, Feb 2, 2020 at 22:58 Ajith shetty <aj...@huawei.com> wrote:
>
>> Is hadoop-3.1 profile supported for this release.? i see lot of UTs
>> failing under this profile.
>> https://github.com/apache/spark/blob/v2.4.5-rc2/pom.xml
>>
>> *Example:*
>>  [INFO] Running org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite
>> [ERROR] Tests run: 3, Failures: 0, Errors: 3, Skipped: 0, Time elapsed:
>> 1.717 s <<< FAILURE! - in
>> org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite
>> [ERROR]
>> saveExternalTableAndQueryIt(org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite)
>> Time elapsed: 1.675 s  <<< ERROR!
>> java.lang.ExceptionInInitializerError
>> at
>> org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite.setUp(JavaMetastoreDataSourcesSuite.java:66)
>> Caused by: java.lang.IllegalArgumentException: *Unrecognized Hadoop
>> major version number: 3.1.0*
>> at
>> org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite.setUp(JavaMetastoreDataSourcesSuite.java:66)
>>
>

Re: [VOTE] Release Apache Spark 2.4.5 (RC2)

Posted by Dongjoon Hyun <do...@gmail.com>.
Yes, it does officially since 2.4.0.

2.4.5 is a maintenance release of 2.4.x line and the community didn't
support Hadoop 3.x on 'branch-2.4'. We didn't run test at all.

Bests,
Dongjoon.

On Sun, Feb 2, 2020 at 22:58 Ajith shetty <aj...@huawei.com> wrote:

> Is hadoop-3.1 profile supported for this release.? i see lot of UTs
> failing under this profile.
> https://github.com/apache/spark/blob/v2.4.5-rc2/pom.xml
>
> *Example:*
>  [INFO] Running org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite
> [ERROR] Tests run: 3, Failures: 0, Errors: 3, Skipped: 0, Time elapsed:
> 1.717 s <<< FAILURE! - in
> org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite
> [ERROR]
> saveExternalTableAndQueryIt(org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite)
> Time elapsed: 1.675 s  <<< ERROR!
> java.lang.ExceptionInInitializerError
> at
> org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite.setUp(JavaMetastoreDataSourcesSuite.java:66)
> Caused by: java.lang.IllegalArgumentException: *Unrecognized Hadoop major
> version number: 3.1.0*
> at
> org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite.setUp(JavaMetastoreDataSourcesSuite.java:66)
>

RE: [VOTE] Release Apache Spark 2.4.5 (RC2)

Posted by Ajith shetty <aj...@huawei.com>.
Is hadoop-3.1 profile supported for this release.? i see lot of UTs failing under this profile.
https://github.com/apache/spark/blob/v2.4.5-rc2/pom.xml

Example:
 [INFO] Running org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite
[ERROR] Tests run: 3, Failures: 0, Errors: 3, Skipped: 0, Time elapsed: 1.717 s <<< FAILURE! - in org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite
[ERROR] saveExternalTableAndQueryIt(org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite)  Time elapsed: 1.675 s  <<< ERROR!
java.lang.ExceptionInInitializerError
at org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite.setUp(JavaMetastoreDataSourcesSuite.java:66)
Caused by: java.lang.IllegalArgumentException: Unrecognized Hadoop major version number: 3.1.0
at org.apache.spark.sql.hive.JavaMetastoreDataSourcesSuite.setUp(JavaMetastoreDataSourcesSuite.java:66)

Re: [VOTE] Release Apache Spark 2.4.5 (RC2)

Posted by Dongjoon Hyun <do...@gmail.com>.
I'll start with my +1.

Today, I verified the artifacts with GPG, and built and tested RC2 with the
followings.

  - Profile: -Pyarn -Phadoop-2.7 -Pkubernetes -Pkinesis-asl -Phive
-Phive-thriftserver
  - OS: CentOS (7.5.1804)
  - Java: OpenJDK 1.8.0_242
     * All Scala/Java UTs and JDBC IT passed.
  - Python 2.7.17 (with numpy 1.16.4, scipy 1.2.2, pandas 0.19.2, pyarrow
0.8.0)
     * All PySpark UTs passed.
  - Python 3.7.6 (with numpy 1.16.4, scipy 1.2.2, pandas 0.23.2, pyarrow
0.11.0)
     * All PySpark UTs passed.
  - Tested with Amazon EKS
     Client Version: v1.17.2
     Server Version: v1.14.9-eks-c0eccc

Bests,
Dongjoon.


On Sun, Feb 2, 2020 at 9:30 PM Dongjoon Hyun <do...@gmail.com>
wrote:

> Please vote on releasing the following candidate as Apache Spark version
> 2.4.5.
>
> The vote is open until February 5th 11PM PST and passes if a majority +1
> PMC votes are cast, with a minimum of 3 +1 votes.
>
> [ ] +1 Release this package as Apache Spark 2.4.5
> [ ] -1 Do not release this package because ...
>
> To learn more about Apache Spark, please see http://spark.apache.org/
>
> The tag to be voted on is v2.4.5-rc2 (commit
> cee4ecbb16917fa85f02c635925e2687400aa56b):
> https://github.com/apache/spark/tree/v2.4.5-rc2
>
> The release files, including signatures, digests, etc. can be found at:
> https://dist.apache.org/repos/dist/dev/spark/v2.4.5-rc2-bin/
>
> Signatures used for Spark RCs can be found in this file:
> https://dist.apache.org/repos/dist/dev/spark/KEYS
>
> The staging repository for this release can be found at:
> https://repository.apache.org/content/repositories/orgapachespark-1340/
>
> The documentation corresponding to this release can be found at:
> https://dist.apache.org/repos/dist/dev/spark/v2.4.5-rc2-docs/
>
> The list of bug fixes going into 2.4.5 can be found at the following URL:
> https://issues.apache.org/jira/projects/SPARK/versions/12346042
>
> This release is using the release script of the tag v2.4.5-rc2.
>
> FAQ
>
> =========================
> How can I help test this release?
> =========================
>
> If you are a Spark user, you can help us test this release by taking
> an existing Spark workload and running on this release candidate, then
> reporting any regressions.
>
> If you're working in PySpark you can set up a virtual env and install
> the current RC and see if anything important breaks, in the Java/Scala
> you can add the staging repository to your projects resolvers and test
> with the RC (make sure to clean up the artifact cache before/after so
> you don't end up building with a out of date RC going forward).
>
> ===========================================
> What should happen to JIRA tickets still targeting 2.4.5?
> ===========================================
>
> The current list of open tickets targeted at 2.4.5 can be found at:
> https://issues.apache.org/jira/projects/SPARK and search for "Target
> Version/s" = 2.4.5
>
> Committers should look at those and triage. Extremely important bug
> fixes, documentation, and API tweaks that impact compatibility should
> be worked on immediately. Everything else please retarget to an
> appropriate release.
>
> ==================
> But my bug isn't fixed?
> ==================
>
> In order to make timely releases, we will typically not hold the
> release unless the bug in question is a regression from the previous
> release. That being said, if there is something which is a regression
> that has not been correctly targeted please ping me or a committer to
> help target the issue.
>

Re: [VOTE] Release Apache Spark 2.4.5 (RC2)

Posted by Sean Owen <sr...@apache.org>.
+1 from me too. Same outcome as in RC1 for me.

On Sun, Feb 2, 2020 at 9:31 PM Dongjoon Hyun <do...@gmail.com> wrote:
>
> Please vote on releasing the following candidate as Apache Spark version 2.4.5.
>
> The vote is open until February 5th 11PM PST and passes if a majority +1 PMC votes are cast, with a minimum of 3 +1 votes.
>
> [ ] +1 Release this package as Apache Spark 2.4.5
> [ ] -1 Do not release this package because ...
>
> To learn more about Apache Spark, please see http://spark.apache.org/
>
> The tag to be voted on is v2.4.5-rc2 (commit cee4ecbb16917fa85f02c635925e2687400aa56b):
> https://github.com/apache/spark/tree/v2.4.5-rc2
>
> The release files, including signatures, digests, etc. can be found at:
> https://dist.apache.org/repos/dist/dev/spark/v2.4.5-rc2-bin/
>
> Signatures used for Spark RCs can be found in this file:
> https://dist.apache.org/repos/dist/dev/spark/KEYS
>
> The staging repository for this release can be found at:
> https://repository.apache.org/content/repositories/orgapachespark-1340/
>
> The documentation corresponding to this release can be found at:
> https://dist.apache.org/repos/dist/dev/spark/v2.4.5-rc2-docs/
>
> The list of bug fixes going into 2.4.5 can be found at the following URL:
> https://issues.apache.org/jira/projects/SPARK/versions/12346042
>
> This release is using the release script of the tag v2.4.5-rc2.
>
> FAQ
>
> =========================
> How can I help test this release?
> =========================
>
> If you are a Spark user, you can help us test this release by taking
> an existing Spark workload and running on this release candidate, then
> reporting any regressions.
>
> If you're working in PySpark you can set up a virtual env and install
> the current RC and see if anything important breaks, in the Java/Scala
> you can add the staging repository to your projects resolvers and test
> with the RC (make sure to clean up the artifact cache before/after so
> you don't end up building with a out of date RC going forward).
>
> ===========================================
> What should happen to JIRA tickets still targeting 2.4.5?
> ===========================================
>
> The current list of open tickets targeted at 2.4.5 can be found at:
> https://issues.apache.org/jira/projects/SPARK and search for "Target Version/s" = 2.4.5
>
> Committers should look at those and triage. Extremely important bug
> fixes, documentation, and API tweaks that impact compatibility should
> be worked on immediately. Everything else please retarget to an
> appropriate release.
>
> ==================
> But my bug isn't fixed?
> ==================
>
> In order to make timely releases, we will typically not hold the
> release unless the bug in question is a regression from the previous
> release. That being said, if there is something which is a regression
> that has not been correctly targeted please ping me or a committer to
> help target the issue.

---------------------------------------------------------------------
To unsubscribe e-mail: dev-unsubscribe@spark.apache.org