You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@livy.apache.org by larry mccay <la...@gmail.com> on 2022/10/14 18:40:01 UTC
Re: Proposal to Revive Apache Livy Community
+dev@livy.incubator.apache.org <de...@livy.incubator.apache.org>
On Fri, Oct 14, 2022 at 2:38 PM larry mccay <la...@gmail.com> wrote:
> Abstract
>
> Livy is a web service that exposes a REST interface for managing long
> running Apache Spark contexts in your cluster. With Livy, new applications
> can be built on top of Apache Spark that require fine grained interaction
> with many Spark contexts [1].
>
> While this project has been well regarded and used in many contexts as the
> defacto standard API to Spark environments, it has been incubating for over
> 5 years without graduation to a TLP and it has become difficult to
> impossible for fixes and improvements to be contributed as the current
> community seems to have moved on.
>
> There has been discussion regarding retirement of this podling where there
> seems to be some increasing interest in joining and reviving the community
> [2].
>
> The intent of this proposal is to avoid retiring a well regarded, actively
> used and rather mature project by reviving the PPMC and community with new
> folks that have a vested interest in the project and health of the
> community.
> Proposal
>
> We propose to revive the PPMC with a set of contributors and maintainers
> as mentors, PPMC members and committers.
>
> The retirement DISCUSS thread [2] has shown a growing interest in
> providing new committers and bringing improvements and fixes from
> organization’s internally maintained forks back to a revived community.
>
> General Approach to Revival:
>
> -
>
> Add new Mentors
> -
>
> Larry McCay, lmccay@apache.org , Cloudera
> -
>
> Sunil Govindan, sunilg@apache.org, Cloudera
> -
>
> Imran Rashid - irashid@apache.org, Cloudera
>
>
>
> -
>
> Add new Committers/PPMC
> -
>
> Larry McCay, lmccay@apache.org, Cloudera
> -
>
> Vinod Kumar Vavilapalli, vinodkv@cloudera.com, Cloudera
> -
>
> Gyorgy Gal, ggal ,gal.gyorgy@gmail.com, Cloudera
> -
>
> Wing Yew Poon, wypoon@cloudera.com, Cloudera
> -
>
> Xilang Yan, xilang.yan@gmail.com, Shopee
> -
>
> Jianzhen Wu, myjianzhen@gmail.com, Shopee
> -
>
> Nagella Jagadeewara Rao, jnagella@visa.com, Visa
> -
>
> Pralab Kumar, pralkuma@visa.com, Visa
> -
>
> Prasad Shrikant, shrikant.ism@gmail.com, Visa
> -
>
> Brahma Reddy Battula, brahma@apache.org, Visa
>
>
>
> -
>
> Invite existing PPMC members to opt-in or otherwise go emeritus
> -
>
> Jean-Baptiste Onofré, jbonofre@apache.org, Talend (opted-in via
> Retirement DISCUSS thread [2])
>
>
>
> -
>
> Invite existing Committers/PPMC members to opt-in or otherwise go
> emeritus
>
>
>
> -
>
> Establish Roadmap via follow up DISCUSS thread
> -
>
> Known Improvements from Forks which will need proposals and
> discussion:
> -
>
> Adding HA for Livy
> -
>
> Updating security capabilities (eg. kerberos for jdbc, fixing
> bugs in encryption)
> -
>
> Expanding the support for kubernetes
> -
>
> Responding to CVEs in dependencies (eg. log4j, thrift)
> -
>
> Livy rest cluster - IS THIS SAME AS HA for Livy ABOVE?
> -
>
> Support multi Spark versions
> -
>
> Implemented a metrics system for Livy
> -
>
> Support customize batch/interactive session lifecycle event
> handler, default log event with log4j, very helpful for trouble shooting
> -
>
> Optimize log to track which session id the log message came
> from, also very helpful for trouble shooting
> -
>
> Support customize Spark config optimization rules, can be used
> to optimize config for users’ job
> -
>
> A set of command line tool which can be used to replace Spark’s
> spark-submit, pyspark, spark-sql but actually submit application in Livy
> -
>
> We are planning to implement a JDBC state store, and allow multi
> Livy Thrift sessions to share one backend Spark application in the next few
> months.
> -
>
> These items and others that are brought to community may need
> consolidation or multiple configurable options and will need to be part of
> the discussion
> -
>
> One-pager Livy Improvement Proposals (LIP) may make sense to
> drive these discussions and convergence
> -
>
> Feature Branch Strategy for large changes
> -
>
> Large features are hard to review we will need to define a
> strategy
> -
>
> Determine the Improvements to be delivered across first 3 Releases
> with Target Release Dates
>
>
>
> -
>
> Ensure CVE and Dependency management hygiene is in place
>
>
> The above approach will usher the community back to an active status with
> a Roadmap of 3 or more release plans and security hygiene in place.
> Development Practices
>
> The Livy project follows a review before commit philosophy. Every commit
> automatically runs through the unit tests and generates coverage reports
> presented as a pull request comment. Our experience with this process leads
> us to believe that it helps ease new contributors into the project. They
> get feedback quickly on common mistakes, lowering the burden on reviewers.
> Those same reviewers get to lead by example, showing the new contributors
> that we value feedback within our community even when changes are done by
> more experienced folks. Taken from the original Apache Livy Proposal [1],
> this should continue to be true. As mentioned, Livy is a mature project and
> as such RTC is the most appropriate for continued quality and awareness.
>
>
>
> 1.
>
> Original Apache Livy Proposal
> https://cwiki.apache.org/confluence/display/incubator/LivyProposal
> 2.
>
> Retirement DISCUSS thread
> https://lists.apache.org/thread/gcstsrhbp91c5mm55htqn1l3djv8m7o0
>
>