You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hudi.apache.org by Vinoth Chandar <vi...@apache.org> on 2020/11/06 17:31:53 UTC

Fwd: [PMCs] Ramping up for Google Summer of Code 2021: invitation to participate

Hi all,

Any one interested in putting up some projects?

Thanks
Vinoth

---------- Forwarded message ---------
From: Sally Khudairi <sk...@apache.org>
Date: Mon, Nov 2, 2020 at 7:52 PM
Subject: [PMCs] Ramping up for Google Summer of Code 2021: invitation to
participate
To: ASF Marketing & Publicity <pr...@apache.org>


Hello PMCs --I hope you are all well.

ASF Community Development (ComDev) oversees our participation in Google
Summer of Code, for which the ASF has been a mentoring organization since
the program's inception 17 years ago.

ComDev is seeking individuals and PMCs interested in participating as
mentors on behalf of the ASF and Apache Projects.

The planning and preparation process begins now. The ComDev team are
collecting ideas for the Apache Project's participation in GSoC and want to
hear from you.

Get started by reviewing the program guidelines at
http://community.apache.org/gsoc.html and be sure to engage your
communities to get involved as well. Ping the ASF's GSoC team at
dev@community.apache.org with any questions.

Good luck and have a great program!

Best,
Sally

- - -
Vice President Marketing & Publicity
Vice President Sponsor Relations
The Apache Software Foundation

Tel +1 617 921 8656 | sk@apache.org

Re: [PMCs] Ramping up for Google Summer of Code 2021: invitation to participate

Posted by Vinoth Chandar <vi...@apache.org>.
Tickets look good. Could we also add descriptions with the goal of each
ticket!

On Fri, Nov 20, 2020 at 8:19 PM Sivabalan <n....@gmail.com> wrote:

> I have added some more tickets. Feel free to check it out.
>
>
> On Wed, Nov 11, 2020 at 7:46 PM Vinoth Chandar <vi...@apache.org> wrote:
>
> > For some reason, that link did not work.
> >
> > https://issues.apache.org/jira/issues/?jql=labels%20%3D%20gsoc2021
> >
> > This list looks promising to me. We can wait a bit for others' feedback
> and
> > go ahead and finalize?
> >
> > On Tue, Nov 10, 2020 at 1:01 PM Raymond Xu <xu...@gmail.com>
> > wrote:
> >
> > > Ok updated the list (more descriptions to be added later for those
> tasks,
> > > once we finalize it)
> > >
> > >
> > >
> >
> https://issues.apache.org/jira/browse/HUDI-1290?jql=labels%20%3D%20gsoc2021%20and%20project%20%3D%20HUDI
> > >
> > > On Tue, Nov 10, 2020 at 11:33 AM Vinoth Chandar <vi...@apache.org>
> > wrote:
> > >
> > > > Sounds good. I can make a pass as well, once you have the trimmed
> list.
> > > >
> > > > Thanks Raymond!
> > > >
> > > > On Mon, Nov 9, 2020 at 7:47 PM Raymond Xu <
> xu.shiyan.raymond@gmail.com
> > >
> > > > wrote:
> > > >
> > > > > yes, agreed to remove the refactoring tasks and reduce to a small
> > > number
> > > > of
> > > > > umbrella tasks and polish them for the program.
> > > > >
> > > > > On Mon, Nov 9, 2020 at 7:39 PM Vinoth Chandar <vi...@apache.org>
> > > wrote:
> > > > >
> > > > > > Good discussion!
> > > > > >
> > > > > > Given this is going to be almost like a summer internship, I
> > suggest
> > > we
> > > > > > limit ourselves to high quality, independent projects.
> > > > > > We have tons of ideas, but the need to ensure that we won't be
> > > picking
> > > > > them
> > > > > > up ourselves, is what makes this tricky.
> > > > > >
> > > > > > Here are some ideas top of my head. (maybe we can use this thread
> > to
> > > > > > collect ideas first)
> > > > > > Most of these are experimental.
> > > > > >
> > > > > > - Schema inference library, that infer a schema from vast
> > quantities
> > > of
> > > > > > unstructured data and help us bootstrap that into Hudi
> > > > > > - Survey indexing techniques, and implement a subset that can
> speed
> > > up
> > > > > > query performance (e.g bitmaps, tree indexes)
> > > > > > - Apache Beam integration (there is a JIRA for this) with a Hudi
> IO
> > > > > module.
> > > > > > - Apache Calcite implementation for querying Hudi datasets (we
> can
> > > pick
> > > > > any
> > > > > > other popular engine also here)
> > > > > > - Apache Pulsar/Kinensis source in Delta Streamer
> > > > > >
> > > > > > Raymond, the current labels have a bunch of refactoring/tasks
> also
> > > > > tagged.
> > > > > > If you also agree, can we untag and only put up say 5 or so,
> bigger
> > > > > > projects?
> > > > > >
> > > > > > Things around refactoring etc, for e.g, would probably get done
> > > before
> > > > > > summer.
> > > > > >
> > > > > > Thanks
> > > > > > Vinoth
> > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > > > On Mon, Nov 9, 2020 at 4:58 PM Raymond Xu <
> > > xu.shiyan.raymond@gmail.com
> > > > >
> > > > > > wrote:
> > > > > >
> > > > > > > Hi Siva and all,
> > > > > > >
> > > > > > > On students' commitments
> > > > > > >
> > > > > > > During the 12 weeks of coding time, nothing should take
> > precedence
> > > > over
> > > > > > > > your project, and you should have no major distractions.
> > > > > > >
> > > > > > >
> > > > > > > Quote from this page
> > > > > > > <
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> https://opensource.googleblog.com/2011/03/dos-and-donts-of-google-summer-of-code.html
> > > > > > > >,
> > > > > > > the participating students will be working on the project as a
> > > > > full-time
> > > > > > > job for 12 weeks. So roughly we could expect 40 hours work per
> > > week.
> > > > > > >
> > > > > > > Also agree on your points of making the experience meaningful.
> I
> > > > > suppose
> > > > > > at
> > > > > > > this stage we are still collecting all sorts of potential
> tasks,
> > > > which
> > > > > > can
> > > > > > > be filtered later.
> > > > > > >
> > > > > > > I have started looking for issues in the backlog and labeled
> > some.
> > > > > Please
> > > > > > > check out this JIRA filter
> > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> https://issues.apache.org/jira/browse/HUDI-1290?jql=labels%20%3D%20gsoc2021%20and%20project%20%3D%20HUDI
> > > > > > >
> > > > > > > Note: those issues' descriptions are to be edited with more
> > > > > introduction
> > > > > > to
> > > > > > > be more newcomer-friendly. And there can be some new tasks
> > created,
> > > > > too.
> > > > > > >
> > > > > > > Any feedback on those labelled tasks? Also, anyone wants to
> bring
> > > in
> > > > > more
> > > > > > > ideas or tasks for this program?
> > > > > > > Please feel free to post JIRA issue links here so we can
> > > consolidate
> > > > > all
> > > > > > > and groom later.
> > > > > > >
> > > > > > >
> > > > > > > Thank you.
> > > > > > >
> > > > > > > Regards,
> > > > > > > Raymond
> > > > > > >
> > > > > > > On Fri, Nov 6, 2020 at 10:16 AM Sivabalan <n....@gmail.com>
> > > > wrote:
> > > > > > >
> > > > > > > > Sorry, just another point to remember is that, this might
> > happen
> > > by
> > > > > > June,
> > > > > > > > July, aug of 2021. So the proposal assumes that the community
> > may
> > > > not
> > > > > > > work
> > > > > > > > on these until then.
> > > > > > > >
> > > > > > > > On Fri, Nov 6, 2020 at 1:03 PM Sivabalan <n.siva.b@gmail.com
> >
> > > > wrote:
> > > > > > > >
> > > > > > > > > I am also interested and still trying to read more on what
> > kind
> > > > of
> > > > > > > > > projects we can propose(execution, design, documentation,
> > > > > > > > usability/tools,
> > > > > > > > > performance framework etc), how much efforts we can expect
> > from
> > > > > > > > > students (is it 10 hours per week or 20 hours per week,
> etc).
> > > One
> > > > > > thing
> > > > > > > > we
> > > > > > > > > should be mindful is that, we should try our best to think
> > how
> > > > best
> > > > > > we
> > > > > > > > can
> > > > > > > > > help students and ensure they get something meaningful out
> of
> > > > > working
> > > > > > > > with
> > > > > > > > > us and get a good sense of how open source projects work,
> > code
> > > > > > quality
> > > > > > > we
> > > > > > > > > expect etc. And not give some assorted 10 different tasks
> for
> > > > them
> > > > > to
> > > > > > > > > complete. We should try to have standalone projects or
> > cohesive
> > > > > work
> > > > > > > > items
> > > > > > > > > (like devX may be).
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > On Fri, Nov 6, 2020 at 12:50 PM Raymond Xu <
> > > > > > > xu.shiyan.raymond@gmail.com>
> > > > > > > > > wrote:
> > > > > > > > >
> > > > > > > > >> I'm interested in being a mentor and would like to create
> > and
> > > > > submit
> > > > > > > > some
> > > > > > > > >> issues. (Sorry wanted to raise this earlier)
> > > > > > > > >>
> > > > > > > > >> To recap it for all:
> > > > > > > > >> There will be college students applying and, once
> accepted,
> > > > > working
> > > > > > on
> > > > > > > > >> some
> > > > > > > > >> JIRA issues of Apache projects in summer 2021. We are to
> > > create
> > > > > > issues
> > > > > > > > and
> > > > > > > > >> label them for this program. Those tasks will be assigned
> to
> > > > > > > > participants
> > > > > > > > >> and worked on around June 2021.
> > > > > > > > >>
> > > > > > > > >> To list some of the possible areas at high level
> > > > > > > > >> - DevX related: code style fix and alignment, nightly
> build
> > > > setup,
> > > > > > > > config
> > > > > > > > >> docs auto-generation
> > > > > > > > >> - New features: new indexing schemes, SQL querying of
> > metadata
> > > > > > > > >> - Utilities improvements: new delta streamer sources, a
> UI,
> > > > > > > integrations
> > > > > > > > >> with other system e.g Airflow operator/sensor to trigger
> > > > pipelines
> > > > > > > based
> > > > > > > > >> on
> > > > > > > > >> Hudi commits
> > > > > > > > >>
> > > > > > > > >>
> > > > > > > > >> On Fri, Nov 6, 2020 at 9:32 AM Vinoth Chandar <
> > > > vinoth@apache.org>
> > > > > > > > wrote:
> > > > > > > > >>
> > > > > > > > >> > Hi all,
> > > > > > > > >> >
> > > > > > > > >> > Any one interested in putting up some projects?
> > > > > > > > >> >
> > > > > > > > >> > Thanks
> > > > > > > > >> > Vinoth
> > > > > > > > >> >
> > > > > > > > >> > ---------- Forwarded message ---------
> > > > > > > > >> > From: Sally Khudairi <sk...@apache.org>
> > > > > > > > >> > Date: Mon, Nov 2, 2020 at 7:52 PM
> > > > > > > > >> > Subject: [PMCs] Ramping up for Google Summer of Code
> 2021:
> > > > > > > invitation
> > > > > > > > to
> > > > > > > > >> > participate
> > > > > > > > >> > To: ASF Marketing & Publicity <pr...@apache.org>
> > > > > > > > >> >
> > > > > > > > >> >
> > > > > > > > >> > Hello PMCs --I hope you are all well.
> > > > > > > > >> >
> > > > > > > > >> > ASF Community Development (ComDev) oversees our
> > > participation
> > > > in
> > > > > > > > Google
> > > > > > > > >> > Summer of Code, for which the ASF has been a mentoring
> > > > > > organization
> > > > > > > > >> since
> > > > > > > > >> > the program's inception 17 years ago.
> > > > > > > > >> >
> > > > > > > > >> > ComDev is seeking individuals and PMCs interested in
> > > > > participating
> > > > > > > as
> > > > > > > > >> > mentors on behalf of the ASF and Apache Projects.
> > > > > > > > >> >
> > > > > > > > >> > The planning and preparation process begins now. The
> > ComDev
> > > > team
> > > > > > are
> > > > > > > > >> > collecting ideas for the Apache Project's participation
> in
> > > > GSoC
> > > > > > and
> > > > > > > > >> want to
> > > > > > > > >> > hear from you.
> > > > > > > > >> >
> > > > > > > > >> > Get started by reviewing the program guidelines at
> > > > > > > > >> > http://community.apache.org/gsoc.html and be sure to
> > engage
> > > > > your
> > > > > > > > >> > communities to get involved as well. Ping the ASF's GSoC
> > > team
> > > > at
> > > > > > > > >> > dev@community.apache.org with any questions.
> > > > > > > > >> >
> > > > > > > > >> > Good luck and have a great program!
> > > > > > > > >> >
> > > > > > > > >> > Best,
> > > > > > > > >> > Sally
> > > > > > > > >> >
> > > > > > > > >> > - - -
> > > > > > > > >> > Vice President Marketing & Publicity
> > > > > > > > >> > Vice President Sponsor Relations
> > > > > > > > >> > The Apache Software Foundation
> > > > > > > > >> >
> > > > > > > > >> > Tel +1 617 921 8656 | sk@apache.org
> > > > > > > > >> >
> > > > > > > > >>
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > --
> > > > > > > > > Regards,
> > > > > > > > > -Sivabalan
> > > > > > > > >
> > > > > > > > --
> > > > > > > > Regards,
> > > > > > > > -Sivabalan
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
>
>
> --
> Regards,
> -Sivabalan
>

Re: [PMCs] Ramping up for Google Summer of Code 2021: invitation to participate

Posted by Sivabalan <n....@gmail.com>.
I have added some more tickets. Feel free to check it out.


On Wed, Nov 11, 2020 at 7:46 PM Vinoth Chandar <vi...@apache.org> wrote:

> For some reason, that link did not work.
>
> https://issues.apache.org/jira/issues/?jql=labels%20%3D%20gsoc2021
>
> This list looks promising to me. We can wait a bit for others' feedback and
> go ahead and finalize?
>
> On Tue, Nov 10, 2020 at 1:01 PM Raymond Xu <xu...@gmail.com>
> wrote:
>
> > Ok updated the list (more descriptions to be added later for those tasks,
> > once we finalize it)
> >
> >
> >
> https://issues.apache.org/jira/browse/HUDI-1290?jql=labels%20%3D%20gsoc2021%20and%20project%20%3D%20HUDI
> >
> > On Tue, Nov 10, 2020 at 11:33 AM Vinoth Chandar <vi...@apache.org>
> wrote:
> >
> > > Sounds good. I can make a pass as well, once you have the trimmed list.
> > >
> > > Thanks Raymond!
> > >
> > > On Mon, Nov 9, 2020 at 7:47 PM Raymond Xu <xu.shiyan.raymond@gmail.com
> >
> > > wrote:
> > >
> > > > yes, agreed to remove the refactoring tasks and reduce to a small
> > number
> > > of
> > > > umbrella tasks and polish them for the program.
> > > >
> > > > On Mon, Nov 9, 2020 at 7:39 PM Vinoth Chandar <vi...@apache.org>
> > wrote:
> > > >
> > > > > Good discussion!
> > > > >
> > > > > Given this is going to be almost like a summer internship, I
> suggest
> > we
> > > > > limit ourselves to high quality, independent projects.
> > > > > We have tons of ideas, but the need to ensure that we won't be
> > picking
> > > > them
> > > > > up ourselves, is what makes this tricky.
> > > > >
> > > > > Here are some ideas top of my head. (maybe we can use this thread
> to
> > > > > collect ideas first)
> > > > > Most of these are experimental.
> > > > >
> > > > > - Schema inference library, that infer a schema from vast
> quantities
> > of
> > > > > unstructured data and help us bootstrap that into Hudi
> > > > > - Survey indexing techniques, and implement a subset that can speed
> > up
> > > > > query performance (e.g bitmaps, tree indexes)
> > > > > - Apache Beam integration (there is a JIRA for this) with a Hudi IO
> > > > module.
> > > > > - Apache Calcite implementation for querying Hudi datasets (we can
> > pick
> > > > any
> > > > > other popular engine also here)
> > > > > - Apache Pulsar/Kinensis source in Delta Streamer
> > > > >
> > > > > Raymond, the current labels have a bunch of refactoring/tasks also
> > > > tagged.
> > > > > If you also agree, can we untag and only put up say 5 or so, bigger
> > > > > projects?
> > > > >
> > > > > Things around refactoring etc, for e.g, would probably get done
> > before
> > > > > summer.
> > > > >
> > > > > Thanks
> > > > > Vinoth
> > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > > > > On Mon, Nov 9, 2020 at 4:58 PM Raymond Xu <
> > xu.shiyan.raymond@gmail.com
> > > >
> > > > > wrote:
> > > > >
> > > > > > Hi Siva and all,
> > > > > >
> > > > > > On students' commitments
> > > > > >
> > > > > > During the 12 weeks of coding time, nothing should take
> precedence
> > > over
> > > > > > > your project, and you should have no major distractions.
> > > > > >
> > > > > >
> > > > > > Quote from this page
> > > > > > <
> > > > > >
> > > > >
> > > >
> > >
> >
> https://opensource.googleblog.com/2011/03/dos-and-donts-of-google-summer-of-code.html
> > > > > > >,
> > > > > > the participating students will be working on the project as a
> > > > full-time
> > > > > > job for 12 weeks. So roughly we could expect 40 hours work per
> > week.
> > > > > >
> > > > > > Also agree on your points of making the experience meaningful. I
> > > > suppose
> > > > > at
> > > > > > this stage we are still collecting all sorts of potential tasks,
> > > which
> > > > > can
> > > > > > be filtered later.
> > > > > >
> > > > > > I have started looking for issues in the backlog and labeled
> some.
> > > > Please
> > > > > > check out this JIRA filter
> > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> https://issues.apache.org/jira/browse/HUDI-1290?jql=labels%20%3D%20gsoc2021%20and%20project%20%3D%20HUDI
> > > > > >
> > > > > > Note: those issues' descriptions are to be edited with more
> > > > introduction
> > > > > to
> > > > > > be more newcomer-friendly. And there can be some new tasks
> created,
> > > > too.
> > > > > >
> > > > > > Any feedback on those labelled tasks? Also, anyone wants to bring
> > in
> > > > more
> > > > > > ideas or tasks for this program?
> > > > > > Please feel free to post JIRA issue links here so we can
> > consolidate
> > > > all
> > > > > > and groom later.
> > > > > >
> > > > > >
> > > > > > Thank you.
> > > > > >
> > > > > > Regards,
> > > > > > Raymond
> > > > > >
> > > > > > On Fri, Nov 6, 2020 at 10:16 AM Sivabalan <n....@gmail.com>
> > > wrote:
> > > > > >
> > > > > > > Sorry, just another point to remember is that, this might
> happen
> > by
> > > > > June,
> > > > > > > July, aug of 2021. So the proposal assumes that the community
> may
> > > not
> > > > > > work
> > > > > > > on these until then.
> > > > > > >
> > > > > > > On Fri, Nov 6, 2020 at 1:03 PM Sivabalan <n....@gmail.com>
> > > wrote:
> > > > > > >
> > > > > > > > I am also interested and still trying to read more on what
> kind
> > > of
> > > > > > > > projects we can propose(execution, design, documentation,
> > > > > > > usability/tools,
> > > > > > > > performance framework etc), how much efforts we can expect
> from
> > > > > > > > students (is it 10 hours per week or 20 hours per week, etc).
> > One
> > > > > thing
> > > > > > > we
> > > > > > > > should be mindful is that, we should try our best to think
> how
> > > best
> > > > > we
> > > > > > > can
> > > > > > > > help students and ensure they get something meaningful out of
> > > > working
> > > > > > > with
> > > > > > > > us and get a good sense of how open source projects work,
> code
> > > > > quality
> > > > > > we
> > > > > > > > expect etc. And not give some assorted 10 different tasks for
> > > them
> > > > to
> > > > > > > > complete. We should try to have standalone projects or
> cohesive
> > > > work
> > > > > > > items
> > > > > > > > (like devX may be).
> > > > > > > >
> > > > > > > >
> > > > > > > > On Fri, Nov 6, 2020 at 12:50 PM Raymond Xu <
> > > > > > xu.shiyan.raymond@gmail.com>
> > > > > > > > wrote:
> > > > > > > >
> > > > > > > >> I'm interested in being a mentor and would like to create
> and
> > > > submit
> > > > > > > some
> > > > > > > >> issues. (Sorry wanted to raise this earlier)
> > > > > > > >>
> > > > > > > >> To recap it for all:
> > > > > > > >> There will be college students applying and, once accepted,
> > > > working
> > > > > on
> > > > > > > >> some
> > > > > > > >> JIRA issues of Apache projects in summer 2021. We are to
> > create
> > > > > issues
> > > > > > > and
> > > > > > > >> label them for this program. Those tasks will be assigned to
> > > > > > > participants
> > > > > > > >> and worked on around June 2021.
> > > > > > > >>
> > > > > > > >> To list some of the possible areas at high level
> > > > > > > >> - DevX related: code style fix and alignment, nightly build
> > > setup,
> > > > > > > config
> > > > > > > >> docs auto-generation
> > > > > > > >> - New features: new indexing schemes, SQL querying of
> metadata
> > > > > > > >> - Utilities improvements: new delta streamer sources, a UI,
> > > > > > integrations
> > > > > > > >> with other system e.g Airflow operator/sensor to trigger
> > > pipelines
> > > > > > based
> > > > > > > >> on
> > > > > > > >> Hudi commits
> > > > > > > >>
> > > > > > > >>
> > > > > > > >> On Fri, Nov 6, 2020 at 9:32 AM Vinoth Chandar <
> > > vinoth@apache.org>
> > > > > > > wrote:
> > > > > > > >>
> > > > > > > >> > Hi all,
> > > > > > > >> >
> > > > > > > >> > Any one interested in putting up some projects?
> > > > > > > >> >
> > > > > > > >> > Thanks
> > > > > > > >> > Vinoth
> > > > > > > >> >
> > > > > > > >> > ---------- Forwarded message ---------
> > > > > > > >> > From: Sally Khudairi <sk...@apache.org>
> > > > > > > >> > Date: Mon, Nov 2, 2020 at 7:52 PM
> > > > > > > >> > Subject: [PMCs] Ramping up for Google Summer of Code 2021:
> > > > > > invitation
> > > > > > > to
> > > > > > > >> > participate
> > > > > > > >> > To: ASF Marketing & Publicity <pr...@apache.org>
> > > > > > > >> >
> > > > > > > >> >
> > > > > > > >> > Hello PMCs --I hope you are all well.
> > > > > > > >> >
> > > > > > > >> > ASF Community Development (ComDev) oversees our
> > participation
> > > in
> > > > > > > Google
> > > > > > > >> > Summer of Code, for which the ASF has been a mentoring
> > > > > organization
> > > > > > > >> since
> > > > > > > >> > the program's inception 17 years ago.
> > > > > > > >> >
> > > > > > > >> > ComDev is seeking individuals and PMCs interested in
> > > > participating
> > > > > > as
> > > > > > > >> > mentors on behalf of the ASF and Apache Projects.
> > > > > > > >> >
> > > > > > > >> > The planning and preparation process begins now. The
> ComDev
> > > team
> > > > > are
> > > > > > > >> > collecting ideas for the Apache Project's participation in
> > > GSoC
> > > > > and
> > > > > > > >> want to
> > > > > > > >> > hear from you.
> > > > > > > >> >
> > > > > > > >> > Get started by reviewing the program guidelines at
> > > > > > > >> > http://community.apache.org/gsoc.html and be sure to
> engage
> > > > your
> > > > > > > >> > communities to get involved as well. Ping the ASF's GSoC
> > team
> > > at
> > > > > > > >> > dev@community.apache.org with any questions.
> > > > > > > >> >
> > > > > > > >> > Good luck and have a great program!
> > > > > > > >> >
> > > > > > > >> > Best,
> > > > > > > >> > Sally
> > > > > > > >> >
> > > > > > > >> > - - -
> > > > > > > >> > Vice President Marketing & Publicity
> > > > > > > >> > Vice President Sponsor Relations
> > > > > > > >> > The Apache Software Foundation
> > > > > > > >> >
> > > > > > > >> > Tel +1 617 921 8656 | sk@apache.org
> > > > > > > >> >
> > > > > > > >>
> > > > > > > >
> > > > > > > >
> > > > > > > > --
> > > > > > > > Regards,
> > > > > > > > -Sivabalan
> > > > > > > >
> > > > > > > --
> > > > > > > Regards,
> > > > > > > -Sivabalan
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
>


-- 
Regards,
-Sivabalan

Re: [PMCs] Ramping up for Google Summer of Code 2021: invitation to participate

Posted by Vinoth Chandar <vi...@apache.org>.
For some reason, that link did not work.

https://issues.apache.org/jira/issues/?jql=labels%20%3D%20gsoc2021

This list looks promising to me. We can wait a bit for others' feedback and
go ahead and finalize?

On Tue, Nov 10, 2020 at 1:01 PM Raymond Xu <xu...@gmail.com>
wrote:

> Ok updated the list (more descriptions to be added later for those tasks,
> once we finalize it)
>
>
> https://issues.apache.org/jira/browse/HUDI-1290?jql=labels%20%3D%20gsoc2021%20and%20project%20%3D%20HUDI
>
> On Tue, Nov 10, 2020 at 11:33 AM Vinoth Chandar <vi...@apache.org> wrote:
>
> > Sounds good. I can make a pass as well, once you have the trimmed list.
> >
> > Thanks Raymond!
> >
> > On Mon, Nov 9, 2020 at 7:47 PM Raymond Xu <xu...@gmail.com>
> > wrote:
> >
> > > yes, agreed to remove the refactoring tasks and reduce to a small
> number
> > of
> > > umbrella tasks and polish them for the program.
> > >
> > > On Mon, Nov 9, 2020 at 7:39 PM Vinoth Chandar <vi...@apache.org>
> wrote:
> > >
> > > > Good discussion!
> > > >
> > > > Given this is going to be almost like a summer internship, I suggest
> we
> > > > limit ourselves to high quality, independent projects.
> > > > We have tons of ideas, but the need to ensure that we won't be
> picking
> > > them
> > > > up ourselves, is what makes this tricky.
> > > >
> > > > Here are some ideas top of my head. (maybe we can use this thread to
> > > > collect ideas first)
> > > > Most of these are experimental.
> > > >
> > > > - Schema inference library, that infer a schema from vast quantities
> of
> > > > unstructured data and help us bootstrap that into Hudi
> > > > - Survey indexing techniques, and implement a subset that can speed
> up
> > > > query performance (e.g bitmaps, tree indexes)
> > > > - Apache Beam integration (there is a JIRA for this) with a Hudi IO
> > > module.
> > > > - Apache Calcite implementation for querying Hudi datasets (we can
> pick
> > > any
> > > > other popular engine also here)
> > > > - Apache Pulsar/Kinensis source in Delta Streamer
> > > >
> > > > Raymond, the current labels have a bunch of refactoring/tasks also
> > > tagged.
> > > > If you also agree, can we untag and only put up say 5 or so, bigger
> > > > projects?
> > > >
> > > > Things around refactoring etc, for e.g, would probably get done
> before
> > > > summer.
> > > >
> > > > Thanks
> > > > Vinoth
> > > >
> > > >
> > > >
> > > >
> > > >
> > > >
> > > >
> > > >
> > > >
> > > > On Mon, Nov 9, 2020 at 4:58 PM Raymond Xu <
> xu.shiyan.raymond@gmail.com
> > >
> > > > wrote:
> > > >
> > > > > Hi Siva and all,
> > > > >
> > > > > On students' commitments
> > > > >
> > > > > During the 12 weeks of coding time, nothing should take precedence
> > over
> > > > > > your project, and you should have no major distractions.
> > > > >
> > > > >
> > > > > Quote from this page
> > > > > <
> > > > >
> > > >
> > >
> >
> https://opensource.googleblog.com/2011/03/dos-and-donts-of-google-summer-of-code.html
> > > > > >,
> > > > > the participating students will be working on the project as a
> > > full-time
> > > > > job for 12 weeks. So roughly we could expect 40 hours work per
> week.
> > > > >
> > > > > Also agree on your points of making the experience meaningful. I
> > > suppose
> > > > at
> > > > > this stage we are still collecting all sorts of potential tasks,
> > which
> > > > can
> > > > > be filtered later.
> > > > >
> > > > > I have started looking for issues in the backlog and labeled some.
> > > Please
> > > > > check out this JIRA filter
> > > > >
> > > > >
> > > >
> > >
> >
> https://issues.apache.org/jira/browse/HUDI-1290?jql=labels%20%3D%20gsoc2021%20and%20project%20%3D%20HUDI
> > > > >
> > > > > Note: those issues' descriptions are to be edited with more
> > > introduction
> > > > to
> > > > > be more newcomer-friendly. And there can be some new tasks created,
> > > too.
> > > > >
> > > > > Any feedback on those labelled tasks? Also, anyone wants to bring
> in
> > > more
> > > > > ideas or tasks for this program?
> > > > > Please feel free to post JIRA issue links here so we can
> consolidate
> > > all
> > > > > and groom later.
> > > > >
> > > > >
> > > > > Thank you.
> > > > >
> > > > > Regards,
> > > > > Raymond
> > > > >
> > > > > On Fri, Nov 6, 2020 at 10:16 AM Sivabalan <n....@gmail.com>
> > wrote:
> > > > >
> > > > > > Sorry, just another point to remember is that, this might happen
> by
> > > > June,
> > > > > > July, aug of 2021. So the proposal assumes that the community may
> > not
> > > > > work
> > > > > > on these until then.
> > > > > >
> > > > > > On Fri, Nov 6, 2020 at 1:03 PM Sivabalan <n....@gmail.com>
> > wrote:
> > > > > >
> > > > > > > I am also interested and still trying to read more on what kind
> > of
> > > > > > > projects we can propose(execution, design, documentation,
> > > > > > usability/tools,
> > > > > > > performance framework etc), how much efforts we can expect from
> > > > > > > students (is it 10 hours per week or 20 hours per week, etc).
> One
> > > > thing
> > > > > > we
> > > > > > > should be mindful is that, we should try our best to think how
> > best
> > > > we
> > > > > > can
> > > > > > > help students and ensure they get something meaningful out of
> > > working
> > > > > > with
> > > > > > > us and get a good sense of how open source projects work, code
> > > > quality
> > > > > we
> > > > > > > expect etc. And not give some assorted 10 different tasks for
> > them
> > > to
> > > > > > > complete. We should try to have standalone projects or cohesive
> > > work
> > > > > > items
> > > > > > > (like devX may be).
> > > > > > >
> > > > > > >
> > > > > > > On Fri, Nov 6, 2020 at 12:50 PM Raymond Xu <
> > > > > xu.shiyan.raymond@gmail.com>
> > > > > > > wrote:
> > > > > > >
> > > > > > >> I'm interested in being a mentor and would like to create and
> > > submit
> > > > > > some
> > > > > > >> issues. (Sorry wanted to raise this earlier)
> > > > > > >>
> > > > > > >> To recap it for all:
> > > > > > >> There will be college students applying and, once accepted,
> > > working
> > > > on
> > > > > > >> some
> > > > > > >> JIRA issues of Apache projects in summer 2021. We are to
> create
> > > > issues
> > > > > > and
> > > > > > >> label them for this program. Those tasks will be assigned to
> > > > > > participants
> > > > > > >> and worked on around June 2021.
> > > > > > >>
> > > > > > >> To list some of the possible areas at high level
> > > > > > >> - DevX related: code style fix and alignment, nightly build
> > setup,
> > > > > > config
> > > > > > >> docs auto-generation
> > > > > > >> - New features: new indexing schemes, SQL querying of metadata
> > > > > > >> - Utilities improvements: new delta streamer sources, a UI,
> > > > > integrations
> > > > > > >> with other system e.g Airflow operator/sensor to trigger
> > pipelines
> > > > > based
> > > > > > >> on
> > > > > > >> Hudi commits
> > > > > > >>
> > > > > > >>
> > > > > > >> On Fri, Nov 6, 2020 at 9:32 AM Vinoth Chandar <
> > vinoth@apache.org>
> > > > > > wrote:
> > > > > > >>
> > > > > > >> > Hi all,
> > > > > > >> >
> > > > > > >> > Any one interested in putting up some projects?
> > > > > > >> >
> > > > > > >> > Thanks
> > > > > > >> > Vinoth
> > > > > > >> >
> > > > > > >> > ---------- Forwarded message ---------
> > > > > > >> > From: Sally Khudairi <sk...@apache.org>
> > > > > > >> > Date: Mon, Nov 2, 2020 at 7:52 PM
> > > > > > >> > Subject: [PMCs] Ramping up for Google Summer of Code 2021:
> > > > > invitation
> > > > > > to
> > > > > > >> > participate
> > > > > > >> > To: ASF Marketing & Publicity <pr...@apache.org>
> > > > > > >> >
> > > > > > >> >
> > > > > > >> > Hello PMCs --I hope you are all well.
> > > > > > >> >
> > > > > > >> > ASF Community Development (ComDev) oversees our
> participation
> > in
> > > > > > Google
> > > > > > >> > Summer of Code, for which the ASF has been a mentoring
> > > > organization
> > > > > > >> since
> > > > > > >> > the program's inception 17 years ago.
> > > > > > >> >
> > > > > > >> > ComDev is seeking individuals and PMCs interested in
> > > participating
> > > > > as
> > > > > > >> > mentors on behalf of the ASF and Apache Projects.
> > > > > > >> >
> > > > > > >> > The planning and preparation process begins now. The ComDev
> > team
> > > > are
> > > > > > >> > collecting ideas for the Apache Project's participation in
> > GSoC
> > > > and
> > > > > > >> want to
> > > > > > >> > hear from you.
> > > > > > >> >
> > > > > > >> > Get started by reviewing the program guidelines at
> > > > > > >> > http://community.apache.org/gsoc.html and be sure to engage
> > > your
> > > > > > >> > communities to get involved as well. Ping the ASF's GSoC
> team
> > at
> > > > > > >> > dev@community.apache.org with any questions.
> > > > > > >> >
> > > > > > >> > Good luck and have a great program!
> > > > > > >> >
> > > > > > >> > Best,
> > > > > > >> > Sally
> > > > > > >> >
> > > > > > >> > - - -
> > > > > > >> > Vice President Marketing & Publicity
> > > > > > >> > Vice President Sponsor Relations
> > > > > > >> > The Apache Software Foundation
> > > > > > >> >
> > > > > > >> > Tel +1 617 921 8656 | sk@apache.org
> > > > > > >> >
> > > > > > >>
> > > > > > >
> > > > > > >
> > > > > > > --
> > > > > > > Regards,
> > > > > > > -Sivabalan
> > > > > > >
> > > > > > --
> > > > > > Regards,
> > > > > > -Sivabalan
> > > > > >
> > > > >
> > > >
> > >
> >
>

Re: [PMCs] Ramping up for Google Summer of Code 2021: invitation to participate

Posted by Raymond Xu <xu...@gmail.com>.
Ok updated the list (more descriptions to be added later for those tasks,
once we finalize it)

https://issues.apache.org/jira/browse/HUDI-1290?jql=labels%20%3D%20gsoc2021%20and%20project%20%3D%20HUDI

On Tue, Nov 10, 2020 at 11:33 AM Vinoth Chandar <vi...@apache.org> wrote:

> Sounds good. I can make a pass as well, once you have the trimmed list.
>
> Thanks Raymond!
>
> On Mon, Nov 9, 2020 at 7:47 PM Raymond Xu <xu...@gmail.com>
> wrote:
>
> > yes, agreed to remove the refactoring tasks and reduce to a small number
> of
> > umbrella tasks and polish them for the program.
> >
> > On Mon, Nov 9, 2020 at 7:39 PM Vinoth Chandar <vi...@apache.org> wrote:
> >
> > > Good discussion!
> > >
> > > Given this is going to be almost like a summer internship, I suggest we
> > > limit ourselves to high quality, independent projects.
> > > We have tons of ideas, but the need to ensure that we won't be picking
> > them
> > > up ourselves, is what makes this tricky.
> > >
> > > Here are some ideas top of my head. (maybe we can use this thread to
> > > collect ideas first)
> > > Most of these are experimental.
> > >
> > > - Schema inference library, that infer a schema from vast quantities of
> > > unstructured data and help us bootstrap that into Hudi
> > > - Survey indexing techniques, and implement a subset that can speed up
> > > query performance (e.g bitmaps, tree indexes)
> > > - Apache Beam integration (there is a JIRA for this) with a Hudi IO
> > module.
> > > - Apache Calcite implementation for querying Hudi datasets (we can pick
> > any
> > > other popular engine also here)
> > > - Apache Pulsar/Kinensis source in Delta Streamer
> > >
> > > Raymond, the current labels have a bunch of refactoring/tasks also
> > tagged.
> > > If you also agree, can we untag and only put up say 5 or so, bigger
> > > projects?
> > >
> > > Things around refactoring etc, for e.g, would probably get done before
> > > summer.
> > >
> > > Thanks
> > > Vinoth
> > >
> > >
> > >
> > >
> > >
> > >
> > >
> > >
> > >
> > > On Mon, Nov 9, 2020 at 4:58 PM Raymond Xu <xu.shiyan.raymond@gmail.com
> >
> > > wrote:
> > >
> > > > Hi Siva and all,
> > > >
> > > > On students' commitments
> > > >
> > > > During the 12 weeks of coding time, nothing should take precedence
> over
> > > > > your project, and you should have no major distractions.
> > > >
> > > >
> > > > Quote from this page
> > > > <
> > > >
> > >
> >
> https://opensource.googleblog.com/2011/03/dos-and-donts-of-google-summer-of-code.html
> > > > >,
> > > > the participating students will be working on the project as a
> > full-time
> > > > job for 12 weeks. So roughly we could expect 40 hours work per week.
> > > >
> > > > Also agree on your points of making the experience meaningful. I
> > suppose
> > > at
> > > > this stage we are still collecting all sorts of potential tasks,
> which
> > > can
> > > > be filtered later.
> > > >
> > > > I have started looking for issues in the backlog and labeled some.
> > Please
> > > > check out this JIRA filter
> > > >
> > > >
> > >
> >
> https://issues.apache.org/jira/browse/HUDI-1290?jql=labels%20%3D%20gsoc2021%20and%20project%20%3D%20HUDI
> > > >
> > > > Note: those issues' descriptions are to be edited with more
> > introduction
> > > to
> > > > be more newcomer-friendly. And there can be some new tasks created,
> > too.
> > > >
> > > > Any feedback on those labelled tasks? Also, anyone wants to bring in
> > more
> > > > ideas or tasks for this program?
> > > > Please feel free to post JIRA issue links here so we can consolidate
> > all
> > > > and groom later.
> > > >
> > > >
> > > > Thank you.
> > > >
> > > > Regards,
> > > > Raymond
> > > >
> > > > On Fri, Nov 6, 2020 at 10:16 AM Sivabalan <n....@gmail.com>
> wrote:
> > > >
> > > > > Sorry, just another point to remember is that, this might happen by
> > > June,
> > > > > July, aug of 2021. So the proposal assumes that the community may
> not
> > > > work
> > > > > on these until then.
> > > > >
> > > > > On Fri, Nov 6, 2020 at 1:03 PM Sivabalan <n....@gmail.com>
> wrote:
> > > > >
> > > > > > I am also interested and still trying to read more on what kind
> of
> > > > > > projects we can propose(execution, design, documentation,
> > > > > usability/tools,
> > > > > > performance framework etc), how much efforts we can expect from
> > > > > > students (is it 10 hours per week or 20 hours per week, etc). One
> > > thing
> > > > > we
> > > > > > should be mindful is that, we should try our best to think how
> best
> > > we
> > > > > can
> > > > > > help students and ensure they get something meaningful out of
> > working
> > > > > with
> > > > > > us and get a good sense of how open source projects work, code
> > > quality
> > > > we
> > > > > > expect etc. And not give some assorted 10 different tasks for
> them
> > to
> > > > > > complete. We should try to have standalone projects or cohesive
> > work
> > > > > items
> > > > > > (like devX may be).
> > > > > >
> > > > > >
> > > > > > On Fri, Nov 6, 2020 at 12:50 PM Raymond Xu <
> > > > xu.shiyan.raymond@gmail.com>
> > > > > > wrote:
> > > > > >
> > > > > >> I'm interested in being a mentor and would like to create and
> > submit
> > > > > some
> > > > > >> issues. (Sorry wanted to raise this earlier)
> > > > > >>
> > > > > >> To recap it for all:
> > > > > >> There will be college students applying and, once accepted,
> > working
> > > on
> > > > > >> some
> > > > > >> JIRA issues of Apache projects in summer 2021. We are to create
> > > issues
> > > > > and
> > > > > >> label them for this program. Those tasks will be assigned to
> > > > > participants
> > > > > >> and worked on around June 2021.
> > > > > >>
> > > > > >> To list some of the possible areas at high level
> > > > > >> - DevX related: code style fix and alignment, nightly build
> setup,
> > > > > config
> > > > > >> docs auto-generation
> > > > > >> - New features: new indexing schemes, SQL querying of metadata
> > > > > >> - Utilities improvements: new delta streamer sources, a UI,
> > > > integrations
> > > > > >> with other system e.g Airflow operator/sensor to trigger
> pipelines
> > > > based
> > > > > >> on
> > > > > >> Hudi commits
> > > > > >>
> > > > > >>
> > > > > >> On Fri, Nov 6, 2020 at 9:32 AM Vinoth Chandar <
> vinoth@apache.org>
> > > > > wrote:
> > > > > >>
> > > > > >> > Hi all,
> > > > > >> >
> > > > > >> > Any one interested in putting up some projects?
> > > > > >> >
> > > > > >> > Thanks
> > > > > >> > Vinoth
> > > > > >> >
> > > > > >> > ---------- Forwarded message ---------
> > > > > >> > From: Sally Khudairi <sk...@apache.org>
> > > > > >> > Date: Mon, Nov 2, 2020 at 7:52 PM
> > > > > >> > Subject: [PMCs] Ramping up for Google Summer of Code 2021:
> > > > invitation
> > > > > to
> > > > > >> > participate
> > > > > >> > To: ASF Marketing & Publicity <pr...@apache.org>
> > > > > >> >
> > > > > >> >
> > > > > >> > Hello PMCs --I hope you are all well.
> > > > > >> >
> > > > > >> > ASF Community Development (ComDev) oversees our participation
> in
> > > > > Google
> > > > > >> > Summer of Code, for which the ASF has been a mentoring
> > > organization
> > > > > >> since
> > > > > >> > the program's inception 17 years ago.
> > > > > >> >
> > > > > >> > ComDev is seeking individuals and PMCs interested in
> > participating
> > > > as
> > > > > >> > mentors on behalf of the ASF and Apache Projects.
> > > > > >> >
> > > > > >> > The planning and preparation process begins now. The ComDev
> team
> > > are
> > > > > >> > collecting ideas for the Apache Project's participation in
> GSoC
> > > and
> > > > > >> want to
> > > > > >> > hear from you.
> > > > > >> >
> > > > > >> > Get started by reviewing the program guidelines at
> > > > > >> > http://community.apache.org/gsoc.html and be sure to engage
> > your
> > > > > >> > communities to get involved as well. Ping the ASF's GSoC team
> at
> > > > > >> > dev@community.apache.org with any questions.
> > > > > >> >
> > > > > >> > Good luck and have a great program!
> > > > > >> >
> > > > > >> > Best,
> > > > > >> > Sally
> > > > > >> >
> > > > > >> > - - -
> > > > > >> > Vice President Marketing & Publicity
> > > > > >> > Vice President Sponsor Relations
> > > > > >> > The Apache Software Foundation
> > > > > >> >
> > > > > >> > Tel +1 617 921 8656 | sk@apache.org
> > > > > >> >
> > > > > >>
> > > > > >
> > > > > >
> > > > > > --
> > > > > > Regards,
> > > > > > -Sivabalan
> > > > > >
> > > > > --
> > > > > Regards,
> > > > > -Sivabalan
> > > > >
> > > >
> > >
> >
>

Re: [PMCs] Ramping up for Google Summer of Code 2021: invitation to participate

Posted by Vinoth Chandar <vi...@apache.org>.
Sounds good. I can make a pass as well, once you have the trimmed list.

Thanks Raymond!

On Mon, Nov 9, 2020 at 7:47 PM Raymond Xu <xu...@gmail.com>
wrote:

> yes, agreed to remove the refactoring tasks and reduce to a small number of
> umbrella tasks and polish them for the program.
>
> On Mon, Nov 9, 2020 at 7:39 PM Vinoth Chandar <vi...@apache.org> wrote:
>
> > Good discussion!
> >
> > Given this is going to be almost like a summer internship, I suggest we
> > limit ourselves to high quality, independent projects.
> > We have tons of ideas, but the need to ensure that we won't be picking
> them
> > up ourselves, is what makes this tricky.
> >
> > Here are some ideas top of my head. (maybe we can use this thread to
> > collect ideas first)
> > Most of these are experimental.
> >
> > - Schema inference library, that infer a schema from vast quantities of
> > unstructured data and help us bootstrap that into Hudi
> > - Survey indexing techniques, and implement a subset that can speed up
> > query performance (e.g bitmaps, tree indexes)
> > - Apache Beam integration (there is a JIRA for this) with a Hudi IO
> module.
> > - Apache Calcite implementation for querying Hudi datasets (we can pick
> any
> > other popular engine also here)
> > - Apache Pulsar/Kinensis source in Delta Streamer
> >
> > Raymond, the current labels have a bunch of refactoring/tasks also
> tagged.
> > If you also agree, can we untag and only put up say 5 or so, bigger
> > projects?
> >
> > Things around refactoring etc, for e.g, would probably get done before
> > summer.
> >
> > Thanks
> > Vinoth
> >
> >
> >
> >
> >
> >
> >
> >
> >
> > On Mon, Nov 9, 2020 at 4:58 PM Raymond Xu <xu...@gmail.com>
> > wrote:
> >
> > > Hi Siva and all,
> > >
> > > On students' commitments
> > >
> > > During the 12 weeks of coding time, nothing should take precedence over
> > > > your project, and you should have no major distractions.
> > >
> > >
> > > Quote from this page
> > > <
> > >
> >
> https://opensource.googleblog.com/2011/03/dos-and-donts-of-google-summer-of-code.html
> > > >,
> > > the participating students will be working on the project as a
> full-time
> > > job for 12 weeks. So roughly we could expect 40 hours work per week.
> > >
> > > Also agree on your points of making the experience meaningful. I
> suppose
> > at
> > > this stage we are still collecting all sorts of potential tasks, which
> > can
> > > be filtered later.
> > >
> > > I have started looking for issues in the backlog and labeled some.
> Please
> > > check out this JIRA filter
> > >
> > >
> >
> https://issues.apache.org/jira/browse/HUDI-1290?jql=labels%20%3D%20gsoc2021%20and%20project%20%3D%20HUDI
> > >
> > > Note: those issues' descriptions are to be edited with more
> introduction
> > to
> > > be more newcomer-friendly. And there can be some new tasks created,
> too.
> > >
> > > Any feedback on those labelled tasks? Also, anyone wants to bring in
> more
> > > ideas or tasks for this program?
> > > Please feel free to post JIRA issue links here so we can consolidate
> all
> > > and groom later.
> > >
> > >
> > > Thank you.
> > >
> > > Regards,
> > > Raymond
> > >
> > > On Fri, Nov 6, 2020 at 10:16 AM Sivabalan <n....@gmail.com> wrote:
> > >
> > > > Sorry, just another point to remember is that, this might happen by
> > June,
> > > > July, aug of 2021. So the proposal assumes that the community may not
> > > work
> > > > on these until then.
> > > >
> > > > On Fri, Nov 6, 2020 at 1:03 PM Sivabalan <n....@gmail.com> wrote:
> > > >
> > > > > I am also interested and still trying to read more on what kind of
> > > > > projects we can propose(execution, design, documentation,
> > > > usability/tools,
> > > > > performance framework etc), how much efforts we can expect from
> > > > > students (is it 10 hours per week or 20 hours per week, etc). One
> > thing
> > > > we
> > > > > should be mindful is that, we should try our best to think how best
> > we
> > > > can
> > > > > help students and ensure they get something meaningful out of
> working
> > > > with
> > > > > us and get a good sense of how open source projects work, code
> > quality
> > > we
> > > > > expect etc. And not give some assorted 10 different tasks for them
> to
> > > > > complete. We should try to have standalone projects or cohesive
> work
> > > > items
> > > > > (like devX may be).
> > > > >
> > > > >
> > > > > On Fri, Nov 6, 2020 at 12:50 PM Raymond Xu <
> > > xu.shiyan.raymond@gmail.com>
> > > > > wrote:
> > > > >
> > > > >> I'm interested in being a mentor and would like to create and
> submit
> > > > some
> > > > >> issues. (Sorry wanted to raise this earlier)
> > > > >>
> > > > >> To recap it for all:
> > > > >> There will be college students applying and, once accepted,
> working
> > on
> > > > >> some
> > > > >> JIRA issues of Apache projects in summer 2021. We are to create
> > issues
> > > > and
> > > > >> label them for this program. Those tasks will be assigned to
> > > > participants
> > > > >> and worked on around June 2021.
> > > > >>
> > > > >> To list some of the possible areas at high level
> > > > >> - DevX related: code style fix and alignment, nightly build setup,
> > > > config
> > > > >> docs auto-generation
> > > > >> - New features: new indexing schemes, SQL querying of metadata
> > > > >> - Utilities improvements: new delta streamer sources, a UI,
> > > integrations
> > > > >> with other system e.g Airflow operator/sensor to trigger pipelines
> > > based
> > > > >> on
> > > > >> Hudi commits
> > > > >>
> > > > >>
> > > > >> On Fri, Nov 6, 2020 at 9:32 AM Vinoth Chandar <vi...@apache.org>
> > > > wrote:
> > > > >>
> > > > >> > Hi all,
> > > > >> >
> > > > >> > Any one interested in putting up some projects?
> > > > >> >
> > > > >> > Thanks
> > > > >> > Vinoth
> > > > >> >
> > > > >> > ---------- Forwarded message ---------
> > > > >> > From: Sally Khudairi <sk...@apache.org>
> > > > >> > Date: Mon, Nov 2, 2020 at 7:52 PM
> > > > >> > Subject: [PMCs] Ramping up for Google Summer of Code 2021:
> > > invitation
> > > > to
> > > > >> > participate
> > > > >> > To: ASF Marketing & Publicity <pr...@apache.org>
> > > > >> >
> > > > >> >
> > > > >> > Hello PMCs --I hope you are all well.
> > > > >> >
> > > > >> > ASF Community Development (ComDev) oversees our participation in
> > > > Google
> > > > >> > Summer of Code, for which the ASF has been a mentoring
> > organization
> > > > >> since
> > > > >> > the program's inception 17 years ago.
> > > > >> >
> > > > >> > ComDev is seeking individuals and PMCs interested in
> participating
> > > as
> > > > >> > mentors on behalf of the ASF and Apache Projects.
> > > > >> >
> > > > >> > The planning and preparation process begins now. The ComDev team
> > are
> > > > >> > collecting ideas for the Apache Project's participation in GSoC
> > and
> > > > >> want to
> > > > >> > hear from you.
> > > > >> >
> > > > >> > Get started by reviewing the program guidelines at
> > > > >> > http://community.apache.org/gsoc.html and be sure to engage
> your
> > > > >> > communities to get involved as well. Ping the ASF's GSoC team at
> > > > >> > dev@community.apache.org with any questions.
> > > > >> >
> > > > >> > Good luck and have a great program!
> > > > >> >
> > > > >> > Best,
> > > > >> > Sally
> > > > >> >
> > > > >> > - - -
> > > > >> > Vice President Marketing & Publicity
> > > > >> > Vice President Sponsor Relations
> > > > >> > The Apache Software Foundation
> > > > >> >
> > > > >> > Tel +1 617 921 8656 | sk@apache.org
> > > > >> >
> > > > >>
> > > > >
> > > > >
> > > > > --
> > > > > Regards,
> > > > > -Sivabalan
> > > > >
> > > > --
> > > > Regards,
> > > > -Sivabalan
> > > >
> > >
> >
>

Re: [PMCs] Ramping up for Google Summer of Code 2021: invitation to participate

Posted by Raymond Xu <xu...@gmail.com>.
yes, agreed to remove the refactoring tasks and reduce to a small number of
umbrella tasks and polish them for the program.

On Mon, Nov 9, 2020 at 7:39 PM Vinoth Chandar <vi...@apache.org> wrote:

> Good discussion!
>
> Given this is going to be almost like a summer internship, I suggest we
> limit ourselves to high quality, independent projects.
> We have tons of ideas, but the need to ensure that we won't be picking them
> up ourselves, is what makes this tricky.
>
> Here are some ideas top of my head. (maybe we can use this thread to
> collect ideas first)
> Most of these are experimental.
>
> - Schema inference library, that infer a schema from vast quantities of
> unstructured data and help us bootstrap that into Hudi
> - Survey indexing techniques, and implement a subset that can speed up
> query performance (e.g bitmaps, tree indexes)
> - Apache Beam integration (there is a JIRA for this) with a Hudi IO module.
> - Apache Calcite implementation for querying Hudi datasets (we can pick any
> other popular engine also here)
> - Apache Pulsar/Kinensis source in Delta Streamer
>
> Raymond, the current labels have a bunch of refactoring/tasks also tagged.
> If you also agree, can we untag and only put up say 5 or so, bigger
> projects?
>
> Things around refactoring etc, for e.g, would probably get done before
> summer.
>
> Thanks
> Vinoth
>
>
>
>
>
>
>
>
>
> On Mon, Nov 9, 2020 at 4:58 PM Raymond Xu <xu...@gmail.com>
> wrote:
>
> > Hi Siva and all,
> >
> > On students' commitments
> >
> > During the 12 weeks of coding time, nothing should take precedence over
> > > your project, and you should have no major distractions.
> >
> >
> > Quote from this page
> > <
> >
> https://opensource.googleblog.com/2011/03/dos-and-donts-of-google-summer-of-code.html
> > >,
> > the participating students will be working on the project as a full-time
> > job for 12 weeks. So roughly we could expect 40 hours work per week.
> >
> > Also agree on your points of making the experience meaningful. I suppose
> at
> > this stage we are still collecting all sorts of potential tasks, which
> can
> > be filtered later.
> >
> > I have started looking for issues in the backlog and labeled some. Please
> > check out this JIRA filter
> >
> >
> https://issues.apache.org/jira/browse/HUDI-1290?jql=labels%20%3D%20gsoc2021%20and%20project%20%3D%20HUDI
> >
> > Note: those issues' descriptions are to be edited with more introduction
> to
> > be more newcomer-friendly. And there can be some new tasks created, too.
> >
> > Any feedback on those labelled tasks? Also, anyone wants to bring in more
> > ideas or tasks for this program?
> > Please feel free to post JIRA issue links here so we can consolidate all
> > and groom later.
> >
> >
> > Thank you.
> >
> > Regards,
> > Raymond
> >
> > On Fri, Nov 6, 2020 at 10:16 AM Sivabalan <n....@gmail.com> wrote:
> >
> > > Sorry, just another point to remember is that, this might happen by
> June,
> > > July, aug of 2021. So the proposal assumes that the community may not
> > work
> > > on these until then.
> > >
> > > On Fri, Nov 6, 2020 at 1:03 PM Sivabalan <n....@gmail.com> wrote:
> > >
> > > > I am also interested and still trying to read more on what kind of
> > > > projects we can propose(execution, design, documentation,
> > > usability/tools,
> > > > performance framework etc), how much efforts we can expect from
> > > > students (is it 10 hours per week or 20 hours per week, etc). One
> thing
> > > we
> > > > should be mindful is that, we should try our best to think how best
> we
> > > can
> > > > help students and ensure they get something meaningful out of working
> > > with
> > > > us and get a good sense of how open source projects work, code
> quality
> > we
> > > > expect etc. And not give some assorted 10 different tasks for them to
> > > > complete. We should try to have standalone projects or cohesive work
> > > items
> > > > (like devX may be).
> > > >
> > > >
> > > > On Fri, Nov 6, 2020 at 12:50 PM Raymond Xu <
> > xu.shiyan.raymond@gmail.com>
> > > > wrote:
> > > >
> > > >> I'm interested in being a mentor and would like to create and submit
> > > some
> > > >> issues. (Sorry wanted to raise this earlier)
> > > >>
> > > >> To recap it for all:
> > > >> There will be college students applying and, once accepted, working
> on
> > > >> some
> > > >> JIRA issues of Apache projects in summer 2021. We are to create
> issues
> > > and
> > > >> label them for this program. Those tasks will be assigned to
> > > participants
> > > >> and worked on around June 2021.
> > > >>
> > > >> To list some of the possible areas at high level
> > > >> - DevX related: code style fix and alignment, nightly build setup,
> > > config
> > > >> docs auto-generation
> > > >> - New features: new indexing schemes, SQL querying of metadata
> > > >> - Utilities improvements: new delta streamer sources, a UI,
> > integrations
> > > >> with other system e.g Airflow operator/sensor to trigger pipelines
> > based
> > > >> on
> > > >> Hudi commits
> > > >>
> > > >>
> > > >> On Fri, Nov 6, 2020 at 9:32 AM Vinoth Chandar <vi...@apache.org>
> > > wrote:
> > > >>
> > > >> > Hi all,
> > > >> >
> > > >> > Any one interested in putting up some projects?
> > > >> >
> > > >> > Thanks
> > > >> > Vinoth
> > > >> >
> > > >> > ---------- Forwarded message ---------
> > > >> > From: Sally Khudairi <sk...@apache.org>
> > > >> > Date: Mon, Nov 2, 2020 at 7:52 PM
> > > >> > Subject: [PMCs] Ramping up for Google Summer of Code 2021:
> > invitation
> > > to
> > > >> > participate
> > > >> > To: ASF Marketing & Publicity <pr...@apache.org>
> > > >> >
> > > >> >
> > > >> > Hello PMCs --I hope you are all well.
> > > >> >
> > > >> > ASF Community Development (ComDev) oversees our participation in
> > > Google
> > > >> > Summer of Code, for which the ASF has been a mentoring
> organization
> > > >> since
> > > >> > the program's inception 17 years ago.
> > > >> >
> > > >> > ComDev is seeking individuals and PMCs interested in participating
> > as
> > > >> > mentors on behalf of the ASF and Apache Projects.
> > > >> >
> > > >> > The planning and preparation process begins now. The ComDev team
> are
> > > >> > collecting ideas for the Apache Project's participation in GSoC
> and
> > > >> want to
> > > >> > hear from you.
> > > >> >
> > > >> > Get started by reviewing the program guidelines at
> > > >> > http://community.apache.org/gsoc.html and be sure to engage your
> > > >> > communities to get involved as well. Ping the ASF's GSoC team at
> > > >> > dev@community.apache.org with any questions.
> > > >> >
> > > >> > Good luck and have a great program!
> > > >> >
> > > >> > Best,
> > > >> > Sally
> > > >> >
> > > >> > - - -
> > > >> > Vice President Marketing & Publicity
> > > >> > Vice President Sponsor Relations
> > > >> > The Apache Software Foundation
> > > >> >
> > > >> > Tel +1 617 921 8656 | sk@apache.org
> > > >> >
> > > >>
> > > >
> > > >
> > > > --
> > > > Regards,
> > > > -Sivabalan
> > > >
> > > --
> > > Regards,
> > > -Sivabalan
> > >
> >
>

Re: [PMCs] Ramping up for Google Summer of Code 2021: invitation to participate

Posted by Vinoth Chandar <vi...@apache.org>.
Good discussion!

Given this is going to be almost like a summer internship, I suggest we
limit ourselves to high quality, independent projects.
We have tons of ideas, but the need to ensure that we won't be picking them
up ourselves, is what makes this tricky.

Here are some ideas top of my head. (maybe we can use this thread to
collect ideas first)
Most of these are experimental.

- Schema inference library, that infer a schema from vast quantities of
unstructured data and help us bootstrap that into Hudi
- Survey indexing techniques, and implement a subset that can speed up
query performance (e.g bitmaps, tree indexes)
- Apache Beam integration (there is a JIRA for this) with a Hudi IO module.
- Apache Calcite implementation for querying Hudi datasets (we can pick any
other popular engine also here)
- Apache Pulsar/Kinensis source in Delta Streamer

Raymond, the current labels have a bunch of refactoring/tasks also tagged.
If you also agree, can we untag and only put up say 5 or so, bigger
projects?

Things around refactoring etc, for e.g, would probably get done before
summer.

Thanks
Vinoth









On Mon, Nov 9, 2020 at 4:58 PM Raymond Xu <xu...@gmail.com>
wrote:

> Hi Siva and all,
>
> On students' commitments
>
> During the 12 weeks of coding time, nothing should take precedence over
> > your project, and you should have no major distractions.
>
>
> Quote from this page
> <
> https://opensource.googleblog.com/2011/03/dos-and-donts-of-google-summer-of-code.html
> >,
> the participating students will be working on the project as a full-time
> job for 12 weeks. So roughly we could expect 40 hours work per week.
>
> Also agree on your points of making the experience meaningful. I suppose at
> this stage we are still collecting all sorts of potential tasks, which can
> be filtered later.
>
> I have started looking for issues in the backlog and labeled some. Please
> check out this JIRA filter
>
> https://issues.apache.org/jira/browse/HUDI-1290?jql=labels%20%3D%20gsoc2021%20and%20project%20%3D%20HUDI
>
> Note: those issues' descriptions are to be edited with more introduction to
> be more newcomer-friendly. And there can be some new tasks created, too.
>
> Any feedback on those labelled tasks? Also, anyone wants to bring in more
> ideas or tasks for this program?
> Please feel free to post JIRA issue links here so we can consolidate all
> and groom later.
>
>
> Thank you.
>
> Regards,
> Raymond
>
> On Fri, Nov 6, 2020 at 10:16 AM Sivabalan <n....@gmail.com> wrote:
>
> > Sorry, just another point to remember is that, this might happen by June,
> > July, aug of 2021. So the proposal assumes that the community may not
> work
> > on these until then.
> >
> > On Fri, Nov 6, 2020 at 1:03 PM Sivabalan <n....@gmail.com> wrote:
> >
> > > I am also interested and still trying to read more on what kind of
> > > projects we can propose(execution, design, documentation,
> > usability/tools,
> > > performance framework etc), how much efforts we can expect from
> > > students (is it 10 hours per week or 20 hours per week, etc). One thing
> > we
> > > should be mindful is that, we should try our best to think how best we
> > can
> > > help students and ensure they get something meaningful out of working
> > with
> > > us and get a good sense of how open source projects work, code quality
> we
> > > expect etc. And not give some assorted 10 different tasks for them to
> > > complete. We should try to have standalone projects or cohesive work
> > items
> > > (like devX may be).
> > >
> > >
> > > On Fri, Nov 6, 2020 at 12:50 PM Raymond Xu <
> xu.shiyan.raymond@gmail.com>
> > > wrote:
> > >
> > >> I'm interested in being a mentor and would like to create and submit
> > some
> > >> issues. (Sorry wanted to raise this earlier)
> > >>
> > >> To recap it for all:
> > >> There will be college students applying and, once accepted, working on
> > >> some
> > >> JIRA issues of Apache projects in summer 2021. We are to create issues
> > and
> > >> label them for this program. Those tasks will be assigned to
> > participants
> > >> and worked on around June 2021.
> > >>
> > >> To list some of the possible areas at high level
> > >> - DevX related: code style fix and alignment, nightly build setup,
> > config
> > >> docs auto-generation
> > >> - New features: new indexing schemes, SQL querying of metadata
> > >> - Utilities improvements: new delta streamer sources, a UI,
> integrations
> > >> with other system e.g Airflow operator/sensor to trigger pipelines
> based
> > >> on
> > >> Hudi commits
> > >>
> > >>
> > >> On Fri, Nov 6, 2020 at 9:32 AM Vinoth Chandar <vi...@apache.org>
> > wrote:
> > >>
> > >> > Hi all,
> > >> >
> > >> > Any one interested in putting up some projects?
> > >> >
> > >> > Thanks
> > >> > Vinoth
> > >> >
> > >> > ---------- Forwarded message ---------
> > >> > From: Sally Khudairi <sk...@apache.org>
> > >> > Date: Mon, Nov 2, 2020 at 7:52 PM
> > >> > Subject: [PMCs] Ramping up for Google Summer of Code 2021:
> invitation
> > to
> > >> > participate
> > >> > To: ASF Marketing & Publicity <pr...@apache.org>
> > >> >
> > >> >
> > >> > Hello PMCs --I hope you are all well.
> > >> >
> > >> > ASF Community Development (ComDev) oversees our participation in
> > Google
> > >> > Summer of Code, for which the ASF has been a mentoring organization
> > >> since
> > >> > the program's inception 17 years ago.
> > >> >
> > >> > ComDev is seeking individuals and PMCs interested in participating
> as
> > >> > mentors on behalf of the ASF and Apache Projects.
> > >> >
> > >> > The planning and preparation process begins now. The ComDev team are
> > >> > collecting ideas for the Apache Project's participation in GSoC and
> > >> want to
> > >> > hear from you.
> > >> >
> > >> > Get started by reviewing the program guidelines at
> > >> > http://community.apache.org/gsoc.html and be sure to engage your
> > >> > communities to get involved as well. Ping the ASF's GSoC team at
> > >> > dev@community.apache.org with any questions.
> > >> >
> > >> > Good luck and have a great program!
> > >> >
> > >> > Best,
> > >> > Sally
> > >> >
> > >> > - - -
> > >> > Vice President Marketing & Publicity
> > >> > Vice President Sponsor Relations
> > >> > The Apache Software Foundation
> > >> >
> > >> > Tel +1 617 921 8656 | sk@apache.org
> > >> >
> > >>
> > >
> > >
> > > --
> > > Regards,
> > > -Sivabalan
> > >
> > --
> > Regards,
> > -Sivabalan
> >
>

Re: [PMCs] Ramping up for Google Summer of Code 2021: invitation to participate

Posted by Raymond Xu <xu...@gmail.com>.
Hi Siva and all,

On students' commitments

During the 12 weeks of coding time, nothing should take precedence over
> your project, and you should have no major distractions.


Quote from this page
<https://opensource.googleblog.com/2011/03/dos-and-donts-of-google-summer-of-code.html>,
the participating students will be working on the project as a full-time
job for 12 weeks. So roughly we could expect 40 hours work per week.

Also agree on your points of making the experience meaningful. I suppose at
this stage we are still collecting all sorts of potential tasks, which can
be filtered later.

I have started looking for issues in the backlog and labeled some. Please
check out this JIRA filter
https://issues.apache.org/jira/browse/HUDI-1290?jql=labels%20%3D%20gsoc2021%20and%20project%20%3D%20HUDI

Note: those issues' descriptions are to be edited with more introduction to
be more newcomer-friendly. And there can be some new tasks created, too.

Any feedback on those labelled tasks? Also, anyone wants to bring in more
ideas or tasks for this program?
Please feel free to post JIRA issue links here so we can consolidate all
and groom later.


Thank you.

Regards,
Raymond

On Fri, Nov 6, 2020 at 10:16 AM Sivabalan <n....@gmail.com> wrote:

> Sorry, just another point to remember is that, this might happen by June,
> July, aug of 2021. So the proposal assumes that the community may not work
> on these until then.
>
> On Fri, Nov 6, 2020 at 1:03 PM Sivabalan <n....@gmail.com> wrote:
>
> > I am also interested and still trying to read more on what kind of
> > projects we can propose(execution, design, documentation,
> usability/tools,
> > performance framework etc), how much efforts we can expect from
> > students (is it 10 hours per week or 20 hours per week, etc). One thing
> we
> > should be mindful is that, we should try our best to think how best we
> can
> > help students and ensure they get something meaningful out of working
> with
> > us and get a good sense of how open source projects work, code quality we
> > expect etc. And not give some assorted 10 different tasks for them to
> > complete. We should try to have standalone projects or cohesive work
> items
> > (like devX may be).
> >
> >
> > On Fri, Nov 6, 2020 at 12:50 PM Raymond Xu <xu...@gmail.com>
> > wrote:
> >
> >> I'm interested in being a mentor and would like to create and submit
> some
> >> issues. (Sorry wanted to raise this earlier)
> >>
> >> To recap it for all:
> >> There will be college students applying and, once accepted, working on
> >> some
> >> JIRA issues of Apache projects in summer 2021. We are to create issues
> and
> >> label them for this program. Those tasks will be assigned to
> participants
> >> and worked on around June 2021.
> >>
> >> To list some of the possible areas at high level
> >> - DevX related: code style fix and alignment, nightly build setup,
> config
> >> docs auto-generation
> >> - New features: new indexing schemes, SQL querying of metadata
> >> - Utilities improvements: new delta streamer sources, a UI, integrations
> >> with other system e.g Airflow operator/sensor to trigger pipelines based
> >> on
> >> Hudi commits
> >>
> >>
> >> On Fri, Nov 6, 2020 at 9:32 AM Vinoth Chandar <vi...@apache.org>
> wrote:
> >>
> >> > Hi all,
> >> >
> >> > Any one interested in putting up some projects?
> >> >
> >> > Thanks
> >> > Vinoth
> >> >
> >> > ---------- Forwarded message ---------
> >> > From: Sally Khudairi <sk...@apache.org>
> >> > Date: Mon, Nov 2, 2020 at 7:52 PM
> >> > Subject: [PMCs] Ramping up for Google Summer of Code 2021: invitation
> to
> >> > participate
> >> > To: ASF Marketing & Publicity <pr...@apache.org>
> >> >
> >> >
> >> > Hello PMCs --I hope you are all well.
> >> >
> >> > ASF Community Development (ComDev) oversees our participation in
> Google
> >> > Summer of Code, for which the ASF has been a mentoring organization
> >> since
> >> > the program's inception 17 years ago.
> >> >
> >> > ComDev is seeking individuals and PMCs interested in participating as
> >> > mentors on behalf of the ASF and Apache Projects.
> >> >
> >> > The planning and preparation process begins now. The ComDev team are
> >> > collecting ideas for the Apache Project's participation in GSoC and
> >> want to
> >> > hear from you.
> >> >
> >> > Get started by reviewing the program guidelines at
> >> > http://community.apache.org/gsoc.html and be sure to engage your
> >> > communities to get involved as well. Ping the ASF's GSoC team at
> >> > dev@community.apache.org with any questions.
> >> >
> >> > Good luck and have a great program!
> >> >
> >> > Best,
> >> > Sally
> >> >
> >> > - - -
> >> > Vice President Marketing & Publicity
> >> > Vice President Sponsor Relations
> >> > The Apache Software Foundation
> >> >
> >> > Tel +1 617 921 8656 | sk@apache.org
> >> >
> >>
> >
> >
> > --
> > Regards,
> > -Sivabalan
> >
> --
> Regards,
> -Sivabalan
>

Re: [PMCs] Ramping up for Google Summer of Code 2021: invitation to participate

Posted by Sivabalan <n....@gmail.com>.
Sorry, just another point to remember is that, this might happen by June,
July, aug of 2021. So the proposal assumes that the community may not work
on these until then.

On Fri, Nov 6, 2020 at 1:03 PM Sivabalan <n....@gmail.com> wrote:

> I am also interested and still trying to read more on what kind of
> projects we can propose(execution, design, documentation, usability/tools,
> performance framework etc), how much efforts we can expect from
> students (is it 10 hours per week or 20 hours per week, etc). One thing we
> should be mindful is that, we should try our best to think how best we can
> help students and ensure they get something meaningful out of working with
> us and get a good sense of how open source projects work, code quality we
> expect etc. And not give some assorted 10 different tasks for them to
> complete. We should try to have standalone projects or cohesive work items
> (like devX may be).
>
>
> On Fri, Nov 6, 2020 at 12:50 PM Raymond Xu <xu...@gmail.com>
> wrote:
>
>> I'm interested in being a mentor and would like to create and submit some
>> issues. (Sorry wanted to raise this earlier)
>>
>> To recap it for all:
>> There will be college students applying and, once accepted, working on
>> some
>> JIRA issues of Apache projects in summer 2021. We are to create issues and
>> label them for this program. Those tasks will be assigned to participants
>> and worked on around June 2021.
>>
>> To list some of the possible areas at high level
>> - DevX related: code style fix and alignment, nightly build setup, config
>> docs auto-generation
>> - New features: new indexing schemes, SQL querying of metadata
>> - Utilities improvements: new delta streamer sources, a UI, integrations
>> with other system e.g Airflow operator/sensor to trigger pipelines based
>> on
>> Hudi commits
>>
>>
>> On Fri, Nov 6, 2020 at 9:32 AM Vinoth Chandar <vi...@apache.org> wrote:
>>
>> > Hi all,
>> >
>> > Any one interested in putting up some projects?
>> >
>> > Thanks
>> > Vinoth
>> >
>> > ---------- Forwarded message ---------
>> > From: Sally Khudairi <sk...@apache.org>
>> > Date: Mon, Nov 2, 2020 at 7:52 PM
>> > Subject: [PMCs] Ramping up for Google Summer of Code 2021: invitation to
>> > participate
>> > To: ASF Marketing & Publicity <pr...@apache.org>
>> >
>> >
>> > Hello PMCs --I hope you are all well.
>> >
>> > ASF Community Development (ComDev) oversees our participation in Google
>> > Summer of Code, for which the ASF has been a mentoring organization
>> since
>> > the program's inception 17 years ago.
>> >
>> > ComDev is seeking individuals and PMCs interested in participating as
>> > mentors on behalf of the ASF and Apache Projects.
>> >
>> > The planning and preparation process begins now. The ComDev team are
>> > collecting ideas for the Apache Project's participation in GSoC and
>> want to
>> > hear from you.
>> >
>> > Get started by reviewing the program guidelines at
>> > http://community.apache.org/gsoc.html and be sure to engage your
>> > communities to get involved as well. Ping the ASF's GSoC team at
>> > dev@community.apache.org with any questions.
>> >
>> > Good luck and have a great program!
>> >
>> > Best,
>> > Sally
>> >
>> > - - -
>> > Vice President Marketing & Publicity
>> > Vice President Sponsor Relations
>> > The Apache Software Foundation
>> >
>> > Tel +1 617 921 8656 | sk@apache.org
>> >
>>
>
>
> --
> Regards,
> -Sivabalan
>
-- 
Regards,
-Sivabalan

Re: [PMCs] Ramping up for Google Summer of Code 2021: invitation to participate

Posted by Sivabalan <n....@gmail.com>.
I am also interested and still trying to read more on what kind of projects
we can propose(execution, design, documentation, usability/tools,
performance framework etc), how much efforts we can expect from
students (is it 10 hours per week or 20 hours per week, etc). One thing we
should be mindful is that, we should try our best to think how best we can
help students and ensure they get something meaningful out of working with
us and get a good sense of how open source projects work, code quality we
expect etc. And not give some assorted 10 different tasks for them to
complete. We should try to have standalone projects or cohesive work items
(like devX may be).


On Fri, Nov 6, 2020 at 12:50 PM Raymond Xu <xu...@gmail.com>
wrote:

> I'm interested in being a mentor and would like to create and submit some
> issues. (Sorry wanted to raise this earlier)
>
> To recap it for all:
> There will be college students applying and, once accepted, working on some
> JIRA issues of Apache projects in summer 2021. We are to create issues and
> label them for this program. Those tasks will be assigned to participants
> and worked on around June 2021.
>
> To list some of the possible areas at high level
> - DevX related: code style fix and alignment, nightly build setup, config
> docs auto-generation
> - New features: new indexing schemes, SQL querying of metadata
> - Utilities improvements: new delta streamer sources, a UI, integrations
> with other system e.g Airflow operator/sensor to trigger pipelines based on
> Hudi commits
>
>
> On Fri, Nov 6, 2020 at 9:32 AM Vinoth Chandar <vi...@apache.org> wrote:
>
> > Hi all,
> >
> > Any one interested in putting up some projects?
> >
> > Thanks
> > Vinoth
> >
> > ---------- Forwarded message ---------
> > From: Sally Khudairi <sk...@apache.org>
> > Date: Mon, Nov 2, 2020 at 7:52 PM
> > Subject: [PMCs] Ramping up for Google Summer of Code 2021: invitation to
> > participate
> > To: ASF Marketing & Publicity <pr...@apache.org>
> >
> >
> > Hello PMCs --I hope you are all well.
> >
> > ASF Community Development (ComDev) oversees our participation in Google
> > Summer of Code, for which the ASF has been a mentoring organization since
> > the program's inception 17 years ago.
> >
> > ComDev is seeking individuals and PMCs interested in participating as
> > mentors on behalf of the ASF and Apache Projects.
> >
> > The planning and preparation process begins now. The ComDev team are
> > collecting ideas for the Apache Project's participation in GSoC and want
> to
> > hear from you.
> >
> > Get started by reviewing the program guidelines at
> > http://community.apache.org/gsoc.html and be sure to engage your
> > communities to get involved as well. Ping the ASF's GSoC team at
> > dev@community.apache.org with any questions.
> >
> > Good luck and have a great program!
> >
> > Best,
> > Sally
> >
> > - - -
> > Vice President Marketing & Publicity
> > Vice President Sponsor Relations
> > The Apache Software Foundation
> >
> > Tel +1 617 921 8656 | sk@apache.org
> >
>


-- 
Regards,
-Sivabalan

Re: [PMCs] Ramping up for Google Summer of Code 2021: invitation to participate

Posted by Raymond Xu <xu...@gmail.com>.
I'm interested in being a mentor and would like to create and submit some
issues. (Sorry wanted to raise this earlier)

To recap it for all:
There will be college students applying and, once accepted, working on some
JIRA issues of Apache projects in summer 2021. We are to create issues and
label them for this program. Those tasks will be assigned to participants
and worked on around June 2021.

To list some of the possible areas at high level
- DevX related: code style fix and alignment, nightly build setup, config
docs auto-generation
- New features: new indexing schemes, SQL querying of metadata
- Utilities improvements: new delta streamer sources, a UI, integrations
with other system e.g Airflow operator/sensor to trigger pipelines based on
Hudi commits


On Fri, Nov 6, 2020 at 9:32 AM Vinoth Chandar <vi...@apache.org> wrote:

> Hi all,
>
> Any one interested in putting up some projects?
>
> Thanks
> Vinoth
>
> ---------- Forwarded message ---------
> From: Sally Khudairi <sk...@apache.org>
> Date: Mon, Nov 2, 2020 at 7:52 PM
> Subject: [PMCs] Ramping up for Google Summer of Code 2021: invitation to
> participate
> To: ASF Marketing & Publicity <pr...@apache.org>
>
>
> Hello PMCs --I hope you are all well.
>
> ASF Community Development (ComDev) oversees our participation in Google
> Summer of Code, for which the ASF has been a mentoring organization since
> the program's inception 17 years ago.
>
> ComDev is seeking individuals and PMCs interested in participating as
> mentors on behalf of the ASF and Apache Projects.
>
> The planning and preparation process begins now. The ComDev team are
> collecting ideas for the Apache Project's participation in GSoC and want to
> hear from you.
>
> Get started by reviewing the program guidelines at
> http://community.apache.org/gsoc.html and be sure to engage your
> communities to get involved as well. Ping the ASF's GSoC team at
> dev@community.apache.org with any questions.
>
> Good luck and have a great program!
>
> Best,
> Sally
>
> - - -
> Vice President Marketing & Publicity
> Vice President Sponsor Relations
> The Apache Software Foundation
>
> Tel +1 617 921 8656 | sk@apache.org
>