You are viewing a plain text version of this content. The canonical link for it is here.
Posted to general@incubator.apache.org by Daniel Dai <da...@gmail.com> on 2014/12/11 19:02:03 UTC

[VOTE] accept SAMOA into incubator

Following the discussion earlier, I'm calling a vote to accept SAMOA as a
new Incubator project.

[ ] +1 Accept SAMOA into the Incubator
[ ] +0 Indifferent to the acceptance of SAMOA
[ ] -1 Do not accept SAMOA because ...

The vote will be open for at least 72h and closes at the earliest on Dec 14
19:00 GMT.

https://wiki.apache.org/incubator/SAMOAProposal

Thanks,
Daniel

= SAMOA =
== Abstract ==
SAMOA is an an open-source platform for mining big data streams.

== Proposal ==
SAMOA provides a collection of distributed streaming algorithms for the
most common data mining and machine learning tasks such as classification,
clustering, and regression, as well as programming abstractions to develop
new algorithms that run on top of distributed stream processing engines
(DSPEs). It features a pluggable architecture that allows it to run on
several DSPEs such as Apache Storm, Apache S4, and Apache Samza.

== Background ==
Hadoop and its ecosystem have changed the way data are processed by
allowing to push algorithms to unprecedented scale. As an example, Mahout
allows to run data mining and machine learning algorithms on very large
datasets. However, Hadoop and Mahout are not suited to handle streaming
data. Simply put, the goal of SAMOA is to provide a streaming counterpart
to Mahout.

== Rationale ==
SAMOA aims to fill the current gap in tools for mining large scale streams.
Many organizations can benefit from a scalable stream mining platform
system such as SAMOA.

SAMOA is a natural fit for the Apache Software Foundation. It is licensed
under the ASL v2.0. It already interoperates with several existing Apache
projects such as Storm, S4, and Samza. Furthermore, it is complementary to
existing Apache projects such as Mahout. The initial committers are
familiar with the Apache process and subscribes to the Apache mission.
Indeed, the team includes multiple Apache committers. Finally, joining
Apache will help coordinate the development effort of the growing number of
organizations which contribute to SAMOA.

== Initial Goals ==
* Move the existing codebase to Apache
* Integrate with the Apache development process
* Incremental development and releases per Apache guidelines

== Current Status ==
SAMOA started as a research project at Yahoo Labs in 2013 and was
open-sourced in October the same year. It has been under development on
Yahoo's public GitHub repository since being open-sourced. It has undergone
two releases (0.1, 0.2).

=== Meritocracy ===
The SAMOA project already operates on meritocratic principles. Today, SAMOA
has several developers and has accepted multiple patches from outside of
Yahoo Labs. However, our intent with this incubator proposal is to start
building a more diverse developer community around SAMOA that follows the
Apache meritocracy model. We will identify all committers and PPMC members
for the project operating under the ASF meritocratic principles. We plan to
continue support for new contributors and work with those who contribute
significantly to the project to make them committers.

=== Community ===
SAMOA is currently being used internally at Yahoo. Acceptance into the
Apache foundation would bolster the existing user and developer community
around SAMOA. That community includes contributors from several
institutions, active mostly on GitHub's pages. SAMOA has been starred more
than 300 times and forked more than 50 times on GitHub as of November 2014.

=== Core Developers ===
The core developers are a diverse group, many of which already very
experienced with open source. There are two existing Apache committers,
along with people from various companies and universities.

=== Alignment ===
The ASF is the natural choice to host SAMOA. First, its goal of encouraging
community-driven open-source projects fits with our vision for SAMOA.
Additionally, many other projects that SAMOA is based on, such as Apache
Storm, S4, Samza, and HDFS, are hosted by the ASF. Close proximity of SAMOA
to these projects within the ASF will provide mutual benefit.

== Known Risks ==
=== Orphaned Products ===
Given the current level of investment in SAMOA the risk of the project
being abandoned is minimal. There are several constituents who are highly
incentivized to continue development, and Yahoo Labs relies on SAMOA as a
platform for a large number of long-term research projects. However, the
small number of initial committers might be a concern. We plan to address
this issue during incubation by growing the community and the number of
committers.

=== Inexperience with Open Source ===
SAMOA has existed as a healthy open source project for one year. During
this time, we have curated an open-source community successfully,
attracting developers from a diverse group of universities and companies
including Huawei, Yahoo, University of Porto, and Universitat Politecnica
de Catalunya.

Gianmarco is a committer for Apache Pig, Matthieu for Apache S4. Albert is
one of the lead developers of MOA, an open-source tool for streaming
machine learning.

=== Homogenous Developers ===
The initial list of committers includes developers from several
institutions, both academic and industrial. The committers are
geographically distributed across Europe, America, and Asia.

=== Reliance on Salaried Developers ===
Like most open source projects, SAMOA receives a substantial support from
salaried developers. In addition, those working from within corporations
often devote “after hours” or spare time in the project - and these come
from several organizations. We will work to ensure the ability for the
project to continuously be stewarded and to proceed forward independently
of salaried developers.

=== Relationship with Other Apache Products ===
SAMOA interoperates with several existing Apache projects, mainly by using
them as stream processing engines: Apache Storm, Apache S4, and Apache
Samza. It is a counterpart of Apache Mahout for streaming. It also uses
several other Apache components, including Apache Maven and several Apache
Commons libraries.

=== A Excessive Fascination with the Apache Brand ===
SAMOA is already a healthy and relatively well known open source project.
This proposal is not for the purpose of generating publicity. Rather, the
primary benefits to joining Apache are those outlined in the Rationale
section. We are more interested in establishing a strong community that can
drive the project independently of Yahoo.

== Documentation ==
The reader will find these websites relevant:

* SAMOA website: http://samoa-project.net/
* SAMOA documentation: https://github.com/yahoo/samoa/wiki/
* Issue tracking: https://github.com/yahoo/samoa/issues
* Codebase: https://github.com/yahoo/samoa
* User group: http://groups.google.com/group/samoa-user

== Initial Source ==
The SAMOA codebase is currently hosted on GitHub:
https://github.com/yahoo/samoa. This is the exact codebase that we would
migrate to the Apache foundation.

== Source and Intellectual Property Submission Plan ==
Currently, the SAMOA codebase is distributed under an Apache license v2.0.
The vast majority of code has copyright held by Yahoo. Upon entering the
Incubator, Yahoo will grant a license to the Apache foundation. In certain
cases where individuals or organizations hold copyright, we will ensure
they grant a license to the Apache foundation. Going forward, all commits
will be licensed directly to the Apache foundation through our signed
Individual Contributor License Agreements for all committers on the project.

== Cryptography ==
We do not expect SAMOA to be a controlled export item due to the use of
encryption.

== External Dependencies ==
To the best of our knowledge, all dependencies of SAMOA are distributed
under Apache compatible licenses. Upon acceptance to the incubator, we
would begin a thorough analysis of all transitive dependencies to verify
this fact and introduce license checking into the build and release process
(for instance integrating Apache Rat).

== Required Resources ==
=== Mailing Lists ===
We will migrate the existing SAMOA mailing lists as follows:

* samoa-users@googlegroups --> users@samoa.incubator.apache.org
* samoa-developers@googlegroups --> dev@samoa.incubator.apache.org

SAMOA commits are hosted on GitHub, so we would request the following
mailing list:

* commits@samoa.incubator.apache.org

We would also request the following mailing list:

* private@samoa.incubator.apache.org (with moderated subscription)

=== Source control ===
The SAMOA team would like to use Git for source control, due to our current
use of Git. We request a writeable Git repo for SAMOA, and mirroring to be
set up to GitHub through INFRA.

* https://git-wip-us.apache.org/repos/asf/incubator-samoa.git

=== Issue Tracking ===
SAMOA currently uses GitHub for issue tracking. We will migrate to the
Apache JIRA instance. http://issues.apache.org/jira/browse/SAMOA

== Initial Committers & Affiliations ==
* Albert Bifet, Huawei, <abifet at waikato dot ac dot nz>
* Gianmarco De Francisci Morales, Yahoo Labs, <gdfm at apache dot org>
* Nicolas Kourtellis, Yahoo Labs, <nkourtellis at gmail dot com>
* Matthieu Morel, Yahoo Labs, <mmorel at apache dot org>
* Arinto Murdopo, Living Analytics Research Centre, <arintom at smu dot edu
dot sg>
* Olivier Van Laere, BlueShift Labs, <olivier at getblueshift dot com>

== Sponsors ==
=== Champion ===
* Daniel Dai <daijy at apache dot org>

=== Nominated Mentors ===
* Alan Gates <gates at apache dot org>
* Ted Dunning <tdunning at apache dot org>
* Ashutosh Chauhan <hashutosh at apache dot org>
* Enis Soztutar <enis at apache dot org>

=== Sponsoring Entity ===
The Apache Incubator

Re: [VOTE] accept SAMOA into incubator

Posted by Roman Shaposhnik <ro...@shaposhnik.org>.
On Thu, Dec 11, 2014 at 10:02 AM, Daniel Dai <da...@gmail.com> wrote:
> Following the discussion earlier, I'm calling a vote to accept SAMOA as a
> new Incubator project.
>
> [ ] +1 Accept SAMOA into the Incubator
> [ ] +0 Indifferent to the acceptance of SAMOA
> [ ] -1 Do not accept SAMOA because ...

+1 (binding)

Thanks,
Roman.

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: [VOTE] accept SAMOA into incubator

Posted by "Alan D. Cabrera" <li...@toolazydogs.com>.
+1 binding


Regards,
Alan

On Dec 11, 2014, at 10:02 AM, Daniel Dai <da...@gmail.com> wrote:

> Following the discussion earlier, I'm calling a vote to accept SAMOA as a
> new Incubator project.
> 
> [ ] +1 Accept SAMOA into the Incubator
> [ ] +0 Indifferent to the acceptance of SAMOA
> [ ] -1 Do not accept SAMOA because ...


---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: [VOTE] accept SAMOA into incubator

Posted by Andrew Purtell <ap...@apache.org>.
+1 (binding)

On Thu, Dec 11, 2014 at 10:02 AM, Daniel Dai <da...@gmail.com> wrote:

> Following the discussion earlier, I'm calling a vote to accept SAMOA as a
> new Incubator project.
>
> [ ] +1 Accept SAMOA into the Incubator
> [ ] +0 Indifferent to the acceptance of SAMOA
> [ ] -1 Do not accept SAMOA because ...
>
> The vote will be open for at least 72h and closes at the earliest on Dec 14
> 19:00 GMT.
>
> https://wiki.apache.org/incubator/SAMOAProposal
>
> Thanks,
> Daniel
>
> = SAMOA =
> == Abstract ==
> SAMOA is an an open-source platform for mining big data streams.
>
> == Proposal ==
> SAMOA provides a collection of distributed streaming algorithms for the
> most common data mining and machine learning tasks such as classification,
> clustering, and regression, as well as programming abstractions to develop
> new algorithms that run on top of distributed stream processing engines
> (DSPEs). It features a pluggable architecture that allows it to run on
> several DSPEs such as Apache Storm, Apache S4, and Apache Samza.
>
> == Background ==
> Hadoop and its ecosystem have changed the way data are processed by
> allowing to push algorithms to unprecedented scale. As an example, Mahout
> allows to run data mining and machine learning algorithms on very large
> datasets. However, Hadoop and Mahout are not suited to handle streaming
> data. Simply put, the goal of SAMOA is to provide a streaming counterpart
> to Mahout.
>
> == Rationale ==
> SAMOA aims to fill the current gap in tools for mining large scale streams.
> Many organizations can benefit from a scalable stream mining platform
> system such as SAMOA.
>
> SAMOA is a natural fit for the Apache Software Foundation. It is licensed
> under the ASL v2.0. It already interoperates with several existing Apache
> projects such as Storm, S4, and Samza. Furthermore, it is complementary to
> existing Apache projects such as Mahout. The initial committers are
> familiar with the Apache process and subscribes to the Apache mission.
> Indeed, the team includes multiple Apache committers. Finally, joining
> Apache will help coordinate the development effort of the growing number of
> organizations which contribute to SAMOA.
>
> == Initial Goals ==
> * Move the existing codebase to Apache
> * Integrate with the Apache development process
> * Incremental development and releases per Apache guidelines
>
> == Current Status ==
> SAMOA started as a research project at Yahoo Labs in 2013 and was
> open-sourced in October the same year. It has been under development on
> Yahoo's public GitHub repository since being open-sourced. It has undergone
> two releases (0.1, 0.2).
>
> === Meritocracy ===
> The SAMOA project already operates on meritocratic principles. Today, SAMOA
> has several developers and has accepted multiple patches from outside of
> Yahoo Labs. However, our intent with this incubator proposal is to start
> building a more diverse developer community around SAMOA that follows the
> Apache meritocracy model. We will identify all committers and PPMC members
> for the project operating under the ASF meritocratic principles. We plan to
> continue support for new contributors and work with those who contribute
> significantly to the project to make them committers.
>
> === Community ===
> SAMOA is currently being used internally at Yahoo. Acceptance into the
> Apache foundation would bolster the existing user and developer community
> around SAMOA. That community includes contributors from several
> institutions, active mostly on GitHub's pages. SAMOA has been starred more
> than 300 times and forked more than 50 times on GitHub as of November 2014.
>
> === Core Developers ===
> The core developers are a diverse group, many of which already very
> experienced with open source. There are two existing Apache committers,
> along with people from various companies and universities.
>
> === Alignment ===
> The ASF is the natural choice to host SAMOA. First, its goal of encouraging
> community-driven open-source projects fits with our vision for SAMOA.
> Additionally, many other projects that SAMOA is based on, such as Apache
> Storm, S4, Samza, and HDFS, are hosted by the ASF. Close proximity of SAMOA
> to these projects within the ASF will provide mutual benefit.
>
> == Known Risks ==
> === Orphaned Products ===
> Given the current level of investment in SAMOA the risk of the project
> being abandoned is minimal. There are several constituents who are highly
> incentivized to continue development, and Yahoo Labs relies on SAMOA as a
> platform for a large number of long-term research projects. However, the
> small number of initial committers might be a concern. We plan to address
> this issue during incubation by growing the community and the number of
> committers.
>
> === Inexperience with Open Source ===
> SAMOA has existed as a healthy open source project for one year. During
> this time, we have curated an open-source community successfully,
> attracting developers from a diverse group of universities and companies
> including Huawei, Yahoo, University of Porto, and Universitat Politecnica
> de Catalunya.
>
> Gianmarco is a committer for Apache Pig, Matthieu for Apache S4. Albert is
> one of the lead developers of MOA, an open-source tool for streaming
> machine learning.
>
> === Homogenous Developers ===
> The initial list of committers includes developers from several
> institutions, both academic and industrial. The committers are
> geographically distributed across Europe, America, and Asia.
>
> === Reliance on Salaried Developers ===
> Like most open source projects, SAMOA receives a substantial support from
> salaried developers. In addition, those working from within corporations
> often devote “after hours” or spare time in the project - and these come
> from several organizations. We will work to ensure the ability for the
> project to continuously be stewarded and to proceed forward independently
> of salaried developers.
>
> === Relationship with Other Apache Products ===
> SAMOA interoperates with several existing Apache projects, mainly by using
> them as stream processing engines: Apache Storm, Apache S4, and Apache
> Samza. It is a counterpart of Apache Mahout for streaming. It also uses
> several other Apache components, including Apache Maven and several Apache
> Commons libraries.
>
> === A Excessive Fascination with the Apache Brand ===
> SAMOA is already a healthy and relatively well known open source project.
> This proposal is not for the purpose of generating publicity. Rather, the
> primary benefits to joining Apache are those outlined in the Rationale
> section. We are more interested in establishing a strong community that can
> drive the project independently of Yahoo.
>
> == Documentation ==
> The reader will find these websites relevant:
>
> * SAMOA website: http://samoa-project.net/
> * SAMOA documentation: https://github.com/yahoo/samoa/wiki/
> * Issue tracking: https://github.com/yahoo/samoa/issues
> * Codebase: https://github.com/yahoo/samoa
> * User group: http://groups.google.com/group/samoa-user
>
> == Initial Source ==
> The SAMOA codebase is currently hosted on GitHub:
> https://github.com/yahoo/samoa. This is the exact codebase that we would
> migrate to the Apache foundation.
>
> == Source and Intellectual Property Submission Plan ==
> Currently, the SAMOA codebase is distributed under an Apache license v2.0.
> The vast majority of code has copyright held by Yahoo. Upon entering the
> Incubator, Yahoo will grant a license to the Apache foundation. In certain
> cases where individuals or organizations hold copyright, we will ensure
> they grant a license to the Apache foundation. Going forward, all commits
> will be licensed directly to the Apache foundation through our signed
> Individual Contributor License Agreements for all committers on the
> project.
>
> == Cryptography ==
> We do not expect SAMOA to be a controlled export item due to the use of
> encryption.
>
> == External Dependencies ==
> To the best of our knowledge, all dependencies of SAMOA are distributed
> under Apache compatible licenses. Upon acceptance to the incubator, we
> would begin a thorough analysis of all transitive dependencies to verify
> this fact and introduce license checking into the build and release process
> (for instance integrating Apache Rat).
>
> == Required Resources ==
> === Mailing Lists ===
> We will migrate the existing SAMOA mailing lists as follows:
>
> * samoa-users@googlegroups --> users@samoa.incubator.apache.org
> * samoa-developers@googlegroups --> dev@samoa.incubator.apache.org
>
> SAMOA commits are hosted on GitHub, so we would request the following
> mailing list:
>
> * commits@samoa.incubator.apache.org
>
> We would also request the following mailing list:
>
> * private@samoa.incubator.apache.org (with moderated subscription)
>
> === Source control ===
> The SAMOA team would like to use Git for source control, due to our current
> use of Git. We request a writeable Git repo for SAMOA, and mirroring to be
> set up to GitHub through INFRA.
>
> * https://git-wip-us.apache.org/repos/asf/incubator-samoa.git
>
> === Issue Tracking ===
> SAMOA currently uses GitHub for issue tracking. We will migrate to the
> Apache JIRA instance. http://issues.apache.org/jira/browse/SAMOA
>
> == Initial Committers & Affiliations ==
> * Albert Bifet, Huawei, <abifet at waikato dot ac dot nz>
> * Gianmarco De Francisci Morales, Yahoo Labs, <gdfm at apache dot org>
> * Nicolas Kourtellis, Yahoo Labs, <nkourtellis at gmail dot com>
> * Matthieu Morel, Yahoo Labs, <mmorel at apache dot org>
> * Arinto Murdopo, Living Analytics Research Centre, <arintom at smu dot edu
> dot sg>
> * Olivier Van Laere, BlueShift Labs, <olivier at getblueshift dot com>
>
> == Sponsors ==
> === Champion ===
> * Daniel Dai <daijy at apache dot org>
>
> === Nominated Mentors ===
> * Alan Gates <gates at apache dot org>
> * Ted Dunning <tdunning at apache dot org>
> * Ashutosh Chauhan <hashutosh at apache dot org>
> * Enis Soztutar <enis at apache dot org>
>
> === Sponsoring Entity ===
> The Apache Incubator
>



-- 
Best regards,

   - Andy

Problems worthy of attack prove their worth by hitting back. - Piet Hein
(via Tom White)

Re: [VOTE] accept SAMOA into incubator

Posted by Enis Söztutar <en...@apache.org>.
+1 (binding)

Enis

On Thu, Dec 11, 2014 at 10:07 AM, Henry Saputra <he...@gmail.com>
wrote:

> +1 (binding)
>
>
> - Henry
>
> On Thu, Dec 11, 2014 at 10:02 AM, Daniel Dai <da...@gmail.com> wrote:
> > Following the discussion earlier, I'm calling a vote to accept SAMOA as a
> > new Incubator project.
> >
> > [ ] +1 Accept SAMOA into the Incubator
> > [ ] +0 Indifferent to the acceptance of SAMOA
> > [ ] -1 Do not accept SAMOA because ...
> >
> > The vote will be open for at least 72h and closes at the earliest on Dec
> 14
> > 19:00 GMT.
> >
> > https://wiki.apache.org/incubator/SAMOAProposal
> >
> > Thanks,
> > Daniel
> >
> > = SAMOA =
> > == Abstract ==
> > SAMOA is an an open-source platform for mining big data streams.
> >
> > == Proposal ==
> > SAMOA provides a collection of distributed streaming algorithms for the
> > most common data mining and machine learning tasks such as
> classification,
> > clustering, and regression, as well as programming abstractions to
> develop
> > new algorithms that run on top of distributed stream processing engines
> > (DSPEs). It features a pluggable architecture that allows it to run on
> > several DSPEs such as Apache Storm, Apache S4, and Apache Samza.
> >
> > == Background ==
> > Hadoop and its ecosystem have changed the way data are processed by
> > allowing to push algorithms to unprecedented scale. As an example, Mahout
> > allows to run data mining and machine learning algorithms on very large
> > datasets. However, Hadoop and Mahout are not suited to handle streaming
> > data. Simply put, the goal of SAMOA is to provide a streaming counterpart
> > to Mahout.
> >
> > == Rationale ==
> > SAMOA aims to fill the current gap in tools for mining large scale
> streams.
> > Many organizations can benefit from a scalable stream mining platform
> > system such as SAMOA.
> >
> > SAMOA is a natural fit for the Apache Software Foundation. It is licensed
> > under the ASL v2.0. It already interoperates with several existing Apache
> > projects such as Storm, S4, and Samza. Furthermore, it is complementary
> to
> > existing Apache projects such as Mahout. The initial committers are
> > familiar with the Apache process and subscribes to the Apache mission.
> > Indeed, the team includes multiple Apache committers. Finally, joining
> > Apache will help coordinate the development effort of the growing number
> of
> > organizations which contribute to SAMOA.
> >
> > == Initial Goals ==
> > * Move the existing codebase to Apache
> > * Integrate with the Apache development process
> > * Incremental development and releases per Apache guidelines
> >
> > == Current Status ==
> > SAMOA started as a research project at Yahoo Labs in 2013 and was
> > open-sourced in October the same year. It has been under development on
> > Yahoo's public GitHub repository since being open-sourced. It has
> undergone
> > two releases (0.1, 0.2).
> >
> > === Meritocracy ===
> > The SAMOA project already operates on meritocratic principles. Today,
> SAMOA
> > has several developers and has accepted multiple patches from outside of
> > Yahoo Labs. However, our intent with this incubator proposal is to start
> > building a more diverse developer community around SAMOA that follows the
> > Apache meritocracy model. We will identify all committers and PPMC
> members
> > for the project operating under the ASF meritocratic principles. We plan
> to
> > continue support for new contributors and work with those who contribute
> > significantly to the project to make them committers.
> >
> > === Community ===
> > SAMOA is currently being used internally at Yahoo. Acceptance into the
> > Apache foundation would bolster the existing user and developer community
> > around SAMOA. That community includes contributors from several
> > institutions, active mostly on GitHub's pages. SAMOA has been starred
> more
> > than 300 times and forked more than 50 times on GitHub as of November
> 2014.
> >
> > === Core Developers ===
> > The core developers are a diverse group, many of which already very
> > experienced with open source. There are two existing Apache committers,
> > along with people from various companies and universities.
> >
> > === Alignment ===
> > The ASF is the natural choice to host SAMOA. First, its goal of
> encouraging
> > community-driven open-source projects fits with our vision for SAMOA.
> > Additionally, many other projects that SAMOA is based on, such as Apache
> > Storm, S4, Samza, and HDFS, are hosted by the ASF. Close proximity of
> SAMOA
> > to these projects within the ASF will provide mutual benefit.
> >
> > == Known Risks ==
> > === Orphaned Products ===
> > Given the current level of investment in SAMOA the risk of the project
> > being abandoned is minimal. There are several constituents who are highly
> > incentivized to continue development, and Yahoo Labs relies on SAMOA as a
> > platform for a large number of long-term research projects. However, the
> > small number of initial committers might be a concern. We plan to address
> > this issue during incubation by growing the community and the number of
> > committers.
> >
> > === Inexperience with Open Source ===
> > SAMOA has existed as a healthy open source project for one year. During
> > this time, we have curated an open-source community successfully,
> > attracting developers from a diverse group of universities and companies
> > including Huawei, Yahoo, University of Porto, and Universitat Politecnica
> > de Catalunya.
> >
> > Gianmarco is a committer for Apache Pig, Matthieu for Apache S4. Albert
> is
> > one of the lead developers of MOA, an open-source tool for streaming
> > machine learning.
> >
> > === Homogenous Developers ===
> > The initial list of committers includes developers from several
> > institutions, both academic and industrial. The committers are
> > geographically distributed across Europe, America, and Asia.
> >
> > === Reliance on Salaried Developers ===
> > Like most open source projects, SAMOA receives a substantial support from
> > salaried developers. In addition, those working from within corporations
> > often devote “after hours” or spare time in the project - and these come
> > from several organizations. We will work to ensure the ability for the
> > project to continuously be stewarded and to proceed forward independently
> > of salaried developers.
> >
> > === Relationship with Other Apache Products ===
> > SAMOA interoperates with several existing Apache projects, mainly by
> using
> > them as stream processing engines: Apache Storm, Apache S4, and Apache
> > Samza. It is a counterpart of Apache Mahout for streaming. It also uses
> > several other Apache components, including Apache Maven and several
> Apache
> > Commons libraries.
> >
> > === A Excessive Fascination with the Apache Brand ===
> > SAMOA is already a healthy and relatively well known open source project.
> > This proposal is not for the purpose of generating publicity. Rather, the
> > primary benefits to joining Apache are those outlined in the Rationale
> > section. We are more interested in establishing a strong community that
> can
> > drive the project independently of Yahoo.
> >
> > == Documentation ==
> > The reader will find these websites relevant:
> >
> > * SAMOA website: http://samoa-project.net/
> > * SAMOA documentation: https://github.com/yahoo/samoa/wiki/
> > * Issue tracking: https://github.com/yahoo/samoa/issues
> > * Codebase: https://github.com/yahoo/samoa
> > * User group: http://groups.google.com/group/samoa-user
> >
> > == Initial Source ==
> > The SAMOA codebase is currently hosted on GitHub:
> > https://github.com/yahoo/samoa. This is the exact codebase that we would
> > migrate to the Apache foundation.
> >
> > == Source and Intellectual Property Submission Plan ==
> > Currently, the SAMOA codebase is distributed under an Apache license
> v2.0.
> > The vast majority of code has copyright held by Yahoo. Upon entering the
> > Incubator, Yahoo will grant a license to the Apache foundation. In
> certain
> > cases where individuals or organizations hold copyright, we will ensure
> > they grant a license to the Apache foundation. Going forward, all commits
> > will be licensed directly to the Apache foundation through our signed
> > Individual Contributor License Agreements for all committers on the
> project.
> >
> > == Cryptography ==
> > We do not expect SAMOA to be a controlled export item due to the use of
> > encryption.
> >
> > == External Dependencies ==
> > To the best of our knowledge, all dependencies of SAMOA are distributed
> > under Apache compatible licenses. Upon acceptance to the incubator, we
> > would begin a thorough analysis of all transitive dependencies to verify
> > this fact and introduce license checking into the build and release
> process
> > (for instance integrating Apache Rat).
> >
> > == Required Resources ==
> > === Mailing Lists ===
> > We will migrate the existing SAMOA mailing lists as follows:
> >
> > * samoa-users@googlegroups --> users@samoa.incubator.apache.org
> > * samoa-developers@googlegroups --> dev@samoa.incubator.apache.org
> >
> > SAMOA commits are hosted on GitHub, so we would request the following
> > mailing list:
> >
> > * commits@samoa.incubator.apache.org
> >
> > We would also request the following mailing list:
> >
> > * private@samoa.incubator.apache.org (with moderated subscription)
> >
> > === Source control ===
> > The SAMOA team would like to use Git for source control, due to our
> current
> > use of Git. We request a writeable Git repo for SAMOA, and mirroring to
> be
> > set up to GitHub through INFRA.
> >
> > * https://git-wip-us.apache.org/repos/asf/incubator-samoa.git
> >
> > === Issue Tracking ===
> > SAMOA currently uses GitHub for issue tracking. We will migrate to the
> > Apache JIRA instance. http://issues.apache.org/jira/browse/SAMOA
> >
> > == Initial Committers & Affiliations ==
> > * Albert Bifet, Huawei, <abifet at waikato dot ac dot nz>
> > * Gianmarco De Francisci Morales, Yahoo Labs, <gdfm at apache dot org>
> > * Nicolas Kourtellis, Yahoo Labs, <nkourtellis at gmail dot com>
> > * Matthieu Morel, Yahoo Labs, <mmorel at apache dot org>
> > * Arinto Murdopo, Living Analytics Research Centre, <arintom at smu dot
> edu
> > dot sg>
> > * Olivier Van Laere, BlueShift Labs, <olivier at getblueshift dot com>
> >
> > == Sponsors ==
> > === Champion ===
> > * Daniel Dai <daijy at apache dot org>
> >
> > === Nominated Mentors ===
> > * Alan Gates <gates at apache dot org>
> > * Ted Dunning <tdunning at apache dot org>
> > * Ashutosh Chauhan <hashutosh at apache dot org>
> > * Enis Soztutar <enis at apache dot org>
> >
> > === Sponsoring Entity ===
> > The Apache Incubator
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
> For additional commands, e-mail: general-help@incubator.apache.org
>
>

Re: [VOTE] accept SAMOA into incubator

Posted by Henry Saputra <he...@gmail.com>.
+1 (binding)


- Henry

On Thu, Dec 11, 2014 at 10:02 AM, Daniel Dai <da...@gmail.com> wrote:
> Following the discussion earlier, I'm calling a vote to accept SAMOA as a
> new Incubator project.
>
> [ ] +1 Accept SAMOA into the Incubator
> [ ] +0 Indifferent to the acceptance of SAMOA
> [ ] -1 Do not accept SAMOA because ...
>
> The vote will be open for at least 72h and closes at the earliest on Dec 14
> 19:00 GMT.
>
> https://wiki.apache.org/incubator/SAMOAProposal
>
> Thanks,
> Daniel
>
> = SAMOA =
> == Abstract ==
> SAMOA is an an open-source platform for mining big data streams.
>
> == Proposal ==
> SAMOA provides a collection of distributed streaming algorithms for the
> most common data mining and machine learning tasks such as classification,
> clustering, and regression, as well as programming abstractions to develop
> new algorithms that run on top of distributed stream processing engines
> (DSPEs). It features a pluggable architecture that allows it to run on
> several DSPEs such as Apache Storm, Apache S4, and Apache Samza.
>
> == Background ==
> Hadoop and its ecosystem have changed the way data are processed by
> allowing to push algorithms to unprecedented scale. As an example, Mahout
> allows to run data mining and machine learning algorithms on very large
> datasets. However, Hadoop and Mahout are not suited to handle streaming
> data. Simply put, the goal of SAMOA is to provide a streaming counterpart
> to Mahout.
>
> == Rationale ==
> SAMOA aims to fill the current gap in tools for mining large scale streams.
> Many organizations can benefit from a scalable stream mining platform
> system such as SAMOA.
>
> SAMOA is a natural fit for the Apache Software Foundation. It is licensed
> under the ASL v2.0. It already interoperates with several existing Apache
> projects such as Storm, S4, and Samza. Furthermore, it is complementary to
> existing Apache projects such as Mahout. The initial committers are
> familiar with the Apache process and subscribes to the Apache mission.
> Indeed, the team includes multiple Apache committers. Finally, joining
> Apache will help coordinate the development effort of the growing number of
> organizations which contribute to SAMOA.
>
> == Initial Goals ==
> * Move the existing codebase to Apache
> * Integrate with the Apache development process
> * Incremental development and releases per Apache guidelines
>
> == Current Status ==
> SAMOA started as a research project at Yahoo Labs in 2013 and was
> open-sourced in October the same year. It has been under development on
> Yahoo's public GitHub repository since being open-sourced. It has undergone
> two releases (0.1, 0.2).
>
> === Meritocracy ===
> The SAMOA project already operates on meritocratic principles. Today, SAMOA
> has several developers and has accepted multiple patches from outside of
> Yahoo Labs. However, our intent with this incubator proposal is to start
> building a more diverse developer community around SAMOA that follows the
> Apache meritocracy model. We will identify all committers and PPMC members
> for the project operating under the ASF meritocratic principles. We plan to
> continue support for new contributors and work with those who contribute
> significantly to the project to make them committers.
>
> === Community ===
> SAMOA is currently being used internally at Yahoo. Acceptance into the
> Apache foundation would bolster the existing user and developer community
> around SAMOA. That community includes contributors from several
> institutions, active mostly on GitHub's pages. SAMOA has been starred more
> than 300 times and forked more than 50 times on GitHub as of November 2014.
>
> === Core Developers ===
> The core developers are a diverse group, many of which already very
> experienced with open source. There are two existing Apache committers,
> along with people from various companies and universities.
>
> === Alignment ===
> The ASF is the natural choice to host SAMOA. First, its goal of encouraging
> community-driven open-source projects fits with our vision for SAMOA.
> Additionally, many other projects that SAMOA is based on, such as Apache
> Storm, S4, Samza, and HDFS, are hosted by the ASF. Close proximity of SAMOA
> to these projects within the ASF will provide mutual benefit.
>
> == Known Risks ==
> === Orphaned Products ===
> Given the current level of investment in SAMOA the risk of the project
> being abandoned is minimal. There are several constituents who are highly
> incentivized to continue development, and Yahoo Labs relies on SAMOA as a
> platform for a large number of long-term research projects. However, the
> small number of initial committers might be a concern. We plan to address
> this issue during incubation by growing the community and the number of
> committers.
>
> === Inexperience with Open Source ===
> SAMOA has existed as a healthy open source project for one year. During
> this time, we have curated an open-source community successfully,
> attracting developers from a diverse group of universities and companies
> including Huawei, Yahoo, University of Porto, and Universitat Politecnica
> de Catalunya.
>
> Gianmarco is a committer for Apache Pig, Matthieu for Apache S4. Albert is
> one of the lead developers of MOA, an open-source tool for streaming
> machine learning.
>
> === Homogenous Developers ===
> The initial list of committers includes developers from several
> institutions, both academic and industrial. The committers are
> geographically distributed across Europe, America, and Asia.
>
> === Reliance on Salaried Developers ===
> Like most open source projects, SAMOA receives a substantial support from
> salaried developers. In addition, those working from within corporations
> often devote “after hours” or spare time in the project - and these come
> from several organizations. We will work to ensure the ability for the
> project to continuously be stewarded and to proceed forward independently
> of salaried developers.
>
> === Relationship with Other Apache Products ===
> SAMOA interoperates with several existing Apache projects, mainly by using
> them as stream processing engines: Apache Storm, Apache S4, and Apache
> Samza. It is a counterpart of Apache Mahout for streaming. It also uses
> several other Apache components, including Apache Maven and several Apache
> Commons libraries.
>
> === A Excessive Fascination with the Apache Brand ===
> SAMOA is already a healthy and relatively well known open source project.
> This proposal is not for the purpose of generating publicity. Rather, the
> primary benefits to joining Apache are those outlined in the Rationale
> section. We are more interested in establishing a strong community that can
> drive the project independently of Yahoo.
>
> == Documentation ==
> The reader will find these websites relevant:
>
> * SAMOA website: http://samoa-project.net/
> * SAMOA documentation: https://github.com/yahoo/samoa/wiki/
> * Issue tracking: https://github.com/yahoo/samoa/issues
> * Codebase: https://github.com/yahoo/samoa
> * User group: http://groups.google.com/group/samoa-user
>
> == Initial Source ==
> The SAMOA codebase is currently hosted on GitHub:
> https://github.com/yahoo/samoa. This is the exact codebase that we would
> migrate to the Apache foundation.
>
> == Source and Intellectual Property Submission Plan ==
> Currently, the SAMOA codebase is distributed under an Apache license v2.0.
> The vast majority of code has copyright held by Yahoo. Upon entering the
> Incubator, Yahoo will grant a license to the Apache foundation. In certain
> cases where individuals or organizations hold copyright, we will ensure
> they grant a license to the Apache foundation. Going forward, all commits
> will be licensed directly to the Apache foundation through our signed
> Individual Contributor License Agreements for all committers on the project.
>
> == Cryptography ==
> We do not expect SAMOA to be a controlled export item due to the use of
> encryption.
>
> == External Dependencies ==
> To the best of our knowledge, all dependencies of SAMOA are distributed
> under Apache compatible licenses. Upon acceptance to the incubator, we
> would begin a thorough analysis of all transitive dependencies to verify
> this fact and introduce license checking into the build and release process
> (for instance integrating Apache Rat).
>
> == Required Resources ==
> === Mailing Lists ===
> We will migrate the existing SAMOA mailing lists as follows:
>
> * samoa-users@googlegroups --> users@samoa.incubator.apache.org
> * samoa-developers@googlegroups --> dev@samoa.incubator.apache.org
>
> SAMOA commits are hosted on GitHub, so we would request the following
> mailing list:
>
> * commits@samoa.incubator.apache.org
>
> We would also request the following mailing list:
>
> * private@samoa.incubator.apache.org (with moderated subscription)
>
> === Source control ===
> The SAMOA team would like to use Git for source control, due to our current
> use of Git. We request a writeable Git repo for SAMOA, and mirroring to be
> set up to GitHub through INFRA.
>
> * https://git-wip-us.apache.org/repos/asf/incubator-samoa.git
>
> === Issue Tracking ===
> SAMOA currently uses GitHub for issue tracking. We will migrate to the
> Apache JIRA instance. http://issues.apache.org/jira/browse/SAMOA
>
> == Initial Committers & Affiliations ==
> * Albert Bifet, Huawei, <abifet at waikato dot ac dot nz>
> * Gianmarco De Francisci Morales, Yahoo Labs, <gdfm at apache dot org>
> * Nicolas Kourtellis, Yahoo Labs, <nkourtellis at gmail dot com>
> * Matthieu Morel, Yahoo Labs, <mmorel at apache dot org>
> * Arinto Murdopo, Living Analytics Research Centre, <arintom at smu dot edu
> dot sg>
> * Olivier Van Laere, BlueShift Labs, <olivier at getblueshift dot com>
>
> == Sponsors ==
> === Champion ===
> * Daniel Dai <daijy at apache dot org>
>
> === Nominated Mentors ===
> * Alan Gates <gates at apache dot org>
> * Ted Dunning <tdunning at apache dot org>
> * Ashutosh Chauhan <hashutosh at apache dot org>
> * Enis Soztutar <enis at apache dot org>
>
> === Sponsoring Entity ===
> The Apache Incubator

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: [VOTE] accept SAMOA into incubator

Posted by Daniel Dai <da...@gmail.com>.
With 8 binding +1s (Henry Saputra, Enis Söztutar, Roman Shaposhnik,
Konstantin Boudnik, Jakob Homan, Andrew Purtell, John D. Ament and Daniel
Dai) and 2 non-binding +1s (P. Taylor Goetz, Naresh Agarwal), the vote
passes. Thanks everyone for taking time to vote. I will proceed with next
steps.

Thanks,
Daniel

On Sun, Dec 14, 2014 at 3:10 AM, Naresh Agarwal <na...@inmobi.com>
wrote:

> +1 (non-binding)
>
> Thanks
> Naresh
>
> On Fri, Dec 12, 2014 at 7:22 AM, John D. Ament <jo...@gmail.com>
> wrote:
> >
> > +1 binding
> >
> > On Thu Dec 11 2014 at 5:10:50 PM Konstantin Boudnik <co...@apache.org>
> > wrote:
> >
> > > +1 (binding).
> > >
> > > I small comment: we don't do users@ list of podlings, do we? If so
> > >  samoa-users@googlegroups --> users@samoa.incubator.apache.org
> > > will need to be converged into dev@.
> > >
> > >
> > Not all podlings use a users@, but they can if they like.  Usually if
> it's
> > coming from an established community there will be one.
> >
> >
> > > Cos
> > >
> > > On Thu, Dec 11, 2014 at 10:02AM, Daniel Dai wrote:
> > > > Following the discussion earlier, I'm calling a vote to accept SAMOA
> > as a
> > > > new Incubator project.
> > > >
> > > > [ ] +1 Accept SAMOA into the Incubator
> > > > [ ] +0 Indifferent to the acceptance of SAMOA
> > > > [ ] -1 Do not accept SAMOA because ...
> > > >
> > > > The vote will be open for at least 72h and closes at the earliest on
> > Dec
> > > 14
> > > > 19:00 GMT.
> > > >
> > > > https://wiki.apache.org/incubator/SAMOAProposal
> > > >
> > > > Thanks,
> > > > Daniel
> > > >
> > > > = SAMOA =
> > > > == Abstract ==
> > > > SAMOA is an an open-source platform for mining big data streams.
> > > >
> > > > == Proposal ==
> > > > SAMOA provides a collection of distributed streaming algorithms for
> the
> > > > most common data mining and machine learning tasks such as
> > > classification,
> > > > clustering, and regression, as well as programming abstractions to
> > > develop
> > > > new algorithms that run on top of distributed stream processing
> engines
> > > > (DSPEs). It features a pluggable architecture that allows it to run
> on
> > > > several DSPEs such as Apache Storm, Apache S4, and Apache Samza.
> > > >
> > > > == Background ==
> > > > Hadoop and its ecosystem have changed the way data are processed by
> > > > allowing to push algorithms to unprecedented scale. As an example,
> > Mahout
> > > > allows to run data mining and machine learning algorithms on very
> large
> > > > datasets. However, Hadoop and Mahout are not suited to handle
> streaming
> > > > data. Simply put, the goal of SAMOA is to provide a streaming
> > counterpart
> > > > to Mahout.
> > > >
> > > > == Rationale ==
> > > > SAMOA aims to fill the current gap in tools for mining large scale
> > > streams.
> > > > Many organizations can benefit from a scalable stream mining platform
> > > > system such as SAMOA.
> > > >
> > > > SAMOA is a natural fit for the Apache Software Foundation. It is
> > licensed
> > > > under the ASL v2.0. It already interoperates with several existing
> > Apache
> > > > projects such as Storm, S4, and Samza. Furthermore, it is
> complementary
> > > to
> > > > existing Apache projects such as Mahout. The initial committers are
> > > > familiar with the Apache process and subscribes to the Apache
> mission.
> > > > Indeed, the team includes multiple Apache committers. Finally,
> joining
> > > > Apache will help coordinate the development effort of the growing
> > number
> > > of
> > > > organizations which contribute to SAMOA.
> > > >
> > > > == Initial Goals ==
> > > > * Move the existing codebase to Apache
> > > > * Integrate with the Apache development process
> > > > * Incremental development and releases per Apache guidelines
> > > >
> > > > == Current Status ==
> > > > SAMOA started as a research project at Yahoo Labs in 2013 and was
> > > > open-sourced in October the same year. It has been under development
> on
> > > > Yahoo's public GitHub repository since being open-sourced. It has
> > > undergone
> > > > two releases (0.1, 0.2).
> > > >
> > > > === Meritocracy ===
> > > > The SAMOA project already operates on meritocratic principles. Today,
> > > SAMOA
> > > > has several developers and has accepted multiple patches from outside
> > of
> > > > Yahoo Labs. However, our intent with this incubator proposal is to
> > start
> > > > building a more diverse developer community around SAMOA that follows
> > the
> > > > Apache meritocracy model. We will identify all committers and PPMC
> > > members
> > > > for the project operating under the ASF meritocratic principles. We
> > plan
> > > to
> > > > continue support for new contributors and work with those who
> > contribute
> > > > significantly to the project to make them committers.
> > > >
> > > > === Community ===
> > > > SAMOA is currently being used internally at Yahoo. Acceptance into
> the
> > > > Apache foundation would bolster the existing user and developer
> > community
> > > > around SAMOA. That community includes contributors from several
> > > > institutions, active mostly on GitHub's pages. SAMOA has been starred
> > > more
> > > > than 300 times and forked more than 50 times on GitHub as of November
> > > 2014.
> > > >
> > > > === Core Developers ===
> > > > The core developers are a diverse group, many of which already very
> > > > experienced with open source. There are two existing Apache
> committers,
> > > > along with people from various companies and universities.
> > > >
> > > > === Alignment ===
> > > > The ASF is the natural choice to host SAMOA. First, its goal of
> > > encouraging
> > > > community-driven open-source projects fits with our vision for SAMOA.
> > > > Additionally, many other projects that SAMOA is based on, such as
> > Apache
> > > > Storm, S4, Samza, and HDFS, are hosted by the ASF. Close proximity of
> > > SAMOA
> > > > to these projects within the ASF will provide mutual benefit.
> > > >
> > > > == Known Risks ==
> > > > === Orphaned Products ===
> > > > Given the current level of investment in SAMOA the risk of the
> project
> > > > being abandoned is minimal. There are several constituents who are
> > highly
> > > > incentivized to continue development, and Yahoo Labs relies on SAMOA
> > as a
> > > > platform for a large number of long-term research projects. However,
> > the
> > > > small number of initial committers might be a concern. We plan to
> > address
> > > > this issue during incubation by growing the community and the number
> of
> > > > committers.
> > > >
> > > > === Inexperience with Open Source ===
> > > > SAMOA has existed as a healthy open source project for one year.
> During
> > > > this time, we have curated an open-source community successfully,
> > > > attracting developers from a diverse group of universities and
> > companies
> > > > including Huawei, Yahoo, University of Porto, and Universitat
> > Politecnica
> > > > de Catalunya.
> > > >
> > > > Gianmarco is a committer for Apache Pig, Matthieu for Apache S4.
> Albert
> > > is
> > > > one of the lead developers of MOA, an open-source tool for streaming
> > > > machine learning.
> > > >
> > > > === Homogenous Developers ===
> > > > The initial list of committers includes developers from several
> > > > institutions, both academic and industrial. The committers are
> > > > geographically distributed across Europe, America, and Asia.
> > > >
> > > > === Reliance on Salaried Developers ===
> > > > Like most open source projects, SAMOA receives a substantial support
> > from
> > > > salaried developers. In addition, those working from within
> > corporations
> > > > often devote “after hours” or spare time in the project - and these
> > come
> > > > from several organizations. We will work to ensure the ability for
> the
> > > > project to continuously be stewarded and to proceed forward
> > independently
> > > > of salaried developers.
> > > >
> > > > === Relationship with Other Apache Products ===
> > > > SAMOA interoperates with several existing Apache projects, mainly by
> > > using
> > > > them as stream processing engines: Apache Storm, Apache S4, and
> Apache
> > > > Samza. It is a counterpart of Apache Mahout for streaming. It also
> uses
> > > > several other Apache components, including Apache Maven and several
> > > Apache
> > > > Commons libraries.
> > > >
> > > > === A Excessive Fascination with the Apache Brand ===
> > > > SAMOA is already a healthy and relatively well known open source
> > project.
> > > > This proposal is not for the purpose of generating publicity. Rather,
> > the
> > > > primary benefits to joining Apache are those outlined in the
> Rationale
> > > > section. We are more interested in establishing a strong community
> that
> > > can
> > > > drive the project independently of Yahoo.
> > > >
> > > > == Documentation ==
> > > > The reader will find these websites relevant:
> > > >
> > > > * SAMOA website: http://samoa-project.net/
> > > > * SAMOA documentation: https://github.com/yahoo/samoa/wiki/
> > > > * Issue tracking: https://github.com/yahoo/samoa/issues
> > > > * Codebase: https://github.com/yahoo/samoa
> > > > * User group: http://groups.google.com/group/samoa-user
> > > >
> > > > == Initial Source ==
> > > > The SAMOA codebase is currently hosted on GitHub:
> > > > https://github.com/yahoo/samoa. This is the exact codebase that we
> > would
> > > > migrate to the Apache foundation.
> > > >
> > > > == Source and Intellectual Property Submission Plan ==
> > > > Currently, the SAMOA codebase is distributed under an Apache license
> > > v2.0.
> > > > The vast majority of code has copyright held by Yahoo. Upon entering
> > the
> > > > Incubator, Yahoo will grant a license to the Apache foundation. In
> > > certain
> > > > cases where individuals or organizations hold copyright, we will
> ensure
> > > > they grant a license to the Apache foundation. Going forward, all
> > commits
> > > > will be licensed directly to the Apache foundation through our signed
> > > > Individual Contributor License Agreements for all committers on the
> > > project.
> > > >
> > > > == Cryptography ==
> > > > We do not expect SAMOA to be a controlled export item due to the use
> of
> > > > encryption.
> > > >
> > > > == External Dependencies ==
> > > > To the best of our knowledge, all dependencies of SAMOA are
> distributed
> > > > under Apache compatible licenses. Upon acceptance to the incubator,
> we
> > > > would begin a thorough analysis of all transitive dependencies to
> > verify
> > > > this fact and introduce license checking into the build and release
> > > process
> > > > (for instance integrating Apache Rat).
> > > >
> > > > == Required Resources ==
> > > > === Mailing Lists ===
> > > > We will migrate the existing SAMOA mailing lists as follows:
> > > >
> > > > * samoa-users@googlegroups --> users@samoa.incubator.apache.org
> > > > * samoa-developers@googlegroups --> dev@samoa.incubator.apache.org
> > > >
> > > > SAMOA commits are hosted on GitHub, so we would request the following
> > > > mailing list:
> > > >
> > > > * commits@samoa.incubator.apache.org
> > > >
> > > > We would also request the following mailing list:
> > > >
> > > > * private@samoa.incubator.apache.org (with moderated subscription)
> > > >
> > > > === Source control ===
> > > > The SAMOA team would like to use Git for source control, due to our
> > > current
> > > > use of Git. We request a writeable Git repo for SAMOA, and mirroring
> to
> > > be
> > > > set up to GitHub through INFRA.
> > > >
> > > > * https://git-wip-us.apache.org/repos/asf/incubator-samoa.git
> > > >
> > > > === Issue Tracking ===
> > > > SAMOA currently uses GitHub for issue tracking. We will migrate to
> the
> > > > Apache JIRA instance. http://issues.apache.org/jira/browse/SAMOA
> > > >
> > > > == Initial Committers & Affiliations ==
> > > > * Albert Bifet, Huawei, <abifet at waikato dot ac dot nz>
> > > > * Gianmarco De Francisci Morales, Yahoo Labs, <gdfm at apache dot
> org>
> > > > * Nicolas Kourtellis, Yahoo Labs, <nkourtellis at gmail dot com>
> > > > * Matthieu Morel, Yahoo Labs, <mmorel at apache dot org>
> > > > * Arinto Murdopo, Living Analytics Research Centre, <arintom at smu
> dot
> > > edu
> > > > dot sg>
> > > > * Olivier Van Laere, BlueShift Labs, <olivier at getblueshift dot
> com>
> > > >
> > > > == Sponsors ==
> > > > === Champion ===
> > > > * Daniel Dai <daijy at apache dot org>
> > > >
> > > > === Nominated Mentors ===
> > > > * Alan Gates <gates at apache dot org>
> > > > * Ted Dunning <tdunning at apache dot org>
> > > > * Ashutosh Chauhan <hashutosh at apache dot org>
> > > > * Enis Soztutar <enis at apache dot org>
> > > >
> > > > === Sponsoring Entity ===
> > > > The Apache Incubator
> > >
> > > ---------------------------------------------------------------------
> > > To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
> > > For additional commands, e-mail: general-help@incubator.apache.org
> > >
> > >
> >
>
> --
> _____________________________________________________________
> The information contained in this communication is intended solely for the
> use of the individual or entity to whom it is addressed and others
> authorized to receive it. It may contain confidential or legally privileged
> information. If you are not the intended recipient you are hereby notified
> that any disclosure, copying, distribution or taking any action in reliance
> on the contents of this information is strictly prohibited and may be
> unlawful. If you have received this communication in error, please notify
> us immediately by responding to this email and then delete it from your
> system. The firm is neither liable for the proper and complete transmission
> of the information contained in this communication nor for any delay in its
> receipt.
>

Re: [VOTE] accept SAMOA into incubator

Posted by Naresh Agarwal <na...@inmobi.com>.
+1 (non-binding)

Thanks
Naresh

On Fri, Dec 12, 2014 at 7:22 AM, John D. Ament <jo...@gmail.com>
wrote:
>
> +1 binding
>
> On Thu Dec 11 2014 at 5:10:50 PM Konstantin Boudnik <co...@apache.org>
> wrote:
>
> > +1 (binding).
> >
> > I small comment: we don't do users@ list of podlings, do we? If so
> >  samoa-users@googlegroups --> users@samoa.incubator.apache.org
> > will need to be converged into dev@.
> >
> >
> Not all podlings use a users@, but they can if they like.  Usually if it's
> coming from an established community there will be one.
>
>
> > Cos
> >
> > On Thu, Dec 11, 2014 at 10:02AM, Daniel Dai wrote:
> > > Following the discussion earlier, I'm calling a vote to accept SAMOA
> as a
> > > new Incubator project.
> > >
> > > [ ] +1 Accept SAMOA into the Incubator
> > > [ ] +0 Indifferent to the acceptance of SAMOA
> > > [ ] -1 Do not accept SAMOA because ...
> > >
> > > The vote will be open for at least 72h and closes at the earliest on
> Dec
> > 14
> > > 19:00 GMT.
> > >
> > > https://wiki.apache.org/incubator/SAMOAProposal
> > >
> > > Thanks,
> > > Daniel
> > >
> > > = SAMOA =
> > > == Abstract ==
> > > SAMOA is an an open-source platform for mining big data streams.
> > >
> > > == Proposal ==
> > > SAMOA provides a collection of distributed streaming algorithms for the
> > > most common data mining and machine learning tasks such as
> > classification,
> > > clustering, and regression, as well as programming abstractions to
> > develop
> > > new algorithms that run on top of distributed stream processing engines
> > > (DSPEs). It features a pluggable architecture that allows it to run on
> > > several DSPEs such as Apache Storm, Apache S4, and Apache Samza.
> > >
> > > == Background ==
> > > Hadoop and its ecosystem have changed the way data are processed by
> > > allowing to push algorithms to unprecedented scale. As an example,
> Mahout
> > > allows to run data mining and machine learning algorithms on very large
> > > datasets. However, Hadoop and Mahout are not suited to handle streaming
> > > data. Simply put, the goal of SAMOA is to provide a streaming
> counterpart
> > > to Mahout.
> > >
> > > == Rationale ==
> > > SAMOA aims to fill the current gap in tools for mining large scale
> > streams.
> > > Many organizations can benefit from a scalable stream mining platform
> > > system such as SAMOA.
> > >
> > > SAMOA is a natural fit for the Apache Software Foundation. It is
> licensed
> > > under the ASL v2.0. It already interoperates with several existing
> Apache
> > > projects such as Storm, S4, and Samza. Furthermore, it is complementary
> > to
> > > existing Apache projects such as Mahout. The initial committers are
> > > familiar with the Apache process and subscribes to the Apache mission.
> > > Indeed, the team includes multiple Apache committers. Finally, joining
> > > Apache will help coordinate the development effort of the growing
> number
> > of
> > > organizations which contribute to SAMOA.
> > >
> > > == Initial Goals ==
> > > * Move the existing codebase to Apache
> > > * Integrate with the Apache development process
> > > * Incremental development and releases per Apache guidelines
> > >
> > > == Current Status ==
> > > SAMOA started as a research project at Yahoo Labs in 2013 and was
> > > open-sourced in October the same year. It has been under development on
> > > Yahoo's public GitHub repository since being open-sourced. It has
> > undergone
> > > two releases (0.1, 0.2).
> > >
> > > === Meritocracy ===
> > > The SAMOA project already operates on meritocratic principles. Today,
> > SAMOA
> > > has several developers and has accepted multiple patches from outside
> of
> > > Yahoo Labs. However, our intent with this incubator proposal is to
> start
> > > building a more diverse developer community around SAMOA that follows
> the
> > > Apache meritocracy model. We will identify all committers and PPMC
> > members
> > > for the project operating under the ASF meritocratic principles. We
> plan
> > to
> > > continue support for new contributors and work with those who
> contribute
> > > significantly to the project to make them committers.
> > >
> > > === Community ===
> > > SAMOA is currently being used internally at Yahoo. Acceptance into the
> > > Apache foundation would bolster the existing user and developer
> community
> > > around SAMOA. That community includes contributors from several
> > > institutions, active mostly on GitHub's pages. SAMOA has been starred
> > more
> > > than 300 times and forked more than 50 times on GitHub as of November
> > 2014.
> > >
> > > === Core Developers ===
> > > The core developers are a diverse group, many of which already very
> > > experienced with open source. There are two existing Apache committers,
> > > along with people from various companies and universities.
> > >
> > > === Alignment ===
> > > The ASF is the natural choice to host SAMOA. First, its goal of
> > encouraging
> > > community-driven open-source projects fits with our vision for SAMOA.
> > > Additionally, many other projects that SAMOA is based on, such as
> Apache
> > > Storm, S4, Samza, and HDFS, are hosted by the ASF. Close proximity of
> > SAMOA
> > > to these projects within the ASF will provide mutual benefit.
> > >
> > > == Known Risks ==
> > > === Orphaned Products ===
> > > Given the current level of investment in SAMOA the risk of the project
> > > being abandoned is minimal. There are several constituents who are
> highly
> > > incentivized to continue development, and Yahoo Labs relies on SAMOA
> as a
> > > platform for a large number of long-term research projects. However,
> the
> > > small number of initial committers might be a concern. We plan to
> address
> > > this issue during incubation by growing the community and the number of
> > > committers.
> > >
> > > === Inexperience with Open Source ===
> > > SAMOA has existed as a healthy open source project for one year. During
> > > this time, we have curated an open-source community successfully,
> > > attracting developers from a diverse group of universities and
> companies
> > > including Huawei, Yahoo, University of Porto, and Universitat
> Politecnica
> > > de Catalunya.
> > >
> > > Gianmarco is a committer for Apache Pig, Matthieu for Apache S4. Albert
> > is
> > > one of the lead developers of MOA, an open-source tool for streaming
> > > machine learning.
> > >
> > > === Homogenous Developers ===
> > > The initial list of committers includes developers from several
> > > institutions, both academic and industrial. The committers are
> > > geographically distributed across Europe, America, and Asia.
> > >
> > > === Reliance on Salaried Developers ===
> > > Like most open source projects, SAMOA receives a substantial support
> from
> > > salaried developers. In addition, those working from within
> corporations
> > > often devote “after hours” or spare time in the project - and these
> come
> > > from several organizations. We will work to ensure the ability for the
> > > project to continuously be stewarded and to proceed forward
> independently
> > > of salaried developers.
> > >
> > > === Relationship with Other Apache Products ===
> > > SAMOA interoperates with several existing Apache projects, mainly by
> > using
> > > them as stream processing engines: Apache Storm, Apache S4, and Apache
> > > Samza. It is a counterpart of Apache Mahout for streaming. It also uses
> > > several other Apache components, including Apache Maven and several
> > Apache
> > > Commons libraries.
> > >
> > > === A Excessive Fascination with the Apache Brand ===
> > > SAMOA is already a healthy and relatively well known open source
> project.
> > > This proposal is not for the purpose of generating publicity. Rather,
> the
> > > primary benefits to joining Apache are those outlined in the Rationale
> > > section. We are more interested in establishing a strong community that
> > can
> > > drive the project independently of Yahoo.
> > >
> > > == Documentation ==
> > > The reader will find these websites relevant:
> > >
> > > * SAMOA website: http://samoa-project.net/
> > > * SAMOA documentation: https://github.com/yahoo/samoa/wiki/
> > > * Issue tracking: https://github.com/yahoo/samoa/issues
> > > * Codebase: https://github.com/yahoo/samoa
> > > * User group: http://groups.google.com/group/samoa-user
> > >
> > > == Initial Source ==
> > > The SAMOA codebase is currently hosted on GitHub:
> > > https://github.com/yahoo/samoa. This is the exact codebase that we
> would
> > > migrate to the Apache foundation.
> > >
> > > == Source and Intellectual Property Submission Plan ==
> > > Currently, the SAMOA codebase is distributed under an Apache license
> > v2.0.
> > > The vast majority of code has copyright held by Yahoo. Upon entering
> the
> > > Incubator, Yahoo will grant a license to the Apache foundation. In
> > certain
> > > cases where individuals or organizations hold copyright, we will ensure
> > > they grant a license to the Apache foundation. Going forward, all
> commits
> > > will be licensed directly to the Apache foundation through our signed
> > > Individual Contributor License Agreements for all committers on the
> > project.
> > >
> > > == Cryptography ==
> > > We do not expect SAMOA to be a controlled export item due to the use of
> > > encryption.
> > >
> > > == External Dependencies ==
> > > To the best of our knowledge, all dependencies of SAMOA are distributed
> > > under Apache compatible licenses. Upon acceptance to the incubator, we
> > > would begin a thorough analysis of all transitive dependencies to
> verify
> > > this fact and introduce license checking into the build and release
> > process
> > > (for instance integrating Apache Rat).
> > >
> > > == Required Resources ==
> > > === Mailing Lists ===
> > > We will migrate the existing SAMOA mailing lists as follows:
> > >
> > > * samoa-users@googlegroups --> users@samoa.incubator.apache.org
> > > * samoa-developers@googlegroups --> dev@samoa.incubator.apache.org
> > >
> > > SAMOA commits are hosted on GitHub, so we would request the following
> > > mailing list:
> > >
> > > * commits@samoa.incubator.apache.org
> > >
> > > We would also request the following mailing list:
> > >
> > > * private@samoa.incubator.apache.org (with moderated subscription)
> > >
> > > === Source control ===
> > > The SAMOA team would like to use Git for source control, due to our
> > current
> > > use of Git. We request a writeable Git repo for SAMOA, and mirroring to
> > be
> > > set up to GitHub through INFRA.
> > >
> > > * https://git-wip-us.apache.org/repos/asf/incubator-samoa.git
> > >
> > > === Issue Tracking ===
> > > SAMOA currently uses GitHub for issue tracking. We will migrate to the
> > > Apache JIRA instance. http://issues.apache.org/jira/browse/SAMOA
> > >
> > > == Initial Committers & Affiliations ==
> > > * Albert Bifet, Huawei, <abifet at waikato dot ac dot nz>
> > > * Gianmarco De Francisci Morales, Yahoo Labs, <gdfm at apache dot org>
> > > * Nicolas Kourtellis, Yahoo Labs, <nkourtellis at gmail dot com>
> > > * Matthieu Morel, Yahoo Labs, <mmorel at apache dot org>
> > > * Arinto Murdopo, Living Analytics Research Centre, <arintom at smu dot
> > edu
> > > dot sg>
> > > * Olivier Van Laere, BlueShift Labs, <olivier at getblueshift dot com>
> > >
> > > == Sponsors ==
> > > === Champion ===
> > > * Daniel Dai <daijy at apache dot org>
> > >
> > > === Nominated Mentors ===
> > > * Alan Gates <gates at apache dot org>
> > > * Ted Dunning <tdunning at apache dot org>
> > > * Ashutosh Chauhan <hashutosh at apache dot org>
> > > * Enis Soztutar <enis at apache dot org>
> > >
> > > === Sponsoring Entity ===
> > > The Apache Incubator
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
> > For additional commands, e-mail: general-help@incubator.apache.org
> >
> >
>

-- 
_____________________________________________________________
The information contained in this communication is intended solely for the 
use of the individual or entity to whom it is addressed and others 
authorized to receive it. It may contain confidential or legally privileged 
information. If you are not the intended recipient you are hereby notified 
that any disclosure, copying, distribution or taking any action in reliance 
on the contents of this information is strictly prohibited and may be 
unlawful. If you have received this communication in error, please notify 
us immediately by responding to this email and then delete it from your 
system. The firm is neither liable for the proper and complete transmission 
of the information contained in this communication nor for any delay in its 
receipt.

Re: [VOTE] accept SAMOA into incubator

Posted by "John D. Ament" <jo...@gmail.com>.
+1 binding

On Thu Dec 11 2014 at 5:10:50 PM Konstantin Boudnik <co...@apache.org> wrote:

> +1 (binding).
>
> I small comment: we don't do users@ list of podlings, do we? If so
>  samoa-users@googlegroups --> users@samoa.incubator.apache.org
> will need to be converged into dev@.
>
>
Not all podlings use a users@, but they can if they like.  Usually if it's
coming from an established community there will be one.


> Cos
>
> On Thu, Dec 11, 2014 at 10:02AM, Daniel Dai wrote:
> > Following the discussion earlier, I'm calling a vote to accept SAMOA as a
> > new Incubator project.
> >
> > [ ] +1 Accept SAMOA into the Incubator
> > [ ] +0 Indifferent to the acceptance of SAMOA
> > [ ] -1 Do not accept SAMOA because ...
> >
> > The vote will be open for at least 72h and closes at the earliest on Dec
> 14
> > 19:00 GMT.
> >
> > https://wiki.apache.org/incubator/SAMOAProposal
> >
> > Thanks,
> > Daniel
> >
> > = SAMOA =
> > == Abstract ==
> > SAMOA is an an open-source platform for mining big data streams.
> >
> > == Proposal ==
> > SAMOA provides a collection of distributed streaming algorithms for the
> > most common data mining and machine learning tasks such as
> classification,
> > clustering, and regression, as well as programming abstractions to
> develop
> > new algorithms that run on top of distributed stream processing engines
> > (DSPEs). It features a pluggable architecture that allows it to run on
> > several DSPEs such as Apache Storm, Apache S4, and Apache Samza.
> >
> > == Background ==
> > Hadoop and its ecosystem have changed the way data are processed by
> > allowing to push algorithms to unprecedented scale. As an example, Mahout
> > allows to run data mining and machine learning algorithms on very large
> > datasets. However, Hadoop and Mahout are not suited to handle streaming
> > data. Simply put, the goal of SAMOA is to provide a streaming counterpart
> > to Mahout.
> >
> > == Rationale ==
> > SAMOA aims to fill the current gap in tools for mining large scale
> streams.
> > Many organizations can benefit from a scalable stream mining platform
> > system such as SAMOA.
> >
> > SAMOA is a natural fit for the Apache Software Foundation. It is licensed
> > under the ASL v2.0. It already interoperates with several existing Apache
> > projects such as Storm, S4, and Samza. Furthermore, it is complementary
> to
> > existing Apache projects such as Mahout. The initial committers are
> > familiar with the Apache process and subscribes to the Apache mission.
> > Indeed, the team includes multiple Apache committers. Finally, joining
> > Apache will help coordinate the development effort of the growing number
> of
> > organizations which contribute to SAMOA.
> >
> > == Initial Goals ==
> > * Move the existing codebase to Apache
> > * Integrate with the Apache development process
> > * Incremental development and releases per Apache guidelines
> >
> > == Current Status ==
> > SAMOA started as a research project at Yahoo Labs in 2013 and was
> > open-sourced in October the same year. It has been under development on
> > Yahoo's public GitHub repository since being open-sourced. It has
> undergone
> > two releases (0.1, 0.2).
> >
> > === Meritocracy ===
> > The SAMOA project already operates on meritocratic principles. Today,
> SAMOA
> > has several developers and has accepted multiple patches from outside of
> > Yahoo Labs. However, our intent with this incubator proposal is to start
> > building a more diverse developer community around SAMOA that follows the
> > Apache meritocracy model. We will identify all committers and PPMC
> members
> > for the project operating under the ASF meritocratic principles. We plan
> to
> > continue support for new contributors and work with those who contribute
> > significantly to the project to make them committers.
> >
> > === Community ===
> > SAMOA is currently being used internally at Yahoo. Acceptance into the
> > Apache foundation would bolster the existing user and developer community
> > around SAMOA. That community includes contributors from several
> > institutions, active mostly on GitHub's pages. SAMOA has been starred
> more
> > than 300 times and forked more than 50 times on GitHub as of November
> 2014.
> >
> > === Core Developers ===
> > The core developers are a diverse group, many of which already very
> > experienced with open source. There are two existing Apache committers,
> > along with people from various companies and universities.
> >
> > === Alignment ===
> > The ASF is the natural choice to host SAMOA. First, its goal of
> encouraging
> > community-driven open-source projects fits with our vision for SAMOA.
> > Additionally, many other projects that SAMOA is based on, such as Apache
> > Storm, S4, Samza, and HDFS, are hosted by the ASF. Close proximity of
> SAMOA
> > to these projects within the ASF will provide mutual benefit.
> >
> > == Known Risks ==
> > === Orphaned Products ===
> > Given the current level of investment in SAMOA the risk of the project
> > being abandoned is minimal. There are several constituents who are highly
> > incentivized to continue development, and Yahoo Labs relies on SAMOA as a
> > platform for a large number of long-term research projects. However, the
> > small number of initial committers might be a concern. We plan to address
> > this issue during incubation by growing the community and the number of
> > committers.
> >
> > === Inexperience with Open Source ===
> > SAMOA has existed as a healthy open source project for one year. During
> > this time, we have curated an open-source community successfully,
> > attracting developers from a diverse group of universities and companies
> > including Huawei, Yahoo, University of Porto, and Universitat Politecnica
> > de Catalunya.
> >
> > Gianmarco is a committer for Apache Pig, Matthieu for Apache S4. Albert
> is
> > one of the lead developers of MOA, an open-source tool for streaming
> > machine learning.
> >
> > === Homogenous Developers ===
> > The initial list of committers includes developers from several
> > institutions, both academic and industrial. The committers are
> > geographically distributed across Europe, America, and Asia.
> >
> > === Reliance on Salaried Developers ===
> > Like most open source projects, SAMOA receives a substantial support from
> > salaried developers. In addition, those working from within corporations
> > often devote “after hours” or spare time in the project - and these come
> > from several organizations. We will work to ensure the ability for the
> > project to continuously be stewarded and to proceed forward independently
> > of salaried developers.
> >
> > === Relationship with Other Apache Products ===
> > SAMOA interoperates with several existing Apache projects, mainly by
> using
> > them as stream processing engines: Apache Storm, Apache S4, and Apache
> > Samza. It is a counterpart of Apache Mahout for streaming. It also uses
> > several other Apache components, including Apache Maven and several
> Apache
> > Commons libraries.
> >
> > === A Excessive Fascination with the Apache Brand ===
> > SAMOA is already a healthy and relatively well known open source project.
> > This proposal is not for the purpose of generating publicity. Rather, the
> > primary benefits to joining Apache are those outlined in the Rationale
> > section. We are more interested in establishing a strong community that
> can
> > drive the project independently of Yahoo.
> >
> > == Documentation ==
> > The reader will find these websites relevant:
> >
> > * SAMOA website: http://samoa-project.net/
> > * SAMOA documentation: https://github.com/yahoo/samoa/wiki/
> > * Issue tracking: https://github.com/yahoo/samoa/issues
> > * Codebase: https://github.com/yahoo/samoa
> > * User group: http://groups.google.com/group/samoa-user
> >
> > == Initial Source ==
> > The SAMOA codebase is currently hosted on GitHub:
> > https://github.com/yahoo/samoa. This is the exact codebase that we would
> > migrate to the Apache foundation.
> >
> > == Source and Intellectual Property Submission Plan ==
> > Currently, the SAMOA codebase is distributed under an Apache license
> v2.0.
> > The vast majority of code has copyright held by Yahoo. Upon entering the
> > Incubator, Yahoo will grant a license to the Apache foundation. In
> certain
> > cases where individuals or organizations hold copyright, we will ensure
> > they grant a license to the Apache foundation. Going forward, all commits
> > will be licensed directly to the Apache foundation through our signed
> > Individual Contributor License Agreements for all committers on the
> project.
> >
> > == Cryptography ==
> > We do not expect SAMOA to be a controlled export item due to the use of
> > encryption.
> >
> > == External Dependencies ==
> > To the best of our knowledge, all dependencies of SAMOA are distributed
> > under Apache compatible licenses. Upon acceptance to the incubator, we
> > would begin a thorough analysis of all transitive dependencies to verify
> > this fact and introduce license checking into the build and release
> process
> > (for instance integrating Apache Rat).
> >
> > == Required Resources ==
> > === Mailing Lists ===
> > We will migrate the existing SAMOA mailing lists as follows:
> >
> > * samoa-users@googlegroups --> users@samoa.incubator.apache.org
> > * samoa-developers@googlegroups --> dev@samoa.incubator.apache.org
> >
> > SAMOA commits are hosted on GitHub, so we would request the following
> > mailing list:
> >
> > * commits@samoa.incubator.apache.org
> >
> > We would also request the following mailing list:
> >
> > * private@samoa.incubator.apache.org (with moderated subscription)
> >
> > === Source control ===
> > The SAMOA team would like to use Git for source control, due to our
> current
> > use of Git. We request a writeable Git repo for SAMOA, and mirroring to
> be
> > set up to GitHub through INFRA.
> >
> > * https://git-wip-us.apache.org/repos/asf/incubator-samoa.git
> >
> > === Issue Tracking ===
> > SAMOA currently uses GitHub for issue tracking. We will migrate to the
> > Apache JIRA instance. http://issues.apache.org/jira/browse/SAMOA
> >
> > == Initial Committers & Affiliations ==
> > * Albert Bifet, Huawei, <abifet at waikato dot ac dot nz>
> > * Gianmarco De Francisci Morales, Yahoo Labs, <gdfm at apache dot org>
> > * Nicolas Kourtellis, Yahoo Labs, <nkourtellis at gmail dot com>
> > * Matthieu Morel, Yahoo Labs, <mmorel at apache dot org>
> > * Arinto Murdopo, Living Analytics Research Centre, <arintom at smu dot
> edu
> > dot sg>
> > * Olivier Van Laere, BlueShift Labs, <olivier at getblueshift dot com>
> >
> > == Sponsors ==
> > === Champion ===
> > * Daniel Dai <daijy at apache dot org>
> >
> > === Nominated Mentors ===
> > * Alan Gates <gates at apache dot org>
> > * Ted Dunning <tdunning at apache dot org>
> > * Ashutosh Chauhan <hashutosh at apache dot org>
> > * Enis Soztutar <enis at apache dot org>
> >
> > === Sponsoring Entity ===
> > The Apache Incubator
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
> For additional commands, e-mail: general-help@incubator.apache.org
>
>

Re: [VOTE] accept SAMOA into incubator

Posted by Jakob Homan <jg...@gmail.com>.
+1 (binding)

On Thu, Dec 11, 2014 at 2:08 PM, Konstantin Boudnik <co...@apache.org> wrote:
> +1 (binding).
>
> I small comment: we don't do users@ list of podlings, do we? If so
>  samoa-users@googlegroups --> users@samoa.incubator.apache.org
> will need to be converged into dev@.
>
> Cos
>
> On Thu, Dec 11, 2014 at 10:02AM, Daniel Dai wrote:
>> Following the discussion earlier, I'm calling a vote to accept SAMOA as a
>> new Incubator project.
>>
>> [ ] +1 Accept SAMOA into the Incubator
>> [ ] +0 Indifferent to the acceptance of SAMOA
>> [ ] -1 Do not accept SAMOA because ...
>>
>> The vote will be open for at least 72h and closes at the earliest on Dec 14
>> 19:00 GMT.
>>
>> https://wiki.apache.org/incubator/SAMOAProposal
>>
>> Thanks,
>> Daniel
>>
>> = SAMOA =
>> == Abstract ==
>> SAMOA is an an open-source platform for mining big data streams.
>>
>> == Proposal ==
>> SAMOA provides a collection of distributed streaming algorithms for the
>> most common data mining and machine learning tasks such as classification,
>> clustering, and regression, as well as programming abstractions to develop
>> new algorithms that run on top of distributed stream processing engines
>> (DSPEs). It features a pluggable architecture that allows it to run on
>> several DSPEs such as Apache Storm, Apache S4, and Apache Samza.
>>
>> == Background ==
>> Hadoop and its ecosystem have changed the way data are processed by
>> allowing to push algorithms to unprecedented scale. As an example, Mahout
>> allows to run data mining and machine learning algorithms on very large
>> datasets. However, Hadoop and Mahout are not suited to handle streaming
>> data. Simply put, the goal of SAMOA is to provide a streaming counterpart
>> to Mahout.
>>
>> == Rationale ==
>> SAMOA aims to fill the current gap in tools for mining large scale streams.
>> Many organizations can benefit from a scalable stream mining platform
>> system such as SAMOA.
>>
>> SAMOA is a natural fit for the Apache Software Foundation. It is licensed
>> under the ASL v2.0. It already interoperates with several existing Apache
>> projects such as Storm, S4, and Samza. Furthermore, it is complementary to
>> existing Apache projects such as Mahout. The initial committers are
>> familiar with the Apache process and subscribes to the Apache mission.
>> Indeed, the team includes multiple Apache committers. Finally, joining
>> Apache will help coordinate the development effort of the growing number of
>> organizations which contribute to SAMOA.
>>
>> == Initial Goals ==
>> * Move the existing codebase to Apache
>> * Integrate with the Apache development process
>> * Incremental development and releases per Apache guidelines
>>
>> == Current Status ==
>> SAMOA started as a research project at Yahoo Labs in 2013 and was
>> open-sourced in October the same year. It has been under development on
>> Yahoo's public GitHub repository since being open-sourced. It has undergone
>> two releases (0.1, 0.2).
>>
>> === Meritocracy ===
>> The SAMOA project already operates on meritocratic principles. Today, SAMOA
>> has several developers and has accepted multiple patches from outside of
>> Yahoo Labs. However, our intent with this incubator proposal is to start
>> building a more diverse developer community around SAMOA that follows the
>> Apache meritocracy model. We will identify all committers and PPMC members
>> for the project operating under the ASF meritocratic principles. We plan to
>> continue support for new contributors and work with those who contribute
>> significantly to the project to make them committers.
>>
>> === Community ===
>> SAMOA is currently being used internally at Yahoo. Acceptance into the
>> Apache foundation would bolster the existing user and developer community
>> around SAMOA. That community includes contributors from several
>> institutions, active mostly on GitHub's pages. SAMOA has been starred more
>> than 300 times and forked more than 50 times on GitHub as of November 2014.
>>
>> === Core Developers ===
>> The core developers are a diverse group, many of which already very
>> experienced with open source. There are two existing Apache committers,
>> along with people from various companies and universities.
>>
>> === Alignment ===
>> The ASF is the natural choice to host SAMOA. First, its goal of encouraging
>> community-driven open-source projects fits with our vision for SAMOA.
>> Additionally, many other projects that SAMOA is based on, such as Apache
>> Storm, S4, Samza, and HDFS, are hosted by the ASF. Close proximity of SAMOA
>> to these projects within the ASF will provide mutual benefit.
>>
>> == Known Risks ==
>> === Orphaned Products ===
>> Given the current level of investment in SAMOA the risk of the project
>> being abandoned is minimal. There are several constituents who are highly
>> incentivized to continue development, and Yahoo Labs relies on SAMOA as a
>> platform for a large number of long-term research projects. However, the
>> small number of initial committers might be a concern. We plan to address
>> this issue during incubation by growing the community and the number of
>> committers.
>>
>> === Inexperience with Open Source ===
>> SAMOA has existed as a healthy open source project for one year. During
>> this time, we have curated an open-source community successfully,
>> attracting developers from a diverse group of universities and companies
>> including Huawei, Yahoo, University of Porto, and Universitat Politecnica
>> de Catalunya.
>>
>> Gianmarco is a committer for Apache Pig, Matthieu for Apache S4. Albert is
>> one of the lead developers of MOA, an open-source tool for streaming
>> machine learning.
>>
>> === Homogenous Developers ===
>> The initial list of committers includes developers from several
>> institutions, both academic and industrial. The committers are
>> geographically distributed across Europe, America, and Asia.
>>
>> === Reliance on Salaried Developers ===
>> Like most open source projects, SAMOA receives a substantial support from
>> salaried developers. In addition, those working from within corporations
>> often devote “after hours” or spare time in the project - and these come
>> from several organizations. We will work to ensure the ability for the
>> project to continuously be stewarded and to proceed forward independently
>> of salaried developers.
>>
>> === Relationship with Other Apache Products ===
>> SAMOA interoperates with several existing Apache projects, mainly by using
>> them as stream processing engines: Apache Storm, Apache S4, and Apache
>> Samza. It is a counterpart of Apache Mahout for streaming. It also uses
>> several other Apache components, including Apache Maven and several Apache
>> Commons libraries.
>>
>> === A Excessive Fascination with the Apache Brand ===
>> SAMOA is already a healthy and relatively well known open source project.
>> This proposal is not for the purpose of generating publicity. Rather, the
>> primary benefits to joining Apache are those outlined in the Rationale
>> section. We are more interested in establishing a strong community that can
>> drive the project independently of Yahoo.
>>
>> == Documentation ==
>> The reader will find these websites relevant:
>>
>> * SAMOA website: http://samoa-project.net/
>> * SAMOA documentation: https://github.com/yahoo/samoa/wiki/
>> * Issue tracking: https://github.com/yahoo/samoa/issues
>> * Codebase: https://github.com/yahoo/samoa
>> * User group: http://groups.google.com/group/samoa-user
>>
>> == Initial Source ==
>> The SAMOA codebase is currently hosted on GitHub:
>> https://github.com/yahoo/samoa. This is the exact codebase that we would
>> migrate to the Apache foundation.
>>
>> == Source and Intellectual Property Submission Plan ==
>> Currently, the SAMOA codebase is distributed under an Apache license v2.0.
>> The vast majority of code has copyright held by Yahoo. Upon entering the
>> Incubator, Yahoo will grant a license to the Apache foundation. In certain
>> cases where individuals or organizations hold copyright, we will ensure
>> they grant a license to the Apache foundation. Going forward, all commits
>> will be licensed directly to the Apache foundation through our signed
>> Individual Contributor License Agreements for all committers on the project.
>>
>> == Cryptography ==
>> We do not expect SAMOA to be a controlled export item due to the use of
>> encryption.
>>
>> == External Dependencies ==
>> To the best of our knowledge, all dependencies of SAMOA are distributed
>> under Apache compatible licenses. Upon acceptance to the incubator, we
>> would begin a thorough analysis of all transitive dependencies to verify
>> this fact and introduce license checking into the build and release process
>> (for instance integrating Apache Rat).
>>
>> == Required Resources ==
>> === Mailing Lists ===
>> We will migrate the existing SAMOA mailing lists as follows:
>>
>> * samoa-users@googlegroups --> users@samoa.incubator.apache.org
>> * samoa-developers@googlegroups --> dev@samoa.incubator.apache.org
>>
>> SAMOA commits are hosted on GitHub, so we would request the following
>> mailing list:
>>
>> * commits@samoa.incubator.apache.org
>>
>> We would also request the following mailing list:
>>
>> * private@samoa.incubator.apache.org (with moderated subscription)
>>
>> === Source control ===
>> The SAMOA team would like to use Git for source control, due to our current
>> use of Git. We request a writeable Git repo for SAMOA, and mirroring to be
>> set up to GitHub through INFRA.
>>
>> * https://git-wip-us.apache.org/repos/asf/incubator-samoa.git
>>
>> === Issue Tracking ===
>> SAMOA currently uses GitHub for issue tracking. We will migrate to the
>> Apache JIRA instance. http://issues.apache.org/jira/browse/SAMOA
>>
>> == Initial Committers & Affiliations ==
>> * Albert Bifet, Huawei, <abifet at waikato dot ac dot nz>
>> * Gianmarco De Francisci Morales, Yahoo Labs, <gdfm at apache dot org>
>> * Nicolas Kourtellis, Yahoo Labs, <nkourtellis at gmail dot com>
>> * Matthieu Morel, Yahoo Labs, <mmorel at apache dot org>
>> * Arinto Murdopo, Living Analytics Research Centre, <arintom at smu dot edu
>> dot sg>
>> * Olivier Van Laere, BlueShift Labs, <olivier at getblueshift dot com>
>>
>> == Sponsors ==
>> === Champion ===
>> * Daniel Dai <daijy at apache dot org>
>>
>> === Nominated Mentors ===
>> * Alan Gates <gates at apache dot org>
>> * Ted Dunning <tdunning at apache dot org>
>> * Ashutosh Chauhan <hashutosh at apache dot org>
>> * Enis Soztutar <enis at apache dot org>
>>
>> === Sponsoring Entity ===
>> The Apache Incubator
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
> For additional commands, e-mail: general-help@incubator.apache.org
>

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: [VOTE] accept SAMOA into incubator

Posted by Konstantin Boudnik <co...@apache.org>.
+1 (binding).

I small comment: we don't do users@ list of podlings, do we? If so
 samoa-users@googlegroups --> users@samoa.incubator.apache.org 
will need to be converged into dev@.

Cos

On Thu, Dec 11, 2014 at 10:02AM, Daniel Dai wrote:
> Following the discussion earlier, I'm calling a vote to accept SAMOA as a
> new Incubator project.
> 
> [ ] +1 Accept SAMOA into the Incubator
> [ ] +0 Indifferent to the acceptance of SAMOA
> [ ] -1 Do not accept SAMOA because ...
> 
> The vote will be open for at least 72h and closes at the earliest on Dec 14
> 19:00 GMT.
> 
> https://wiki.apache.org/incubator/SAMOAProposal
> 
> Thanks,
> Daniel
> 
> = SAMOA =
> == Abstract ==
> SAMOA is an an open-source platform for mining big data streams.
> 
> == Proposal ==
> SAMOA provides a collection of distributed streaming algorithms for the
> most common data mining and machine learning tasks such as classification,
> clustering, and regression, as well as programming abstractions to develop
> new algorithms that run on top of distributed stream processing engines
> (DSPEs). It features a pluggable architecture that allows it to run on
> several DSPEs such as Apache Storm, Apache S4, and Apache Samza.
> 
> == Background ==
> Hadoop and its ecosystem have changed the way data are processed by
> allowing to push algorithms to unprecedented scale. As an example, Mahout
> allows to run data mining and machine learning algorithms on very large
> datasets. However, Hadoop and Mahout are not suited to handle streaming
> data. Simply put, the goal of SAMOA is to provide a streaming counterpart
> to Mahout.
> 
> == Rationale ==
> SAMOA aims to fill the current gap in tools for mining large scale streams.
> Many organizations can benefit from a scalable stream mining platform
> system such as SAMOA.
> 
> SAMOA is a natural fit for the Apache Software Foundation. It is licensed
> under the ASL v2.0. It already interoperates with several existing Apache
> projects such as Storm, S4, and Samza. Furthermore, it is complementary to
> existing Apache projects such as Mahout. The initial committers are
> familiar with the Apache process and subscribes to the Apache mission.
> Indeed, the team includes multiple Apache committers. Finally, joining
> Apache will help coordinate the development effort of the growing number of
> organizations which contribute to SAMOA.
> 
> == Initial Goals ==
> * Move the existing codebase to Apache
> * Integrate with the Apache development process
> * Incremental development and releases per Apache guidelines
> 
> == Current Status ==
> SAMOA started as a research project at Yahoo Labs in 2013 and was
> open-sourced in October the same year. It has been under development on
> Yahoo's public GitHub repository since being open-sourced. It has undergone
> two releases (0.1, 0.2).
> 
> === Meritocracy ===
> The SAMOA project already operates on meritocratic principles. Today, SAMOA
> has several developers and has accepted multiple patches from outside of
> Yahoo Labs. However, our intent with this incubator proposal is to start
> building a more diverse developer community around SAMOA that follows the
> Apache meritocracy model. We will identify all committers and PPMC members
> for the project operating under the ASF meritocratic principles. We plan to
> continue support for new contributors and work with those who contribute
> significantly to the project to make them committers.
> 
> === Community ===
> SAMOA is currently being used internally at Yahoo. Acceptance into the
> Apache foundation would bolster the existing user and developer community
> around SAMOA. That community includes contributors from several
> institutions, active mostly on GitHub's pages. SAMOA has been starred more
> than 300 times and forked more than 50 times on GitHub as of November 2014.
> 
> === Core Developers ===
> The core developers are a diverse group, many of which already very
> experienced with open source. There are two existing Apache committers,
> along with people from various companies and universities.
> 
> === Alignment ===
> The ASF is the natural choice to host SAMOA. First, its goal of encouraging
> community-driven open-source projects fits with our vision for SAMOA.
> Additionally, many other projects that SAMOA is based on, such as Apache
> Storm, S4, Samza, and HDFS, are hosted by the ASF. Close proximity of SAMOA
> to these projects within the ASF will provide mutual benefit.
> 
> == Known Risks ==
> === Orphaned Products ===
> Given the current level of investment in SAMOA the risk of the project
> being abandoned is minimal. There are several constituents who are highly
> incentivized to continue development, and Yahoo Labs relies on SAMOA as a
> platform for a large number of long-term research projects. However, the
> small number of initial committers might be a concern. We plan to address
> this issue during incubation by growing the community and the number of
> committers.
> 
> === Inexperience with Open Source ===
> SAMOA has existed as a healthy open source project for one year. During
> this time, we have curated an open-source community successfully,
> attracting developers from a diverse group of universities and companies
> including Huawei, Yahoo, University of Porto, and Universitat Politecnica
> de Catalunya.
> 
> Gianmarco is a committer for Apache Pig, Matthieu for Apache S4. Albert is
> one of the lead developers of MOA, an open-source tool for streaming
> machine learning.
> 
> === Homogenous Developers ===
> The initial list of committers includes developers from several
> institutions, both academic and industrial. The committers are
> geographically distributed across Europe, America, and Asia.
> 
> === Reliance on Salaried Developers ===
> Like most open source projects, SAMOA receives a substantial support from
> salaried developers. In addition, those working from within corporations
> often devote “after hours” or spare time in the project - and these come
> from several organizations. We will work to ensure the ability for the
> project to continuously be stewarded and to proceed forward independently
> of salaried developers.
> 
> === Relationship with Other Apache Products ===
> SAMOA interoperates with several existing Apache projects, mainly by using
> them as stream processing engines: Apache Storm, Apache S4, and Apache
> Samza. It is a counterpart of Apache Mahout for streaming. It also uses
> several other Apache components, including Apache Maven and several Apache
> Commons libraries.
> 
> === A Excessive Fascination with the Apache Brand ===
> SAMOA is already a healthy and relatively well known open source project.
> This proposal is not for the purpose of generating publicity. Rather, the
> primary benefits to joining Apache are those outlined in the Rationale
> section. We are more interested in establishing a strong community that can
> drive the project independently of Yahoo.
> 
> == Documentation ==
> The reader will find these websites relevant:
> 
> * SAMOA website: http://samoa-project.net/
> * SAMOA documentation: https://github.com/yahoo/samoa/wiki/
> * Issue tracking: https://github.com/yahoo/samoa/issues
> * Codebase: https://github.com/yahoo/samoa
> * User group: http://groups.google.com/group/samoa-user
> 
> == Initial Source ==
> The SAMOA codebase is currently hosted on GitHub:
> https://github.com/yahoo/samoa. This is the exact codebase that we would
> migrate to the Apache foundation.
> 
> == Source and Intellectual Property Submission Plan ==
> Currently, the SAMOA codebase is distributed under an Apache license v2.0.
> The vast majority of code has copyright held by Yahoo. Upon entering the
> Incubator, Yahoo will grant a license to the Apache foundation. In certain
> cases where individuals or organizations hold copyright, we will ensure
> they grant a license to the Apache foundation. Going forward, all commits
> will be licensed directly to the Apache foundation through our signed
> Individual Contributor License Agreements for all committers on the project.
> 
> == Cryptography ==
> We do not expect SAMOA to be a controlled export item due to the use of
> encryption.
> 
> == External Dependencies ==
> To the best of our knowledge, all dependencies of SAMOA are distributed
> under Apache compatible licenses. Upon acceptance to the incubator, we
> would begin a thorough analysis of all transitive dependencies to verify
> this fact and introduce license checking into the build and release process
> (for instance integrating Apache Rat).
> 
> == Required Resources ==
> === Mailing Lists ===
> We will migrate the existing SAMOA mailing lists as follows:
> 
> * samoa-users@googlegroups --> users@samoa.incubator.apache.org
> * samoa-developers@googlegroups --> dev@samoa.incubator.apache.org
> 
> SAMOA commits are hosted on GitHub, so we would request the following
> mailing list:
> 
> * commits@samoa.incubator.apache.org
> 
> We would also request the following mailing list:
> 
> * private@samoa.incubator.apache.org (with moderated subscription)
> 
> === Source control ===
> The SAMOA team would like to use Git for source control, due to our current
> use of Git. We request a writeable Git repo for SAMOA, and mirroring to be
> set up to GitHub through INFRA.
> 
> * https://git-wip-us.apache.org/repos/asf/incubator-samoa.git
> 
> === Issue Tracking ===
> SAMOA currently uses GitHub for issue tracking. We will migrate to the
> Apache JIRA instance. http://issues.apache.org/jira/browse/SAMOA
> 
> == Initial Committers & Affiliations ==
> * Albert Bifet, Huawei, <abifet at waikato dot ac dot nz>
> * Gianmarco De Francisci Morales, Yahoo Labs, <gdfm at apache dot org>
> * Nicolas Kourtellis, Yahoo Labs, <nkourtellis at gmail dot com>
> * Matthieu Morel, Yahoo Labs, <mmorel at apache dot org>
> * Arinto Murdopo, Living Analytics Research Centre, <arintom at smu dot edu
> dot sg>
> * Olivier Van Laere, BlueShift Labs, <olivier at getblueshift dot com>
> 
> == Sponsors ==
> === Champion ===
> * Daniel Dai <daijy at apache dot org>
> 
> === Nominated Mentors ===
> * Alan Gates <gates at apache dot org>
> * Ted Dunning <tdunning at apache dot org>
> * Ashutosh Chauhan <hashutosh at apache dot org>
> * Enis Soztutar <enis at apache dot org>
> 
> === Sponsoring Entity ===
> The Apache Incubator

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: [VOTE] accept SAMOA into incubator

Posted by "P. Taylor Goetz" <pt...@gmail.com>.
+1 (non-binding)

I’d be happy to help if SAMOA needs anything from the Storm project.

-Taylor

On Dec 11, 2014, at 1:02 PM, Daniel Dai <da...@gmail.com> wrote:

> Following the discussion earlier, I'm calling a vote to accept SAMOA as a
> new Incubator project.
> 
> [ ] +1 Accept SAMOA into the Incubator
> [ ] +0 Indifferent to the acceptance of SAMOA
> [ ] -1 Do not accept SAMOA because ...
> 
> The vote will be open for at least 72h and closes at the earliest on Dec 14
> 19:00 GMT.
> 
> https://wiki.apache.org/incubator/SAMOAProposal
> 
> Thanks,
> Daniel
> 
> = SAMOA =
> == Abstract ==
> SAMOA is an an open-source platform for mining big data streams.
> 
> == Proposal ==
> SAMOA provides a collection of distributed streaming algorithms for the
> most common data mining and machine learning tasks such as classification,
> clustering, and regression, as well as programming abstractions to develop
> new algorithms that run on top of distributed stream processing engines
> (DSPEs). It features a pluggable architecture that allows it to run on
> several DSPEs such as Apache Storm, Apache S4, and Apache Samza.
> 
> == Background ==
> Hadoop and its ecosystem have changed the way data are processed by
> allowing to push algorithms to unprecedented scale. As an example, Mahout
> allows to run data mining and machine learning algorithms on very large
> datasets. However, Hadoop and Mahout are not suited to handle streaming
> data. Simply put, the goal of SAMOA is to provide a streaming counterpart
> to Mahout.
> 
> == Rationale ==
> SAMOA aims to fill the current gap in tools for mining large scale streams.
> Many organizations can benefit from a scalable stream mining platform
> system such as SAMOA.
> 
> SAMOA is a natural fit for the Apache Software Foundation. It is licensed
> under the ASL v2.0. It already interoperates with several existing Apache
> projects such as Storm, S4, and Samza. Furthermore, it is complementary to
> existing Apache projects such as Mahout. The initial committers are
> familiar with the Apache process and subscribes to the Apache mission.
> Indeed, the team includes multiple Apache committers. Finally, joining
> Apache will help coordinate the development effort of the growing number of
> organizations which contribute to SAMOA.
> 
> == Initial Goals ==
> * Move the existing codebase to Apache
> * Integrate with the Apache development process
> * Incremental development and releases per Apache guidelines
> 
> == Current Status ==
> SAMOA started as a research project at Yahoo Labs in 2013 and was
> open-sourced in October the same year. It has been under development on
> Yahoo's public GitHub repository since being open-sourced. It has undergone
> two releases (0.1, 0.2).
> 
> === Meritocracy ===
> The SAMOA project already operates on meritocratic principles. Today, SAMOA
> has several developers and has accepted multiple patches from outside of
> Yahoo Labs. However, our intent with this incubator proposal is to start
> building a more diverse developer community around SAMOA that follows the
> Apache meritocracy model. We will identify all committers and PPMC members
> for the project operating under the ASF meritocratic principles. We plan to
> continue support for new contributors and work with those who contribute
> significantly to the project to make them committers.
> 
> === Community ===
> SAMOA is currently being used internally at Yahoo. Acceptance into the
> Apache foundation would bolster the existing user and developer community
> around SAMOA. That community includes contributors from several
> institutions, active mostly on GitHub's pages. SAMOA has been starred more
> than 300 times and forked more than 50 times on GitHub as of November 2014.
> 
> === Core Developers ===
> The core developers are a diverse group, many of which already very
> experienced with open source. There are two existing Apache committers,
> along with people from various companies and universities.
> 
> === Alignment ===
> The ASF is the natural choice to host SAMOA. First, its goal of encouraging
> community-driven open-source projects fits with our vision for SAMOA.
> Additionally, many other projects that SAMOA is based on, such as Apache
> Storm, S4, Samza, and HDFS, are hosted by the ASF. Close proximity of SAMOA
> to these projects within the ASF will provide mutual benefit.
> 
> == Known Risks ==
> === Orphaned Products ===
> Given the current level of investment in SAMOA the risk of the project
> being abandoned is minimal. There are several constituents who are highly
> incentivized to continue development, and Yahoo Labs relies on SAMOA as a
> platform for a large number of long-term research projects. However, the
> small number of initial committers might be a concern. We plan to address
> this issue during incubation by growing the community and the number of
> committers.
> 
> === Inexperience with Open Source ===
> SAMOA has existed as a healthy open source project for one year. During
> this time, we have curated an open-source community successfully,
> attracting developers from a diverse group of universities and companies
> including Huawei, Yahoo, University of Porto, and Universitat Politecnica
> de Catalunya.
> 
> Gianmarco is a committer for Apache Pig, Matthieu for Apache S4. Albert is
> one of the lead developers of MOA, an open-source tool for streaming
> machine learning.
> 
> === Homogenous Developers ===
> The initial list of committers includes developers from several
> institutions, both academic and industrial. The committers are
> geographically distributed across Europe, America, and Asia.
> 
> === Reliance on Salaried Developers ===
> Like most open source projects, SAMOA receives a substantial support from
> salaried developers. In addition, those working from within corporations
> often devote “after hours” or spare time in the project - and these come
> from several organizations. We will work to ensure the ability for the
> project to continuously be stewarded and to proceed forward independently
> of salaried developers.
> 
> === Relationship with Other Apache Products ===
> SAMOA interoperates with several existing Apache projects, mainly by using
> them as stream processing engines: Apache Storm, Apache S4, and Apache
> Samza. It is a counterpart of Apache Mahout for streaming. It also uses
> several other Apache components, including Apache Maven and several Apache
> Commons libraries.
> 
> === A Excessive Fascination with the Apache Brand ===
> SAMOA is already a healthy and relatively well known open source project.
> This proposal is not for the purpose of generating publicity. Rather, the
> primary benefits to joining Apache are those outlined in the Rationale
> section. We are more interested in establishing a strong community that can
> drive the project independently of Yahoo.
> 
> == Documentation ==
> The reader will find these websites relevant:
> 
> * SAMOA website: http://samoa-project.net/
> * SAMOA documentation: https://github.com/yahoo/samoa/wiki/
> * Issue tracking: https://github.com/yahoo/samoa/issues
> * Codebase: https://github.com/yahoo/samoa
> * User group: http://groups.google.com/group/samoa-user
> 
> == Initial Source ==
> The SAMOA codebase is currently hosted on GitHub:
> https://github.com/yahoo/samoa. This is the exact codebase that we would
> migrate to the Apache foundation.
> 
> == Source and Intellectual Property Submission Plan ==
> Currently, the SAMOA codebase is distributed under an Apache license v2.0.
> The vast majority of code has copyright held by Yahoo. Upon entering the
> Incubator, Yahoo will grant a license to the Apache foundation. In certain
> cases where individuals or organizations hold copyright, we will ensure
> they grant a license to the Apache foundation. Going forward, all commits
> will be licensed directly to the Apache foundation through our signed
> Individual Contributor License Agreements for all committers on the project.
> 
> == Cryptography ==
> We do not expect SAMOA to be a controlled export item due to the use of
> encryption.
> 
> == External Dependencies ==
> To the best of our knowledge, all dependencies of SAMOA are distributed
> under Apache compatible licenses. Upon acceptance to the incubator, we
> would begin a thorough analysis of all transitive dependencies to verify
> this fact and introduce license checking into the build and release process
> (for instance integrating Apache Rat).
> 
> == Required Resources ==
> === Mailing Lists ===
> We will migrate the existing SAMOA mailing lists as follows:
> 
> * samoa-users@googlegroups --> users@samoa.incubator.apache.org
> * samoa-developers@googlegroups --> dev@samoa.incubator.apache.org
> 
> SAMOA commits are hosted on GitHub, so we would request the following
> mailing list:
> 
> * commits@samoa.incubator.apache.org
> 
> We would also request the following mailing list:
> 
> * private@samoa.incubator.apache.org (with moderated subscription)
> 
> === Source control ===
> The SAMOA team would like to use Git for source control, due to our current
> use of Git. We request a writeable Git repo for SAMOA, and mirroring to be
> set up to GitHub through INFRA.
> 
> * https://git-wip-us.apache.org/repos/asf/incubator-samoa.git
> 
> === Issue Tracking ===
> SAMOA currently uses GitHub for issue tracking. We will migrate to the
> Apache JIRA instance. http://issues.apache.org/jira/browse/SAMOA
> 
> == Initial Committers & Affiliations ==
> * Albert Bifet, Huawei, <abifet at waikato dot ac dot nz>
> * Gianmarco De Francisci Morales, Yahoo Labs, <gdfm at apache dot org>
> * Nicolas Kourtellis, Yahoo Labs, <nkourtellis at gmail dot com>
> * Matthieu Morel, Yahoo Labs, <mmorel at apache dot org>
> * Arinto Murdopo, Living Analytics Research Centre, <arintom at smu dot edu
> dot sg>
> * Olivier Van Laere, BlueShift Labs, <olivier at getblueshift dot com>
> 
> == Sponsors ==
> === Champion ===
> * Daniel Dai <daijy at apache dot org>
> 
> === Nominated Mentors ===
> * Alan Gates <gates at apache dot org>
> * Ted Dunning <tdunning at apache dot org>
> * Ashutosh Chauhan <hashutosh at apache dot org>
> * Enis Soztutar <enis at apache dot org>
> 
> === Sponsoring Entity ===
> The Apache Incubator