You are viewing a plain text version of this content. The canonical link for it is here.
Posted to general@hadoop.apache.org by Nigel Daley <nd...@mac.com> on 2011/02/11 08:08:46 UTC

[VOTE] Abandon mrunit MapReduce contrib

I think the PMC should abandon the mrunit MapReduce contrib component.  The originator of mrunit and primary maintainer (Aaron Kimball) is moving the active development elsewhere. 

There are 2 unresolved contrib/mrunit issues in Jira, none of them Patch Available.

Here is my +1.

Nige

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi All,

> Votes to remove code should be because the PMC doesn't think the code is
> worth maintaining any more. I don't think that applies in this case. Aaron
> is a committer in Hadoop now and I'd strongly encourage you both to give
> your changes back to Apache instead of forking it into GitHub.

+1.

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Steve Loughran <st...@apache.org>.
On 11/02/11 22:26, Eric Sammer wrote:
> Just to add to the option of going to incubator, I'm fine with that as well.
> Github was an easy thing to get started and I was under the impression we
> needed some greater degree of committer diversity and, frankly, a bigger
> project. If mrunit is a candidate, keeping this under the ASF umbrella is
> more than fine with me.
>

There is a git repository coming up at Apache, with global read and 
LDAP-authenticated write:
https://issues.apache.org/jira/browse/INFRA-3165

I know a lot of the in house Hadoop teams use Git for SCM, so I think 
having the big Hadoop projects work on Git too makes sense -it would 
certainly help Owen and team get their changes over if their branches 
could be merged.

Starting off with a small but active Hadoop family project would be the 
way to start this, and MRUnit does appeal as a first step
  -active development
  -fast and decoupled release cycle
  -off the critical path

I'll volunteer to go down as a committer with the usual caveat that I am 
over-committed and unless I have a direct personal need to touch the 
code I'll be effectively offline. Tom White can vouch for that, given my 
engagement in Whirr.

Here then are my votes

  +1 to move Mrunit into incubator
  +1 to experiment with mrunit being live in Apache's Git repository.

-steve

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Arun C Murthy <ac...@yahoo-inc.com>.
On Feb 11, 2011, at 9:57 PM, Nigel Daley wrote:

> This is great!  So we'll leave mrunit in contrib until it can be  
> moved to incubator.
>

+1



Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Nigel Daley <nd...@mac.com>.
mrunit has been moved to the incubator:
https://svn.apache.org/repos/asf/incubator/mrunit/

I'll file an issue to remove the code from Hadoop SVN.

Nige

On Feb 11, 2011, at 9:57 PM, Nigel Daley wrote:

> This is great!  So we'll leave mrunit in contrib until it can be moved to incubator.
> 
> Nige
> 
> On Feb 11, 2011, at 2:26 PM, Eric Sammer wrote:
> 
>> Just to add to the option of going to incubator, I'm fine with that as well.
>> Github was an easy thing to get started and I was under the impression we
>> needed some greater degree of committer diversity and, frankly, a bigger
>> project. If mrunit is a candidate, keeping this under the ASF umbrella is
>> more than fine with me.
>> 
>> On Fri, Feb 11, 2011 at 5:10 PM, Aaron Kimball <ak...@gmail.com> wrote:
>> 
>>> The main reason I am interested in removing MRUnit from Hadoop is that I
>>> believe that MRUnit deserves its own release cycle. I think this is in the
>>> best interest of its users.
>>> 
>>> MRUnit is valuable to users of several different versions of Hadoop. But
>>> MRUnit has only ever been committed to version 0.21 and above -- even
>>> though
>>> in practice, the majority (dare I say--all) of its users are running on
>>> 0.20. The only place today to get a version of MRUnit compatible with 0.20
>>> has been through a Cloudera release, which backported the entire MRUnit
>>> patchset.
>>> 
>>> My thoughts on MRUnit in 0.20.100 resonate with Eric's. There will be
>>> further fixes to MRUnit and its lightweight codebase can be released far
>>> more rapidly than whenever the next 0.20.1xx release of Hadoop would occur.
>>> Given that MRUnit has already been in the repository since April 2009 (see
>>> https://issues.apache.org/jira/browse/HADOOP-5518) and has yet to see an
>>> Apache 0.20-based release, I do not think it is in the best interest of the
>>> library's userbase to couple MRUnit's release cycle to that of Hadoop
>>> itself.
>>> 
>>> Perhaps more importantly, access to new features in MRUnit should not
>>> require upgrading one's entire Hadoop deployment; this is a client library
>>> that depends only on Hadoop's public APIs.
>>> 
>>> My primary concern is to move MRUnit to a place where the community can
>>> derive the most benefit from it. The Apache Incubator could fulfill this
>>> role; given the presence of individuals willing to mentor this project, I
>>> believe this would be a successful way to release MRUnit more quickly and
>>> continue to work to grow the MRUnit community.
>>> 
>>> Regards,
>>> - Aaron
>>> 
>>> 
>>> On Fri, Feb 11, 2011 at 11:57 AM, Mattmann, Chris A (388J) <
>>> chris.a.mattmann@jpl.nasa.gov> wrote:
>>> 
>>>> Awesome Patrick, we'd probably need one more active mentor. Any takers?
>>>> 
>>>> After we get that, then we cook up a proposal on the Incubator wiki here
>>>> [1], and follow the process here [2] to get started...
>>>> 
>>>> Cheers,
>>>> Chris
>>>> 
>>>> [1] http://wiki.apache.org/incubator/MRUnitProposal
>>>> [2] http://incubator.apache.org/guides/proposal.html
>>>> 
>>>> On Feb 11, 2011, at 11:52 AM, Patrick Hunt wrote:
>>>> 
>>>>> On Fri, Feb 11, 2011 at 9:44 AM, Mattmann, Chris A (388J)
>>>>> <ch...@jpl.nasa.gov> wrote:
>>>>>> Guys, BTW, if you need help or a mentor in Apache Incubator-ville for
>>>> MRUnit, I would be happy to help.
>>>>> 
>>>>> I was going to suggest the same thing (mrunit to incubator). I would
>>>>> also be happy to be a mentor.
>>>>> 
>>>>> Patrick
>>>>> 
>>>>>> 
>>>>>> On Feb 11, 2011, at 9:04 AM, Eric Sammer wrote:
>>>>>> 
>>>>>>> On Fri, Feb 11, 2011 at 11:48 AM, Owen O'Malley <om...@apache.org>
>>>> wrote:
>>>>>>> 
>>>>>>>> On Feb 11, 2011, at 8:02 AM, Eric Sammer wrote:
>>>>>>>> 
>>>>>>>> - allow mrunit to have its own release cycle. This is, I think, the
>>>> most
>>>>>>>>> 
>>>>>>>> 
>>>>>>>> important.
>>>>>>>>> 
>>>>>>>> 
>>>>>>>> If you submit your work to Apache we can evaluate it for inclusion
>>> in
>>>> the
>>>>>>>> 0.20.100 branch to get your changes released in a timely manner.
>>>>>>> 
>>>>>>> 
>>>>>>> I'm thinking in general (beyond the next immediate release).
>>>> Independent of
>>>>>>> where mrunit goes, I think it should leave the contrib tree to
>>>> facilitate
>>>>>>> light weight releases (the dependency on Hadoop proper is a public
>>>> facing
>>>>>>> API - a pure client). I think most projects could benefit from this
>>>> with the
>>>>>>> exception of things that are tightly coupled to Hadoop releases or
>>>> touch
>>>>>>> non-public APIs.
>>>>>>> 
>>>>>>> 
>>>>>>>> I would actually prefer to move it to Extras or Incubator and leave
>>>> this
>>>>>>>>> within the ASF.
>>>>>>>>> 
>>>>>>>> 
>>>>>>>> Extras is **NOT** inside of the ASF. Extras is a source hosting
>>> system
>>>> for
>>>>>>>> non-Apache projects that are related to Apache projects.
>>>>>>> 
>>>>>>> 
>>>>>>> Got it. Thanks for correcting me. I only mentioned it because someone
>>>>>>> suggested it to me initially.
>>>>>>> 
>>>>>>> 
>>>>>>>> Right now, I picked github because of the ability to easily
>>>>>>>> collaborate with others (and to use git).
>>>>>>>> 
>>>>>>> 
>>>>>>> I agree that it is unfortunate that Apache doesn't yet support
>>>> read-write
>>>>>>>> git access. However, you'll find that building a community is easier
>>>> at
>>>>>>>> Apache than at github.
>>>>>>>> 
>>>>>>> 
>>>>>>>> -- Owen
>>>>>>>> 
>>>>>>> 
>>>>>>> 
>>>>>>> 
>>>>>>> --
>>>>>>> Eric Sammer
>>>>>>> twitter: esammer
>>>>>>> data: www.cloudera.com
>>>>>> 
>>>>>> 
>>>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>>> Chris Mattmann, Ph.D.
>>>>>> Senior Computer Scientist
>>>>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>>>>> Office: 171-266B, Mailstop: 171-246
>>>>>> Email: chris.a.mattmann@nasa.gov
>>>>>> WWW:   http://sunset.usc.edu/~mattmann/
>>>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>>> Adjunct Assistant Professor, Computer Science Department
>>>>>> University of Southern California, Los Angeles, CA 90089 USA
>>>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>>> 
>>>>>> 
>>>> 
>>>> 
>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>> Chris Mattmann, Ph.D.
>>>> Senior Computer Scientist
>>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>>> Office: 171-266B, Mailstop: 171-246
>>>> Email: chris.a.mattmann@nasa.gov
>>>> WWW:   http://sunset.usc.edu/~mattmann/
>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>> Adjunct Assistant Professor, Computer Science Department
>>>> University of Southern California, Los Angeles, CA 90089 USA
>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>> 
>>>> 
>>> 
>> 
>> 
>> 
>> -- 
>> Eric Sammer
>> twitter: esammer
>> data: www.cloudera.com
> 


Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Nigel Daley <nd...@mac.com>.
This is great!  So we'll leave mrunit in contrib until it can be moved to incubator.

Nige

On Feb 11, 2011, at 2:26 PM, Eric Sammer wrote:

> Just to add to the option of going to incubator, I'm fine with that as well.
> Github was an easy thing to get started and I was under the impression we
> needed some greater degree of committer diversity and, frankly, a bigger
> project. If mrunit is a candidate, keeping this under the ASF umbrella is
> more than fine with me.
> 
> On Fri, Feb 11, 2011 at 5:10 PM, Aaron Kimball <ak...@gmail.com> wrote:
> 
>> The main reason I am interested in removing MRUnit from Hadoop is that I
>> believe that MRUnit deserves its own release cycle. I think this is in the
>> best interest of its users.
>> 
>> MRUnit is valuable to users of several different versions of Hadoop. But
>> MRUnit has only ever been committed to version 0.21 and above -- even
>> though
>> in practice, the majority (dare I say--all) of its users are running on
>> 0.20. The only place today to get a version of MRUnit compatible with 0.20
>> has been through a Cloudera release, which backported the entire MRUnit
>> patchset.
>> 
>> My thoughts on MRUnit in 0.20.100 resonate with Eric's. There will be
>> further fixes to MRUnit and its lightweight codebase can be released far
>> more rapidly than whenever the next 0.20.1xx release of Hadoop would occur.
>> Given that MRUnit has already been in the repository since April 2009 (see
>> https://issues.apache.org/jira/browse/HADOOP-5518) and has yet to see an
>> Apache 0.20-based release, I do not think it is in the best interest of the
>> library's userbase to couple MRUnit's release cycle to that of Hadoop
>> itself.
>> 
>> Perhaps more importantly, access to new features in MRUnit should not
>> require upgrading one's entire Hadoop deployment; this is a client library
>> that depends only on Hadoop's public APIs.
>> 
>> My primary concern is to move MRUnit to a place where the community can
>> derive the most benefit from it. The Apache Incubator could fulfill this
>> role; given the presence of individuals willing to mentor this project, I
>> believe this would be a successful way to release MRUnit more quickly and
>> continue to work to grow the MRUnit community.
>> 
>> Regards,
>> - Aaron
>> 
>> 
>> On Fri, Feb 11, 2011 at 11:57 AM, Mattmann, Chris A (388J) <
>> chris.a.mattmann@jpl.nasa.gov> wrote:
>> 
>>> Awesome Patrick, we'd probably need one more active mentor. Any takers?
>>> 
>>> After we get that, then we cook up a proposal on the Incubator wiki here
>>> [1], and follow the process here [2] to get started...
>>> 
>>> Cheers,
>>> Chris
>>> 
>>> [1] http://wiki.apache.org/incubator/MRUnitProposal
>>> [2] http://incubator.apache.org/guides/proposal.html
>>> 
>>> On Feb 11, 2011, at 11:52 AM, Patrick Hunt wrote:
>>> 
>>>> On Fri, Feb 11, 2011 at 9:44 AM, Mattmann, Chris A (388J)
>>>> <ch...@jpl.nasa.gov> wrote:
>>>>> Guys, BTW, if you need help or a mentor in Apache Incubator-ville for
>>> MRUnit, I would be happy to help.
>>>> 
>>>> I was going to suggest the same thing (mrunit to incubator). I would
>>>> also be happy to be a mentor.
>>>> 
>>>> Patrick
>>>> 
>>>>> 
>>>>> On Feb 11, 2011, at 9:04 AM, Eric Sammer wrote:
>>>>> 
>>>>>> On Fri, Feb 11, 2011 at 11:48 AM, Owen O'Malley <om...@apache.org>
>>> wrote:
>>>>>> 
>>>>>>> On Feb 11, 2011, at 8:02 AM, Eric Sammer wrote:
>>>>>>> 
>>>>>>> - allow mrunit to have its own release cycle. This is, I think, the
>>> most
>>>>>>>> 
>>>>>>> 
>>>>>>> important.
>>>>>>>> 
>>>>>>> 
>>>>>>> If you submit your work to Apache we can evaluate it for inclusion
>> in
>>> the
>>>>>>> 0.20.100 branch to get your changes released in a timely manner.
>>>>>> 
>>>>>> 
>>>>>> I'm thinking in general (beyond the next immediate release).
>>> Independent of
>>>>>> where mrunit goes, I think it should leave the contrib tree to
>>> facilitate
>>>>>> light weight releases (the dependency on Hadoop proper is a public
>>> facing
>>>>>> API - a pure client). I think most projects could benefit from this
>>> with the
>>>>>> exception of things that are tightly coupled to Hadoop releases or
>>> touch
>>>>>> non-public APIs.
>>>>>> 
>>>>>> 
>>>>>>> I would actually prefer to move it to Extras or Incubator and leave
>>> this
>>>>>>>> within the ASF.
>>>>>>>> 
>>>>>>> 
>>>>>>> Extras is **NOT** inside of the ASF. Extras is a source hosting
>> system
>>> for
>>>>>>> non-Apache projects that are related to Apache projects.
>>>>>> 
>>>>>> 
>>>>>> Got it. Thanks for correcting me. I only mentioned it because someone
>>>>>> suggested it to me initially.
>>>>>> 
>>>>>> 
>>>>>>> Right now, I picked github because of the ability to easily
>>>>>>> collaborate with others (and to use git).
>>>>>>> 
>>>>>> 
>>>>>> I agree that it is unfortunate that Apache doesn't yet support
>>> read-write
>>>>>>> git access. However, you'll find that building a community is easier
>>> at
>>>>>>> Apache than at github.
>>>>>>> 
>>>>>> 
>>>>>>> -- Owen
>>>>>>> 
>>>>>> 
>>>>>> 
>>>>>> 
>>>>>> --
>>>>>> Eric Sammer
>>>>>> twitter: esammer
>>>>>> data: www.cloudera.com
>>>>> 
>>>>> 
>>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>> Chris Mattmann, Ph.D.
>>>>> Senior Computer Scientist
>>>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>>>> Office: 171-266B, Mailstop: 171-246
>>>>> Email: chris.a.mattmann@nasa.gov
>>>>> WWW:   http://sunset.usc.edu/~mattmann/
>>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>> Adjunct Assistant Professor, Computer Science Department
>>>>> University of Southern California, Los Angeles, CA 90089 USA
>>>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>>> 
>>>>> 
>>> 
>>> 
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> 
>>> 
>> 
> 
> 
> 
> -- 
> Eric Sammer
> twitter: esammer
> data: www.cloudera.com


Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Eric Sammer <es...@cloudera.com>.
Just to add to the option of going to incubator, I'm fine with that as well.
Github was an easy thing to get started and I was under the impression we
needed some greater degree of committer diversity and, frankly, a bigger
project. If mrunit is a candidate, keeping this under the ASF umbrella is
more than fine with me.

On Fri, Feb 11, 2011 at 5:10 PM, Aaron Kimball <ak...@gmail.com> wrote:

> The main reason I am interested in removing MRUnit from Hadoop is that I
> believe that MRUnit deserves its own release cycle. I think this is in the
> best interest of its users.
>
> MRUnit is valuable to users of several different versions of Hadoop. But
> MRUnit has only ever been committed to version 0.21 and above -- even
> though
> in practice, the majority (dare I say--all) of its users are running on
> 0.20. The only place today to get a version of MRUnit compatible with 0.20
> has been through a Cloudera release, which backported the entire MRUnit
> patchset.
>
> My thoughts on MRUnit in 0.20.100 resonate with Eric's. There will be
> further fixes to MRUnit and its lightweight codebase can be released far
> more rapidly than whenever the next 0.20.1xx release of Hadoop would occur.
> Given that MRUnit has already been in the repository since April 2009 (see
> https://issues.apache.org/jira/browse/HADOOP-5518) and has yet to see an
> Apache 0.20-based release, I do not think it is in the best interest of the
> library's userbase to couple MRUnit's release cycle to that of Hadoop
> itself.
>
> Perhaps more importantly, access to new features in MRUnit should not
> require upgrading one's entire Hadoop deployment; this is a client library
> that depends only on Hadoop's public APIs.
>
> My primary concern is to move MRUnit to a place where the community can
> derive the most benefit from it. The Apache Incubator could fulfill this
> role; given the presence of individuals willing to mentor this project, I
> believe this would be a successful way to release MRUnit more quickly and
> continue to work to grow the MRUnit community.
>
> Regards,
> - Aaron
>
>
> On Fri, Feb 11, 2011 at 11:57 AM, Mattmann, Chris A (388J) <
> chris.a.mattmann@jpl.nasa.gov> wrote:
>
> > Awesome Patrick, we'd probably need one more active mentor. Any takers?
> >
> > After we get that, then we cook up a proposal on the Incubator wiki here
> > [1], and follow the process here [2] to get started...
> >
> > Cheers,
> > Chris
> >
> > [1] http://wiki.apache.org/incubator/MRUnitProposal
> > [2] http://incubator.apache.org/guides/proposal.html
> >
> > On Feb 11, 2011, at 11:52 AM, Patrick Hunt wrote:
> >
> > > On Fri, Feb 11, 2011 at 9:44 AM, Mattmann, Chris A (388J)
> > > <ch...@jpl.nasa.gov> wrote:
> > >> Guys, BTW, if you need help or a mentor in Apache Incubator-ville for
> > MRUnit, I would be happy to help.
> > >
> > > I was going to suggest the same thing (mrunit to incubator). I would
> > > also be happy to be a mentor.
> > >
> > > Patrick
> > >
> > >>
> > >> On Feb 11, 2011, at 9:04 AM, Eric Sammer wrote:
> > >>
> > >>> On Fri, Feb 11, 2011 at 11:48 AM, Owen O'Malley <om...@apache.org>
> > wrote:
> > >>>
> > >>>> On Feb 11, 2011, at 8:02 AM, Eric Sammer wrote:
> > >>>>
> > >>>> - allow mrunit to have its own release cycle. This is, I think, the
> > most
> > >>>>>
> > >>>>
> > >>>> important.
> > >>>>>
> > >>>>
> > >>>> If you submit your work to Apache we can evaluate it for inclusion
> in
> > the
> > >>>> 0.20.100 branch to get your changes released in a timely manner.
> > >>>
> > >>>
> > >>> I'm thinking in general (beyond the next immediate release).
> > Independent of
> > >>> where mrunit goes, I think it should leave the contrib tree to
> > facilitate
> > >>> light weight releases (the dependency on Hadoop proper is a public
> > facing
> > >>> API - a pure client). I think most projects could benefit from this
> > with the
> > >>> exception of things that are tightly coupled to Hadoop releases or
> > touch
> > >>> non-public APIs.
> > >>>
> > >>>
> > >>>> I would actually prefer to move it to Extras or Incubator and leave
> > this
> > >>>>> within the ASF.
> > >>>>>
> > >>>>
> > >>>> Extras is **NOT** inside of the ASF. Extras is a source hosting
> system
> > for
> > >>>> non-Apache projects that are related to Apache projects.
> > >>>
> > >>>
> > >>> Got it. Thanks for correcting me. I only mentioned it because someone
> > >>> suggested it to me initially.
> > >>>
> > >>>
> > >>>> Right now, I picked github because of the ability to easily
> > >>>> collaborate with others (and to use git).
> > >>>>
> > >>>
> > >>> I agree that it is unfortunate that Apache doesn't yet support
> > read-write
> > >>>> git access. However, you'll find that building a community is easier
> > at
> > >>>> Apache than at github.
> > >>>>
> > >>>
> > >>>> -- Owen
> > >>>>
> > >>>
> > >>>
> > >>>
> > >>> --
> > >>> Eric Sammer
> > >>> twitter: esammer
> > >>> data: www.cloudera.com
> > >>
> > >>
> > >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> > >> Chris Mattmann, Ph.D.
> > >> Senior Computer Scientist
> > >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > >> Office: 171-266B, Mailstop: 171-246
> > >> Email: chris.a.mattmann@nasa.gov
> > >> WWW:   http://sunset.usc.edu/~mattmann/
> > >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> > >> Adjunct Assistant Professor, Computer Science Department
> > >> University of Southern California, Los Angeles, CA 90089 USA
> > >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> > >>
> > >>
> >
> >
> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> > Chris Mattmann, Ph.D.
> > Senior Computer Scientist
> > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > Office: 171-266B, Mailstop: 171-246
> > Email: chris.a.mattmann@nasa.gov
> > WWW:   http://sunset.usc.edu/~mattmann/
> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> > Adjunct Assistant Professor, Computer Science Department
> > University of Southern California, Los Angeles, CA 90089 USA
> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
> >
>



-- 
Eric Sammer
twitter: esammer
data: www.cloudera.com

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Aaron Kimball <ak...@gmail.com>.
I have made some revisions and improvements here too..
- Aaron

On Sat, Feb 19, 2011 at 6:52 PM, Mattmann, Chris A (388J) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Hi Guys,
>
> I took a pass at the MRUnit proposal [1]. It's far from complete, but just
> letting you know I took a swipe :)
>
> Cheers,
> Chris
>
> [1] http://wiki.apache.org/incubator/MRUnitProposal
>
> On Feb 17, 2011, at 1:52 PM, Mattmann, Chris A (388J) wrote:
>
> > Thanks Patrick.
> >
> > Sounds good. Besides mentors, we need a champion, who must be an ASF
> member. So, either me, you or Nigel can do it. I'm already championing Gora,
> and would be happy to champion MRUnit, but am open to either one of you guys
> doing it too (or even another ASF member). Just let me know. I'll head over
> to the wiki and add my info. Thanks for getting this started Eric+Patrick!
> >
> > Cheers,
> > Chris
> >
> > On Feb 17, 2011, at 1:00 PM, Patrick Hunt wrote:
> >
> >> Chris a page is up (still being created by Eric afaict):
> >> http://wiki.apache.org/incubator/MRUnitProposal
> >>
> >> I took the liberty of listing us as mentors.
> >>
> >> Patrick
> >>
> >> On Thu, Feb 17, 2011 at 11:31 AM, Mattmann, Chris A (388J)
> >> <ch...@jpl.nasa.gov> wrote:
> >>> Hey Guys,
> >>>
> >>> FYI on this: Eric has mentioned he is going to start the Incubator
> proposal for MRUnit. Let's start small and then grow big (as needed). It
> seems like we've achieved enough consensus for the required mentors and
> critical mass to make an MRUnit Incubator proposal and then to have the
> Incubator community weigh in. If that expands to include other testing
> projects/etc., we can address that over the Incubation process, and as
> needed.
> >>>
> >>> Eric: as soon as that wiki page is up, I'd be happy to add my name to
> it as a mentor and /kick the can on this.
> >>>
> >>> Cheers,
> >>> Chris
> >>>
> >>> On Feb 17, 2011, at 11:11 AM, Aaron Kimball wrote:
> >>>
> >>>> The MRUnit community is a specific subset of the Hadoop community:
> Engineers
> >>>> writing Java code running on Hadoop. The Hadoop community also
> includes
> >>>> IT/ops staff who maintain Hadoop clusters, data scientists who use
> tools
> >>>> such as Pig & Hive, as well as those written by the aforementioned
> >>>> engineers, etc.
> >>>>
> >>>> The Hadoop project has long recognized that tools aimed at a specific
> subset
> >>>> of the Hadoop community, with separate release cycles, can more
> successfully
> >>>> reach their aims by splitting into incubator projects. Hive, Pig, and
> HBase,
> >>>> for example, have all gone this path.
> >>>>
> >>>> A "current" version of MRUnit would need to compile against multiple
> >>>> versions of Hadoop itself. This is not possible if it is in the same
> source
> >>>> tree as Hadoop.
> >>>>
> >>>> - Aaron
> >>>>
> >>>> On Thu, Feb 17, 2011 at 5:31 AM, Bernd Fondermann <
> >>>> bernd.fondermann@googlemail.com> wrote:
> >>>>
> >>>>> On Fri, Feb 11, 2011 at 23:10, Aaron Kimball <ak...@gmail.com>
> wrote:
> >>>>>> The main reason I am interested in removing MRUnit from Hadoop is
> that I
> >>>>>> believe that MRUnit deserves its own release cycle. I think this is
> in
> >>>>> the
> >>>>>> best interest of its users.
> >>>>>
> >>>>> Not in mine, at least. (I'm writing MR unit tests.)
> >>>>> Many projects release more than one product. I'd rather get MRUnit
> >>>>> from the same source where I get my MR from.
> >>>>> Separate release cylcles would be ok for me, though.
> >>>>>
> >>>>>> Perhaps more importantly, access to new features in MRUnit should
> not
> >>>>>> require upgrading one's entire Hadoop deployment; this is a client
> >>>>> library
> >>>>>> that depends only on Hadoop's public APIs.
> >>>>>
> >>>>> +1.
> >>>>>
> >>>>>> My primary concern is to move MRUnit to a place where the community
> can
> >>>>>> derive the most benefit from it. The Apache Incubator could fulfill
> this
> >>>>>> role; given the presence of individuals willing to mentor this
> project, I
> >>>>>> believe this would be a successful way to release MRUnit more
> quickly and
> >>>>>> continue to work to grow the MRUnit community.
> >>>>>
> >>>>> What are your expectations what MRUnit would become, software-wise?
> >>>>> Wouldn't the MRUnit community be largely the same as the Hadoop-MR
> >>>>> community?
> >>>>>
> >>>>> Bernd
> >>>>>
> >>>
> >>>
> >>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>> Chris Mattmann, Ph.D.
> >>> Senior Computer Scientist
> >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >>> Office: 171-266B, Mailstop: 171-246
> >>> Email: chris.a.mattmann@nasa.gov
> >>> WWW:   http://sunset.usc.edu/~mattmann/
> >>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>> Adjunct Assistant Professor, Computer Science Department
> >>> University of Southern California, Los Angeles, CA 90089 USA
> >>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>>
> >>>
> >
> >
> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> > Chris Mattmann, Ph.D.
> > Senior Computer Scientist
> > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> > Office: 171-266B, Mailstop: 171-246
> > Email: chris.a.mattmann@nasa.gov
> > WWW:   http://sunset.usc.edu/~mattmann/
> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> > Adjunct Assistant Professor, Computer Science Department
> > University of Southern California, Los Angeles, CA 90089 USA
> > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >
>
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hi Guys,

I took a pass at the MRUnit proposal [1]. It's far from complete, but just letting you know I took a swipe :)

Cheers,
Chris

[1] http://wiki.apache.org/incubator/MRUnitProposal

On Feb 17, 2011, at 1:52 PM, Mattmann, Chris A (388J) wrote:

> Thanks Patrick.
> 
> Sounds good. Besides mentors, we need a champion, who must be an ASF member. So, either me, you or Nigel can do it. I'm already championing Gora, and would be happy to champion MRUnit, but am open to either one of you guys doing it too (or even another ASF member). Just let me know. I'll head over to the wiki and add my info. Thanks for getting this started Eric+Patrick!
> 
> Cheers,
> Chris
> 
> On Feb 17, 2011, at 1:00 PM, Patrick Hunt wrote:
> 
>> Chris a page is up (still being created by Eric afaict):
>> http://wiki.apache.org/incubator/MRUnitProposal
>> 
>> I took the liberty of listing us as mentors.
>> 
>> Patrick
>> 
>> On Thu, Feb 17, 2011 at 11:31 AM, Mattmann, Chris A (388J)
>> <ch...@jpl.nasa.gov> wrote:
>>> Hey Guys,
>>> 
>>> FYI on this: Eric has mentioned he is going to start the Incubator proposal for MRUnit. Let's start small and then grow big (as needed). It seems like we've achieved enough consensus for the required mentors and critical mass to make an MRUnit Incubator proposal and then to have the Incubator community weigh in. If that expands to include other testing projects/etc., we can address that over the Incubation process, and as needed.
>>> 
>>> Eric: as soon as that wiki page is up, I'd be happy to add my name to it as a mentor and /kick the can on this.
>>> 
>>> Cheers,
>>> Chris
>>> 
>>> On Feb 17, 2011, at 11:11 AM, Aaron Kimball wrote:
>>> 
>>>> The MRUnit community is a specific subset of the Hadoop community: Engineers
>>>> writing Java code running on Hadoop. The Hadoop community also includes
>>>> IT/ops staff who maintain Hadoop clusters, data scientists who use tools
>>>> such as Pig & Hive, as well as those written by the aforementioned
>>>> engineers, etc.
>>>> 
>>>> The Hadoop project has long recognized that tools aimed at a specific subset
>>>> of the Hadoop community, with separate release cycles, can more successfully
>>>> reach their aims by splitting into incubator projects. Hive, Pig, and HBase,
>>>> for example, have all gone this path.
>>>> 
>>>> A "current" version of MRUnit would need to compile against multiple
>>>> versions of Hadoop itself. This is not possible if it is in the same source
>>>> tree as Hadoop.
>>>> 
>>>> - Aaron
>>>> 
>>>> On Thu, Feb 17, 2011 at 5:31 AM, Bernd Fondermann <
>>>> bernd.fondermann@googlemail.com> wrote:
>>>> 
>>>>> On Fri, Feb 11, 2011 at 23:10, Aaron Kimball <ak...@gmail.com> wrote:
>>>>>> The main reason I am interested in removing MRUnit from Hadoop is that I
>>>>>> believe that MRUnit deserves its own release cycle. I think this is in
>>>>> the
>>>>>> best interest of its users.
>>>>> 
>>>>> Not in mine, at least. (I'm writing MR unit tests.)
>>>>> Many projects release more than one product. I'd rather get MRUnit
>>>>> from the same source where I get my MR from.
>>>>> Separate release cylcles would be ok for me, though.
>>>>> 
>>>>>> Perhaps more importantly, access to new features in MRUnit should not
>>>>>> require upgrading one's entire Hadoop deployment; this is a client
>>>>> library
>>>>>> that depends only on Hadoop's public APIs.
>>>>> 
>>>>> +1.
>>>>> 
>>>>>> My primary concern is to move MRUnit to a place where the community can
>>>>>> derive the most benefit from it. The Apache Incubator could fulfill this
>>>>>> role; given the presence of individuals willing to mentor this project, I
>>>>>> believe this would be a successful way to release MRUnit more quickly and
>>>>>> continue to work to grow the MRUnit community.
>>>>> 
>>>>> What are your expectations what MRUnit would become, software-wise?
>>>>> Wouldn't the MRUnit community be largely the same as the Hadoop-MR
>>>>> community?
>>>>> 
>>>>> Bernd
>>>>> 
>>> 
>>> 
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> 
>>> 
> 
> 
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Thanks Patrick.

Sounds good. Besides mentors, we need a champion, who must be an ASF member. So, either me, you or Nigel can do it. I'm already championing Gora, and would be happy to champion MRUnit, but am open to either one of you guys doing it too (or even another ASF member). Just let me know. I'll head over to the wiki and add my info. Thanks for getting this started Eric+Patrick!

Cheers,
Chris

On Feb 17, 2011, at 1:00 PM, Patrick Hunt wrote:

> Chris a page is up (still being created by Eric afaict):
> http://wiki.apache.org/incubator/MRUnitProposal
> 
> I took the liberty of listing us as mentors.
> 
> Patrick
> 
> On Thu, Feb 17, 2011 at 11:31 AM, Mattmann, Chris A (388J)
> <ch...@jpl.nasa.gov> wrote:
>> Hey Guys,
>> 
>> FYI on this: Eric has mentioned he is going to start the Incubator proposal for MRUnit. Let's start small and then grow big (as needed). It seems like we've achieved enough consensus for the required mentors and critical mass to make an MRUnit Incubator proposal and then to have the Incubator community weigh in. If that expands to include other testing projects/etc., we can address that over the Incubation process, and as needed.
>> 
>> Eric: as soon as that wiki page is up, I'd be happy to add my name to it as a mentor and /kick the can on this.
>> 
>> Cheers,
>> Chris
>> 
>> On Feb 17, 2011, at 11:11 AM, Aaron Kimball wrote:
>> 
>>> The MRUnit community is a specific subset of the Hadoop community: Engineers
>>> writing Java code running on Hadoop. The Hadoop community also includes
>>> IT/ops staff who maintain Hadoop clusters, data scientists who use tools
>>> such as Pig & Hive, as well as those written by the aforementioned
>>> engineers, etc.
>>> 
>>> The Hadoop project has long recognized that tools aimed at a specific subset
>>> of the Hadoop community, with separate release cycles, can more successfully
>>> reach their aims by splitting into incubator projects. Hive, Pig, and HBase,
>>> for example, have all gone this path.
>>> 
>>> A "current" version of MRUnit would need to compile against multiple
>>> versions of Hadoop itself. This is not possible if it is in the same source
>>> tree as Hadoop.
>>> 
>>> - Aaron
>>> 
>>> On Thu, Feb 17, 2011 at 5:31 AM, Bernd Fondermann <
>>> bernd.fondermann@googlemail.com> wrote:
>>> 
>>>> On Fri, Feb 11, 2011 at 23:10, Aaron Kimball <ak...@gmail.com> wrote:
>>>>> The main reason I am interested in removing MRUnit from Hadoop is that I
>>>>> believe that MRUnit deserves its own release cycle. I think this is in
>>>> the
>>>>> best interest of its users.
>>>> 
>>>> Not in mine, at least. (I'm writing MR unit tests.)
>>>> Many projects release more than one product. I'd rather get MRUnit
>>>> from the same source where I get my MR from.
>>>> Separate release cylcles would be ok for me, though.
>>>> 
>>>>> Perhaps more importantly, access to new features in MRUnit should not
>>>>> require upgrading one's entire Hadoop deployment; this is a client
>>>> library
>>>>> that depends only on Hadoop's public APIs.
>>>> 
>>>> +1.
>>>> 
>>>>> My primary concern is to move MRUnit to a place where the community can
>>>>> derive the most benefit from it. The Apache Incubator could fulfill this
>>>>> role; given the presence of individuals willing to mentor this project, I
>>>>> believe this would be a successful way to release MRUnit more quickly and
>>>>> continue to work to grow the MRUnit community.
>>>> 
>>>> What are your expectations what MRUnit would become, software-wise?
>>>> Wouldn't the MRUnit community be largely the same as the Hadoop-MR
>>>> community?
>>>> 
>>>> Bernd
>>>> 
>> 
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: chris.a.mattmann@nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> 
>> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Patrick Hunt <ph...@apache.org>.
Chris a page is up (still being created by Eric afaict):
http://wiki.apache.org/incubator/MRUnitProposal

I took the liberty of listing us as mentors.

Patrick

On Thu, Feb 17, 2011 at 11:31 AM, Mattmann, Chris A (388J)
<ch...@jpl.nasa.gov> wrote:
> Hey Guys,
>
> FYI on this: Eric has mentioned he is going to start the Incubator proposal for MRUnit. Let's start small and then grow big (as needed). It seems like we've achieved enough consensus for the required mentors and critical mass to make an MRUnit Incubator proposal and then to have the Incubator community weigh in. If that expands to include other testing projects/etc., we can address that over the Incubation process, and as needed.
>
> Eric: as soon as that wiki page is up, I'd be happy to add my name to it as a mentor and /kick the can on this.
>
> Cheers,
> Chris
>
> On Feb 17, 2011, at 11:11 AM, Aaron Kimball wrote:
>
>> The MRUnit community is a specific subset of the Hadoop community: Engineers
>> writing Java code running on Hadoop. The Hadoop community also includes
>> IT/ops staff who maintain Hadoop clusters, data scientists who use tools
>> such as Pig & Hive, as well as those written by the aforementioned
>> engineers, etc.
>>
>> The Hadoop project has long recognized that tools aimed at a specific subset
>> of the Hadoop community, with separate release cycles, can more successfully
>> reach their aims by splitting into incubator projects. Hive, Pig, and HBase,
>> for example, have all gone this path.
>>
>> A "current" version of MRUnit would need to compile against multiple
>> versions of Hadoop itself. This is not possible if it is in the same source
>> tree as Hadoop.
>>
>> - Aaron
>>
>> On Thu, Feb 17, 2011 at 5:31 AM, Bernd Fondermann <
>> bernd.fondermann@googlemail.com> wrote:
>>
>>> On Fri, Feb 11, 2011 at 23:10, Aaron Kimball <ak...@gmail.com> wrote:
>>>> The main reason I am interested in removing MRUnit from Hadoop is that I
>>>> believe that MRUnit deserves its own release cycle. I think this is in
>>> the
>>>> best interest of its users.
>>>
>>> Not in mine, at least. (I'm writing MR unit tests.)
>>> Many projects release more than one product. I'd rather get MRUnit
>>> from the same source where I get my MR from.
>>> Separate release cylcles would be ok for me, though.
>>>
>>>> Perhaps more importantly, access to new features in MRUnit should not
>>>> require upgrading one's entire Hadoop deployment; this is a client
>>> library
>>>> that depends only on Hadoop's public APIs.
>>>
>>> +1.
>>>
>>>> My primary concern is to move MRUnit to a place where the community can
>>>> derive the most benefit from it. The Apache Incubator could fulfill this
>>>> role; given the presence of individuals willing to mentor this project, I
>>>> believe this would be a successful way to release MRUnit more quickly and
>>>> continue to work to grow the MRUnit community.
>>>
>>> What are your expectations what MRUnit would become, software-wise?
>>> Wouldn't the MRUnit community be largely the same as the Hadoop-MR
>>> community?
>>>
>>> Bernd
>>>
>
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hey Guys,

FYI on this: Eric has mentioned he is going to start the Incubator proposal for MRUnit. Let's start small and then grow big (as needed). It seems like we've achieved enough consensus for the required mentors and critical mass to make an MRUnit Incubator proposal and then to have the Incubator community weigh in. If that expands to include other testing projects/etc., we can address that over the Incubation process, and as needed. 

Eric: as soon as that wiki page is up, I'd be happy to add my name to it as a mentor and /kick the can on this.

Cheers,
Chris

On Feb 17, 2011, at 11:11 AM, Aaron Kimball wrote:

> The MRUnit community is a specific subset of the Hadoop community: Engineers
> writing Java code running on Hadoop. The Hadoop community also includes
> IT/ops staff who maintain Hadoop clusters, data scientists who use tools
> such as Pig & Hive, as well as those written by the aforementioned
> engineers, etc.
> 
> The Hadoop project has long recognized that tools aimed at a specific subset
> of the Hadoop community, with separate release cycles, can more successfully
> reach their aims by splitting into incubator projects. Hive, Pig, and HBase,
> for example, have all gone this path.
> 
> A "current" version of MRUnit would need to compile against multiple
> versions of Hadoop itself. This is not possible if it is in the same source
> tree as Hadoop.
> 
> - Aaron
> 
> On Thu, Feb 17, 2011 at 5:31 AM, Bernd Fondermann <
> bernd.fondermann@googlemail.com> wrote:
> 
>> On Fri, Feb 11, 2011 at 23:10, Aaron Kimball <ak...@gmail.com> wrote:
>>> The main reason I am interested in removing MRUnit from Hadoop is that I
>>> believe that MRUnit deserves its own release cycle. I think this is in
>> the
>>> best interest of its users.
>> 
>> Not in mine, at least. (I'm writing MR unit tests.)
>> Many projects release more than one product. I'd rather get MRUnit
>> from the same source where I get my MR from.
>> Separate release cylcles would be ok for me, though.
>> 
>>> Perhaps more importantly, access to new features in MRUnit should not
>>> require upgrading one's entire Hadoop deployment; this is a client
>> library
>>> that depends only on Hadoop's public APIs.
>> 
>> +1.
>> 
>>> My primary concern is to move MRUnit to a place where the community can
>>> derive the most benefit from it. The Apache Incubator could fulfill this
>>> role; given the presence of individuals willing to mentor this project, I
>>> believe this would be a successful way to release MRUnit more quickly and
>>> continue to work to grow the MRUnit community.
>> 
>> What are your expectations what MRUnit would become, software-wise?
>> Wouldn't the MRUnit community be largely the same as the Hadoop-MR
>> community?
>> 
>> Bernd
>> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Aaron Kimball <ak...@gmail.com>.
The MRUnit community is a specific subset of the Hadoop community: Engineers
writing Java code running on Hadoop. The Hadoop community also includes
IT/ops staff who maintain Hadoop clusters, data scientists who use tools
such as Pig & Hive, as well as those written by the aforementioned
engineers, etc.

The Hadoop project has long recognized that tools aimed at a specific subset
of the Hadoop community, with separate release cycles, can more successfully
reach their aims by splitting into incubator projects. Hive, Pig, and HBase,
for example, have all gone this path.

A "current" version of MRUnit would need to compile against multiple
versions of Hadoop itself. This is not possible if it is in the same source
tree as Hadoop.

- Aaron

On Thu, Feb 17, 2011 at 5:31 AM, Bernd Fondermann <
bernd.fondermann@googlemail.com> wrote:

> On Fri, Feb 11, 2011 at 23:10, Aaron Kimball <ak...@gmail.com> wrote:
> > The main reason I am interested in removing MRUnit from Hadoop is that I
> > believe that MRUnit deserves its own release cycle. I think this is in
> the
> > best interest of its users.
>
> Not in mine, at least. (I'm writing MR unit tests.)
> Many projects release more than one product. I'd rather get MRUnit
> from the same source where I get my MR from.
> Separate release cylcles would be ok for me, though.
>
> > Perhaps more importantly, access to new features in MRUnit should not
> > require upgrading one's entire Hadoop deployment; this is a client
> library
> > that depends only on Hadoop's public APIs.
>
> +1.
>
> > My primary concern is to move MRUnit to a place where the community can
> > derive the most benefit from it. The Apache Incubator could fulfill this
> > role; given the presence of individuals willing to mentor this project, I
> > believe this would be a successful way to release MRUnit more quickly and
> > continue to work to grow the MRUnit community.
>
> What are your expectations what MRUnit would become, software-wise?
> Wouldn't the MRUnit community be largely the same as the Hadoop-MR
> community?
>
>  Bernd
>

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Bernd Fondermann <be...@googlemail.com>.
On Fri, Feb 11, 2011 at 23:10, Aaron Kimball <ak...@gmail.com> wrote:
> The main reason I am interested in removing MRUnit from Hadoop is that I
> believe that MRUnit deserves its own release cycle. I think this is in the
> best interest of its users.

Not in mine, at least. (I'm writing MR unit tests.)
Many projects release more than one product. I'd rather get MRUnit
from the same source where I get my MR from.
Separate release cylcles would be ok for me, though.

> Perhaps more importantly, access to new features in MRUnit should not
> require upgrading one's entire Hadoop deployment; this is a client library
> that depends only on Hadoop's public APIs.

+1.

> My primary concern is to move MRUnit to a place where the community can
> derive the most benefit from it. The Apache Incubator could fulfill this
> role; given the presence of individuals willing to mentor this project, I
> believe this would be a successful way to release MRUnit more quickly and
> continue to work to grow the MRUnit community.

What are your expectations what MRUnit would become, software-wise?
Wouldn't the MRUnit community be largely the same as the Hadoop-MR
community?

  Bernd

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Aaron Kimball <ak...@gmail.com>.
The main reason I am interested in removing MRUnit from Hadoop is that I
believe that MRUnit deserves its own release cycle. I think this is in the
best interest of its users.

MRUnit is valuable to users of several different versions of Hadoop. But
MRUnit has only ever been committed to version 0.21 and above -- even though
in practice, the majority (dare I say--all) of its users are running on
0.20. The only place today to get a version of MRUnit compatible with 0.20
has been through a Cloudera release, which backported the entire MRUnit
patchset.

My thoughts on MRUnit in 0.20.100 resonate with Eric's. There will be
further fixes to MRUnit and its lightweight codebase can be released far
more rapidly than whenever the next 0.20.1xx release of Hadoop would occur.
Given that MRUnit has already been in the repository since April 2009 (see
https://issues.apache.org/jira/browse/HADOOP-5518) and has yet to see an
Apache 0.20-based release, I do not think it is in the best interest of the
library's userbase to couple MRUnit's release cycle to that of Hadoop
itself.

Perhaps more importantly, access to new features in MRUnit should not
require upgrading one's entire Hadoop deployment; this is a client library
that depends only on Hadoop's public APIs.

My primary concern is to move MRUnit to a place where the community can
derive the most benefit from it. The Apache Incubator could fulfill this
role; given the presence of individuals willing to mentor this project, I
believe this would be a successful way to release MRUnit more quickly and
continue to work to grow the MRUnit community.

Regards,
- Aaron


On Fri, Feb 11, 2011 at 11:57 AM, Mattmann, Chris A (388J) <
chris.a.mattmann@jpl.nasa.gov> wrote:

> Awesome Patrick, we'd probably need one more active mentor. Any takers?
>
> After we get that, then we cook up a proposal on the Incubator wiki here
> [1], and follow the process here [2] to get started...
>
> Cheers,
> Chris
>
> [1] http://wiki.apache.org/incubator/MRUnitProposal
> [2] http://incubator.apache.org/guides/proposal.html
>
> On Feb 11, 2011, at 11:52 AM, Patrick Hunt wrote:
>
> > On Fri, Feb 11, 2011 at 9:44 AM, Mattmann, Chris A (388J)
> > <ch...@jpl.nasa.gov> wrote:
> >> Guys, BTW, if you need help or a mentor in Apache Incubator-ville for
> MRUnit, I would be happy to help.
> >
> > I was going to suggest the same thing (mrunit to incubator). I would
> > also be happy to be a mentor.
> >
> > Patrick
> >
> >>
> >> On Feb 11, 2011, at 9:04 AM, Eric Sammer wrote:
> >>
> >>> On Fri, Feb 11, 2011 at 11:48 AM, Owen O'Malley <om...@apache.org>
> wrote:
> >>>
> >>>> On Feb 11, 2011, at 8:02 AM, Eric Sammer wrote:
> >>>>
> >>>> - allow mrunit to have its own release cycle. This is, I think, the
> most
> >>>>>
> >>>>
> >>>> important.
> >>>>>
> >>>>
> >>>> If you submit your work to Apache we can evaluate it for inclusion in
> the
> >>>> 0.20.100 branch to get your changes released in a timely manner.
> >>>
> >>>
> >>> I'm thinking in general (beyond the next immediate release).
> Independent of
> >>> where mrunit goes, I think it should leave the contrib tree to
> facilitate
> >>> light weight releases (the dependency on Hadoop proper is a public
> facing
> >>> API - a pure client). I think most projects could benefit from this
> with the
> >>> exception of things that are tightly coupled to Hadoop releases or
> touch
> >>> non-public APIs.
> >>>
> >>>
> >>>> I would actually prefer to move it to Extras or Incubator and leave
> this
> >>>>> within the ASF.
> >>>>>
> >>>>
> >>>> Extras is **NOT** inside of the ASF. Extras is a source hosting system
> for
> >>>> non-Apache projects that are related to Apache projects.
> >>>
> >>>
> >>> Got it. Thanks for correcting me. I only mentioned it because someone
> >>> suggested it to me initially.
> >>>
> >>>
> >>>> Right now, I picked github because of the ability to easily
> >>>> collaborate with others (and to use git).
> >>>>
> >>>
> >>> I agree that it is unfortunate that Apache doesn't yet support
> read-write
> >>>> git access. However, you'll find that building a community is easier
> at
> >>>> Apache than at github.
> >>>>
> >>>
> >>>> -- Owen
> >>>>
> >>>
> >>>
> >>>
> >>> --
> >>> Eric Sammer
> >>> twitter: esammer
> >>> data: www.cloudera.com
> >>
> >>
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> Chris Mattmann, Ph.D.
> >> Senior Computer Scientist
> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> Office: 171-266B, Mailstop: 171-246
> >> Email: chris.a.mattmann@nasa.gov
> >> WWW:   http://sunset.usc.edu/~mattmann/
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> Adjunct Assistant Professor, Computer Science Department
> >> University of Southern California, Los Angeles, CA 90089 USA
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>
>
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Awesome Patrick, we'd probably need one more active mentor. Any takers?

After we get that, then we cook up a proposal on the Incubator wiki here [1], and follow the process here [2] to get started...

Cheers,
Chris

[1] http://wiki.apache.org/incubator/MRUnitProposal
[2] http://incubator.apache.org/guides/proposal.html

On Feb 11, 2011, at 11:52 AM, Patrick Hunt wrote:

> On Fri, Feb 11, 2011 at 9:44 AM, Mattmann, Chris A (388J)
> <ch...@jpl.nasa.gov> wrote:
>> Guys, BTW, if you need help or a mentor in Apache Incubator-ville for MRUnit, I would be happy to help.
> 
> I was going to suggest the same thing (mrunit to incubator). I would
> also be happy to be a mentor.
> 
> Patrick
> 
>> 
>> On Feb 11, 2011, at 9:04 AM, Eric Sammer wrote:
>> 
>>> On Fri, Feb 11, 2011 at 11:48 AM, Owen O'Malley <om...@apache.org> wrote:
>>> 
>>>> On Feb 11, 2011, at 8:02 AM, Eric Sammer wrote:
>>>> 
>>>> - allow mrunit to have its own release cycle. This is, I think, the most
>>>>> 
>>>> 
>>>> important.
>>>>> 
>>>> 
>>>> If you submit your work to Apache we can evaluate it for inclusion in the
>>>> 0.20.100 branch to get your changes released in a timely manner.
>>> 
>>> 
>>> I'm thinking in general (beyond the next immediate release). Independent of
>>> where mrunit goes, I think it should leave the contrib tree to facilitate
>>> light weight releases (the dependency on Hadoop proper is a public facing
>>> API - a pure client). I think most projects could benefit from this with the
>>> exception of things that are tightly coupled to Hadoop releases or touch
>>> non-public APIs.
>>> 
>>> 
>>>> I would actually prefer to move it to Extras or Incubator and leave this
>>>>> within the ASF.
>>>>> 
>>>> 
>>>> Extras is **NOT** inside of the ASF. Extras is a source hosting system for
>>>> non-Apache projects that are related to Apache projects.
>>> 
>>> 
>>> Got it. Thanks for correcting me. I only mentioned it because someone
>>> suggested it to me initially.
>>> 
>>> 
>>>> Right now, I picked github because of the ability to easily
>>>> collaborate with others (and to use git).
>>>> 
>>> 
>>> I agree that it is unfortunate that Apache doesn't yet support read-write
>>>> git access. However, you'll find that building a community is easier at
>>>> Apache than at github.
>>>> 
>>> 
>>>> -- Owen
>>>> 
>>> 
>>> 
>>> 
>>> --
>>> Eric Sammer
>>> twitter: esammer
>>> data: www.cloudera.com
>> 
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: chris.a.mattmann@nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> 
>> 


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Eric Sammer <es...@cloudera.com>.
I've started the wiki page proposal for Incubator for mrunit. I'll ping
people off list for mentoring. Much appreciated for all the help!

On Tue, Feb 15, 2011 at 1:36 PM, Nigel Daley <nd...@mac.com> wrote:

> I'm happy to help mentor as well.
>
> Cheers,
> Nige
>
> On Feb 11, 2011, at 11:52 AM, Patrick Hunt wrote:
>
> > On Fri, Feb 11, 2011 at 9:44 AM, Mattmann, Chris A (388J)
> > <ch...@jpl.nasa.gov> wrote:
> >> Guys, BTW, if you need help or a mentor in Apache Incubator-ville for
> MRUnit, I would be happy to help.
> >
> > I was going to suggest the same thing (mrunit to incubator). I would
> > also be happy to be a mentor.
> >
> > Patrick
> >
> >>
> >> On Feb 11, 2011, at 9:04 AM, Eric Sammer wrote:
> >>
> >>> On Fri, Feb 11, 2011 at 11:48 AM, Owen O'Malley <om...@apache.org>
> wrote:
> >>>
> >>>> On Feb 11, 2011, at 8:02 AM, Eric Sammer wrote:
> >>>>
> >>>> - allow mrunit to have its own release cycle. This is, I think, the
> most
> >>>>>
> >>>>
> >>>> important.
> >>>>>
> >>>>
> >>>> If you submit your work to Apache we can evaluate it for inclusion in
> the
> >>>> 0.20.100 branch to get your changes released in a timely manner.
> >>>
> >>>
> >>> I'm thinking in general (beyond the next immediate release).
> Independent of
> >>> where mrunit goes, I think it should leave the contrib tree to
> facilitate
> >>> light weight releases (the dependency on Hadoop proper is a public
> facing
> >>> API - a pure client). I think most projects could benefit from this
> with the
> >>> exception of things that are tightly coupled to Hadoop releases or
> touch
> >>> non-public APIs.
> >>>
> >>>
> >>>> I would actually prefer to move it to Extras or Incubator and leave
> this
> >>>>> within the ASF.
> >>>>>
> >>>>
> >>>> Extras is **NOT** inside of the ASF. Extras is a source hosting system
> for
> >>>> non-Apache projects that are related to Apache projects.
> >>>
> >>>
> >>> Got it. Thanks for correcting me. I only mentioned it because someone
> >>> suggested it to me initially.
> >>>
> >>>
> >>>> Right now, I picked github because of the ability to easily
> >>>> collaborate with others (and to use git).
> >>>>
> >>>
> >>> I agree that it is unfortunate that Apache doesn't yet support
> read-write
> >>>> git access. However, you'll find that building a community is easier
> at
> >>>> Apache than at github.
> >>>>
> >>>
> >>>> -- Owen
> >>>>
> >>>
> >>>
> >>>
> >>> --
> >>> Eric Sammer
> >>> twitter: esammer
> >>> data: www.cloudera.com
> >>
> >>
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> Chris Mattmann, Ph.D.
> >> Senior Computer Scientist
> >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> >> Office: 171-266B, Mailstop: 171-246
> >> Email: chris.a.mattmann@nasa.gov
> >> WWW:   http://sunset.usc.edu/~mattmann/
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >> Adjunct Assistant Professor, Computer Science Department
> >> University of Southern California, Los Angeles, CA 90089 USA
> >> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> >>
> >>
>
>


-- 
Eric Sammer
twitter: esammer
data: www.cloudera.com

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Konstantin Boudnik <co...@apache.org>.
I am up to help as a committer on this project.
--
  Take care,
Konstantin (Cos) Boudnik


On Tue, Feb 15, 2011 at 10:36, Nigel Daley <nd...@mac.com> wrote:
> I'm happy to help mentor as well.
>
> Cheers,
> Nige
>
> On Feb 11, 2011, at 11:52 AM, Patrick Hunt wrote:
>
>> On Fri, Feb 11, 2011 at 9:44 AM, Mattmann, Chris A (388J)
>> <ch...@jpl.nasa.gov> wrote:
>>> Guys, BTW, if you need help or a mentor in Apache Incubator-ville for MRUnit, I would be happy to help.
>>
>> I was going to suggest the same thing (mrunit to incubator). I would
>> also be happy to be a mentor.
>>
>> Patrick
>>
>>>
>>> On Feb 11, 2011, at 9:04 AM, Eric Sammer wrote:
>>>
>>>> On Fri, Feb 11, 2011 at 11:48 AM, Owen O'Malley <om...@apache.org> wrote:
>>>>
>>>>> On Feb 11, 2011, at 8:02 AM, Eric Sammer wrote:
>>>>>
>>>>> - allow mrunit to have its own release cycle. This is, I think, the most
>>>>>>
>>>>>
>>>>> important.
>>>>>>
>>>>>
>>>>> If you submit your work to Apache we can evaluate it for inclusion in the
>>>>> 0.20.100 branch to get your changes released in a timely manner.
>>>>
>>>>
>>>> I'm thinking in general (beyond the next immediate release). Independent of
>>>> where mrunit goes, I think it should leave the contrib tree to facilitate
>>>> light weight releases (the dependency on Hadoop proper is a public facing
>>>> API - a pure client). I think most projects could benefit from this with the
>>>> exception of things that are tightly coupled to Hadoop releases or touch
>>>> non-public APIs.
>>>>
>>>>
>>>>> I would actually prefer to move it to Extras or Incubator and leave this
>>>>>> within the ASF.
>>>>>>
>>>>>
>>>>> Extras is **NOT** inside of the ASF. Extras is a source hosting system for
>>>>> non-Apache projects that are related to Apache projects.
>>>>
>>>>
>>>> Got it. Thanks for correcting me. I only mentioned it because someone
>>>> suggested it to me initially.
>>>>
>>>>
>>>>> Right now, I picked github because of the ability to easily
>>>>> collaborate with others (and to use git).
>>>>>
>>>>
>>>> I agree that it is unfortunate that Apache doesn't yet support read-write
>>>>> git access. However, you'll find that building a community is easier at
>>>>> Apache than at github.
>>>>>
>>>>
>>>>> -- Owen
>>>>>
>>>>
>>>>
>>>>
>>>> --
>>>> Eric Sammer
>>>> twitter: esammer
>>>> data: www.cloudera.com
>>>
>>>
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Chris Mattmann, Ph.D.
>>> Senior Computer Scientist
>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>>> Office: 171-266B, Mailstop: 171-246
>>> Email: chris.a.mattmann@nasa.gov
>>> WWW:   http://sunset.usc.edu/~mattmann/
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>> Adjunct Assistant Professor, Computer Science Department
>>> University of Southern California, Los Angeles, CA 90089 USA
>>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>>>
>>>
>
>

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Nigel Daley <nd...@mac.com>.
I'm happy to help mentor as well.

Cheers,
Nige

On Feb 11, 2011, at 11:52 AM, Patrick Hunt wrote:

> On Fri, Feb 11, 2011 at 9:44 AM, Mattmann, Chris A (388J)
> <ch...@jpl.nasa.gov> wrote:
>> Guys, BTW, if you need help or a mentor in Apache Incubator-ville for MRUnit, I would be happy to help.
> 
> I was going to suggest the same thing (mrunit to incubator). I would
> also be happy to be a mentor.
> 
> Patrick
> 
>> 
>> On Feb 11, 2011, at 9:04 AM, Eric Sammer wrote:
>> 
>>> On Fri, Feb 11, 2011 at 11:48 AM, Owen O'Malley <om...@apache.org> wrote:
>>> 
>>>> On Feb 11, 2011, at 8:02 AM, Eric Sammer wrote:
>>>> 
>>>> - allow mrunit to have its own release cycle. This is, I think, the most
>>>>> 
>>>> 
>>>> important.
>>>>> 
>>>> 
>>>> If you submit your work to Apache we can evaluate it for inclusion in the
>>>> 0.20.100 branch to get your changes released in a timely manner.
>>> 
>>> 
>>> I'm thinking in general (beyond the next immediate release). Independent of
>>> where mrunit goes, I think it should leave the contrib tree to facilitate
>>> light weight releases (the dependency on Hadoop proper is a public facing
>>> API - a pure client). I think most projects could benefit from this with the
>>> exception of things that are tightly coupled to Hadoop releases or touch
>>> non-public APIs.
>>> 
>>> 
>>>> I would actually prefer to move it to Extras or Incubator and leave this
>>>>> within the ASF.
>>>>> 
>>>> 
>>>> Extras is **NOT** inside of the ASF. Extras is a source hosting system for
>>>> non-Apache projects that are related to Apache projects.
>>> 
>>> 
>>> Got it. Thanks for correcting me. I only mentioned it because someone
>>> suggested it to me initially.
>>> 
>>> 
>>>> Right now, I picked github because of the ability to easily
>>>> collaborate with others (and to use git).
>>>> 
>>> 
>>> I agree that it is unfortunate that Apache doesn't yet support read-write
>>>> git access. However, you'll find that building a community is easier at
>>>> Apache than at github.
>>>> 
>>> 
>>>> -- Owen
>>>> 
>>> 
>>> 
>>> 
>>> --
>>> Eric Sammer
>>> twitter: esammer
>>> data: www.cloudera.com
>> 
>> 
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Chris Mattmann, Ph.D.
>> Senior Computer Scientist
>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
>> Office: 171-266B, Mailstop: 171-246
>> Email: chris.a.mattmann@nasa.gov
>> WWW:   http://sunset.usc.edu/~mattmann/
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> Adjunct Assistant Professor, Computer Science Department
>> University of Southern California, Los Angeles, CA 90089 USA
>> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>> 
>> 


Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Patrick Hunt <ph...@apache.org>.
On Fri, Feb 11, 2011 at 9:44 AM, Mattmann, Chris A (388J)
<ch...@jpl.nasa.gov> wrote:
> Guys, BTW, if you need help or a mentor in Apache Incubator-ville for MRUnit, I would be happy to help.

I was going to suggest the same thing (mrunit to incubator). I would
also be happy to be a mentor.

Patrick

>
> On Feb 11, 2011, at 9:04 AM, Eric Sammer wrote:
>
>> On Fri, Feb 11, 2011 at 11:48 AM, Owen O'Malley <om...@apache.org> wrote:
>>
>>> On Feb 11, 2011, at 8:02 AM, Eric Sammer wrote:
>>>
>>> - allow mrunit to have its own release cycle. This is, I think, the most
>>>>
>>>
>>> important.
>>>>
>>>
>>> If you submit your work to Apache we can evaluate it for inclusion in the
>>> 0.20.100 branch to get your changes released in a timely manner.
>>
>>
>> I'm thinking in general (beyond the next immediate release). Independent of
>> where mrunit goes, I think it should leave the contrib tree to facilitate
>> light weight releases (the dependency on Hadoop proper is a public facing
>> API - a pure client). I think most projects could benefit from this with the
>> exception of things that are tightly coupled to Hadoop releases or touch
>> non-public APIs.
>>
>>
>>> I would actually prefer to move it to Extras or Incubator and leave this
>>>> within the ASF.
>>>>
>>>
>>> Extras is **NOT** inside of the ASF. Extras is a source hosting system for
>>> non-Apache projects that are related to Apache projects.
>>
>>
>> Got it. Thanks for correcting me. I only mentioned it because someone
>> suggested it to me initially.
>>
>>
>>> Right now, I picked github because of the ability to easily
>>> collaborate with others (and to use git).
>>>
>>
>> I agree that it is unfortunate that Apache doesn't yet support read-write
>>> git access. However, you'll find that building a community is easier at
>>> Apache than at github.
>>>
>>
>>> -- Owen
>>>
>>
>>
>>
>> --
>> Eric Sammer
>> twitter: esammer
>> data: www.cloudera.com
>
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Senior Computer Scientist
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 171-266B, Mailstop: 171-246
> Email: chris.a.mattmann@nasa.gov
> WWW:   http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Assistant Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Guys, BTW, if you need help or a mentor in Apache Incubator-ville for MRUnit, I would be happy to help.

Cheers,
Chris

On Feb 11, 2011, at 9:04 AM, Eric Sammer wrote:

> On Fri, Feb 11, 2011 at 11:48 AM, Owen O'Malley <om...@apache.org> wrote:
> 
>> On Feb 11, 2011, at 8:02 AM, Eric Sammer wrote:
>> 
>> - allow mrunit to have its own release cycle. This is, I think, the most
>>> 
>> 
>> important.
>>> 
>> 
>> If you submit your work to Apache we can evaluate it for inclusion in the
>> 0.20.100 branch to get your changes released in a timely manner.
> 
> 
> I'm thinking in general (beyond the next immediate release). Independent of
> where mrunit goes, I think it should leave the contrib tree to facilitate
> light weight releases (the dependency on Hadoop proper is a public facing
> API - a pure client). I think most projects could benefit from this with the
> exception of things that are tightly coupled to Hadoop releases or touch
> non-public APIs.
> 
> 
>> I would actually prefer to move it to Extras or Incubator and leave this
>>> within the ASF.
>>> 
>> 
>> Extras is **NOT** inside of the ASF. Extras is a source hosting system for
>> non-Apache projects that are related to Apache projects.
> 
> 
> Got it. Thanks for correcting me. I only mentioned it because someone
> suggested it to me initially.
> 
> 
>> Right now, I picked github because of the ability to easily
>> collaborate with others (and to use git).
>> 
> 
> I agree that it is unfortunate that Apache doesn't yet support read-write
>> git access. However, you'll find that building a community is easier at
>> Apache than at github.
>> 
> 
>> -- Owen
>> 
> 
> 
> 
> -- 
> Eric Sammer
> twitter: esammer
> data: www.cloudera.com


++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Garrett Wu <wu...@gmail.com>.
On Fri, Feb 11, 2011 at 9:04 AM, Eric Sammer <es...@cloudera.com> wrote:

> On Fri, Feb 11, 2011 at 11:48 AM, Owen O'Malley <om...@apache.org>
> wrote:
>
> > On Feb 11, 2011, at 8:02 AM, Eric Sammer wrote:
> >
> >  - allow mrunit to have its own release cycle. This is, I think, the most
> >>
> >
> >  important.
> >>
> >
> > If you submit your work to Apache we can evaluate it for inclusion in the
> > 0.20.100 branch to get your changes released in a timely manner.
>
>
> I'm thinking in general (beyond the next immediate release). Independent of
> where mrunit goes, I think it should leave the contrib tree to facilitate
> light weight releases (the dependency on Hadoop proper is a public facing
> API - a pure client). I think most projects could benefit from this with
> the
> exception of things that are tightly coupled to Hadoop releases or touch
> non-public APIs.
>
>
+1 for a faster release cycle and using git.  I have a couple of patches for
mrunit, and it would be nice to get those in separately from hadoop
releases.


>
> >  I would actually prefer to move it to Extras or Incubator and leave this
> >> within the ASF.
> >>
> >
> > Extras is **NOT** inside of the ASF. Extras is a source hosting system
> for
> > non-Apache projects that are related to Apache projects.
>
>
> Got it. Thanks for correcting me. I only mentioned it because someone
> suggested it to me initially.
>
>
> > Right now, I picked github because of the ability to easily
> > collaborate with others (and to use git).
> >
>
> I agree that it is unfortunate that Apache doesn't yet support read-write
> > git access. However, you'll find that building a community is easier at
> > Apache than at github.
> >
>
> > -- Owen
> >
>
>
>
> --
> Eric Sammer
> twitter: esammer
> data: www.cloudera.com
>

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Eric Sammer <es...@cloudera.com>.
On Fri, Feb 11, 2011 at 11:48 AM, Owen O'Malley <om...@apache.org> wrote:

> On Feb 11, 2011, at 8:02 AM, Eric Sammer wrote:
>
>  - allow mrunit to have its own release cycle. This is, I think, the most
>>
>
>  important.
>>
>
> If you submit your work to Apache we can evaluate it for inclusion in the
> 0.20.100 branch to get your changes released in a timely manner.


I'm thinking in general (beyond the next immediate release). Independent of
where mrunit goes, I think it should leave the contrib tree to facilitate
light weight releases (the dependency on Hadoop proper is a public facing
API - a pure client). I think most projects could benefit from this with the
exception of things that are tightly coupled to Hadoop releases or touch
non-public APIs.


>  I would actually prefer to move it to Extras or Incubator and leave this
>> within the ASF.
>>
>
> Extras is **NOT** inside of the ASF. Extras is a source hosting system for
> non-Apache projects that are related to Apache projects.


Got it. Thanks for correcting me. I only mentioned it because someone
suggested it to me initially.


> Right now, I picked github because of the ability to easily
> collaborate with others (and to use git).
>

I agree that it is unfortunate that Apache doesn't yet support read-write
> git access. However, you'll find that building a community is easier at
> Apache than at github.
>

> -- Owen
>



-- 
Eric Sammer
twitter: esammer
data: www.cloudera.com

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Owen O'Malley <om...@apache.org>.
On Feb 11, 2011, at 8:02 AM, Eric Sammer wrote:

> - allow mrunit to have its own release cycle. This is, I think, the  
> most

> important.

If you submit your work to Apache we can evaluate it for inclusion in  
the 0.20.100 branch to get your changes released in a timely manner.

> I would actually prefer to move it to Extras or Incubator and leave  
> this
> within the ASF.

Extras is **NOT** inside of the ASF. Extras is a source hosting system  
for non-Apache projects that are related to Apache projects.

> Right now, I picked github because of the ability to easily
> collaborate with others (and to use git).

I agree that it is unfortunate that Apache doesn't yet support read- 
write git access. However, you'll find that building a community is  
easier at Apache than at github.

-- Owen

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Eric Sammer <es...@cloudera.com>.
Owen:

I think you make a fair point. The reason I think it still makes sense to
bring mrunit out of Hadoop contrib is to:

- start to simplify the build by breaking projects that are only clients of
Hadoop libs out of contrib.
- allow mrunit to have its own release cycle. This is, I think, the most
important.

I would actually prefer to move it to Extras or Incubator and leave this
within the ASF. Right now, I picked github because of the ability to easily
collaborate with others (and to use git).

Thanks!

On Fri, Feb 11, 2011 at 2:52 AM, Owen O'Malley <om...@apache.org> wrote:

> On Thu, Feb 10, 2011 at 11:36 PM, Aaron Kimball <akimball83@gmail.com
> >wrote:
>
> > +1. Eric Sammer and I will be working on this via github. (Come join us!)
>
>
> Votes to remove code should be because the PMC doesn't think the code is
> worth maintaining any more. I don't think that applies in this case. Aaron
> is a committer in Hadoop now and I'd strongly encourage you both to give
> your changes back to Apache instead of forking it into GitHub.
>
> -- Owen
>



-- 
Eric Sammer
twitter: esammer
data: www.cloudera.com

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Owen O'Malley <om...@apache.org>.
On Thu, Feb 10, 2011 at 11:36 PM, Aaron Kimball <ak...@gmail.com>wrote:

> +1. Eric Sammer and I will be working on this via github. (Come join us!)


Votes to remove code should be because the PMC doesn't think the code is
worth maintaining any more. I don't think that applies in this case. Aaron
is a committer in Hadoop now and I'd strongly encourage you both to give
your changes back to Apache instead of forking it into GitHub.

-- Owen

Re: [VOTE] Abandon mrunit MapReduce contrib

Posted by Aaron Kimball <ak...@gmail.com>.
+1. Eric Sammer and I will be working on this via github. (Come join us!)

- Aaron

On Thu, Feb 10, 2011 at 11:08 PM, Nigel Daley <nd...@mac.com> wrote:

> I think the PMC should abandon the mrunit MapReduce contrib component.  The
> originator of mrunit and primary maintainer (Aaron Kimball) is moving the
> active development elsewhere.
>
> There are 2 unresolved contrib/mrunit issues in Jira, none of them Patch
> Available.
>
> Here is my +1.
>
> Nige
>