You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by Sean Owen <sr...@gmail.com> on 2011/03/25 16:12:26 UTC

Think about 0.5 release?

Maybe it's the first flush of springtime that's reinvigorated activity, but
I do see significantly more activity in the last week or two. That's great.
Especially since a lot of the activity is going into older issues too. The
number of open issues is declining; it was over 70 a month or two ago and is
now down to just over 50. That's a good trend.

Our 0.4 release was Oct 31 2010. Given a general pattern of releasing every
6 months, that would suggest end of April for 0.5. I see no reason to rush,
but reason to start asking those release questions:

- What do we want out of 0.5 as a release? I suggest it be viewed as nearly
a release candidate for 1.0; the APIs and functionality should be
substantially set by 0.5, to be polished for a 1.0 release in Q4.
- What is marked for 0.5 that just isn't realistically going to be done by
anyone in a few weeks?
- What isn't in JIRA for 0.5 that would be nice for 0.5?

In particular, there are 24 issues still marked as targeted for 0.5. Here's
an open invitation to punt issues to unscheduled, and for fixing/resolving
issues.

I think 0.5 will also be a good milestone to ask if anyone wants to become
an emeritus committer.

Best,
Sean



TKeySummaryAssigneeReporterPStatusResolutionCreatedUpdatedDue<https://issues.apache.org/jira/browse/MAHOUT-293>[image:
Improvement] <https://issues.apache.org/jira/browse/MAHOUT-293>MAHOUT-293<https://issues.apache.org/jira/browse/MAHOUT-293>

Add more tunable parameters to PFPGrowth
implementation<https://issues.apache.org/jira/browse/MAHOUT-293>
Robin Anil<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=robinanil>Robin
Anil <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=robinanil>[image:
Major][image: Open] Open*Unresolved*15/Feb/1001/Mar/11
<https://issues.apache.org/jira/browse/MAHOUT-294>[image:
Improvement] <https://issues.apache.org/jira/browse/MAHOUT-294>MAHOUT-294<https://issues.apache.org/jira/browse/MAHOUT-294>

Uniform API behavior for Jobs<https://issues.apache.org/jira/browse/MAHOUT-294>
*Unassigned*Robin
Anil<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=robinanil>[image:
Major][image: Open] Open*Unresolved*16/Feb/1031/Jan/11
<https://issues.apache.org/jira/browse/MAHOUT-308>[image:
Improvement] <https://issues.apache.org/jira/browse/MAHOUT-308>MAHOUT-308<https://issues.apache.org/jira/browse/MAHOUT-308>

Improve Lanczos to handle extremely large feature sets (without
hashing)<https://issues.apache.org/jira/browse/MAHOUT-308>
Jake Mannix<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jake.mannix>Jake
Mannix<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jake.mannix>[image:
Major][image: Patch Available] Patch Available*Unresolved*24/Feb/1001/Mar/11
  <https://issues.apache.org/jira/browse/MAHOUT-319>[image:
Improvement]<https://issues.apache.org/jira/browse/MAHOUT-319>
MAHOUT-319 <https://issues.apache.org/jira/browse/MAHOUT-319>

SVD solvers should be gracefully
stoppable/restartable<https://issues.apache.org/jira/browse/MAHOUT-319>
Jake Mannix<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jake.mannix>Jake
Mannix<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jake.mannix>[image:
Major][image: Open] Open*Unresolved*01/Mar/1003/Nov/10
<https://issues.apache.org/jira/browse/MAHOUT-369>[image:
Bug] <https://issues.apache.org/jira/browse/MAHOUT-369>MAHOUT-369<https://issues.apache.org/jira/browse/MAHOUT-369>

Issues with DistributedLanczosSolver
output<https://issues.apache.org/jira/browse/MAHOUT-369>
Jake Mannix<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jake.mannix>Danny
Leshem <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=dleshem>[image:
Major][image: Open] Open*Unresolved*07/Apr/1001/Mar/11
<https://issues.apache.org/jira/browse/MAHOUT-384>[image:
New Feature] <https://issues.apache.org/jira/browse/MAHOUT-384>MAHOUT-384<https://issues.apache.org/jira/browse/MAHOUT-384>

Implement of AVF algorithm<https://issues.apache.org/jira/browse/MAHOUT-384>
Robin Anil<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=robinanil>tony
cui <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=tony.cui>[image:
Major][image: Open] Open*Unresolved*22/Apr/1026/Jan/11
<https://issues.apache.org/jira/browse/MAHOUT-399>[image:
Bug] <https://issues.apache.org/jira/browse/MAHOUT-399>MAHOUT-399<https://issues.apache.org/jira/browse/MAHOUT-399>

LDA on Mahout 0.3 does not converge to correct solution for overlapping
pyramids toy problem. <https://issues.apache.org/jira/browse/MAHOUT-399>
Ted Dunning<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=tdunning>Michael
Lazarus<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=mikelazarus>[image:
Major][image: Open]
Open*Unresolved*24/May/1008/Feb/1125/Mar/11<https://issues.apache.org/jira/browse/MAHOUT-479>[image:
Improvement] <https://issues.apache.org/jira/browse/MAHOUT-479>MAHOUT-479<https://issues.apache.org/jira/browse/MAHOUT-479>

Streamline classification/ clustering data
structures<https://issues.apache.org/jira/browse/MAHOUT-479>
Isabel Drost<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=isabel>Isabel
Drost <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=isabel>[image:
Major][image: Open] Open*Unresolved*13/Aug/1008/Mar/11
<https://issues.apache.org/jira/browse/MAHOUT-487>[image:
Bug] <https://issues.apache.org/jira/browse/MAHOUT-487>MAHOUT-487<https://issues.apache.org/jira/browse/MAHOUT-487>

Issues with memory use and inconsistent or state-influenced results when
using CBayesAlgorithm <https://issues.apache.org/jira/browse/MAHOUT-487>
Robin Anil<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=robinanil>Drew
Farris<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=drew.farris>[image:
Minor][image: Open]
Open*Unresolved*24/Aug/1003/Feb/1125/Feb/11<https://issues.apache.org/jira/browse/MAHOUT-499>[image:
Improvement] <https://issues.apache.org/jira/browse/MAHOUT-499>MAHOUT-499<https://issues.apache.org/jira/browse/MAHOUT-499>

Implement LSMR in-memory <https://issues.apache.org/jira/browse/MAHOUT-499>
Ted Dunning<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=tdunning>Ted
Dunning<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=tdunning>[image:
Major][image: Open] Open*Unresolved*09/Sep/1014/Oct/10
<https://issues.apache.org/jira/browse/MAHOUT-510>[image:
Task] <https://issues.apache.org/jira/browse/MAHOUT-510>MAHOUT-510<https://issues.apache.org/jira/browse/MAHOUT-510>

Standardize serialization
mechanisms<https://issues.apache.org/jira/browse/MAHOUT-510>
Sean Owen<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=srowen>Sean
Owen <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=srowen>[image:
Major][image: Patch Available] Patch Available*Unresolved*22/Sep/1008/Feb/11
  <https://issues.apache.org/jira/browse/MAHOUT-517>[image:
Improvement]<https://issues.apache.org/jira/browse/MAHOUT-517>
MAHOUT-517 <https://issues.apache.org/jira/browse/MAHOUT-517>

Eigencuts needs an output
format<https://issues.apache.org/jira/browse/MAHOUT-517>
Jeff Eastman<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jeastman>Jeff
Eastman<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jeastman>[image:
Minor][image: Open]
Open*Unresolved*30/Sep/1004/Feb/1125/Feb/11<https://issues.apache.org/jira/browse/MAHOUT-518>[image:
Improvement] <https://issues.apache.org/jira/browse/MAHOUT-518>MAHOUT-518<https://issues.apache.org/jira/browse/MAHOUT-518>

Implement Affinity Preprocessing for Eigencuts and Spectral
KMeans<https://issues.apache.org/jira/browse/MAHOUT-518>
*Unassigned*Jeff
Eastman<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jeastman>[image:
Major][image: Open] Open*Unresolved*30/Sep/1006/Oct/10
<https://issues.apache.org/jira/browse/MAHOUT-524>[image:
Bug] <https://issues.apache.org/jira/browse/MAHOUT-524>MAHOUT-524<https://issues.apache.org/jira/browse/MAHOUT-524>

DisplaySpectralKMeans example
fails<https://issues.apache.org/jira/browse/MAHOUT-524>
*Unassigned*Jeff
Eastman<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jeastman>[image:
Major][image: Open] Open*Unresolved*12/Oct/1021/Jan/11
<https://issues.apache.org/jira/browse/MAHOUT-525>[image:
New Feature] <https://issues.apache.org/jira/browse/MAHOUT-525>MAHOUT-525<https://issues.apache.org/jira/browse/MAHOUT-525>

Implement LatentFactorLogLinear
models<https://issues.apache.org/jira/browse/MAHOUT-525>
*Unassigned*Ted
Dunning<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=tdunning>[image:
Major][image: Open] Open*Unresolved*14/Oct/1018/Mar/11
<https://issues.apache.org/jira/browse/MAHOUT-529>[image:
New Feature] <https://issues.apache.org/jira/browse/MAHOUT-529>MAHOUT-529<https://issues.apache.org/jira/browse/MAHOUT-529>

Implement LinearRegression<https://issues.apache.org/jira/browse/MAHOUT-529>
Ted Dunning<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=tdunning>Frank
Wang <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=fanjie>[image:
Major][image: Open] Open*Unresolved*21/Oct/1019/Jan/11
<https://issues.apache.org/jira/browse/MAHOUT-550>[image:
New Feature] <https://issues.apache.org/jira/browse/MAHOUT-550>MAHOUT-550<https://issues.apache.org/jira/browse/MAHOUT-550>

Add RandomVector and
RandomMatrix<https://issues.apache.org/jira/browse/MAHOUT-550>
*Unassigned*Lance
Norskog<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=lancenorskog>[image:
Major][image: Open] Open*Unresolved*21/Nov/1026/Nov/10
<https://issues.apache.org/jira/browse/MAHOUT-552>[image:
Bug] <https://issues.apache.org/jira/browse/MAHOUT-552>MAHOUT-552<https://issues.apache.org/jira/browse/MAHOUT-552>

AbstractCluster eliminates NamedVectors by replacing them with
RandomAccessSparseVector
always<https://issues.apache.org/jira/browse/MAHOUT-552>
Jeff Eastman<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jeastman>Pere
Ferrera Bertran<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=ferrerabertran>[image:
Major][image: Reopened] Reopened*Unresolved*24/Nov/1027/Nov/10
<https://issues.apache.org/jira/browse/MAHOUT-586>[image:
Improvement] <https://issues.apache.org/jira/browse/MAHOUT-586>MAHOUT-586<https://issues.apache.org/jira/browse/MAHOUT-586>

Redo RecommenderEvaluator for
modularity<https://issues.apache.org/jira/browse/MAHOUT-586>
Sean Owen<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=srowen>Lance
Norskog<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=lancenorskog>[image:
Major][image: Reopened] Reopened*Unresolved*17/Jan/1124/Mar/11
<https://issues.apache.org/jira/browse/MAHOUT-605>[image:
Bug] <https://issues.apache.org/jira/browse/MAHOUT-605>MAHOUT-605<https://issues.apache.org/jira/browse/MAHOUT-605>

Array returned by
classifier.bayes.algorithm.CBayesAlgorithm.classifyDocument is sorted
ascendant <https://issues.apache.org/jira/browse/MAHOUT-605>
Robin Anil<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=robinanil>Robin
Swezey<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=mizudera>[image:
Minor][image: Reopened]
Reopened*Unresolved*03/Feb/1106/Mar/1111/Feb/11<https://issues.apache.org/jira/browse/MAHOUT-612>[image:
Improvement] <https://issues.apache.org/jira/browse/MAHOUT-612>MAHOUT-612<https://issues.apache.org/jira/browse/MAHOUT-612>

Simplify configuring and running Mahout MapReduce jobs from Java using Java
bean configuration <https://issues.apache.org/jira/browse/MAHOUT-612>
*Unassigned*Frank
Scholten<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=frankscholten>[image:
Major][image: Patch Available] Patch Available*Unresolved*20/Feb/1124/Mar/11
  <https://issues.apache.org/jira/browse/MAHOUT-622>[image:
Improvement]<https://issues.apache.org/jira/browse/MAHOUT-622>
MAHOUT-622 <https://issues.apache.org/jira/browse/MAHOUT-622>

Mahout dependencies are unified under dependency management in parent
pom<https://issues.apache.org/jira/browse/MAHOUT-622>
Dmitriy Lyubimov<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=dlyubimov>Dmitriy
Lyubimov<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=dlyubimov>[image:
Minor][image: Open] Open*Unresolved*10/Mar/1125/Mar/11
<https://issues.apache.org/jira/browse/MAHOUT-626>[image:
Improvement] <https://issues.apache.org/jira/browse/MAHOUT-626>MAHOUT-626<https://issues.apache.org/jira/browse/MAHOUT-626>

T1 and T2 Values in Canopy (&
MeanShift)<https://issues.apache.org/jira/browse/MAHOUT-626>
Jeff Eastman<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jeastman>Jeff
Eastman<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jeastman>[image:
Major][image: Open] Open*Unresolved*13/Mar/1120/Mar/11
<https://issues.apache.org/jira/browse/MAHOUT-633>[image:
Improvement] <https://issues.apache.org/jira/browse/MAHOUT-633>MAHOUT-633<https://issues.apache.org/jira/browse/MAHOUT-633>

Add SequenceFileIterable; put Iterable stuff in one
place<https://issues.apache.org/jira/browse/MAHOUT-633>
Sean Owen<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=srowen>Sean
Owen <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=srowen>[image:
Minor][image: Open] Open*Unresolved*23/Mar/1124/Mar/1131/Mar/11

Re: Think about 0.5 release?

Posted by Ted Dunning <te...@gmail.com>.
A release happens at apache any time a committer offers to be release
manager.

There is no need for it to be a big deal.

With Mahout, the virtue of a release is that it decrease the number of
questions of the form "I used the latest release and couldn't find what you
are guys are talking about".  At least, it decreases the questions for a
time.

On Sat, Mar 26, 2011 at 3:37 PM, Lance Norskog <go...@gmail.com> wrote:

> A major milepost is a "release" that people should use instead of trunk?
> Should Mahout push for that amount of fit&finish?
> Keep plowing ahead on new algorithms?
> Keep plowing ahead on tools to handle new problems?
>
> Lance
>
> On Sat, Mar 26, 2011 at 6:51 AM, Robin Anil <ro...@gmail.com> wrote:
> > On Fri, Mar 25, 2011 at 8:42 PM, Sean Owen <sr...@gmail.com> wrote:
> >
> >> Maybe it's the first flush of springtime that's reinvigorated activity,
> but
> >> I do see significantly more activity in the last week or two. That's
> great.
> >> Especially since a lot of the activity is going into older issues too.
> The
> >> number of open issues is declining; it was over 70 a month or two ago
> and
> >> is
> >> now down to just over 50. That's a good trend.
> >>
> >> Our 0.4 release was Oct 31 2010. Given a general pattern of releasing
> every
> >> 6 months, that would suggest end of April for 0.5. I see no reason to
> rush,
> >> but reason to start asking those release questions:
> >>
> > +1, Faster releases matter a lot
> >
> >>
> >> - What do we want out of 0.5 as a release? I suggest it be viewed as
> nearly
> >> a release candidate for 1.0; the APIs and functionality should be
> >> substantially set by 0.5, to be polished for a 1.0 release in Q4.
> >>
> > Lets focus first on getting the progress we made since 0.4 out. Polishing
> > can continue in the summers
> >
> >
> >> - What is marked for 0.5 that just isn't realistically going to be done
> by
> >> anyone in a few weeks?
> >
> > - What isn't in JIRA for 0.5 that would be nice for 0.5?
> >
> >  Let me take a shot at scrubbing issues as well.
> >
>
>
>
> --
> Lance Norskog
> goksron@gmail.com
>

Re: Think about 0.5 release?

Posted by Lance Norskog <go...@gmail.com>.
A major milepost is a "release" that people should use instead of trunk?
Should Mahout push for that amount of fit&finish?
Keep plowing ahead on new algorithms?
Keep plowing ahead on tools to handle new problems?

Lance

On Sat, Mar 26, 2011 at 6:51 AM, Robin Anil <ro...@gmail.com> wrote:
> On Fri, Mar 25, 2011 at 8:42 PM, Sean Owen <sr...@gmail.com> wrote:
>
>> Maybe it's the first flush of springtime that's reinvigorated activity, but
>> I do see significantly more activity in the last week or two. That's great.
>> Especially since a lot of the activity is going into older issues too. The
>> number of open issues is declining; it was over 70 a month or two ago and
>> is
>> now down to just over 50. That's a good trend.
>>
>> Our 0.4 release was Oct 31 2010. Given a general pattern of releasing every
>> 6 months, that would suggest end of April for 0.5. I see no reason to rush,
>> but reason to start asking those release questions:
>>
> +1, Faster releases matter a lot
>
>>
>> - What do we want out of 0.5 as a release? I suggest it be viewed as nearly
>> a release candidate for 1.0; the APIs and functionality should be
>> substantially set by 0.5, to be polished for a 1.0 release in Q4.
>>
> Lets focus first on getting the progress we made since 0.4 out. Polishing
> can continue in the summers
>
>
>> - What is marked for 0.5 that just isn't realistically going to be done by
>> anyone in a few weeks?
>
> - What isn't in JIRA for 0.5 that would be nice for 0.5?
>
>  Let me take a shot at scrubbing issues as well.
>



-- 
Lance Norskog
goksron@gmail.com

Re: Think about 0.5 release?

Posted by Isabel Drost <is...@apache.org>.
On Sat, 26 Mar 11 Robin Anil wrote:

> On Fri, Mar 25, 2011 at 8:42 PM, Sean Owen <sr...@gmail.com> wrote:
> 
> > Maybe it's the first flush of springtime that's reinvigorated
> > activity, but I do see significantly more activity in the last week
> > or two. That's great. Especially since a lot of the activity is
> > going into older issues too. The number of open issues is
> > declining; it was over 70 a month or two ago and is
> > now down to just over 50. That's a good trend.
> >
> > Our 0.4 release was Oct 31 2010. Given a general pattern of
> > releasing every 6 months, that would suggest end of April for 0.5.
> > I see no reason to rush, but reason to start asking those release
> > questions:
> >
> +1, Faster releases matter a lot

+1 as well for releasing before GSoC starts.


> > - What is marked for 0.5 that just isn't realistically going to be
> > done by anyone in a few weeks?

http://tinyurl.com/4rnggnf - if I managed to correctly set the filters
there should be 19 issues left in the pool right now:

MAHOUT-293	Add more tunable parameters to PFPGrowth
MAHOUT-369	Issues with DistributedLanczosSolver
MAHOUT-294	Uniform API behavior for Jobs
MAHOUT-499	Implement LSMR in-memory
MAHOUT-319	SVD solvers should be gracefully stoppable/restartable
MAHOUT-518	Implement Affinity Preprocessing for Eigencuts and Spectral KMeans
MAHOUT-529	Implement LinearRegression
MAHOUT-550	Add RandomVector and RandomMatrix
MAHOUT-524	DisplaySpectralKMeans example fails
MAHOUT-384	Implement of AVF algorithm
MAHOUT-479	Streamline classification/ clustering data structures
MAHOUT-525	Implement LatentFactorLogLinear models
MAHOUT-626	T1 and T2 Values in Canopy (& MeanShift)
MAHOUT-552	AbstractCluster eliminates NamedVectors by replacing them with
		RandomAccessSparseVector
MAHOUT-399 	LDA on Mahout 0.3 does not converge to correct
		solution for overlapping pyramids toy problem.
MAHOUT-517	Eigencuts needs an output format
MAHOUT-487 	Issues with memory use and inconsistent or
		state-influenced results when using CBayesAlgorithm
MAHOUT-640 	Implementation of refresh in SVDRecommender
MAHOUT-622 	Mahout dependencies are unified under dependency
		management in parent pom

Isabel

Re: Think about 0.5 release?

Posted by Robin Anil <ro...@gmail.com>.
On Fri, Mar 25, 2011 at 8:42 PM, Sean Owen <sr...@gmail.com> wrote:

> Maybe it's the first flush of springtime that's reinvigorated activity, but
> I do see significantly more activity in the last week or two. That's great.
> Especially since a lot of the activity is going into older issues too. The
> number of open issues is declining; it was over 70 a month or two ago and
> is
> now down to just over 50. That's a good trend.
>
> Our 0.4 release was Oct 31 2010. Given a general pattern of releasing every
> 6 months, that would suggest end of April for 0.5. I see no reason to rush,
> but reason to start asking those release questions:
>
+1, Faster releases matter a lot

>
> - What do we want out of 0.5 as a release? I suggest it be viewed as nearly
> a release candidate for 1.0; the APIs and functionality should be
> substantially set by 0.5, to be polished for a 1.0 release in Q4.
>
Lets focus first on getting the progress we made since 0.4 out. Polishing
can continue in the summers


> - What is marked for 0.5 that just isn't realistically going to be done by
> anyone in a few weeks?

- What isn't in JIRA for 0.5 that would be nice for 0.5?

 Let me take a shot at scrubbing issues as well.

Re: Think about 0.5 release?

Posted by Grant Ingersoll <gs...@apache.org>.
Wow, you read my mind.  I was just thinking about this exact thing this morning.

On Mar 25, 2011, at 11:12 AM, Sean Owen wrote:

> Maybe it's the first flush of springtime that's reinvigorated activity, but
> I do see significantly more activity in the last week or two. That's great.
> Especially since a lot of the activity is going into older issues too. The
> number of open issues is declining; it was over 70 a month or two ago and is
> now down to just over 50. That's a good trend.
> 
> Our 0.4 release was Oct 31 2010. Given a general pattern of releasing every
> 6 months, that would suggest end of April for 0.5. I see no reason to rush,
> but reason to start asking those release questions:
> 
> - What do we want out of 0.5 as a release? I suggest it be viewed as nearly
> a release candidate for 1.0; the APIs and functionality should be
> substantially set by 0.5, to be polished for a 1.0 release in Q4.

+1.  I think it would also be good to get a release out before GSOC starts.

> - What is marked for 0.5 that just isn't realistically going to be done by
> anyone in a few weeks?
> - What isn't in JIRA for 0.5 that would be nice for 0.5?

I haven't looked yet, but will try to in the coming days.

> 
> In particular, there are 24 issues still marked as targeted for 0.5. Here's
> an open invitation to punt issues to unscheduled, and for fixing/resolving
> issues.
> 
> I think 0.5 will also be a good milestone to ask if anyone wants to become
> an emeritus committer.
> 
> Best,
> Sean
> 
> 
> 
> TKeySummaryAssigneeReporterPStatusResolutionCreatedUpdatedDue<https://issues.apache.org/jira/browse/MAHOUT-293>[image:
> Improvement] <https://issues.apache.org/jira/browse/MAHOUT-293>MAHOUT-293<https://issues.apache.org/jira/browse/MAHOUT-293>
> 
> Add more tunable parameters to PFPGrowth
> implementation<https://issues.apache.org/jira/browse/MAHOUT-293>
> Robin Anil<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=robinanil>Robin
> Anil <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=robinanil>[image:
> Major][image: Open] Open*Unresolved*15/Feb/1001/Mar/11
> <https://issues.apache.org/jira/browse/MAHOUT-294>[image:
> Improvement] <https://issues.apache.org/jira/browse/MAHOUT-294>MAHOUT-294<https://issues.apache.org/jira/browse/MAHOUT-294>
> 
> Uniform API behavior for Jobs<https://issues.apache.org/jira/browse/MAHOUT-294>
> *Unassigned*Robin
> Anil<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=robinanil>[image:
> Major][image: Open] Open*Unresolved*16/Feb/1031/Jan/11
> <https://issues.apache.org/jira/browse/MAHOUT-308>[image:
> Improvement] <https://issues.apache.org/jira/browse/MAHOUT-308>MAHOUT-308<https://issues.apache.org/jira/browse/MAHOUT-308>
> 
> Improve Lanczos to handle extremely large feature sets (without
> hashing)<https://issues.apache.org/jira/browse/MAHOUT-308>
> Jake Mannix<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jake.mannix>Jake
> Mannix<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jake.mannix>[image:
> Major][image: Patch Available] Patch Available*Unresolved*24/Feb/1001/Mar/11
>  <https://issues.apache.org/jira/browse/MAHOUT-319>[image:
> Improvement]<https://issues.apache.org/jira/browse/MAHOUT-319>
> MAHOUT-319 <https://issues.apache.org/jira/browse/MAHOUT-319>
> 
> SVD solvers should be gracefully
> stoppable/restartable<https://issues.apache.org/jira/browse/MAHOUT-319>
> Jake Mannix<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jake.mannix>Jake
> Mannix<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jake.mannix>[image:
> Major][image: Open] Open*Unresolved*01/Mar/1003/Nov/10
> <https://issues.apache.org/jira/browse/MAHOUT-369>[image:
> Bug] <https://issues.apache.org/jira/browse/MAHOUT-369>MAHOUT-369<https://issues.apache.org/jira/browse/MAHOUT-369>
> 
> Issues with DistributedLanczosSolver
> output<https://issues.apache.org/jira/browse/MAHOUT-369>
> Jake Mannix<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jake.mannix>Danny
> Leshem <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=dleshem>[image:
> Major][image: Open] Open*Unresolved*07/Apr/1001/Mar/11
> <https://issues.apache.org/jira/browse/MAHOUT-384>[image:
> New Feature] <https://issues.apache.org/jira/browse/MAHOUT-384>MAHOUT-384<https://issues.apache.org/jira/browse/MAHOUT-384>
> 
> Implement of AVF algorithm<https://issues.apache.org/jira/browse/MAHOUT-384>
> Robin Anil<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=robinanil>tony
> cui <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=tony.cui>[image:
> Major][image: Open] Open*Unresolved*22/Apr/1026/Jan/11
> <https://issues.apache.org/jira/browse/MAHOUT-399>[image:
> Bug] <https://issues.apache.org/jira/browse/MAHOUT-399>MAHOUT-399<https://issues.apache.org/jira/browse/MAHOUT-399>
> 
> LDA on Mahout 0.3 does not converge to correct solution for overlapping
> pyramids toy problem. <https://issues.apache.org/jira/browse/MAHOUT-399>
> Ted Dunning<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=tdunning>Michael
> Lazarus<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=mikelazarus>[image:
> Major][image: Open]
> Open*Unresolved*24/May/1008/Feb/1125/Mar/11<https://issues.apache.org/jira/browse/MAHOUT-479>[image:
> Improvement] <https://issues.apache.org/jira/browse/MAHOUT-479>MAHOUT-479<https://issues.apache.org/jira/browse/MAHOUT-479>
> 
> Streamline classification/ clustering data
> structures<https://issues.apache.org/jira/browse/MAHOUT-479>
> Isabel Drost<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=isabel>Isabel
> Drost <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=isabel>[image:
> Major][image: Open] Open*Unresolved*13/Aug/1008/Mar/11
> <https://issues.apache.org/jira/browse/MAHOUT-487>[image:
> Bug] <https://issues.apache.org/jira/browse/MAHOUT-487>MAHOUT-487<https://issues.apache.org/jira/browse/MAHOUT-487>
> 
> Issues with memory use and inconsistent or state-influenced results when
> using CBayesAlgorithm <https://issues.apache.org/jira/browse/MAHOUT-487>
> Robin Anil<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=robinanil>Drew
> Farris<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=drew.farris>[image:
> Minor][image: Open]
> Open*Unresolved*24/Aug/1003/Feb/1125/Feb/11<https://issues.apache.org/jira/browse/MAHOUT-499>[image:
> Improvement] <https://issues.apache.org/jira/browse/MAHOUT-499>MAHOUT-499<https://issues.apache.org/jira/browse/MAHOUT-499>
> 
> Implement LSMR in-memory <https://issues.apache.org/jira/browse/MAHOUT-499>
> Ted Dunning<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=tdunning>Ted
> Dunning<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=tdunning>[image:
> Major][image: Open] Open*Unresolved*09/Sep/1014/Oct/10
> <https://issues.apache.org/jira/browse/MAHOUT-510>[image:
> Task] <https://issues.apache.org/jira/browse/MAHOUT-510>MAHOUT-510<https://issues.apache.org/jira/browse/MAHOUT-510>
> 
> Standardize serialization
> mechanisms<https://issues.apache.org/jira/browse/MAHOUT-510>
> Sean Owen<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=srowen>Sean
> Owen <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=srowen>[image:
> Major][image: Patch Available] Patch Available*Unresolved*22/Sep/1008/Feb/11
>  <https://issues.apache.org/jira/browse/MAHOUT-517>[image:
> Improvement]<https://issues.apache.org/jira/browse/MAHOUT-517>
> MAHOUT-517 <https://issues.apache.org/jira/browse/MAHOUT-517>
> 
> Eigencuts needs an output
> format<https://issues.apache.org/jira/browse/MAHOUT-517>
> Jeff Eastman<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jeastman>Jeff
> Eastman<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jeastman>[image:
> Minor][image: Open]
> Open*Unresolved*30/Sep/1004/Feb/1125/Feb/11<https://issues.apache.org/jira/browse/MAHOUT-518>[image:
> Improvement] <https://issues.apache.org/jira/browse/MAHOUT-518>MAHOUT-518<https://issues.apache.org/jira/browse/MAHOUT-518>
> 
> Implement Affinity Preprocessing for Eigencuts and Spectral
> KMeans<https://issues.apache.org/jira/browse/MAHOUT-518>
> *Unassigned*Jeff
> Eastman<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jeastman>[image:
> Major][image: Open] Open*Unresolved*30/Sep/1006/Oct/10
> <https://issues.apache.org/jira/browse/MAHOUT-524>[image:
> Bug] <https://issues.apache.org/jira/browse/MAHOUT-524>MAHOUT-524<https://issues.apache.org/jira/browse/MAHOUT-524>
> 
> DisplaySpectralKMeans example
> fails<https://issues.apache.org/jira/browse/MAHOUT-524>
> *Unassigned*Jeff
> Eastman<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jeastman>[image:
> Major][image: Open] Open*Unresolved*12/Oct/1021/Jan/11
> <https://issues.apache.org/jira/browse/MAHOUT-525>[image:
> New Feature] <https://issues.apache.org/jira/browse/MAHOUT-525>MAHOUT-525<https://issues.apache.org/jira/browse/MAHOUT-525>
> 
> Implement LatentFactorLogLinear
> models<https://issues.apache.org/jira/browse/MAHOUT-525>
> *Unassigned*Ted
> Dunning<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=tdunning>[image:
> Major][image: Open] Open*Unresolved*14/Oct/1018/Mar/11
> <https://issues.apache.org/jira/browse/MAHOUT-529>[image:
> New Feature] <https://issues.apache.org/jira/browse/MAHOUT-529>MAHOUT-529<https://issues.apache.org/jira/browse/MAHOUT-529>
> 
> Implement LinearRegression<https://issues.apache.org/jira/browse/MAHOUT-529>
> Ted Dunning<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=tdunning>Frank
> Wang <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=fanjie>[image:
> Major][image: Open] Open*Unresolved*21/Oct/1019/Jan/11
> <https://issues.apache.org/jira/browse/MAHOUT-550>[image:
> New Feature] <https://issues.apache.org/jira/browse/MAHOUT-550>MAHOUT-550<https://issues.apache.org/jira/browse/MAHOUT-550>
> 
> Add RandomVector and
> RandomMatrix<https://issues.apache.org/jira/browse/MAHOUT-550>
> *Unassigned*Lance
> Norskog<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=lancenorskog>[image:
> Major][image: Open] Open*Unresolved*21/Nov/1026/Nov/10
> <https://issues.apache.org/jira/browse/MAHOUT-552>[image:
> Bug] <https://issues.apache.org/jira/browse/MAHOUT-552>MAHOUT-552<https://issues.apache.org/jira/browse/MAHOUT-552>
> 
> AbstractCluster eliminates NamedVectors by replacing them with
> RandomAccessSparseVector
> always<https://issues.apache.org/jira/browse/MAHOUT-552>
> Jeff Eastman<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jeastman>Pere
> Ferrera Bertran<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=ferrerabertran>[image:
> Major][image: Reopened] Reopened*Unresolved*24/Nov/1027/Nov/10
> <https://issues.apache.org/jira/browse/MAHOUT-586>[image:
> Improvement] <https://issues.apache.org/jira/browse/MAHOUT-586>MAHOUT-586<https://issues.apache.org/jira/browse/MAHOUT-586>
> 
> Redo RecommenderEvaluator for
> modularity<https://issues.apache.org/jira/browse/MAHOUT-586>
> Sean Owen<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=srowen>Lance
> Norskog<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=lancenorskog>[image:
> Major][image: Reopened] Reopened*Unresolved*17/Jan/1124/Mar/11
> <https://issues.apache.org/jira/browse/MAHOUT-605>[image:
> Bug] <https://issues.apache.org/jira/browse/MAHOUT-605>MAHOUT-605<https://issues.apache.org/jira/browse/MAHOUT-605>
> 
> Array returned by
> classifier.bayes.algorithm.CBayesAlgorithm.classifyDocument is sorted
> ascendant <https://issues.apache.org/jira/browse/MAHOUT-605>
> Robin Anil<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=robinanil>Robin
> Swezey<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=mizudera>[image:
> Minor][image: Reopened]
> Reopened*Unresolved*03/Feb/1106/Mar/1111/Feb/11<https://issues.apache.org/jira/browse/MAHOUT-612>[image:
> Improvement] <https://issues.apache.org/jira/browse/MAHOUT-612>MAHOUT-612<https://issues.apache.org/jira/browse/MAHOUT-612>
> 
> Simplify configuring and running Mahout MapReduce jobs from Java using Java
> bean configuration <https://issues.apache.org/jira/browse/MAHOUT-612>
> *Unassigned*Frank
> Scholten<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=frankscholten>[image:
> Major][image: Patch Available] Patch Available*Unresolved*20/Feb/1124/Mar/11
>  <https://issues.apache.org/jira/browse/MAHOUT-622>[image:
> Improvement]<https://issues.apache.org/jira/browse/MAHOUT-622>
> MAHOUT-622 <https://issues.apache.org/jira/browse/MAHOUT-622>
> 
> Mahout dependencies are unified under dependency management in parent
> pom<https://issues.apache.org/jira/browse/MAHOUT-622>
> Dmitriy Lyubimov<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=dlyubimov>Dmitriy
> Lyubimov<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=dlyubimov>[image:
> Minor][image: Open] Open*Unresolved*10/Mar/1125/Mar/11
> <https://issues.apache.org/jira/browse/MAHOUT-626>[image:
> Improvement] <https://issues.apache.org/jira/browse/MAHOUT-626>MAHOUT-626<https://issues.apache.org/jira/browse/MAHOUT-626>
> 
> T1 and T2 Values in Canopy (&
> MeanShift)<https://issues.apache.org/jira/browse/MAHOUT-626>
> Jeff Eastman<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jeastman>Jeff
> Eastman<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=jeastman>[image:
> Major][image: Open] Open*Unresolved*13/Mar/1120/Mar/11
> <https://issues.apache.org/jira/browse/MAHOUT-633>[image:
> Improvement] <https://issues.apache.org/jira/browse/MAHOUT-633>MAHOUT-633<https://issues.apache.org/jira/browse/MAHOUT-633>
> 
> Add SequenceFileIterable; put Iterable stuff in one
> place<https://issues.apache.org/jira/browse/MAHOUT-633>
> Sean Owen<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=srowen>Sean
> Owen <https://issues.apache.org/jira/secure/ViewProfile.jspa?name=srowen>[image:
> Minor][image: Open] Open*Unresolved*23/Mar/1124/Mar/1131/Mar/11

--------------------------
Grant Ingersoll
http://www.lucidimagination.com