You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by Jeff Eastman <jd...@windwardsolutions.com> on 2012/01/08 18:10:58 UTC

Release 0.6 Outstanding Issues

Here's the latest JIRA report for 0.6. In an effort to close the door on 
issues for code freeze, I suggest we triage:

- Bug 598: move to 0.7?
- Feature 897: looks like this can be resolved fixed
- Bug 794: move to 0.7
- Bug 826: fix in 0.6
- Improvements 845, 899: patch is ready but there is some overlap to be 
resolved. Complete for 0.6 asap or move to 0.7
- Improvements 768, 941: move to 0.7

We also have the showstopper build problem with 
SequentialOutOfCoreSvdTest. I suggest we create a JIRA for this.

ASF JIRA 
<https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=true&jqlQuery=project+%3D+MAHOUT+AND+resolution+%3D+Unresolved+AND+fixVersion+%3D+%220.6%22+ORDER+BY+priority+DESC> 

Displaying 8issues at 08/Jan/12 16:54.
Issue Type 	Key 	Summary 	Assignee 	Reporter 	Priority 	Status 
Resolution 	Created 	Updated 	Due Date
Bug 	MAHOUT-598 <https://issues.apache.org/jira/browse/MAHOUT-598> 
Downstream steps in the seq2sparse job flow looking in wrong location 
for output from previous steps when running in Elastic MapReduce (EMR) 
cluster 	Robin Anil 	Timothy Potter 	Major 	Open 	Unresolved 	1/27/11 
16:20 	1/4/12 16:33 	
New Feature 	MAHOUT-897 
<https://issues.apache.org/jira/browse/MAHOUT-897> 	New implementation 
for LDA: Collapsed Variational Bayes (0th derivative approximation), 
with map-side model caching 	Jake Mannix 	Jake Mannix 	Major 	Patch 
Available 	Unresolved 	11/27/11 7:34 	12/3/11 5:16 	
Bug 	MAHOUT-794 <https://issues.apache.org/jira/browse/MAHOUT-794> 
Eigencuts produces unexpected results, part 2 	Shannon Quinn 	Sean 
Owen 	Major 	Open 	Unresolved 	8/21/11 19:37 	8/21/11 19:37 	
Bug 	MAHOUT-826 <https://issues.apache.org/jira/browse/MAHOUT-826> 
Bayes/CBayes classification on a non-existing feature 	Robin Anil 
Andre-Philippe Paquet 	Minor 	Open 	Unresolved 	10/3/11 19:09 	10/15/11 
10:02 	
Improvement 	MAHOUT-845 
<https://issues.apache.org/jira/browse/MAHOUT-845> 	Make cluster top 
terms code more reusable 	Unassigned 	Frank Scholten 	Minor 	Patch 
Available 	Unresolved 	10/19/11 14:06 	1/4/12 16:27 	
Improvement 	MAHOUT-899 
<https://issues.apache.org/jira/browse/MAHOUT-899> 	Add Point Sampling, 
Color coding to ClusterDumper 	Grant Ingersoll 	Grant Ingersoll 	Minor 
Open 	Unresolved 	11/27/11 21:59 	11/29/11 4:34 	
Improvement 	MAHOUT-768 
<https://issues.apache.org/jira/browse/MAHOUT-768> 	Duplicated 
DoubleFunction in mahout and mahout-collections (mahout.math package). 
Ted Dunning 	Dawid Weiss 	Minor 	Open 	Unresolved 	7/24/11 7:38 	1/4/12 
21:39 	
Improvement 	MAHOUT-941 
<https://issues.apache.org/jira/browse/MAHOUT-941> 	Improve 
ConfusionMatrix statistics 	Grant Ingersoll 	Lance Norskog 	Minor 
Open 	Unresolved 	1/5/12 6:46 	1/8/12 2:59 	
Generated at Sun Jan 08 16:54:22 UTC 2012 using JIRA 4.4.1#660-r161644.



Re: Release 0.6 Outstanding Issues

Posted by Lance Norskog <go...@gmail.com>.
MAHOUT-941 is 0.7. Cross if off, please.

On Sun, Jan 8, 2012 at 2:46 PM, Shannon Quinn <sq...@gatech.edu> wrote:
>
>>> - Bug 794: move to 0.7
>>
>> Hmm, not sure on this.  Seems like it is important, but doesn't have much
>> for info.  Shannon?
>
>
> I'm not sure on this, either. I'll need to poke around and find what the bug
> was :P For now let's say 0.7.



-- 
Lance Norskog
goksron@gmail.com

Re: Release 0.6 Outstanding Issues

Posted by Shannon Quinn <sq...@gatech.edu>.
>> - Bug 794: move to 0.7
> Hmm, not sure on this.  Seems like it is important, but doesn't have much for info.  Shannon?

I'm not sure on this, either. I'll need to poke around and find what the 
bug was :P For now let's say 0.7.

Re: Release 0.6 Outstanding Issues

Posted by Grant Ingersoll <gs...@apache.org>.
On Jan 8, 2012, at 12:10 PM, Jeff Eastman wrote:

> Here's the latest JIRA report for 0.6. In an effort to close the door on issues for code freeze, I suggest we triage:
> 
> - Bug 598: move to 0.7?

+1

> - Feature 897: looks like this can be resolved fixed

That was my assessment, too.  Jake?

> - Bug 794: move to 0.7

Hmm, not sure on this.  Seems like it is important, but doesn't have much for info.  Shannon?

> - Bug 826: fix in 0.6
> - Improvements 845, 899: patch is ready but there is some overlap to be resolved. Complete for 0.6 asap or move to 0.7

I'm testing 899 right now.

> - Improvements 768, 941: move to 0.7

941 can move.

> 
> We also have the showstopper build problem with SequentialOutOfCoreSvdTest. I suggest we create a JIRA for this.
> 
> ASF JIRA
> Displaying 8 issues at 08/Jan/12 16:54.
> Issue Type	Key	Summary	Assignee	Reporter	Priority	Status	Resolution	Created	Updated	Due Date
> Bug	MAHOUT-598	Downstream steps in the seq2sparse job flow looking in wrong location for output from previous steps when running in Elastic MapReduce (EMR) cluster	Robin Anil	Timothy Potter	Major	Open	Unresolved	1/27/11 16:20	1/4/12 16:33	 
> New Feature	MAHOUT-897	New implementation for LDA: Collapsed Variational Bayes (0th derivative approximation), with map-side model caching	Jake Mannix	Jake Mannix	Major	Patch Available	Unresolved	11/27/11 7:34	12/3/11 5:16	 
> Bug	MAHOUT-794	Eigencuts produces unexpected results, part 2	Shannon Quinn	Sean Owen	Major	Open	Unresolved	8/21/11 19:37	8/21/11 19:37	 
> Bug	MAHOUT-826	Bayes/CBayes classification on a non-existing feature	Robin Anil	Andre-Philippe Paquet	Minor	Open	Unresolved	10/3/11 19:09	10/15/11 10:02	 
> Improvement	MAHOUT-845	Make cluster top terms code more reusable	Unassigned	Frank Scholten	Minor	Patch Available	Unresolved	10/19/11 14:06	1/4/12 16:27	 
> Improvement	MAHOUT-899	Add Point Sampling, Color coding to ClusterDumper	Grant Ingersoll	Grant Ingersoll	Minor	Open	Unresolved	11/27/11 21:59	11/29/11 4:34	 
> Improvement	MAHOUT-768	Duplicated DoubleFunction in mahout and mahout-collections (mahout.math package).	Ted Dunning	Dawid Weiss	Minor	Open	Unresolved	7/24/11 7:38	1/4/12 21:39	 
> Improvement	MAHOUT-941	Improve ConfusionMatrix statistics	Grant Ingersoll	Lance Norskog	Minor	Open	Unresolved	1/5/12 6:46	1/8/12 2:59	 
> Generated at Sun Jan 08 16:54:22 UTC 2012 using JIRA 4.4.1#660-r161644.

--------------------------------------------
Grant Ingersoll
http://www.lucidimagination.com




Re: Release 0.6 Outstanding Issues

Posted by Grant Ingersoll <gs...@apache.org>.
Benson, 

You did most of this work way back when, what say you?

-Grant

On Jan 8, 2012, at 2:22 PM, Ted Dunning wrote:

> That is a fine solution.  I don't think that anybody will be affected
> negatively by this change and it makes the solution very simple.
> 
> On Sun, Jan 8, 2012 at 10:05 AM, Grant Ingersoll <gs...@apache.org>wrote:
> 
>> I think we should discuss M-768 a bit more.  I'd propose we just fold
>> Collections back into Mahout proper and then we can release it with this
>> release and take care of the duplication by pushing the *Function classes
>> into Collections.
>> 
>> 
>> On Jan 8, 2012, at 12:10 PM, Jeff Eastman wrote:
>> 
>>> Here's the latest JIRA report for 0.6. In an effort to close the door on
>> issues for code freeze, I suggest we triage:
>>> 
>>> - Bug 598: move to 0.7?
>>> - Feature 897: looks like this can be resolved fixed
>>> - Bug 794: move to 0.7
>>> - Bug 826: fix in 0.6
>>> - Improvements 845, 899: patch is ready but there is some overlap to be
>> resolved. Complete for 0.6 asap or move to 0.7
>>> - Improvements 768, 941: move to 0.7
>>> 
>>> We also have the showstopper build problem with
>> SequentialOutOfCoreSvdTest. I suggest we create a JIRA for this.
>>> 
>>> ASF JIRA
>>> Displaying 8 issues at 08/Jan/12 16:54.
>>> Issue Type    Key     Summary Assignee        Reporter        Priority
>>     Status  Resolution      Created Updated Due Date
>>> Bug   MAHOUT-598      Downstream steps in the seq2sparse job flow
>> looking in wrong location for output from previous steps when running in
>> Elastic MapReduce (EMR) cluster    Robin Anil      Timothy Potter  Major
>> Open    Unresolved      1/27/11 16:20   1/4/12 16:33
>>> New Feature   MAHOUT-897      New implementation for LDA: Collapsed
>> Variational Bayes (0th derivative approximation), with map-side model
>> caching     Jake Mannix     Jake Mannix     Major   Patch Available
>> Unresolved      11/27/11 7:34   12/3/11 5:16
>>> Bug   MAHOUT-794      Eigencuts produces unexpected results, part 2
>> Shannon Quinn   Sean Owen       Major   Open    Unresolved      8/21/11
>> 19:37   8/21/11 19:37
>>> Bug   MAHOUT-826      Bayes/CBayes classification on a non-existing
>> feature   Robin Anil      Andre-Philippe Paquet   Minor   Open
>> Unresolved      10/3/11 19:09   10/15/11 10:02
>>> Improvement   MAHOUT-845      Make cluster top terms code more reusable
>>      Unassigned      Frank Scholten  Minor   Patch Available Unresolved
>>   10/19/11 14:06  1/4/12 16:27
>>> Improvement   MAHOUT-899      Add Point Sampling, Color coding to
>> ClusterDumper       Grant Ingersoll Grant Ingersoll Minor   Open
>> Unresolved      11/27/11 21:59  11/29/11 4:34
>>> Improvement   MAHOUT-768      Duplicated DoubleFunction in mahout and
>> mahout-collections (mahout.math package).       Ted Dunning     Dawid Weiss
>>    Minor   Open    Unresolved      7/24/11 7:38    1/4/12 21:39
>>> Improvement   MAHOUT-941      Improve ConfusionMatrix statistics
>> Grant Ingersoll Lance Norskog   Minor   Open    Unresolved      1/5/12
>> 6:46     1/8/12 2:59
>>> Generated at Sun Jan 08 16:54:22 UTC 2012 using JIRA 4.4.1#660-r161644.
>> 
>> --------------------------------------------
>> Grant Ingersoll
>> http://www.lucidimagination.com
>> 
>> 
>> 
>> 

--------------------------------------------
Grant Ingersoll
http://www.lucidimagination.com




Re: Release 0.6 Outstanding Issues

Posted by Ted Dunning <te...@gmail.com>.
That is a fine solution.  I don't think that anybody will be affected
negatively by this change and it makes the solution very simple.

On Sun, Jan 8, 2012 at 10:05 AM, Grant Ingersoll <gs...@apache.org>wrote:

> I think we should discuss M-768 a bit more.  I'd propose we just fold
> Collections back into Mahout proper and then we can release it with this
> release and take care of the duplication by pushing the *Function classes
> into Collections.
>
>
> On Jan 8, 2012, at 12:10 PM, Jeff Eastman wrote:
>
> > Here's the latest JIRA report for 0.6. In an effort to close the door on
> issues for code freeze, I suggest we triage:
> >
> > - Bug 598: move to 0.7?
> > - Feature 897: looks like this can be resolved fixed
> > - Bug 794: move to 0.7
> > - Bug 826: fix in 0.6
> > - Improvements 845, 899: patch is ready but there is some overlap to be
> resolved. Complete for 0.6 asap or move to 0.7
> > - Improvements 768, 941: move to 0.7
> >
> > We also have the showstopper build problem with
> SequentialOutOfCoreSvdTest. I suggest we create a JIRA for this.
> >
> > ASF JIRA
> > Displaying 8 issues at 08/Jan/12 16:54.
> > Issue Type    Key     Summary Assignee        Reporter        Priority
>      Status  Resolution      Created Updated Due Date
> > Bug   MAHOUT-598      Downstream steps in the seq2sparse job flow
> looking in wrong location for output from previous steps when running in
> Elastic MapReduce (EMR) cluster    Robin Anil      Timothy Potter  Major
> Open    Unresolved      1/27/11 16:20   1/4/12 16:33
> > New Feature   MAHOUT-897      New implementation for LDA: Collapsed
> Variational Bayes (0th derivative approximation), with map-side model
> caching     Jake Mannix     Jake Mannix     Major   Patch Available
> Unresolved      11/27/11 7:34   12/3/11 5:16
> > Bug   MAHOUT-794      Eigencuts produces unexpected results, part 2
> Shannon Quinn   Sean Owen       Major   Open    Unresolved      8/21/11
> 19:37   8/21/11 19:37
> > Bug   MAHOUT-826      Bayes/CBayes classification on a non-existing
> feature   Robin Anil      Andre-Philippe Paquet   Minor   Open
>  Unresolved      10/3/11 19:09   10/15/11 10:02
> > Improvement   MAHOUT-845      Make cluster top terms code more reusable
>       Unassigned      Frank Scholten  Minor   Patch Available Unresolved
>    10/19/11 14:06  1/4/12 16:27
> > Improvement   MAHOUT-899      Add Point Sampling, Color coding to
> ClusterDumper       Grant Ingersoll Grant Ingersoll Minor   Open
>  Unresolved      11/27/11 21:59  11/29/11 4:34
> > Improvement   MAHOUT-768      Duplicated DoubleFunction in mahout and
> mahout-collections (mahout.math package).       Ted Dunning     Dawid Weiss
>     Minor   Open    Unresolved      7/24/11 7:38    1/4/12 21:39
> > Improvement   MAHOUT-941      Improve ConfusionMatrix statistics
>  Grant Ingersoll Lance Norskog   Minor   Open    Unresolved      1/5/12
> 6:46     1/8/12 2:59
> > Generated at Sun Jan 08 16:54:22 UTC 2012 using JIRA 4.4.1#660-r161644.
>
> --------------------------------------------
> Grant Ingersoll
> http://www.lucidimagination.com
>
>
>
>

Re: Release 0.6 Outstanding Issues

Posted by Grant Ingersoll <gs...@apache.org>.
I think we should discuss M-768 a bit more.  I'd propose we just fold Collections back into Mahout proper and then we can release it with this release and take care of the duplication by pushing the *Function classes into Collections.


On Jan 8, 2012, at 12:10 PM, Jeff Eastman wrote:

> Here's the latest JIRA report for 0.6. In an effort to close the door on issues for code freeze, I suggest we triage:
> 
> - Bug 598: move to 0.7?
> - Feature 897: looks like this can be resolved fixed
> - Bug 794: move to 0.7
> - Bug 826: fix in 0.6
> - Improvements 845, 899: patch is ready but there is some overlap to be resolved. Complete for 0.6 asap or move to 0.7
> - Improvements 768, 941: move to 0.7
> 
> We also have the showstopper build problem with SequentialOutOfCoreSvdTest. I suggest we create a JIRA for this.
> 
> ASF JIRA
> Displaying 8 issues at 08/Jan/12 16:54.
> Issue Type	Key	Summary	Assignee	Reporter	Priority	Status	Resolution	Created	Updated	Due Date
> Bug	MAHOUT-598	Downstream steps in the seq2sparse job flow looking in wrong location for output from previous steps when running in Elastic MapReduce (EMR) cluster	Robin Anil	Timothy Potter	Major	Open	Unresolved	1/27/11 16:20	1/4/12 16:33	 
> New Feature	MAHOUT-897	New implementation for LDA: Collapsed Variational Bayes (0th derivative approximation), with map-side model caching	Jake Mannix	Jake Mannix	Major	Patch Available	Unresolved	11/27/11 7:34	12/3/11 5:16	 
> Bug	MAHOUT-794	Eigencuts produces unexpected results, part 2	Shannon Quinn	Sean Owen	Major	Open	Unresolved	8/21/11 19:37	8/21/11 19:37	 
> Bug	MAHOUT-826	Bayes/CBayes classification on a non-existing feature	Robin Anil	Andre-Philippe Paquet	Minor	Open	Unresolved	10/3/11 19:09	10/15/11 10:02	 
> Improvement	MAHOUT-845	Make cluster top terms code more reusable	Unassigned	Frank Scholten	Minor	Patch Available	Unresolved	10/19/11 14:06	1/4/12 16:27	 
> Improvement	MAHOUT-899	Add Point Sampling, Color coding to ClusterDumper	Grant Ingersoll	Grant Ingersoll	Minor	Open	Unresolved	11/27/11 21:59	11/29/11 4:34	 
> Improvement	MAHOUT-768	Duplicated DoubleFunction in mahout and mahout-collections (mahout.math package).	Ted Dunning	Dawid Weiss	Minor	Open	Unresolved	7/24/11 7:38	1/4/12 21:39	 
> Improvement	MAHOUT-941	Improve ConfusionMatrix statistics	Grant Ingersoll	Lance Norskog	Minor	Open	Unresolved	1/5/12 6:46	1/8/12 2:59	 
> Generated at Sun Jan 08 16:54:22 UTC 2012 using JIRA 4.4.1#660-r161644.

--------------------------------------------
Grant Ingersoll
http://www.lucidimagination.com