You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Grant Ingersoll (Created) (JIRA)" <ji...@apache.org> on 2011/11/01 16:33:33 UTC

[jira] [Created] (MAHOUT-857) Rework 20 NewsGroup shell script example to include SGD Example

Rework 20 NewsGroup shell script example to include SGD Example
---------------------------------------------------------------

                 Key: MAHOUT-857
                 URL: https://issues.apache.org/jira/browse/MAHOUT-857
             Project: Mahout
          Issue Type: Improvement
            Reporter: Grant Ingersoll


We have build-20news-bayes.sh that runs our NB stuff on 20 news groups.  We also have an SGD example that works on 20 news groups, but no script to run it.  I'm going to rename build-20news-bayes.sh to classify-20news.sh and incorporate the two.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-857) Rework 20 NewsGroup shell script example to include SGD Example

Posted by "Grant Ingersoll (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13141464#comment-13141464 ] 

Grant Ingersoll commented on MAHOUT-857:
----------------------------------------

I committed the last patch, plus some formatting.  Since this is an example, I figured we can just iterate on a committed version.
                
> Rework 20 NewsGroup shell script example to include SGD Example
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-857
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-857
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Grant Ingersoll
>         Attachments: MAHOUT-857.patch, MAHOUT-857.patch, MAHOUT-857.patch
>
>
> We have build-20news-bayes.sh that runs our NB stuff on 20 news groups.  We also have an SGD example that works on 20 news groups, but no script to run it.  I'm going to rename build-20news-bayes.sh to classify-20news.sh and incorporate the two.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-857) Rework 20 NewsGroup shell script example to include SGD Example

Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13141621#comment-13141621 ] 

Hudson commented on MAHOUT-857:
-------------------------------

Integrated in Mahout-Quality #1130 (See [https://builds.apache.org/job/Mahout-Quality/1130/])
    MAHOUT-857: minor formatting
MAHOUT-857: hook in support for SGD to the 20 newsgroups

gsingers : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1196207
Files : 
* /mahout/trunk/examples/src/main/java/org/apache/mahout/classifier/sgd/NewsgroupHelper.java
* /mahout/trunk/examples/src/main/java/org/apache/mahout/classifier/sgd/TestNewsGroups.java

gsingers : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1196206
Files : 
* /mahout/trunk/examples/bin/build-20news-bayes.sh
* /mahout/trunk/examples/bin/classify-20newsgroups.sh
* /mahout/trunk/examples/src/main/java/org/apache/mahout/classifier/sgd/NewsgroupHelper.java
* /mahout/trunk/examples/src/main/java/org/apache/mahout/classifier/sgd/TestNewsGroups.java
* /mahout/trunk/examples/src/main/java/org/apache/mahout/classifier/sgd/TrainNewsGroups.java

                
> Rework 20 NewsGroup shell script example to include SGD Example
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-857
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-857
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Grant Ingersoll
>         Attachments: MAHOUT-857-ll.patch, MAHOUT-857.patch, MAHOUT-857.patch, MAHOUT-857.patch
>
>
> We have build-20news-bayes.sh that runs our NB stuff on 20 news groups.  We also have an SGD example that works on 20 news groups, but no script to run it.  I'm going to rename build-20news-bayes.sh to classify-20news.sh and incorporate the two.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-857) Rework 20 NewsGroup shell script example to include SGD Example

Posted by "Grant Ingersoll (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13141321#comment-13141321 ] 

Grant Ingersoll commented on MAHOUT-857:
----------------------------------------

Here's the conf. matrix I'm getting, which clearly points to some idiocy on my part:
{quote}

7532 test files
=======================================================
Summary
-------------------------------------------------------
Correctly Classified Instances          :        374	    4.9655%
Incorrectly Classified Instances        :       7158	   95.0345%
Total Classified Instances              :       7532

=======================================================
Confusion Matrix
-------------------------------------------------------
a    	b    	c    	d    	e    	f    	g    	h    	i    	j    	k    	l    	m    	n    	o    	p    	q    	r    	s    	t    	u    	<--Classified as
123  	0    	1    	1    	1    	2    	6    	19   	2    	2    	5    	23   	27   	8    	53   	3    	14   	17   	12   	0    	0    	 |  319   	a     = alt.atheism
55   	16   	28   	14   	80   	24   	3    	8    	4    	3    	8    	86   	27   	28   	0    	2    	3    	0    	0    	0    	0    	 |  389   	b     = comp.graphics
38   	171  	57   	14   	49   	5    	3    	6    	2    	4    	3    	25   	7    	6    	1    	1    	0    	2    	0    	0    	0    	 |  394   	c     = comp.os.ms-windows.misc
10   	14   	237  	18   	17   	15   	2    	7    	4    	0    	2    	54   	7    	4    	0    	0    	0    	1    	0    	0    	0    	 |  392   	d     = comp.sys.ibm.pc.hardware
20   	10   	55   	159  	17   	20   	7    	11   	5    	0    	1    	63   	13   	2    	0    	1    	0    	1    	0    	0    	0    	 |  385   	e     = comp.sys.mac.hardware
11   	25   	5    	0    	306  	13   	3    	1    	0    	5    	2    	13   	5    	6    	0    	0    	0    	0    	0    	0    	0    	 |  395   	f     = comp.windows.x
2    	1    	23   	14   	6    	310  	1    	3    	3    	1    	1    	10   	6    	5    	0    	3    	0    	1    	0    	0    	0    	 |  390   	g     = misc.forsale
8    	1    	6    	2    	9    	11   	270  	15   	10   	3    	3    	37   	11   	4    	0    	2    	0    	4    	0    	0    	0    	 |  396   	h     = rec.autos
7    	0    	1    	1    	8    	6    	14   	326  	1    	0    	1    	12   	17   	3    	1    	0    	0    	0    	0    	0    	0    	 |  398   	i     = rec.motorcycles
17   	1    	2    	1    	2    	5    	2    	7    	295  	26   	1    	16   	12   	2    	0    	2    	3    	3    	0    	0    	0    	 |  397   	j     = rec.sport.baseball
6    	1    	0    	0    	1    	3    	3    	6    	55   	291  	1    	7    	4    	14   	2    	4    	1    	0    	0    	0    	0    	 |  399   	k     = rec.sport.hockey
22   	2    	0    	3    	5    	3    	0    	3    	2    	1    	293  	24   	12   	7    	0    	4    	2    	13   	0    	0    	0    	 |  396   	l     = sci.crypt
25   	6    	23   	13   	15   	11   	10   	18   	4    	3    	13   	212  	18   	16   	2    	1    	1    	2    	0    	0    	0    	 |  393   	m     = sci.electronics
14   	4    	5    	2    	5    	7    	2    	17   	7    	3    	0    	38   	268  	11   	4    	3    	4    	2    	0    	0    	0    	 |  396   	n     = sci.med
22   	1    	0    	1    	3    	4    	0    	8    	1    	4    	2    	34   	26   	279  	0    	2    	2    	5    	0    	0    	0    	 |  394   	o     = sci.space
43   	1    	2    	4    	0    	4    	1    	11   	4    	1    	0    	9    	33   	8    	249  	2    	5    	14   	7    	0    	0    	 |  398   	p     = soc.religion.christian
21   	0    	0    	1    	3    	3    	2    	12   	6    	2    	3    	10   	16   	5    	1    	235  	4    	40   	0    	0    	0    	 |  364   	q     = talk.politics.guns
41   	0    	0    	2    	1    	1    	5    	3    	3    	7    	0    	10   	12   	5    	1    	8    	250  	27   	0    	0    	0    	 |  376   	r     = talk.politics.mideast
34   	0    	0    	1    	2    	4    	3    	16   	2    	1    	5    	14   	12   	6    	4    	67   	8    	131  	0    	0    	0    	 |  310   	s     = talk.politics.misc
50   	0    	0    	1    	2    	0    	1    	15   	7    	0    	3    	11   	21   	7    	53   	17   	6    	19   	38   	0    	0    	 |  251   	t     = talk.religion.misc
0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	 |  0     	u     = DEFAULT
Default Category: DEFAULT: 20
{quote}
                
> Rework 20 NewsGroup shell script example to include SGD Example
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-857
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-857
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Grant Ingersoll
>         Attachments: MAHOUT-857.patch
>
>
> We have build-20news-bayes.sh that runs our NB stuff on 20 news groups.  We also have an SGD example that works on 20 news groups, but no script to run it.  I'm going to rename build-20news-bayes.sh to classify-20news.sh and incorporate the two.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-857) Rework 20 NewsGroup shell script example to include SGD Example

Posted by "Grant Ingersoll (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13141353#comment-13141353 ] 

Grant Ingersoll commented on MAHOUT-857:
----------------------------------------

Here's the new confusion matrix:
{quote}

=======================================================
Summary
-------------------------------------------------------
Correctly Classified Instances          :       5137	   68.2023%
Incorrectly Classified Instances        :       2395	   31.7977%
Total Classified Instances              :       7532

=======================================================
Confusion Matrix
-------------------------------------------------------
a    	b    	c    	d    	e    	f    	g    	h    	i    	j    	k    	l    	m    	n    	o    	p    	q    	r    	s    	t    	u    	<--Classified as
0    	76   	2    	0    	0    	1    	3    	0    	3    	0    	1    	4    	4    	19   	16   	70   	3    	6    	28   	83   	0    	 |  319   	a     = alt.atheism
0    	45   	55   	25   	20   	92   	24   	5    	6    	2    	0    	5    	71   	11   	13   	7    	1    	0    	5    	2    	0    	 |  389   	b     = comp.graphics
0    	12   	281  	25   	16   	20   	2    	2    	4    	0    	0    	2    	8    	3    	5    	4    	0    	0    	4    	6    	0    	 |  394   	c     = comp.os.ms-windows.misc
0    	4    	34   	246  	18   	15   	11   	2    	4    	0    	0    	1    	47   	3    	2    	2    	0    	0    	3    	0    	0    	 |  392   	d     = comp.sys.ibm.pc.hardware
0    	4    	10   	39   	253  	4    	12   	3    	5    	3    	0    	0    	29   	9    	8    	2    	0    	0    	4    	0    	0    	 |  385   	e     = comp.sys.mac.hardware
0    	4    	33   	1    	2    	314  	8    	0    	3    	2    	1    	4    	5    	3    	7    	5    	0    	0    	2    	1    	0    	 |  395   	f     = comp.windows.x
0    	1    	2    	14   	9    	1    	336  	1    	2    	0    	0    	1    	15   	1    	4    	3    	0    	0    	0    	0    	0    	 |  390   	g     = misc.forsale
0    	2    	1    	1    	2    	4    	18   	259  	37   	6    	0    	0    	28   	5    	15   	4    	2    	0    	11   	1    	0    	 |  396   	h     = rec.autos
0    	2    	3    	3    	0    	1    	6    	13   	344  	1    	0    	2    	6    	0    	6    	6    	1    	0    	3    	1    	0    	 |  398   	i     = rec.motorcycles
0    	2    	0    	0    	0    	6    	6    	1    	1    	338  	16   	0    	11   	5    	2    	2    	0    	0    	5    	2    	0    	 |  397   	j     = rec.sport.baseball
0    	2    	3    	0    	1    	0    	3    	1    	4    	22   	348  	0    	2    	0    	2    	4    	0    	0    	4    	3    	0    	 |  399   	k     = rec.sport.hockey
0    	9    	3    	0    	1    	3    	2    	0    	4    	3    	1    	310  	15   	1    	11   	13   	5    	1    	14   	0    	0    	 |  396   	l     = sci.crypt
0    	12   	6    	17   	12   	8    	9    	7    	6    	2    	0    	9    	269  	2    	19   	10   	1    	0    	4    	0    	0    	 |  393   	m     = sci.electronics
0    	4    	0    	1    	1    	5    	16   	9    	9    	2    	0    	1    	29   	270  	12   	9    	2    	0    	19   	7    	0    	 |  396   	n     = sci.med
0    	4    	1    	0    	2    	8    	4    	0    	1    	1    	0    	1    	9    	5    	342  	5    	1    	1    	6    	3    	0    	 |  394   	o     = sci.space
0    	4    	3    	1    	0    	2    	3    	2    	5    	0    	0    	1    	5    	13   	6    	301  	0    	1    	6    	45   	0    	 |  398   	p     = soc.religion.christian
0    	4    	2    	0    	0    	0    	2    	2    	5    	2    	1    	6    	6    	4    	5    	7    	283  	3    	24   	8    	0    	 |  364   	q     = talk.politics.guns
0    	1    	0    	0    	0    	2    	1    	0    	5    	4    	0    	2    	3    	1    	5    	14   	6    	286  	39   	7    	0    	 |  376   	r     = talk.politics.mideast
0    	0    	1    	0    	0    	1    	1    	0    	1    	1    	1    	4    	1    	8    	5    	10   	91   	0    	179  	6    	0    	 |  310   	s     = talk.politics.misc
0    	13   	1    	1    	0    	0    	3    	1    	1    	1    	0    	0    	1    	6    	15   	46   	16   	1    	12   	133  	0    	 |  251   	t     = talk.religion.misc
0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	0    	 |  0     	u     = DEFAULT
Default Category: DEFAULT: 20

{quote}
                
> Rework 20 NewsGroup shell script example to include SGD Example
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-857
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-857
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Grant Ingersoll
>         Attachments: MAHOUT-857.patch, MAHOUT-857.patch
>
>
> We have build-20news-bayes.sh that runs our NB stuff on 20 news groups.  We also have an SGD example that works on 20 news groups, but no script to run it.  I'm going to rename build-20news-bayes.sh to classify-20news.sh and incorporate the two.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAHOUT-857) Rework 20 NewsGroup shell script example to include SGD Example

Posted by "Grant Ingersoll (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll updated MAHOUT-857:
-----------------------------------

    Attachment: MAHOUT-857.patch

Here's a patch.  It isn't correct yet for running the test, as it gives terrible results, but I am hoping a second set of eyes will help.
                
> Rework 20 NewsGroup shell script example to include SGD Example
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-857
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-857
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Grant Ingersoll
>         Attachments: MAHOUT-857.patch
>
>
> We have build-20news-bayes.sh that runs our NB stuff on 20 news groups.  We also have an SGD example that works on 20 news groups, but no script to run it.  I'm going to rename build-20news-bayes.sh to classify-20news.sh and incorporate the two.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-857) Rework 20 NewsGroup shell script example to include SGD Example

Posted by "Grant Ingersoll (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13141371#comment-13141371 ] 

Grant Ingersoll commented on MAHOUT-857:
----------------------------------------

Working through some more of this, AUC doesn't make sense here.
                
> Rework 20 NewsGroup shell script example to include SGD Example
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-857
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-857
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Grant Ingersoll
>         Attachments: MAHOUT-857.patch, MAHOUT-857.patch
>
>
> We have build-20news-bayes.sh that runs our NB stuff on 20 news groups.  We also have an SGD example that works on 20 news groups, but no script to run it.  I'm going to rename build-20news-bayes.sh to classify-20news.sh and incorporate the two.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-857) Rework 20 NewsGroup shell script example to include SGD Example

Posted by "Ted Dunning (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13141479#comment-13141479 ] 

Ted Dunning commented on MAHOUT-857:
------------------------------------

That looks like off by one somewhere in the categorization.  Training often has this problem.
                
> Rework 20 NewsGroup shell script example to include SGD Example
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-857
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-857
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Grant Ingersoll
>         Attachments: MAHOUT-857.patch, MAHOUT-857.patch, MAHOUT-857.patch
>
>
> We have build-20news-bayes.sh that runs our NB stuff on 20 news groups.  We also have an SGD example that works on 20 news groups, but no script to run it.  I'm going to rename build-20news-bayes.sh to classify-20news.sh and incorporate the two.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAHOUT-857) Rework 20 NewsGroup shell script example to include SGD Example

Posted by "Grant Ingersoll (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll updated MAHOUT-857:
-----------------------------------

    Attachment: MAHOUT-857.patch

Much better looking patch.  Cleaned up the code, dropped the changes to RunLogistic, put in some smarter handling of the temporary data directories.
                
> Rework 20 NewsGroup shell script example to include SGD Example
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-857
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-857
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Grant Ingersoll
>         Attachments: MAHOUT-857.patch, MAHOUT-857.patch, MAHOUT-857.patch
>
>
> We have build-20news-bayes.sh that runs our NB stuff on 20 news groups.  We also have an SGD example that works on 20 news groups, but no script to run it.  I'm going to rename build-20news-bayes.sh to classify-20news.sh and incorporate the two.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-857) Rework 20 NewsGroup shell script example to include SGD Example

Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13141863#comment-13141863 ] 

Hudson commented on MAHOUT-857:
-------------------------------

Integrated in Mahout-Quality #1132 (See [https://builds.apache.org/job/Mahout-Quality/1132/])
    MAHOUT-857: add in LogLikelihood and OnlineSummarizer to ResultAnalyzer

gsingers : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1196420
Files : 
* /mahout/trunk/core/src/main/java/org/apache/mahout/classifier/ClassifierResult.java
* /mahout/trunk/core/src/main/java/org/apache/mahout/classifier/ResultAnalyzer.java
* /mahout/trunk/examples/src/main/java/org/apache/mahout/classifier/sgd/TestNewsGroups.java
* /mahout/trunk/examples/src/main/java/org/apache/mahout/classifier/sgd/TrainNewsGroups.java

                
> Rework 20 NewsGroup shell script example to include SGD Example
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-857
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-857
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Grant Ingersoll
>         Attachments: MAHOUT-857-ll.patch, MAHOUT-857.patch, MAHOUT-857.patch, MAHOUT-857.patch
>
>
> We have build-20news-bayes.sh that runs our NB stuff on 20 news groups.  We also have an SGD example that works on 20 news groups, but no script to run it.  I'm going to rename build-20news-bayes.sh to classify-20news.sh and incorporate the two.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (MAHOUT-857) Rework 20 NewsGroup shell script example to include SGD Example

Posted by "Grant Ingersoll (Resolved) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll resolved MAHOUT-857.
------------------------------------

       Resolution: Fixed
    Fix Version/s: 0.6
    
> Rework 20 NewsGroup shell script example to include SGD Example
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-857
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-857
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Grant Ingersoll
>             Fix For: 0.6
>
>         Attachments: MAHOUT-857-ll.patch, MAHOUT-857.patch, MAHOUT-857.patch, MAHOUT-857.patch
>
>
> We have build-20news-bayes.sh that runs our NB stuff on 20 news groups.  We also have an SGD example that works on 20 news groups, but no script to run it.  I'm going to rename build-20news-bayes.sh to classify-20news.sh and incorporate the two.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAHOUT-857) Rework 20 NewsGroup shell script example to include SGD Example

Posted by "Grant Ingersoll (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll updated MAHOUT-857:
-----------------------------------

    Attachment: MAHOUT-857.patch

Looks like it was an off by one error due to the use of classify versus classifyFull
                
> Rework 20 NewsGroup shell script example to include SGD Example
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-857
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-857
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Grant Ingersoll
>         Attachments: MAHOUT-857.patch, MAHOUT-857.patch
>
>
> We have build-20news-bayes.sh that runs our NB stuff on 20 news groups.  We also have an SGD example that works on 20 news groups, but no script to run it.  I'm going to rename build-20news-bayes.sh to classify-20news.sh and incorporate the two.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAHOUT-857) Rework 20 NewsGroup shell script example to include SGD Example

Posted by "Grant Ingersoll (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll updated MAHOUT-857:
-----------------------------------

    Attachment: MAHOUT-857-ll.patch

Add support for log-likelihood capture in the results
                
> Rework 20 NewsGroup shell script example to include SGD Example
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-857
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-857
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Grant Ingersoll
>         Attachments: MAHOUT-857-ll.patch, MAHOUT-857.patch, MAHOUT-857.patch, MAHOUT-857.patch
>
>
> We have build-20news-bayes.sh that runs our NB stuff on 20 news groups.  We also have an SGD example that works on 20 news groups, but no script to run it.  I'm going to rename build-20news-bayes.sh to classify-20news.sh and incorporate the two.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira