You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Ted Dunning (JIRA)" <ji...@apache.org> on 2011/05/21 23:33:51 UTC

[jira] [Commented] (MAHOUT-668) Adding knn support to Mahout classifiers

    [ https://issues.apache.org/jira/browse/MAHOUT-668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13037458#comment-13037458 ] 

Ted Dunning commented on MAHOUT-668:
------------------------------------

Style comment: Please add javadocs and remove @author tags

Real comment: Metrics like CityBlock already exist in Mahout.  If you need to re-implement them, you probably will have better
results if you use matrix/vector operations instead of explicit loops.  This is especially true when the assumptions that led
you to your loop structure are violated.  

> Adding knn support to Mahout classifiers
> ----------------------------------------
>
>                 Key: MAHOUT-668
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-668
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Classification
>    Affects Versions: 0.6
>            Reporter: Daniel McEnnis
>              Labels: classification, knn
>         Attachments: MAHOUT-668.pat, Mahout-668-2.patch, Mahout-668-3.patch, Mahout-668.pat
>
>   Original Estimate: 672h
>  Remaining Estimate: 672h
>
> Initial implementation of the knn.  This is a minimum base set with many more possible add-ons including support for text and weka input as well as a classify only (no confusion matrix) back end.  The system was tested on the 20 newsgroup data set.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira