You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Hudson (JIRA)" <ji...@apache.org> on 2011/05/09 05:33:03 UTC

[jira] [Commented] (MAHOUT-479) Streamline classification/ clustering data structures

    [ https://issues.apache.org/jira/browse/MAHOUT-479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030592#comment-13030592 ] 

Hudson commented on MAHOUT-479:
-------------------------------

Integrated in Mahout-Quality #805 (See [https://builds.apache.org/hudson/job/Mahout-Quality/805/])
    MAHOUT-479: added a new iterate method to ClusterIterator. Method accepts 3
hadoop Paths for input, prior and output information plus number of desired iterations. All algorithm data is pulled-from/pushed-to SequenceFiles. Added a unit test and improved the example DisplayKMeans, DisplayFuzzyKMeans and DisplayDirichlet to use the new file-based implementation. Check out Dirichlet.


> Streamline classification/ clustering data structures
> -----------------------------------------------------
>
>                 Key: MAHOUT-479
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-479
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Classification, Clustering
>    Affects Versions: 0.1, 0.2, 0.3, 0.4
>            Reporter: Isabel Drost
>            Assignee: Isabel Drost
>
> Opening this JIRA issue to collect ideas on how to streamline our classification and clustering algorithms to make integration for users easier as per mailing list thread http://markmail.org/message/pnzvrqpv5226twfs
> {quote}
> Jake and Robin and I were talking the other evening and a common lament was that our classification (and clustering) stuff was all over the map in terms of data structures.  Driving that to rest and getting those comments even vaguely as plug and play as our much more advanced recommendation components would be very, very helpful.
> {quote}
> This issue probably also realates to MAHOUT-287 (intention there is to make naive bayes run on vectors as input).
> Ted, Jake, Robin: Would be great if someone of you could add a comment on some of the issues you discussed "the other evening" and (if applicable) any minor or major changes you think could help solve this issue.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira