You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@mahout.apache.org by co...@apache.org on 2009/06/17 00:06:01 UTC

[CONF] Apache Lucene Mahout: ClusteringYourData (page created)

ClusteringYourData (MAHOUT) created by Grant Ingersoll
   http://cwiki.apache.org/confluence/display/MAHOUT/ClusteringYourData

Content:
---------------------------------------------------------------------

+*Mahout_0.2*+

After you've done the [QuickStart] and are familiar with the basics of Mahout, it is time to cluster your own data. 

The following pieces *may* be useful for in getting started:

h1. Input

For starters, you will need your data in an appropriate Vector format (which has changed since Mahout 0.1)

h2. Text Preparation

* See [Creating Vectors from Text] 
* http://www.lucidimagination.com/search/document/4a0e528982b2dac3/document_clustering

h1. Running the Process

+*TODO*+ FILL ME IN

h1. Validating the Output

+*TODO*+ FILL ME IN

h1. References

* [Mahout archive references|http://www.lucidimagination.com/search/p:mahout?q=clustering]

---------------------------------------------------------------------
CONFLUENCE INFORMATION
This message is automatically generated by Confluence

Unsubscribe or edit your notifications preferences
   http://cwiki.apache.org/confluence/users/viewnotifications.action

If you think it was sent incorrectly contact one of the administrators
   http://cwiki.apache.org/confluence/administrators.action

If you want more information on Confluence, or have a bug to report see
   http://www.atlassian.com/software/confluence