You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@mahout.apache.org by co...@apache.org on 2009/10/09 12:26:00 UTC

[CONF] Apache Lucene Mahout > TasteCommandLine

Space: Apache Lucene Mahout (http://cwiki.apache.org/confluence/display/MAHOUT)
Page: TasteCommandLine (http://cwiki.apache.org/confluence/display/MAHOUT/TasteCommandLine)

Added by Isabel Drost:
---------------------------------------------------------------------
h1. Introduction 

This quick start page describes how to run the hadoop based recommendation jobs of Mahout Taste on a Hadoop cluster. 

h1. Steps 

h2. Testing it on one single machine w/o cluster 

In the examples directory type: 
{code} 
mvn -q exec:java -Dexec.mainClass="org.apache.mahout.cf.taste.hadoop.SlopeOneDiffsAveragesJob" -Dexec.args="<OPTIONS>" 
mvn -q exec:java -Dexec.mainClass="org.apache.mahout.cf.taste.hadoop.SlopeOnePrefsToDiffsJob" -Dexec.args="<OPTIONS>" 
mvn -q exec:java -Dexec.mainClass="org.apache.mahout.cf.taste.hadoop.RecommenderJob" -Dexec.args="<OPTIONS>" 
{code} 

h2. Running it on the cluster 

* In $MAHOUT_HOME/, build the jar containing the job (mvn install) The job will be generated in $MAHOUT_HOME/core/target/ and it's name will contain the Mahout version number. For example, when using Mahout 0.1 release, the job will be mahout-core-0.1.jar 
* (Optional) 1 Start up Hadoop: $HADOOP_HOME/bin/start-all.sh 
* Put the data: $HADOOP_HOME/bin/hadoop fs -put <PATH TO DATA> testdata 
* Run the Job: $HADOOP_HOME/bin/hadoop jar $MAHOUT_HOME/core/target/mahout-core-<MAHOUT VERSION>.job org.apache.mahout.cf.taste.hadoop.<JOB> <OPTIONS> 
* Get the data out of HDFS and have a look. Use bin/hadoop fs -lsr output to view all outputs. 

h1. Command line options 
{code} 
Usage:
SlopeOneDiffsAveragesJob and SlopeOnePrefsToDiffsJob [--input <input> --output <output> --help]                                     
Options                                                                         
  --input (-i) input      The Path for input preferences file.                  
  --output (-o) output    The directory pathname for output.             
  --help (-h)             Print out help.
{code}

{code}

Usage:                                                                          
Recommender [--recommenderClassName <recommenderClassName> --userRec <userRec> --userIdFile <userIdFile> --dataModelFile <dataModelFile> --output <output> --help]                                                                         
Options                                                                         
  --recommenderClassName (-r) recommenderClassName    Name of recommender class to use.
  --userRec (-n) userRec                              Desired number of recommendations per user.
  --userIdFile (-f) userIdFile                        File containing user ids.
  --dataModelFile (-m) dataModelFile                  File containing data model.
  --output (-o) output                                The directory pathname for output.
  --help (-h)                                         Print out help.
{code}

Change your notification preferences: http://cwiki.apache.org/confluence/users/viewnotifications.action