You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@mahout.apache.org by co...@apache.org on 2010/03/23 19:36:00 UTC
[CONF] Apache Lucene Mahout > TasteCommandLine
Space: Apache Lucene Mahout (http://cwiki.apache.org/confluence/display/MAHOUT)
Page: TasteCommandLine (http://cwiki.apache.org/confluence/display/MAHOUT/TasteCommandLine)
Edited by Sean Owen:
---------------------------------------------------------------------
h1. Introduction
This quick start page describes how to run the hadoop based recommendation jobs of Mahout Taste on a Hadoop cluster.
h1. Steps
h2. Testing it on one single machine w/o cluster
In the examples directory type, for example:
{code}
mvn -q exec:java -Dexec.mainClass="org.apache.mahout.cf.taste.hadoop.pseudo.RecommenderJob" -Dexec.args="<OPTIONS>"
{code}
h2. Running it on the cluster
* In $MAHOUT_HOME/, build the jar containing the job (mvn install) The job will be generated in $MAHOUT_HOME/core/target/ and it's name will contain the Mahout version number. For example, when using Mahout 0.3 release, the job will be mahout-core-0.3.jar
* (Optional) 1 Start up Hadoop: $HADOOP_HOME/bin/start-all.sh
* Put the data: $HADOOP_HOME/bin/hadoop fs -put <PATH TO DATA> testdata
* Run the Job: $HADOOP_HOME/bin/hadoop jar $MAHOUT_HOME/core/target/mahout-core-<MAHOUT VERSION>.job org.apache.mahout.cf.taste.hadoop.<JOB> <OPTIONS>
* Get the data out of HDFS and have a look. Use bin/hadoop fs -lsr output to view all outputs.
h1. Command line options
Specify only the command line option "--help" for a complete summary of available command line options. Or, refer to the javadoc for the "Job" class being run.
Change your notification preferences: http://cwiki.apache.org/confluence/users/viewnotifications.action