You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by beneo_7 <be...@163.com> on 2010/11/25 10:40:25 UTC

how to make mahout work in hadoop

i made a pseudo-distributed hadoop node by http://hadoop.apache.org/common/docs/r0.21.0/single_node_setup.html#Pseudo-Distributed Operation, and run the hadoop-examples successfully.


however, i use


./mahout canopy -i /home/space/lucene_clustering/vector/vector -o /home/space/lucene_clustering/canopy/ -dm org.apache.mahout.common.distance.EuclideanDistanceMeasure -t1 0.8 -t2 0.2 -ow


to start the mahout, there is no reponse after the log


0/11/25 17:19:18 INFO common.AbstractJob: Command line arguments: {--distanceMeasure=org.apache.mahout.common.distance.EuclideanDistanceMeasure, --endPhase=2147483647, --input=/home/hadoop/vector/vector, --method=mapreduce, --output=/home/hadoop/canopy/, --overwrite=null, --startPhase=0, --t1=0.8, --t2=0.2, --tempDir=temp}
10/11/25 17:19:18 INFO canopy.CanopyDriver: Build Clusters Input: /home/hadoop/vector/vector Out: /home/hadoop/canopy Measure: org.apache.mahout.common.distance.EuclideanDistanceMeasure@2e716cb7 t1: 0.8 t2: 0.2

and i test the
https://cwiki.apache.org/confluence/display/MAHOUT/Clustering+of+synthetic+control+data
unfortunately, it did not work for me.

after the log
10/11/25 17:42:24 WARN driver.MahoutDriver: No org.apache.mahout.clustering.syntheticcontrol.canopy.Job.props found on classpath, will use command-line arguments only
10/11/25 17:42:24 INFO canopy.Job: Running with default arguments
and there is no more response


Re: how to make mahout work in hadoop

Posted by Ted Dunning <te...@gmail.com>.
Do you have a working hadoop cluster?

2010/11/25 beneo_7 <be...@163.com>

> i made a pseudo-distributed hadoop node by
> http://hadoop.apache.org/common/docs/r0.21.0/single_node_setup.html#Pseudo-DistributedOperation, and run the hadoop-examples successfully.
>
>
> however, i use
>
>
> ./mahout canopy -i /home/space/lucene_clustering/vector/vector -o
> /home/space/lucene_clustering/canopy/ -dm
> org.apache.mahout.common.distance.EuclideanDistanceMeasure -t1 0.8 -t2 0.2
> -ow
>
>
> to start the mahout, there is no reponse after the log
>
>
> 0/11/25 17:19:18 INFO common.AbstractJob: Command line arguments:
> {--distanceMeasure=org.apache.mahout.common.distance.EuclideanDistanceMeasure,
> --endPhase=2147483647, --input=/home/hadoop/vector/vector,
> --method=mapreduce, --output=/home/hadoop/canopy/, --overwrite=null,
> --startPhase=0, --t1=0.8, --t2=0.2, --tempDir=temp}
> 10/11/25 17:19:18 INFO canopy.CanopyDriver: Build Clusters Input:
> /home/hadoop/vector/vector Out: /home/hadoop/canopy Measure:
> org.apache.mahout.common.distance.EuclideanDistanceMeasure@2e716cb7 t1:
> 0.8 t2: 0.2
>
> and i test the
>
> https://cwiki.apache.org/confluence/display/MAHOUT/Clustering+of+synthetic+control+data
> unfortunately, it did not work for me.
>
> after the log
> 10/11/25 17:42:24 WARN driver.MahoutDriver: No
> org.apache.mahout.clustering.syntheticcontrol.canopy.Job.props found on
> classpath, will use command-line arguments only
> 10/11/25 17:42:24 INFO canopy.Job: Running with default arguments
> and there is no more response
>
>