You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Wasim <wa...@gmail.com> on 2011/03/01 21:25:17 UTC

Help with Cluster Dumper - no output shown

Hi,

First I ran k-means algorithm using the article Introduction to Apache
Mahout<https://www.ibm.com/developerworks/java/library/j-mahout/index.html#resources>with
following arguments:

<java classname="org.apache.mahout.clustering.kmeans.KMeansDriver"
          fork="true" maxmemory="738m">
      <classpath refid="runtime.classpath"/>
      <arg value="--input"/>
      <arg value="${wiki.dir}/n2/part-full.txt"/>
      <arg value="--clusters"/>
      <arg value="${wiki.dir}/n2/k-output/clusters-in"/>
      <arg value="--k"/>
      <arg value="10"/>
      <arg value="--output"/>
      <arg value="${wiki.dir}/n2/k-output"/>
      <arg value="--distance"/>
      <arg value="org.apache.mahout.utils.CosineDistanceMeasure"/>
      <arg value="--convergence"/>
      <arg value="0.01"/>
      <arg value="--overwrite"/>
    </java>

Now i have the following directories in my "k-output" folder:
clusters-0,clusters-1,clusters-2,clusters-3,clusters-4,clusters-in and
points

Then when i am trying to run cluster-dumper utility using Standalone Java
program:

$ bin/mahout clusterdump --seqFileDir
/cygdrive/c/users/wasim/Downloads/apache-mahout-examples/wikipedia/n2/k-output/clusters-10/
--pointsDir
/cygdrive/c/users/wasim/Downloads/apache-mahout-examples/wikipedia/n2/k-output/points/

i got the following output:

no HADOOP_HOME set, running locally

Mar 1, 2011 8:57:49 PM org.slf4j.impl.JCLLoggerAdapter info

INFO: Command line arguments: {--dictionaryType=text, --endPhase=2147483647,
--pointsDir=/cygdrive/c/users/wasim/Downloads/apache-mahout-examples/wikipedia/n2/k
-output/points/,
--seqFileDir=/cygdrive/c/users/wasim/Downloads/apache-mahout-examples/wikipedia/n2/k-output/clusters-10/,
--startPhase=0, --tempDir=temp}

Mar 1, 2011 8:57:49 PM org.slf4j.impl.JCLLoggerAdapter info
INFO: Program took 332 ms

Why i am not getting clustering data as output???

I am running this commands through cygwin installed on windows machine.


-- 
Thank you & Regards
Muhammad Wasimullah Khan
Mobile:+46 72 03 29 205
Alt.Telephone: +92 345 21 98 451
Email: mwkhan@kth.se
Skype: muhammad.wasim.khan