You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@mahout.apache.org by bu...@apache.org on 2013/11/21 12:13:34 UTC

svn commit: r887494 - in /websites/staging/mahout/trunk/content: ./ users/clustering/fuzzy-k-means-commandline.html

Author: buildbot
Date: Thu Nov 21 11:13:34 2013
New Revision: 887494

Log:
Staging update by buildbot for mahout

Modified:
    websites/staging/mahout/trunk/content/   (props changed)
    websites/staging/mahout/trunk/content/users/clustering/fuzzy-k-means-commandline.html

Propchange: websites/staging/mahout/trunk/content/
------------------------------------------------------------------------------
--- cms:source-revision (original)
+++ cms:source-revision Thu Nov 21 11:13:34 2013
@@ -1 +1 @@
-1544113
+1544114

Modified: websites/staging/mahout/trunk/content/users/clustering/fuzzy-k-means-commandline.html
==============================================================================
--- websites/staging/mahout/trunk/content/users/clustering/fuzzy-k-means-commandline.html (original)
+++ websites/staging/mahout/trunk/content/users/clustering/fuzzy-k-means-commandline.html Thu Nov 21 11:13:34 2013
@@ -430,61 +430,46 @@ to view all outputs.</p>
 <div class="codehilite"><pre>  <span class="o">--</span><span class="n">input</span> <span class="p">(</span><span class="o">-</span><span class="nb">i</span><span class="p">)</span> <span class="n">input</span>                   <span class="n">Path</span> <span class="n">to</span> <span class="n">job</span> <span class="n">input</span> <span class="n">directory</span><span class="p">.</span> 
                            <span class="n">Must</span> <span class="n">be</span> <span class="n">a</span> <span class="n">SequenceFile</span> <span class="n">of</span>    
                            <span class="n">VectorWritable</span>           
-  <span class="o">--</span><span class="n">clusters</span> <span class="p">(</span><span class="o">-</span><span class="n">c</span><span class="p">)</span> <span class="n">clusters</span>             <span class="n">The</span> <span class="n">input</span> <span class="n">centroids</span><span class="p">,</span> <span class="n">as</span>
+  <span class="o">--</span><span class="n">clusters</span> <span class="p">(</span><span class="o">-</span><span class="n">c</span><span class="p">)</span> <span class="n">clusters</span>             <span class="n">The</span> <span class="n">input</span> <span class="n">centroids</span><span class="p">,</span> <span class="n">as</span> <span class="n">Vectors</span><span class="p">.</span> 
+                           <span class="n">Must</span> <span class="n">be</span> <span class="n">a</span> <span class="n">SequenceFile</span> <span class="n">of</span>    
+                           <span class="n">Writable</span><span class="p">,</span> <span class="n">Cluster</span><span class="o">/</span><span class="n">Canopy</span><span class="p">.</span> <span class="n">If</span> <span class="n">k</span>  
+                           <span class="n">is</span> <span class="n">also</span> <span class="n">specified</span><span class="p">,</span> <span class="n">then</span> <span class="n">a</span> <span class="n">random</span> 
+                           <span class="n">set</span> <span class="n">of</span> <span class="n">vectors</span> <span class="n">will</span> <span class="n">be</span> <span class="n">selected</span>  
+                           <span class="n">and</span> <span class="n">written</span> <span class="n">out</span> <span class="n">to</span> <span class="n">this</span> <span class="n">path</span> 
+                           <span class="n">first</span>                
+  <span class="o">--</span><span class="n">output</span> <span class="p">(</span><span class="o">-</span><span class="n">o</span><span class="p">)</span> <span class="n">output</span>                 <span class="n">The</span> <span class="n">directory</span> <span class="n">pathname</span> <span class="k">for</span>   
+                           <span class="n">output</span><span class="p">.</span>              
+  <span class="o">--</span><span class="n">distanceMeasure</span> <span class="p">(</span><span class="o">-</span><span class="n">dm</span><span class="p">)</span> <span class="n">distanceMeasure</span>      <span class="n">The</span> <span class="n">classname</span> <span class="n">of</span> <span class="n">the</span>     
+                           <span class="n">DistanceMeasure</span><span class="p">.</span> <span class="n">Default</span> <span class="n">is</span>  
+                           <span class="n">SquaredEuclidean</span>         
+  <span class="o">--</span><span class="n">convergenceDelta</span> <span class="p">(</span><span class="o">-</span><span class="n">cd</span><span class="p">)</span> <span class="n">convergenceDelta</span>    <span class="n">The</span> <span class="n">convergence</span> <span class="n">delta</span> <span class="n">value</span><span class="p">.</span> 
+                           <span class="n">Default</span> <span class="n">is</span> 0<span class="p">.</span>5           
+  <span class="o">--</span><span class="n">maxIter</span> <span class="p">(</span><span class="o">-</span><span class="n">x</span><span class="p">)</span> <span class="n">maxIter</span>               <span class="n">The</span> <span class="n">maximum</span> <span class="n">number</span> <span class="n">of</span>        
+                           <span class="n">iterations</span><span class="p">.</span>          
+  <span class="o">--</span><span class="n">k</span> <span class="p">(</span><span class="o">-</span><span class="n">k</span><span class="p">)</span> <span class="n">k</span>                       <span class="n">The</span> <span class="n">k</span> <span class="n">in</span> <span class="n">k</span><span class="o">-</span><span class="n">Means</span><span class="p">.</span>  <span class="n">If</span> <span class="n">specified</span><span class="p">,</span> 
+                           <span class="n">then</span> <span class="n">a</span> <span class="n">random</span> <span class="n">selection</span> <span class="n">of</span> <span class="n">k</span> 
+                           <span class="n">Vectors</span> <span class="n">will</span> <span class="n">be</span> <span class="n">chosen</span> <span class="n">as</span> <span class="n">the</span>
+                               <span class="n">Centroid</span> <span class="n">and</span> <span class="n">written</span> <span class="n">to</span> <span class="n">the</span>  
+                           <span class="n">clusters</span> <span class="n">input</span> <span class="n">path</span><span class="p">.</span>     
+  <span class="o">--</span><span class="n">m</span> <span class="p">(</span><span class="o">-</span><span class="n">m</span><span class="p">)</span> <span class="n">m</span>                       <span class="n">coefficient</span> <span class="n">normalization</span>    
+                           <span class="nb">factor</span><span class="p">,</span> <span class="n">must</span> <span class="n">be</span> <span class="n">greater</span> <span class="n">than</span> 1   
+  <span class="o">--</span><span class="n">overwrite</span> <span class="p">(</span><span class="o">-</span><span class="n">ow</span><span class="p">)</span>                <span class="n">If</span> <span class="n">present</span><span class="p">,</span> <span class="n">overwrite</span> <span class="n">the</span> <span class="n">output</span> 
+                           <span class="n">directory</span> <span class="n">before</span> <span class="n">running</span> <span class="n">job</span> 
+  <span class="o">--</span><span class="n">help</span> <span class="p">(</span><span class="o">-</span><span class="n">h</span><span class="p">)</span>                      <span class="n">Print</span> <span class="n">out</span> <span class="n">help</span>           
+  <span class="o">--</span><span class="n">numMap</span> <span class="p">(</span><span class="o">-</span><span class="n">u</span><span class="p">)</span> <span class="n">numMap</span>                 <span class="n">The</span> <span class="n">number</span> <span class="n">of</span> <span class="n">map</span> <span class="n">tasks</span><span class="p">.</span>     
+                           <span class="n">Defaults</span> <span class="n">to</span> 10           
+  <span class="o">--</span><span class="n">maxRed</span> <span class="p">(</span><span class="o">-</span><span class="n">r</span><span class="p">)</span> <span class="n">maxRed</span>                 <span class="n">The</span> <span class="n">number</span> <span class="n">of</span> <span class="n">reduce</span> <span class="n">tasks</span><span class="p">.</span>  
+                           <span class="n">Defaults</span> <span class="n">to</span> 2            
+  <span class="o">--</span><span class="n">emitMostLikely</span> <span class="p">(</span><span class="o">-</span><span class="n">e</span><span class="p">)</span> <span class="n">emitMostLikely</span>         <span class="n">True</span> <span class="k">if</span> <span class="n">clustering</span> <span class="n">should</span> <span class="n">emit</span>   
+                           <span class="n">the</span> <span class="n">most</span> <span class="n">likely</span> <span class="n">point</span> <span class="n">only</span><span class="p">,</span>  
+                           <span class="n">false</span> <span class="k">for</span> <span class="n">threshold</span> <span class="n">clustering</span><span class="p">.</span>  
+                           <span class="n">Default</span> <span class="n">is</span> <span class="n">true</span>          
+  <span class="o">--</span><span class="n">threshold</span> <span class="p">(</span><span class="o">-</span><span class="n">t</span><span class="p">)</span> <span class="n">threshold</span>               <span class="n">The</span> <span class="n">pdf</span> <span class="n">threshold</span> <span class="n">used</span> <span class="k">for</span>   
+                           <span class="n">cluster</span> <span class="n">determination</span><span class="p">.</span> <span class="n">Default</span>   
+                           <span class="n">is</span> 0 
+  <span class="o">--</span><span class="n">clustering</span> <span class="p">(</span><span class="o">-</span><span class="n">cl</span><span class="p">)</span>                   <span class="n">If</span> <span class="n">present</span><span class="p">,</span> <span class="n">run</span> <span class="n">clustering</span> <span class="n">after</span> 
+                           <span class="n">the</span> <span class="n">iterations</span> <span class="n">have</span> <span class="n">taken</span> <span class="n">place</span>
 </pre></div>
-
-
-<p>Vectors. 
-                               Must be a SequenceFile of  <br />
-                               Writable, Cluster/Canopy. 
-If k<br />
-                               is also specified, then a
-random 
-                               set of vectors will be
-selected<br />
-                               and written out to this path 
-                               first              <br />
-      --output (-o) output                 The directory pathname for <br />
-                               output.            <br />
-      --distanceMeasure (-dm) distanceMeasure      The classname of the   <br />
-                               DistanceMeasure. Default is<br />
-                               SquaredEuclidean       <br />
-      --convergenceDelta (-cd) convergenceDelta    The convergence delta value. 
-                               Default is 0.5         <br />
-      --maxIter (-x) maxIter               The maximum number of      <br />
-                               iterations.        <br />
-      --k (-k) k                       The k in k-Means.  If
-specified, 
-                               then a random selection of k 
-                               Vectors will be chosen as
-the  <br />
-                               Centroid and written to the<br />
-                               clusters input path.   <br />
-      --m (-m) m                       coefficient normalization  <br />
-                               factor, must be greater than
-1 <br />
-      --overwrite (-ow)                If present, overwrite the
-output 
-                               directory before running job 
-      --help (-h)                      Print out help         <br />
-      --numMap (-u) numMap                 The number of map tasks.   <br />
-                               Defaults to 10         <br />
-      --maxRed (-r) maxRed                 The number of reduce tasks.<br />
-                               Defaults to 2          <br />
-      --emitMostLikely (-e) emitMostLikely         True if clustering should
-emit <br />
-                               the most likely point only,<br />
-                               false for threshold
-clustering.<br />
-                               Default is true        <br />
-      --threshold (-t) threshold               The pdf threshold used for <br />
-                               cluster determination.
-Default <br />
-                               is 0 
-      --clustering (-cl)                   If present, run clustering
-after 
-                               the iterations have taken
-place  </p>
    </div>
   </div>     
 </div>