You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@mahout.apache.org by ap...@apache.org on 2015/04/04 22:24:31 UTC
svn commit: r1671312 -
/mahout/site/mahout_cms/trunk/content/users/algorithms/spark-naive-bayes.mdtext
Author: apalumbo
Date: Sat Apr 4 20:24:30 2015
New Revision: 1671312
URL: http://svn.apache.org/r1671312
Log:
added examples to spark naive bayes page
Modified:
mahout/site/mahout_cms/trunk/content/users/algorithms/spark-naive-bayes.mdtext
Modified: mahout/site/mahout_cms/trunk/content/users/algorithms/spark-naive-bayes.mdtext
URL: http://svn.apache.org/viewvc/mahout/site/mahout_cms/trunk/content/users/algorithms/spark-naive-bayes.mdtext?rev=1671312&r1=1671311&r2=1671312&view=diff
==============================================================================
--- mahout/site/mahout_cms/trunk/content/users/algorithms/spark-naive-bayes.mdtext (original)
+++ mahout/site/mahout_cms/trunk/content/users/algorithms/spark-naive-bayes.mdtext Sat Apr 4 20:24:30 2015
@@ -94,8 +94,6 @@ Classification and testing on a holdout
$ mahout spark-trainnb
--input (-i) input Path to job input directory.
--output (-o) output The directory pathname for output.
- --alphaI (-a) alphaI Smoothing parameter. Default is 1.0
- --overwrite (-ow) If present, overwrite the output directory. Default is false.
--trainComplementary (-c) Train complementary? Default is false.
--help (-h) Print out help
@@ -103,15 +101,18 @@ Classification and testing on a holdout
$ mahout spark-testnb
--input (-i) input Path to job input directory.
- --output (-o) output The directory pathname for output.
--model (-m) model The path to the model built during training.
- --overwrite (-ow) If present, overwrite the output directory
--testComplementary (-c) Test complementary? Default is false.
--help (-h) Print out help
+## Examples
+1. [20 Newsgroups classification](https://github.com/apache/mahout/blob/master/examples/bin/classify-20newsgroups.sh)
+2. [Document classification with Naive Bayes in the Mahout shell](https://github.com/apache/mahout/blob/master/examples/bin/spark-document-classifier.mscala)
+
## References
[1]: Jason D. M. Rennie, Lawerence Shih, Jamie Teevan, David Karger (2003). [Tackling the Poor Assumptions of Naive Bayes Text Classifiers](http://people.csail.mit.edu/jrennie/papers/icml03-nb.pdf). Proceedings of the Twentieth International Conference on Machine Learning (ICML-2003).
+