You are viewing a plain text version of this content. The canonical link for it is here.
- Re: questions on the results of running lda and ldatopics, thanks - posted by Lance Norskog <go...@gmail.com> on 2011/07/01 05:28:44 UTC, 3 replies.
- Dimensional Reduction via Random Projection: investigations - posted by Lance Norskog <go...@gmail.com> on 2011/07/01 11:44:29 UTC, 22 replies.
- Sequence / Temporal Learning using Mahout's classifier - posted by Svetlomir Kasabov <sk...@smail.inf.fh-brs.de> on 2011/07/01 12:26:05 UTC, 0 replies.
- Hadoop version compatibility. - posted by Xiaobo Gu <gu...@gmail.com> on 2011/07/01 14:35:15 UTC, 5 replies.
- Question on PFP numGroups option - posted by Mark <st...@gmail.com> on 2011/07/01 18:12:56 UTC, 0 replies.
- Exclude by RuleSet - posted by Em <ma...@yahoo.de> on 2011/07/01 18:56:25 UTC, 8 replies.
- Introducing randomness into my results - posted by Salil Apte <sa...@offlinelabs.com> on 2011/07/01 20:42:46 UTC, 12 replies.
- fuzzy kmeans - all cluster with the same top terms - posted by Paulo Magalhaes <pa...@gmail.com> on 2011/07/01 23:37:18 UTC, 0 replies.
- Re: Similarity between users' groups - posted by Radek Maciaszek <ra...@maciaszek.co.uk> on 2011/07/02 12:47:14 UTC, 3 replies.
- StackOverflowError on running bin/mahout with HADOOP_CONF_DIR specified - posted by Sergey Bartunov <sb...@gmail.com> on 2011/07/02 21:01:37 UTC, 4 replies.
- Re: Using with seq2spars org.apache.lucene.analysis.Analyzer - posted by rmx <ru...@hotmail.com> on 2011/07/03 19:54:13 UTC, 1 replies.
- 20news - posted by Vijay Santhanam <vi...@gmail.com> on 2011/07/04 10:52:15 UTC, 21 replies.
- Using naive bayes classification with continuous, categorical and word-like features - posted by Vijay Santhanam <vi...@gmail.com> on 2011/07/04 17:01:53 UTC, 6 replies.
- MySQLJDBCDataModel vs FileDataModel - posted by Mark <st...@gmail.com> on 2011/07/04 19:05:26 UTC, 7 replies.
- how do I choose appropriate OnlineLogisticRegression parameters for modelling this? - posted by Vijay Santhanam <vi...@gmail.com> on 2011/07/04 19:30:10 UTC, 2 replies.
- How could I use bayse model with my C++ online classifier - posted by 刘逸哲 <zh...@alibaba-inc.com> on 2011/07/05 04:54:29 UTC, 2 replies.
- 回复: How could I use bayse model with my C++ online classifier - posted by beneo_7 <be...@163.com> on 2011/07/05 04:57:50 UTC, 0 replies.
- Lanczos SVD scalability - posted by agnonchik <gl...@inm.ras.ru> on 2011/07/05 08:27:07 UTC, 2 replies.
- Re: Using with seq2spars org.apache.lucene.analysis.Analyzer - posted by Sean Owen <sr...@gmail.com> on 2011/07/05 14:34:32 UTC, 0 replies.
- Re: Tranforming data for k-means analysis - posted by Radek Maciaszek <ra...@maciaszek.co.uk> on 2011/07/05 16:09:40 UTC, 1 replies.
- [JOBS] Meebo Machine Learning Opportunities - posted by Jim Dullaghan <ji...@gmail.com> on 2011/07/06 01:47:13 UTC, 0 replies.
- 答复: How could I use bayse model with my C++ online classifier - posted by 刘逸哲 <zh...@alibaba-inc.com> on 2011/07/06 05:16:48 UTC, 0 replies.
- Weighted "features" using naive bayes classifier - posted by Vijay Santhanam <vi...@gmail.com> on 2011/07/06 07:02:56 UTC, 0 replies.
- File format question when write map-reduce applications - posted by Xiaobo Gu <gu...@gmail.com> on 2011/07/06 12:03:19 UTC, 7 replies.
- What's the difference between classic decision tree and Mahout Decision forest algorithm? - posted by Xiaobo Gu <gu...@gmail.com> on 2011/07/06 12:11:03 UTC, 5 replies.
- ParallelALSFactorizationJob: Exception InvalidInputException - posted by Kumar Kandasami <ku...@gmail.com> on 2011/07/06 19:14:59 UTC, 1 replies.
- Generic Recommender algorithm questions (using Mahout 0.4) - posted by Carlos Seminario <re...@gmail.com> on 2011/07/06 23:02:10 UTC, 2 replies.
- Mahout LDA docTopics - posted by Xavier Stevens <xs...@mozilla.com> on 2011/07/06 23:49:17 UTC, 1 replies.
- how to transfer the sequence file into readable format - posted by wine lover <wi...@gmail.com> on 2011/07/07 19:53:10 UTC, 7 replies.
- Logistic Regression: poor results on small data set - posted by hakeem <to...@indeed.com> on 2011/07/07 23:20:57 UTC, 3 replies.
- Available datasets for recommendations - posted by Lance Norskog <go...@gmail.com> on 2011/07/08 05:05:56 UTC, 12 replies.
- document similarity - posted by Luca Natti <il...@gmail.com> on 2011/07/08 10:47:57 UTC, 4 replies.
- Re: Query over Apache Mahout. Need help in deciding whether to go for it or not. - posted by Robin Anil <ro...@gmail.com> on 2011/07/08 12:17:19 UTC, 1 replies.
- Broken links - posted by Maël Thomas <ma...@telecom-bretagne.eu> on 2011/07/08 14:46:38 UTC, 3 replies.
- Failing unit test: testStartParallelFPGrowth - posted by Marc Millstone <mi...@gmail.com> on 2011/07/09 02:41:24 UTC, 19 replies.
- Hadoop library question. - posted by Xiaobo Gu <gu...@gmail.com> on 2011/07/09 03:32:19 UTC, 1 replies.
- Singular vectors of a recommendation Item-Item space - posted by Lance Norskog <go...@gmail.com> on 2011/07/10 04:07:57 UTC, 3 replies.
- Clustering with id - posted by Gabor Makrai <ma...@gmail.com> on 2011/07/11 01:18:57 UTC, 6 replies.
- the problem of importing org.apache.mahout.common.FileLineIterable - posted by surf reta <su...@gmail.com> on 2011/07/11 03:42:58 UTC, 7 replies.
- Re: Plagiarism - document similarity - posted by Luca Natti <il...@gmail.com> on 2011/07/11 09:15:04 UTC, 8 replies.
- Build Failure in Math and Core - posted by Sören Dierkes <so...@informatik.uni-oldenburg.de> on 2011/07/11 10:04:08 UTC, 1 replies.
- Logistic Regression: number of positives and negatives - posted by Svetlomir Kasabov <sk...@smail.inf.fh-brs.de> on 2011/07/11 15:56:57 UTC, 1 replies.
- combination of features worsen the performance - posted by Weihua Zhu <wz...@adconion.com> on 2011/07/11 23:08:48 UTC, 8 replies.
- Connection Pooling - posted by Salil Apte <sa...@offlinelabs.com> on 2011/07/12 03:21:29 UTC, 16 replies.
- What's the accuracy of random forests in Mahout? - posted by Xiaobo Gu <gu...@gmail.com> on 2011/07/12 15:03:20 UTC, 1 replies.
- File format question about Random forest. - posted by Xiaobo Gu <gu...@gmail.com> on 2011/07/12 15:19:42 UTC, 0 replies.
- Using tf-idf vectors to train Naive Bayes - posted by kevin_ravel <ke...@raveldata.com> on 2011/07/12 16:29:14 UTC, 1 replies.
- ItemSimilarity pre-processing - posted by Abmar Barros <ab...@gmail.com> on 2011/07/12 17:32:55 UTC, 4 replies.
- Using Mahout XmlInputFormat with Hadoop Streaming - posted by Diederik van Liere <Di...@Rotman.Utoronto.Ca> on 2011/07/12 18:51:17 UTC, 3 replies.
- Random Forest feature types - posted by Don Pazel <dp...@adconion.com> on 2011/07/12 19:52:39 UTC, 1 replies.
- What about mentaining a short descriptive tables about each algorithms in Mahout on Wiki for new users? - posted by Xiaobo Gu <gu...@gmail.com> on 2011/07/13 16:11:47 UTC, 4 replies.
- similarity metrics? - posted by Ian Upright <ia...@upright.net> on 2011/07/13 23:09:38 UTC, 5 replies.
- Kernels for Text Clustering - posted by Vckay <da...@gmail.com> on 2011/07/14 11:38:30 UTC, 7 replies.
- Understanding mahout's recommendation system parameters - posted by Kris Jack <mr...@gmail.com> on 2011/07/14 16:11:48 UTC, 2 replies.
- Similarity between sparse vectors - posted by marco turchi <ma...@gmail.com> on 2011/07/15 13:36:31 UTC, 4 replies.
- Clustering demographic data - posted by Clive Cox <cl...@rummble.com> on 2011/07/15 22:07:39 UTC, 3 replies.
- Re : Random Forest feature types - posted by deneche abdelhakim <a_...@yahoo.fr> on 2011/07/16 06:26:12 UTC, 0 replies.
- What does the -Dmapred.max.split.size option of org.apache.mahout.df.mapreduce.BuildForest mean for each split ? - posted by Xiaobo Gu <gu...@gmail.com> on 2011/07/16 07:52:33 UTC, 0 replies.
- What does the -p and -mr option of BuildForest and TestForest mean - posted by XiaoboGu <gu...@gmail.com> on 2011/07/16 09:26:06 UTC, 0 replies.
- what file format is required by naive bayes classfier? - posted by Xiaobo Gu <gu...@gmail.com> on 2011/07/17 16:28:09 UTC, 2 replies.
- Including "Unrecommendable" Items - posted by Jamey Wood <ja...@gmail.com> on 2011/07/18 23:26:57 UTC, 5 replies.
- Canopy clustering - posted by Aaron Kaplan <ma...@aaronkaplan.info> on 2011/07/19 01:21:01 UTC, 0 replies.
- fkmeans or Cluster Dumper not working? - posted by Jeffrey <my...@yahoo.com> on 2011/07/20 08:41:28 UTC, 17 replies.
- Problem with method Plus in the Vector class - posted by marco turchi <ma...@gmail.com> on 2011/07/20 15:28:21 UTC, 17 replies.
- Yahoo LDA - posted by Ian Upright <ia...@upright.net> on 2011/07/20 21:52:51 UTC, 1 replies.
- Treating User Demographics as (Pseudo) Items? - posted by Jamey Wood <ja...@gmail.com> on 2011/07/21 04:18:31 UTC, 4 replies.
- Evaluating boolean preference data sets - posted by Marko Ciric <ci...@gmail.com> on 2011/07/21 14:49:31 UTC, 3 replies.
- Wald's Test / parameter significance tests (Logistic Regression) - posted by Svetlomir Kasabov <sk...@smail.inf.fh-brs.de> on 2011/07/21 22:52:02 UTC, 3 replies.
- FW: meanshift reduce task problem - posted by Jeff Eastman <je...@Narus.com> on 2011/07/21 23:30:42 UTC, 1 replies.
- Preserving pairwise distances while normalizing vectors - posted by Lance Norskog <go...@gmail.com> on 2011/07/22 05:25:16 UTC, 7 replies.
- is there exist lda classifier with trained probabilistic model? - posted by jun li <ju...@gmail.com> on 2011/07/22 09:56:07 UTC, 2 replies.
- Pairwise Document Similarity - posted by Niall Riddell <ni...@xspca.com> on 2011/07/22 13:23:38 UTC, 2 replies.
- df-count/data does not exist - posted by Liliana Mamani Sanchez <li...@gmail.com> on 2011/07/22 17:03:13 UTC, 2 replies.
- HMM investigations - posted by Svetlomir Kasabov <sk...@smail.inf.fh-brs.de> on 2011/07/24 15:25:10 UTC, 11 replies.
- Mahout Binary Recommender Evaluator - posted by MT <ma...@telecom-bretagne.eu> on 2011/07/25 11:05:12 UTC, 7 replies.
- HBase & Mahout - Using HBase as a Datastore/source for Mahout - Classification - posted by NightWolf <ni...@gmail.com> on 2011/07/25 14:53:57 UTC, 8 replies.
- What about a universal input data handling mechanism for Mahout? - posted by Xiaobo Gu <gu...@gmail.com> on 2011/07/25 16:42:14 UTC, 3 replies.
- AUC - posted by Marko Ciric <ci...@gmail.com> on 2011/07/25 22:30:36 UTC, 1 replies.
- Item based recommendations - posted by "Antony Corfield [awc]" <aw...@aber.ac.uk> on 2011/07/26 11:50:30 UTC, 0 replies.
- Mahout LDA - posted by Benjamin Heilbrunn <be...@gmail.com> on 2011/07/26 13:27:19 UTC, 1 replies.
- Cluster-center and cluster-radius - posted by Immo Micus <im...@googlemail.com> on 2011/07/26 15:05:56 UTC, 3 replies.
- Re: Item based recommendations - posted by Sean Owen <sr...@gmail.com> on 2011/07/26 15:41:45 UTC, 1 replies.
- using Integer array with NamedVector or RandomAccessSparseVector - posted by Abhik Banerjee <ba...@gmail.com> on 2011/07/26 19:14:05 UTC, 3 replies.
- Article on Mahout recommenders and Cassandra - posted by Sean Owen <sr...@gmail.com> on 2011/07/26 19:22:05 UTC, 1 replies.
- Re: Help building mahout - posted by Abhik Banerjee <ba...@gmail.com> on 2011/07/26 19:22:17 UTC, 0 replies.
- Re: Advice request - posted by Ted Dunning <te...@gmail.com> on 2011/07/26 22:49:25 UTC, 4 replies.
- Classification on Techcrunch - posted by Shrikar archak <sh...@gmail.com> on 2011/07/26 23:17:46 UTC, 4 replies.
- Parallel FPGrowth driver - doc problem? - posted by Lance Norskog <go...@gmail.com> on 2011/07/27 07:11:40 UTC, 2 replies.
- recommendation MetadataModel class? - posted by Lance Norskog <go...@gmail.com> on 2011/07/27 07:13:31 UTC, 0 replies.
- Parallel FPGrowth driver - what is a good demo? - posted by Lance Norskog <go...@gmail.com> on 2011/07/27 08:06:42 UTC, 3 replies.
- Mahout Binary Recommender Evaluation - posted by MT <ma...@telecom-bretagne.eu> on 2011/07/27 09:33:51 UTC, 1 replies.
- Kmeans runs successfully, but no map/reduce jobs - posted by Dave Gettier <dg...@potomacfusion.com> on 2011/07/28 01:20:28 UTC, 6 replies.
- About Building Mahout - posted by 张涛 <49...@163.com> on 2011/07/28 03:59:28 UTC, 1 replies.
- Duplicate documents in a corpus - posted by Rich Heimann <he...@gmail.com> on 2011/07/28 17:49:51 UTC, 5 replies.
- Re: File Not Found Exception - posted by Abhik Banerjee <ba...@gmail.com> on 2011/07/28 17:54:53 UTC, 0 replies.
- Mahout support for SVM - posted by Suneel Marthi <su...@yahoo.com> on 2011/07/29 00:03:00 UTC, 1 replies.
- Problem with ClassNotFoundException - org.apache.mahout.math class - posted by Abhik Banerjee <ba...@gmail.com> on 2011/07/29 00:53:28 UTC, 0 replies.
- Random Decision Forests - Binary Classification of large data set - posted by Night Wolf <ni...@gmail.com> on 2011/07/29 18:25:09 UTC, 1 replies.
- Doubt regarding the kmeans clustering results on mahout - posted by Abhik Banerjee <ba...@gmail.com> on 2011/07/29 20:32:49 UTC, 4 replies.
- Analyzing the clusterdump output - kmeans clustering - posted by Abhik Banerjee <ba...@gmail.com> on 2011/07/30 00:48:28 UTC, 1 replies.
- Mahout in eCommerce - posted by Raymond Richardson <ex...@yahoo.com> on 2011/07/31 00:28:42 UTC, 2 replies.
- OSX/Hadoop problem: filename 'LICENSE' and dir 'license/' clash in mahout-examples-0.6-SNAPSHOT-job.jar - posted by Dan Brickley <da...@danbri.org> on 2011/07/31 19:41:53 UTC, 0 replies.