You are viewing a plain text version of this content. The canonical link for it is here.
- Re: Hi (how to specify the queue parameter) - posted by Wei Li <we...@gmail.com> on 2011/04/01 03:11:56 UTC, 0 replies.
- Naive Bayes score comparison across multiple classifiers - posted by Jyoti Gupta <jy...@gmail.com> on 2011/04/01 07:56:14 UTC, 0 replies.
- RE: FW: OutOfMemoryError: Java heap space - posted by "Sengupta, Sohini IN BLR SISL" <so...@siemens.com> on 2011/04/01 09:32:22 UTC, 1 replies.
- Re: Huge classification engine - posted by Sreejith S <sr...@gmail.com> on 2011/04/01 09:37:37 UTC, 11 replies.
- Re: Wikipedia example FileNotFoundException for categories - posted by Mat Kelcey <ma...@gmail.com> on 2011/04/01 18:36:17 UTC, 1 replies.
- [OFFER] Job - posted by Martin Provencher <mp...@gmail.com> on 2011/04/01 22:03:33 UTC, 0 replies.
- KDD Cup 11 - SVD based Recommender - posted by saral <sa...@gmail.com> on 2011/04/01 23:32:45 UTC, 3 replies.
- Clustering Question - posted by sarath pr <sa...@gmail.com> on 2011/04/03 10:27:54 UTC, 2 replies.
- Re: kmeans - posted by manish <co...@gmail.com> on 2011/04/03 15:14:14 UTC, 2 replies.
- About formatting patches - posted by Lance Norskog <go...@gmail.com> on 2011/04/04 09:41:16 UTC, 11 replies.
- GSOC Application - posted by Harsh <ha...@vit.ac.in> on 2011/04/04 09:53:18 UTC, 2 replies.
- t-test - posted by SIAVASH GHODSI MOGHADDAM <gm...@live.utm.my> on 2011/04/04 11:58:11 UTC, 2 replies.
- PFP Growth : ParallelFPGrowth reduce taking a lo--ong time - posted by Vipul Pandey <vi...@gmail.com> on 2011/04/04 21:44:59 UTC, 2 replies.
- Classification with data from Lucene - posted by David Croley <dc...@renewdata.com> on 2011/04/05 03:51:38 UTC, 4 replies.
- Check the input files present in cluster - posted by Madhusudan Joshi <ma...@gmail.com> on 2011/04/05 07:23:07 UTC, 4 replies.
- Re: Re : Partial Implementation of Random Forest - posted by deneche abdelhakim <a_...@yahoo.fr> on 2011/04/06 06:14:10 UTC, 13 replies.
- How I could run Logistic Regression with a word predictor? - posted by Stanley Xu <we...@gmail.com> on 2011/04/06 12:15:47 UTC, 2 replies.
- Re: is it possible to compute the SVD for a large scale matrix - posted by Danny Bickson <da...@gmail.com> on 2011/04/06 12:55:29 UTC, 19 replies.
- Kmeans clustering options - posted by Kate Ericson <er...@cs.colostate.edu> on 2011/04/07 02:45:27 UTC, 3 replies.
- FuzzyKMeans in distributed execution - posted by Jose Fuentes De Frutos <in...@udc.es> on 2011/04/07 12:59:23 UTC, 7 replies.
- com.google.code.gson:gson:1.3 - posted by Benson Margulies <bi...@gmail.com> on 2011/04/07 15:48:04 UTC, 3 replies.
- Re: Returning number of points in KMeans - posted by john abbott <ab...@gmail.com> on 2011/04/07 18:30:03 UTC, 4 replies.
- Re: Need a little help with using SVD - posted by Timothy Potter <th...@gmail.com> on 2011/04/07 19:32:49 UTC, 0 replies.
- PFPGrowth algorithm and oozie - posted by Mark <st...@gmail.com> on 2011/04/07 19:42:52 UTC, 6 replies.
- Class loader heck in 0.4 with SequenceFileTokenizerMapper.java - posted by Benson Margulies <bi...@gmail.com> on 2011/04/07 21:37:15 UTC, 3 replies.
- ItemSimilarityJob as UserSimilarityJob - posted by Thomas Rewig <tr...@mufin.com> on 2011/04/08 12:04:36 UTC, 2 replies.
- Fwd: [SIG-IRList] RecSys 2011: Call for Tutorial Proposals and 2nd Call for Papers - posted by Grant Ingersoll <gs...@apache.org> on 2011/04/08 14:59:14 UTC, 0 replies.
- help_clusterdump - posted by sarath pr <sa...@gmail.com> on 2011/04/08 17:38:11 UTC, 2 replies.
- Clustering when you can't tune parameters - posted by Benson Margulies <bi...@gmail.com> on 2011/04/08 17:52:50 UTC, 1 replies.
- time-weighted averages - posted by Lance Norskog <go...@gmail.com> on 2011/04/10 05:56:16 UTC, 1 replies.
- Re: Recommended reading - posted by Grant Ingersoll <gs...@apache.org> on 2011/04/10 16:16:22 UTC, 0 replies.
- Can't run clustering example with 0.4 - posted by Xiaobo Gu <gu...@gmail.com> on 2011/04/10 17:42:51 UTC, 5 replies.
- Which maven command to use to put all the binaries into the distribution layout? - posted by Xiaobo Gu <gu...@gmail.com> on 2011/04/10 17:52:59 UTC, 2 replies.
- Spectral Clustering, EigenCuts and Affinity Matrix - posted by Lance Norskog <go...@gmail.com> on 2011/04/11 01:09:34 UTC, 5 replies.
- More for the refactor wish-list - posted by Lance Norskog <go...@gmail.com> on 2011/04/11 03:12:11 UTC, 3 replies.
- How to implement SlopeOne with Hadoop? Anyone from Mahout community can help me? - posted by ke xie <oe...@gmail.com> on 2011/04/11 09:20:13 UTC, 7 replies.
- Speeding RowSimilarityJob-CooccurrencesMapper-SimilarityReducer up in RecommenderJob - posted by Kris Jack <mr...@gmail.com> on 2011/04/11 14:17:03 UTC, 4 replies.
- Incremental training in recommander - posted by Mathieu sgard <ma...@gmail.com> on 2011/04/11 16:37:45 UTC, 2 replies.
- Oops - props file gets ahead of source tree - posted by Lance Norskog <go...@gmail.com> on 2011/04/12 06:55:13 UTC, 2 replies.
- Is any more detailed documentation aout the sgd logistic regression example. - posted by Xiaobo Gu <gu...@gmail.com> on 2011/04/12 18:11:30 UTC, 15 replies.
- Choosing appropriate values for T1 and T2 for canopy clustering - posted by Madhusudan Joshi <ma...@gmail.com> on 2011/04/13 08:22:27 UTC, 1 replies.
- How about a LSH recommender ? - posted by ke xie <oe...@gmail.com> on 2011/04/13 09:19:09 UTC, 9 replies.
- Identify "less similar" documents - posted by Claudia Grieco <gr...@crmpa.unisa.it> on 2011/04/13 11:12:21 UTC, 16 replies.
- 20NewsGroups Error: Illegal Capacity: -40 - posted by Ken Williams <zo...@hotmail.com> on 2011/04/13 13:48:40 UTC, 5 replies.
- Error "Generating an output file from a Lucene Index" - posted by JJJ959 <so...@hotmail.com> on 2011/04/14 09:35:00 UTC, 5 replies.
- Hints for Best Practices for Jobs with amazon EMR - posted by Thomas Rewig <tr...@mufin.com> on 2011/04/14 12:18:00 UTC, 7 replies.
- Vector to DataModel - posted by Julián Limón Núñez <ju...@tukipa.com> on 2011/04/16 00:51:56 UTC, 3 replies.
- Error running bin/mahout rowid - posted by Julián Limón Núñez <ju...@tukipa.com> on 2011/04/16 04:54:43 UTC, 3 replies.
- strange results running lda against westbury corpus - posted by Mat Kelcey <ma...@gmail.com> on 2011/04/17 08:06:34 UTC, 2 replies.
- Misfires in OnlineSummarizer - posted by Lance Norskog <go...@gmail.com> on 2011/04/17 08:53:40 UTC, 5 replies.
- Create vector using existing dictionary and IDF values - posted by Julian Limon <ju...@tukipa.com> on 2011/04/17 09:09:03 UTC, 2 replies.
- How to create a distribution from the snapshot codebase? - posted by Stanley Xu <we...@gmail.com> on 2011/04/18 11:23:10 UTC, 9 replies.
- Re: Question about LDA parameter estimation - posted by Vasil Vasilev <va...@gmail.com> on 2011/04/18 11:44:06 UTC, 0 replies.
- Gitignore file in source tree? - posted by Lance Norskog <go...@gmail.com> on 2011/04/19 06:26:16 UTC, 6 replies.
- How could I set a loss function in SGD? - posted by Stanley Xu <we...@gmail.com> on 2011/04/19 10:33:16 UTC, 11 replies.
- Yet another distributed matrix factorizer - posted by Lance Norskog <go...@gmail.com> on 2011/04/20 06:04:38 UTC, 0 replies.
- Another FP-Growth variant - posted by Lance Norskog <go...@gmail.com> on 2011/04/20 06:06:49 UTC, 0 replies.
- Anyway to speedup the category feature parsing and encoding in the SGD algorithm? - posted by Stanley Xu <we...@gmail.com> on 2011/04/20 15:00:29 UTC, 5 replies.
- Does the Feature Hashing and Collision in the SGD will harm the performance of the algorithm? - posted by Stanley Xu <we...@gmail.com> on 2011/04/20 15:06:40 UTC, 16 replies.
- Custom analyzers for seq2sparse - posted by Camilo Lopez <ca...@camilolopez.com> on 2011/04/20 18:58:51 UTC, 4 replies.
- IDMigration with BooleanPreferences - posted by Ahmet Arslan <io...@yahoo.com> on 2011/04/20 23:54:02 UTC, 2 replies.
- TFIDF based on Field - posted by shambhusingh <sh...@gmail.com> on 2011/04/21 02:27:46 UTC, 2 replies.
- LDA related enhancements - posted by Vasil Vasilev <va...@gmail.com> on 2011/04/21 06:08:47 UTC, 5 replies.
- Blog post about setting up a scalable recommender system with mahout - posted by Sebastian Schelter <ss...@apache.org> on 2011/04/21 19:16:12 UTC, 0 replies.
- kmeans on space-delimited input data, - posted by vs <vi...@gmail.com> on 2011/04/22 20:24:30 UTC, 1 replies.
- Recommend output: User vs. Item, Tanimoto vs. LogLikelihood - posted by Otis Gospodnetic <ot...@yahoo.com> on 2011/04/23 04:41:04 UTC, 3 replies.
- Hoeffding Bound or Additive Chernoff Bound as a SImilarity algorithm? - posted by Lance Norskog <go...@gmail.com> on 2011/04/24 05:58:17 UTC, 5 replies.
- What is the class to extract keyword/category from text/blog? - posted by lai seong au <wh...@yahoo.com> on 2011/04/24 06:13:58 UTC, 0 replies.
- Wrong userSimilarity and itemSimilarity centeredSumXY formula/computation - posted by Shem Cristobal <sh...@gmail.com> on 2011/04/24 17:07:29 UTC, 4 replies.
- Hearst Machine Learning Challenge - posted by Danny Bickson <da...@gmail.com> on 2011/04/25 03:28:51 UTC, 2 replies.
- Cosine distances to Random Vector basis - posted by Lance Norskog <go...@gmail.com> on 2011/04/25 05:56:40 UTC, 5 replies.
- Which exact algorithm is used in the Mahout SGD? - posted by Stanley Xu <we...@gmail.com> on 2011/04/25 15:12:05 UTC, 10 replies.
- Top items in a vector - posted by Julian Limon <ju...@tukipa.com> on 2011/04/25 21:32:21 UTC, 19 replies.
- Can I feed google analytics output to Apache Mahout - posted by Manu <ma...@gmail.com> on 2011/04/25 23:02:38 UTC, 5 replies.
- Fwd: Google Summer of Code 2011 Students Announced - posted by Ted Dunning <te...@gmail.com> on 2011/04/25 23:55:13 UTC, 1 replies.
- How to evaluate a recommender with binary ratings? - posted by Peter Harrington <pe...@gmail.com> on 2011/04/26 02:28:58 UTC, 4 replies.
- best similarity metric for collaborative filtering - posted by Chris Waggoner <ch...@gmail.com> on 2011/04/26 04:21:25 UTC, 5 replies.
- Introduction - posted by Raymond Richardson <ex...@yahoo.com> on 2011/04/26 05:00:11 UTC, 3 replies.
- Convert preference matrix - posted by Mathieu sgard <ma...@gmail.com> on 2011/04/26 09:25:16 UTC, 2 replies.
- Determining Document Cluster Probabilities with LDA - posted by Ian Helmke <ih...@gmail.com> on 2011/04/26 21:16:24 UTC, 8 replies.
- AUC of Random Forest - posted by praneet mhatre <pm...@ics.uci.edu> on 2011/04/27 06:40:37 UTC, 1 replies.
- Mahout first application for biginer - posted by venkat <ve...@gmail.com> on 2011/04/27 08:48:16 UTC, 4 replies.
- Scoring issue - posted by "mohammed.farrag" <mo...@pearlox.com> on 2011/04/27 14:37:19 UTC, 3 replies.
- MovieHackDay - posted by Alan Said <Al...@dai-labor.de> on 2011/04/27 14:57:35 UTC, 0 replies.
- Finding thresholds for canopy - posted by Camilo Lopez <ca...@camilolopez.com> on 2011/04/27 21:39:02 UTC, 8 replies.
- DistributedRowMatrix.transpose Memory Woes - posted by Paul Mahon <pm...@decarta.com> on 2011/04/27 21:43:31 UTC, 6 replies.
- Regarding PCA implementation - posted by Vckay <da...@gmail.com> on 2011/04/28 02:28:20 UTC, 7 replies.
- Genetic algorithm in distributed execution - posted by Jose Fuentes De Frutos <in...@udc.es> on 2011/04/28 13:20:56 UTC, 1 replies.
- Output of transpose and matrixmult - posted by Julian Limon <ju...@tukipa.com> on 2011/04/28 21:33:14 UTC, 3 replies.
- Logistic Regression Tutorial - posted by Benson Margulies <bi...@gmail.com> on 2011/04/28 21:35:22 UTC, 22 replies.
- Mahout examples - posted by sulabh choudhury <su...@gmail.com> on 2011/04/29 00:42:46 UTC, 1 replies.
- Re: Trying to determine the best ML algorithm to use. - posted by Patrick Collins <pa...@ready2sign.com> on 2011/04/29 08:17:54 UTC, 4 replies.
- LanczosSolver Very Slow - posted by Paul Mahon <pm...@decarta.com> on 2011/04/29 18:45:32 UTC, 11 replies.
- Fuzzy matching - posted by James Pettyjohn <ja...@scientology.net> on 2011/04/29 21:50:44 UTC, 1 replies.
- Mahout In Action KMeans Clustering example - posted by Dhruv Kumar <dk...@ecs.umass.edu> on 2011/04/30 03:40:48 UTC, 3 replies.