You are viewing a plain text version of this content. The canonical link for it is here.
- Re: We need help about how to install mahout - posted by bing wang <wa...@gmail.com> on 2011/11/01 04:26:42 UTC, 6 replies.
- Production use cases of Mahout - posted by Tharindu Mathew <mc...@gmail.com> on 2011/11/01 09:05:04 UTC, 10 replies.
- Re: Has anyone tried Spark with Mahout? - posted by Chris K Wensel <ch...@wensel.net> on 2011/11/01 16:35:02 UTC, 0 replies.
- Embedding mahout in a java app - posted by Tharindu Mathew <mc...@gmail.com> on 2011/11/02 09:49:19 UTC, 13 replies.
- does anyone use the "row label bindings" stuff in Vector / Matrix? - posted by Jake Mannix <ja...@gmail.com> on 2011/11/02 10:17:37 UTC, 16 replies.
- Minhash key groups - posted by Grant Ingersoll <gs...@apache.org> on 2011/11/02 15:20:57 UTC, 5 replies.
- Fwd: Mahout In Action - Bayes/CBayes Classification returns NaN - posted by Ted Dunning <te...@gmail.com> on 2011/11/02 19:24:16 UTC, 0 replies.
- Re: NaN - classification results (cbayes) - posted by Sam Cunningham <sa...@yahoo.com> on 2011/11/02 20:19:27 UTC, 5 replies.
- How To Contribute - posted by Grant Ingersoll <gs...@apache.org> on 2011/11/02 20:48:26 UTC, 1 replies.
- How to find which point belongs which cluster after running KMeansClusterer - posted by WangRamon <ra...@hotmail.com> on 2011/11/03 09:53:50 UTC, 10 replies.
- Re: Dirichlet Process Clustering not working - posted by edward choi <mp...@gmail.com> on 2011/11/03 15:25:05 UTC, 0 replies.
- Re: confidence values of one (or more) feature(s) - posted by David Rahman <dr...@googlemail.com> on 2011/11/03 15:25:29 UTC, 9 replies.
- Re: text classification using mahout and lucene index - posted by David Rahman <dr...@googlemail.com> on 2011/11/03 15:50:21 UTC, 0 replies.
- The Elephant in the Room: You are invited - posted by Mi...@emc.com on 2011/11/03 18:12:38 UTC, 0 replies.
- Graphical Mahout Cluster Visualization Tools? - posted by Mark <ma...@sisa.samsung.com> on 2011/11/04 01:40:13 UTC, 2 replies.
- Cluster dumper crashes when run on a large dataset - posted by gaurav redkar <ga...@gmail.com> on 2011/11/04 06:27:42 UTC, 8 replies.
- Watchmaker framework usage - posted by Grant Ingersoll <gs...@apache.org> on 2011/11/04 15:15:03 UTC, 0 replies.
- Can anybody explain the distance method in SquaredEuclideanDistanceMeasure? - posted by WangRamon <ra...@hotmail.com> on 2011/11/04 15:58:01 UTC, 4 replies.
- classification of search queries - posted by abhayd <aj...@hotmail.com> on 2011/11/04 17:20:49 UTC, 4 replies.
- creating vectors from lucene index which does NOT store vectors - posted by Robert Stewart <bs...@gmail.com> on 2011/11/04 18:55:38 UTC, 8 replies.
- SF Apache Mahout User Meeting (MUM) Nov 29th @ Lucid Imagination HQ - posted by Grant Ingersoll <gs...@apache.org> on 2011/11/05 00:54:53 UTC, 1 replies.
- Nearest Neighbor Recommender and Euclidean distance similarity - posted by Lance Norskog <go...@gmail.com> on 2011/11/05 04:27:19 UTC, 1 replies.
- Method "observe" in AbstractCluster - posted by WangRamon <ra...@hotmail.com> on 2011/11/05 11:39:50 UTC, 1 replies.
- getting mahout clustering info back into lucene - posted by Robert Stewart <bs...@gmail.com> on 2011/11/05 13:36:11 UTC, 3 replies.
- Invitation: SF Apache Mahout User Meeting (MUM) Nov 29th @ Lucid Imagination HQ (Nov 29 07:15 AM PST) - posted by Tom Deutsch <td...@us.ibm.com> on 2011/11/05 15:06:08 UTC, 0 replies.
- Obtaining Mahout Version during runtime - posted by "Oliver B. Fischer" <ma...@swe-blog.net> on 2011/11/06 13:42:28 UTC, 4 replies.
- Re: Understanding the SVD recommender - posted by Sean Owen <sr...@gmail.com> on 2011/11/06 18:52:21 UTC, 31 replies.
- Factorizing SVD (online SVDRecommender) - posted by Lance Norskog <go...@gmail.com> on 2011/11/07 00:53:38 UTC, 4 replies.
- Mahout+Hadoop performance bench-marking? - posted by Weiquan Lin <w....@gmail.com> on 2011/11/07 04:47:21 UTC, 1 replies.
- Performance issue in CanopyClusterer - posted by WangRamon <ra...@hotmail.com> on 2011/11/07 09:29:52 UTC, 1 replies.
- Mahout Developer Needed for Interesting Project - posted by Lance <la...@rogers.com> on 2011/11/07 16:43:45 UTC, 0 replies.
- compilation error - posted by edwin <ed...@gmail.com> on 2011/11/07 21:10:08 UTC, 1 replies.
- Relevance Prediction Challenge / WSDM 2012 Web Search Click Data Workshop - posted by Pavel Serdyukov <pa...@yandex-team.ru> on 2011/11/07 22:44:33 UTC, 1 replies.
- How to refresh all components when loading an additional file for the data model to read - posted by Scott Ryan <sr...@wayin.com> on 2011/11/08 00:50:29 UTC, 2 replies.
- Collaborative filtering help needed - posted by Akshay Jain <ja...@gmail.com> on 2011/11/08 10:24:23 UTC, 17 replies.
- Using XML-Data - posted by David Rahman <dr...@googlemail.com> on 2011/11/08 10:50:04 UTC, 0 replies.
- Mahout and multi-label classification - posted by David Rahman <dr...@googlemail.com> on 2011/11/08 11:36:19 UTC, 8 replies.
- Comparing results of Mahout SVD and Scilab - posted by motta <mo...@gmail.com> on 2011/11/08 13:11:54 UTC, 3 replies.
- BayesFeatureDriver Execution on remote cluster - posted by Jamal B <jm...@gmail.com> on 2011/11/08 15:09:20 UTC, 3 replies.
- RowSimilarityJob input - posted by Sören Brunk <so...@deri.org> on 2011/11/08 17:33:00 UTC, 2 replies.
- Mahout session at hadoop world 2011 - posted by ma...@yahoo.com on 2011/11/08 22:24:50 UTC, 2 replies.
- Cluster labeling - posted by Frank Scholten <fr...@frankscholten.nl> on 2011/11/08 23:56:11 UTC, 0 replies.
- Dirichlet Clustering Output - posted by praneet mhatre <pr...@gmail.com> on 2011/11/09 01:12:10 UTC, 1 replies.
- NewsKMeansClustering - the result most people want seems to be missing - posted by Rob Podolski <ro...@yahoo.co.uk> on 2011/11/09 12:17:42 UTC, 2 replies.
- meanshift clustering - posted by gaurav redkar <ga...@gmail.com> on 2011/11/09 13:08:40 UTC, 3 replies.
- Running Mahout SVD on Amazon Elastic Map Reduce - posted by motta <mo...@gmail.com> on 2011/11/09 14:27:05 UTC, 6 replies.
- new posting about (machine learning) mapreduce algorithms - posted by Amund Tveit <at...@gmail.com> on 2011/11/09 14:47:13 UTC, 0 replies.
- SGD TrainNewsGroups interim output - posted by Grant Ingersoll <gr...@gmail.com> on 2011/11/09 16:45:53 UTC, 3 replies.
- AdaptiveLogisticRegression - posted by Koert Kuipers <ko...@tresata.com> on 2011/11/09 16:52:06 UTC, 1 replies.
- Issues with running Mahout LDA over the Reuters data set (Mahout in Action) - posted by Varnit Khanna <va...@gmail.com> on 2011/11/09 18:48:38 UTC, 1 replies.
- User based CF - posted by WangRamon <ra...@hotmail.com> on 2011/11/10 08:34:25 UTC, 5 replies.
- Help needed for Recommendation engine - posted by VIGNESH PRAJAPATI <vi...@gmail.com> on 2011/11/10 11:39:42 UTC, 9 replies.
- AdaptiveLogisticRegression and AbstractVectorClassifier - posted by Koert Kuipers <ko...@tresata.com> on 2011/11/10 16:55:29 UTC, 1 replies.
- logistic regression mishaps - posted by Ilya Gluhovsky <il...@gmail.com> on 2011/11/11 03:21:19 UTC, 0 replies.
- Wiki edit request - posted by Lance Norskog <go...@gmail.com> on 2011/11/11 04:54:48 UTC, 6 replies.
- incosistent output while using clusterdumper - posted by gaurav redkar <ga...@gmail.com> on 2011/11/11 15:20:43 UTC, 2 replies.
- New User to Mahout - posted by thinkingbigdata <th...@gmail.com> on 2011/11/12 00:51:17 UTC, 4 replies.
- Coocurrence job - posted by Lance Norskog <go...@gmail.com> on 2011/11/12 03:48:37 UTC, 3 replies.
- Multi-file Matrices? - posted by Lance Norskog <go...@gmail.com> on 2011/11/13 00:23:02 UTC, 7 replies.
- lsi - posted by Sebastian Schelter <ss...@apache.org> on 2011/11/13 19:47:42 UTC, 16 replies.
- About Mahout. - posted by Ali Ait Abderrahmane <ai...@gmail.com> on 2011/11/13 22:36:46 UTC, 10 replies.
- Classifying documents in database - posted by Sam Cunningham <sa...@yahoo.com> on 2011/11/14 04:47:30 UTC, 5 replies.
- Terminology Extraction - posted by Yuval Feinstein <yu...@citypath.com> on 2011/11/14 08:11:28 UTC, 4 replies.
- Coding format update: Eclipse Lucene conventions - posted by Lance Norskog <go...@gmail.com> on 2011/11/14 08:19:28 UTC, 1 replies.
- distributed similarity calculation for CF - posted by Chris Schilling <ch...@gmail.com> on 2011/11/14 19:24:57 UTC, 3 replies.
- trainclassifier as a command vs. TrainClassifier.java - posted by Sam Cunningham <sa...@yahoo.com> on 2011/11/15 04:46:39 UTC, 4 replies.
- mahout for enterprise search project - posted by Burcu Buyukkagnici <bo...@gmail.com> on 2011/11/15 08:12:51 UTC, 4 replies.
- Re: Getting Mahout LDA to run - posted by Varnit Khanna <va...@gmail.com> on 2011/11/15 18:15:04 UTC, 0 replies.
- Problem compiling mahout - posted by GMailPegasus <pe...@gmail.com> on 2011/11/15 18:27:25 UTC, 7 replies.
- mahout svm - posted by Daniel Siqueira Oliveira <da...@gmail.com> on 2011/11/15 22:08:56 UTC, 1 replies.
- Mahout heap out of space - posted by Mohammed Al khooja <mk...@gmail.com> on 2011/11/15 23:12:14 UTC, 2 replies.
- Documentation - posted by David Kincaid <ki...@gmail.com> on 2011/11/16 03:35:59 UTC, 2 replies.
- NewsKMeansClustering does not find any clusters! - posted by Ahmad Ammari <am...@gmail.com> on 2011/11/16 10:47:54 UTC, 5 replies.
- Exceptions when running kmeans from the mahout launcher - posted by Ahmad Ammari <am...@gmail.com> on 2011/11/16 10:57:44 UTC, 2 replies.
- Maximum number of categories in a Bayesian classifier - posted by Lyall Morrison <ly...@gmail.com> on 2011/11/16 13:51:14 UTC, 1 replies.
- OutofMemoryError when running kmeans or fuzzykmeans cluster method - posted by "zou.cl" <zo...@neusoft.com> on 2011/11/17 02:05:46 UTC, 1 replies.
- clustering hardware requirements - posted by Ioan Eugen Stan <st...@gmail.com> on 2011/11/17 03:39:13 UTC, 6 replies.
- Weighting Preferences for Particular Items in Mahout? - posted by Jamey Wood <ja...@gmail.com> on 2011/11/17 18:16:34 UTC, 6 replies.
- Austin Hacker Dojo - Big Data Machine Learning - posted by David Boney <li...@semanticartifacts.com> on 2011/11/17 19:05:09 UTC, 1 replies.
- LDA clustering: how to print top K topics in each document - posted by Omkar Raut <om...@gmail.com> on 2011/11/17 20:17:32 UTC, 0 replies.
- OutofMemory problem in ClusterDumper - posted by "zou.cl" <zo...@neusoft.com> on 2011/11/18 07:39:26 UTC, 2 replies.
- lambda overfitting param and ParallelALSFactorizationJob -- suggested value? - posted by Sean Owen <sr...@gmail.com> on 2011/11/18 14:24:29 UTC, 4 replies.
- Large Scale Clustering - posted by Grant Ingersoll <gs...@apache.org> on 2011/11/18 21:52:51 UTC, 2 replies.
- Error in executing mahout kmeans - posted by DIPESH KUMAR SINGH <di...@gmail.com> on 2011/11/19 04:54:11 UTC, 9 replies.
- Mahout: NB Model for Text Classification - In Sample Error - posted by Night Wolf <ni...@gmail.com> on 2011/11/19 07:43:41 UTC, 3 replies.
- error - prepare20newsgroups - posted by "Faizan(Aroha)" <fa...@arohalabs.net> on 2011/11/19 13:30:15 UTC, 1 replies.
- Trouble understanding how to use the FP_Growth algorithm - posted by Sébastien Noir <in...@blackos.com> on 2011/11/21 10:49:37 UTC, 5 replies.
- Evaluation of different recommendation algorithms for 12.000 user data set - posted by Manuel Blechschmidt <Ma...@gmx.de> on 2011/11/21 12:06:54 UTC, 7 replies.
- issue about large number of items to recommend - posted by James Li <ja...@gmail.com> on 2011/11/21 17:26:44 UTC, 3 replies.
- Organization Meeting - Austin Hackers Dojo - Big Data Machine Learning - posted by David Boney <li...@semanticartifacts.com> on 2011/11/21 18:54:56 UTC, 0 replies.
- Clustering Question (from a newbie) - posted by "Fernando O." <fo...@gmail.com> on 2011/11/22 11:42:54 UTC, 7 replies.
- Which input formats to use for classifying WEKA's ARFF format? - posted by HorstItUpright <ho...@gmail.com> on 2011/11/22 17:03:40 UTC, 1 replies.
- Relevance score - Classification - posted by "Faizan(Aroha)" <fa...@arohalabs.net> on 2011/11/23 12:21:02 UTC, 9 replies.
- Re: MinHash Clustering in Mahout - posted by Grant Ingersoll <gs...@apache.org> on 2011/11/23 14:54:26 UTC, 4 replies.
- KnnItemBasedRecommender question - posted by Georgi Stanev <Ge...@holidaycheck.com> on 2011/11/23 15:34:15 UTC, 7 replies.
- "Context-aware" recommendations redux - posted by Dimitar Roustchev <d....@googlemail.com> on 2011/11/23 16:38:03 UTC, 2 replies.
- LDA dictionary file - posted by Mohammed Al khooja <mk...@gmail.com> on 2011/11/24 00:31:06 UTC, 3 replies.
- ItemSimilarityJob's results differ from non-distributed version - posted by Greg H <gr...@gmail.com> on 2011/11/24 02:14:34 UTC, 14 replies.
- Load Dataset and Instances from database - posted by "Sturm, Martin" <Ma...@uc4.com> on 2011/11/24 15:34:46 UTC, 10 replies.
- Re:ClusteredPoints - posted by Rachana <ra...@gmail.com> on 2011/11/25 07:36:43 UTC, 3 replies.
- Facing problem while fetching the document id from cluser - posted by syed kather <in...@gmail.com> on 2011/11/25 11:03:40 UTC, 2 replies.
- Reminder: SF Mahout User Meeting - posted by Grant Ingersoll <gs...@apache.org> on 2011/11/26 01:30:48 UTC, 0 replies.
- org.apache.maven.plugins:maven-antrun-plugin:1.6:run grief, copy-dependencies and unpack goals not supported by m2e, importing mahout into Eclipse - posted by Mike Spreitzer <ms...@us.ibm.com> on 2011/11/26 02:09:12 UTC, 5 replies.
- Using Brisk with Mahout - posted by Tan Shern Shiou <sh...@mnc.com.my> on 2011/11/26 18:13:49 UTC, 9 replies.
- Scalable graph clustering implementation - posted by "Bae, Jae Hyeon" <me...@gmail.com> on 2011/11/26 22:21:52 UTC, 1 replies.
- Sequential Pattern Mining - posted by Nishant Chandra <ni...@gmail.com> on 2011/11/27 10:44:52 UTC, 10 replies.
- ItemSimilarity example - posted by bish maten <bi...@gmail.com> on 2011/11/27 12:42:16 UTC, 2 replies.
- mahout command problems - posted by bish maten <bi...@gmail.com> on 2011/11/27 13:02:45 UTC, 9 replies.
- output of Recommender - posted by bish maten <bi...@gmail.com> on 2011/11/27 14:41:38 UTC, 2 replies.
- LDATopic - posted by bish maten <bi...@gmail.com> on 2011/11/28 04:28:33 UTC, 3 replies.
- Mahout distribution download - posted by bish maten <bi...@gmail.com> on 2011/11/28 18:01:21 UTC, 2 replies.
- Organization Meeting - Austin ACM Special Interest Group on Knowledge Discovery and Data Mining - posted by David Boney <li...@semanticartifacts.com> on 2011/11/28 19:16:42 UTC, 0 replies.
- Increasing Mapper/Reducer memory - posted by Mohammed Al khooja <mk...@gmail.com> on 2011/11/28 19:35:14 UTC, 2 replies.
- LDA vocabulary limits - posted by Mohammed Al khooja <mk...@gmail.com> on 2011/11/29 03:38:50 UTC, 2 replies.
- Data class taxonomy for machine learning - posted by Lance Norskog <go...@gmail.com> on 2011/11/29 05:25:11 UTC, 3 replies.
- Time-based preferences for recommendation - posted by Anatoliy Kats <a....@rambler-co.ru> on 2011/11/29 10:32:39 UTC, 8 replies.
- Clustering graph coloring and layout - posted by Grant Ingersoll <gs...@apache.org> on 2011/11/29 14:03:05 UTC, 3 replies.
- Evaluating recommendations with expired items - posted by Anatoliy Kats <a....@rambler-co.ru> on 2011/11/29 17:09:02 UTC, 2 replies.
- Including a timestamp when setting preferences - posted by Jamey Wood <ja...@gmail.com> on 2011/11/29 18:32:55 UTC, 4 replies.
- Successful Organization Meeting for Austin SIGKDD - posted by David Boney <li...@semanticartifacts.com> on 2011/11/30 07:35:05 UTC, 1 replies.
- Mahout performance issues - posted by Daniel Zohar <di...@gmail.com> on 2011/11/30 10:11:45 UTC, 8 replies.
- Clustering - Sequence File from Directory - posted by "Faizan(Aroha)" <fa...@arohalabs.net> on 2011/11/30 11:54:15 UTC, 1 replies.