You are viewing a plain text version of this content. The canonical link for it is here.
- Re: Number of reduce tasks of PFP - posted by 戴清灏 <ro...@gmail.com> on 2012/05/01 02:53:14 UTC, 1 replies.
- SVD with arff file format - posted by Yohan Jin <yo...@gmail.com> on 2012/05/01 09:19:38 UTC, 0 replies.
- Re: integrating databases - posted by Manuel Blechschmidt <Ma...@gmx.de> on 2012/05/01 10:37:13 UTC, 1 replies.
- Re: How does SVDRecommender work in mahout? - posted by Daniel Quach <da...@cs.ucla.edu> on 2012/05/02 06:50:03 UTC, 0 replies.
- Announcement: 'Parallel Processing beyond MapReduce' workshop after Berlin Buzzwords - posted by Sebastian Schelter <ss...@apache.org> on 2012/05/02 07:57:50 UTC, 0 replies.
- Mahout - Pig Hackday - posted by Timothy Potter <th...@gmail.com> on 2012/05/02 20:06:31 UTC, 9 replies.
- Re: [mahout] labels in clustering algorythms - posted by Konstantin Shmakov <ks...@gmail.com> on 2012/05/03 05:42:56 UTC, 0 replies.
- Problem Running org.apache.mahout.cf.taste.hadoop.item.RecommenderJob on Hadoop - posted by Utkarsh Gupta <Ut...@infosys.com> on 2012/05/03 08:25:24 UTC, 2 replies.
- Mahout + BigDataR Linux - posted by Nicholas Kolegraff <ni...@gmail.com> on 2012/05/03 16:06:31 UTC, 15 replies.
- A Mahout Naive Bayes classifier problem - posted by Zehao Jin <ze...@gmail.com> on 2012/05/04 15:08:32 UTC, 4 replies.
- Test failure: org.apache.mahout.math.hadoop.decomposer.TestDistributedLanczosSolverCLI - posted by mBria <br...@gmail.com> on 2012/05/04 20:55:30 UTC, 2 replies.
- Heap Space Issues with Complementary Naive Bayes - posted by Ryan Rosario <uc...@gmail.com> on 2012/05/05 01:49:07 UTC, 5 replies.
- Problem running new LDA algorithm (cvb) against the Reuters data - posted by DAN HELM <da...@verizon.net> on 2012/05/05 05:54:22 UTC, 3 replies.
- SGD cold start and model persistence questions - posted by hao wang <wa...@huofar.com> on 2012/05/05 09:06:51 UTC, 0 replies.
- SGD cold start and model persistence questions - posted by hao wang <ou...@gmail.com> on 2012/05/05 12:42:37 UTC, 1 replies.
- Re: Recommendation scores from LogLikelihood Similarity recommender - posted by Will C <wi...@infomofo.com> on 2012/05/06 19:48:24 UTC, 3 replies.
- kmeans not returning k clusters - posted by Pat Ferrel <pa...@occamsmachete.com> on 2012/05/06 22:49:52 UTC, 15 replies.
- Canopies and RowSimilarity - posted by Pat Ferrel <pa...@occamsmachete.com> on 2012/05/06 23:08:59 UTC, 5 replies.
- Re: the analysis of output of Clustering algorithm by mahout - posted by Paritosh Ranjan <pr...@xebia.com> on 2012/05/07 10:23:42 UTC, 0 replies.
- Eclipse + Mahout + Maven: maven-antrun-plugin error? - posted by mBria <br...@gmail.com> on 2012/05/07 21:27:11 UTC, 0 replies.
- Re: Eclipse + Mahout + Maven: maven-antrun-plugin error? - posted by Dmitriy Lyubimov <dl...@gmail.com> on 2012/05/07 23:01:36 UTC, 1 replies.
- how to implement item-based recommender on movie genre data? - posted by Daniel Quach <da...@cs.ucla.edu> on 2012/05/08 09:53:39 UTC, 3 replies.
- How to index by long ID in RandomAccessSparseVector - posted by 冯伟 <wh...@gmail.com> on 2012/05/08 10:13:38 UTC, 1 replies.
- Mahout Naive Bayes - posted by Nimesh Parikh <da...@gmail.com> on 2012/05/08 15:44:52 UTC, 0 replies.
- 2 questions about lda implementation - posted by ivan obeso <se...@gmail.com> on 2012/05/08 17:54:45 UTC, 2 replies.
- rowsimilarity not creating requested number of similar docs - posted by Pat Ferrel <pa...@occamsmachete.com> on 2012/05/08 19:06:57 UTC, 1 replies.
- Realtime classification model - posted by Mohit Anchlia <mo...@gmail.com> on 2012/05/08 19:28:14 UTC, 0 replies.
- Exclusing certain ratings when running recommender - posted by Mugoma Joseph Okomba <mu...@yengas.com> on 2012/05/09 03:00:51 UTC, 10 replies.
- High Dimensional Datasets for Binary Classification - posted by praneet mhatre <pr...@gmail.com> on 2012/05/09 05:06:29 UTC, 0 replies.
- From Item-based Recommender to User-based Recommender - posted by 冯伟 <wh...@gmail.com> on 2012/05/09 16:41:58 UTC, 5 replies.
- Theory question about Pearson Correlation and user based recommender - posted by Daniel Quach <da...@cs.ucla.edu> on 2012/05/09 18:13:08 UTC, 1 replies.
- Having a devil of a time running k-means examples with Mahout 0.6 / Hadoop 0.20.2 - posted by Alex Hasha <al...@bundle.com> on 2012/05/09 23:13:25 UTC, 1 replies.
- Canopy estimator - posted by Pat Ferrel <pa...@farfetchers.com> on 2012/05/10 02:36:15 UTC, 11 replies.
- How to run a mahout clustering job through a web service - posted by "Chandra Mohan, Ananda Vel Murugan" <An...@honeywell.com> on 2012/05/10 08:33:12 UTC, 5 replies.
- Extra low speed mahout distribution with hadoop - posted by Ma...@htc.com on 2012/05/10 11:29:07 UTC, 3 replies.
- Improve Recommendations - posted by Jahangir Mohammed <md...@gmail.com> on 2012/05/10 17:31:47 UTC, 3 replies.
- About Matrix Factorization and Vector/Matrix Manipulation - posted by 冯伟 <wh...@gmail.com> on 2012/05/10 17:33:44 UTC, 2 replies.
- Some guidance for this noob - "Metadata Matching Engine" - posted by mBria <br...@gmail.com> on 2012/05/10 23:57:13 UTC, 5 replies.
- clusterdump lucene document ID - posted by Benjamin Busjaeger <bu...@googlemail.com> on 2012/05/11 09:30:36 UTC, 0 replies.
- Recommender with ratings takes a long time to process - posted by Emilio Suarez <Em...@intela.com> on 2012/05/11 19:18:48 UTC, 4 replies.
- Question about storage in Pig-vector (Pig + Mahout) - posted by Timothy Potter <th...@gmail.com> on 2012/05/11 20:38:58 UTC, 8 replies.
- Re: Checksum error on K-means - posted by Paritosh Ranjan <pr...@xebia.com> on 2012/05/11 21:19:38 UTC, 0 replies.
- Recommender with item features - posted by EDUARDO ANTONIO BUITRAGO ZAPATA <ed...@gmail.com> on 2012/05/12 03:15:25 UTC, 2 replies.
- [Announcement] Giraph talk in Berlin on May 29th - posted by Sebastian Schelter <ss...@apache.org> on 2012/05/12 11:58:42 UTC, 4 replies.
- RowSimilarity - posted by Pat Ferrel <pa...@occamsmachete.com> on 2012/05/13 01:29:44 UTC, 9 replies.
- Exception running 20newsgroups example - posted by mahout-newbie <ra...@gmail.com> on 2012/05/14 03:33:23 UTC, 2 replies.
- 40 hours to run 1/2 Netflix Data? - posted by 许春玲 <xu...@sari.ac.cn> on 2012/05/14 03:44:06 UTC, 3 replies.
- Interpretation of co-efficients of features in Mahout logistic output - posted by "Nowal, Akshay" <Ak...@SYNTELINC.COM> on 2012/05/14 07:49:23 UTC, 1 replies.
- Re: Interpretation of co-efficients of features in Mahout logistic output - posted by "Nowal, Akshay" <Ak...@SYNTELINC.COM> on 2012/05/14 09:39:04 UTC, 0 replies.
- Persistent Data Model - posted by Nikolaos Romanos Katsipoulakis <po...@gmail.com> on 2012/05/14 13:44:41 UTC, 5 replies.
- online clustering with mahout - posted by Ioan Eugen Stan <st...@gmail.com> on 2012/05/14 14:34:39 UTC, 4 replies.
- large scale kmeans - posted by Jiaan Zeng <l....@gmail.com> on 2012/05/15 05:22:32 UTC, 1 replies.
- question on VectorWritable convertor in elephant-bird. - posted by Yohan Chin <yo...@gmail.com> on 2012/05/15 08:43:20 UTC, 6 replies.
- CachingRecommender versus Recommender - posted by Nikolaos Romanos Katsipoulakis <po...@gmail.com> on 2012/05/15 15:27:50 UTC, 1 replies.
- choosing appropriate t1,t2 for canopy clustering - posted by Robert Stewart <bs...@gmail.com> on 2012/05/15 16:45:30 UTC, 7 replies.
- Judging the quality of clustering - posted by Pat Ferrel <pa...@occamsmachete.com> on 2012/05/15 22:46:06 UTC, 9 replies.
- Recall and Precision values in Item Based Collaborative filtering - posted by Utkarsh Gupta <Ut...@infosys.com> on 2012/05/16 09:54:56 UTC, 0 replies.
- mahout job and hadoop - posted by "Chandra Mohan, Ananda Vel Murugan" <An...@honeywell.com> on 2012/05/16 12:00:22 UTC, 2 replies.
- About Random walk with restart - posted by huanchen <ia...@gmail.com> on 2012/05/17 04:33:00 UTC, 8 replies.
- Save a UserSimilarity in a File - posted by Nikolaos Romanos Katsipoulakis <po...@gmail.com> on 2012/05/17 12:25:10 UTC, 4 replies.
- Subcribe - posted by Amol Kulkarni <av...@gmail.com> on 2012/05/17 14:53:53 UTC, 1 replies.
- tokenizer for text - posted by Jiaan Zeng <l....@gmail.com> on 2012/05/18 16:15:19 UTC, 5 replies.
- LDA, printing Topics - posted by Simon Handley <sh...@alumni.stanford.org> on 2012/05/18 17:10:04 UTC, 0 replies.
- Help with running taste-demo on mahout-examples-0.7-SNAPSHOT.jar - posted by Dhananjay Sampath <dh...@gmail.com> on 2012/05/18 20:07:24 UTC, 7 replies.
- NoClassDefFoundError calling custom analyzer in seq2sparse - posted by DAN HELM <da...@verizon.net> on 2012/05/18 20:36:57 UTC, 0 replies.
- How to approach this? Classification vs Recommendation - posted by fht <li...@gmail.com> on 2012/05/18 23:00:25 UTC, 2 replies.
- How to make normal Text suitable for Kmeans using mahout - posted by siddharth0ece <si...@gmail.com> on 2012/05/19 09:46:41 UTC, 1 replies.
- Wikipedia things/strings dataset - posted by Dan Brickley <da...@danbri.org> on 2012/05/19 21:50:12 UTC, 4 replies.
- Mixing simiarity measures - posted by Mugoma Joseph Okomba <mu...@yengas.com> on 2012/05/20 03:31:45 UTC, 9 replies.
- Getting Recommendation for User not in Input Data - posted by Utkarsh Gupta <Ut...@infosys.com> on 2012/05/21 14:28:55 UTC, 3 replies.
- Mahout's Text Similarity using HBase - posted by Junaid Surve <ju...@gmail.com> on 2012/05/21 16:12:38 UTC, 0 replies.
- Exporting Mahout model output as Weka input - posted by Guy Ernest <gu...@gmail.com> on 2012/05/21 23:09:16 UTC, 3 replies.
- sample code for mahout random forest implementation - posted by "Chandra Mohan, Ananda Vel Murugan" <An...@honeywell.com> on 2012/05/22 06:14:01 UTC, 1 replies.
- Forecasting in Mahout - posted by ParvathyPillai <pa...@gmail.com> on 2012/05/22 12:31:04 UTC, 7 replies.
- Questions about LDA topic/term model - posted by ivan obeso <se...@gmail.com> on 2012/05/22 12:39:23 UTC, 0 replies.
- Kmeans in Mahout - posted by erick chiliboyi <si...@gmail.com> on 2012/05/22 17:02:56 UTC, 2 replies.
- CDbw and Evaluator results - posted by Pat Ferrel <pa...@occamsmachete.com> on 2012/05/22 20:02:29 UTC, 2 replies.
- Mahout clusterdump - posted by Bahadır Yılmaz <ba...@gmail.com> on 2012/05/22 23:27:08 UTC, 2 replies.
- Continue K-Means on pseudo-distributed mode after a server failure - posted by Michael Kazekin <Mi...@mediainsight.info> on 2012/05/24 14:29:46 UTC, 2 replies.
- Any other names for generic user/item based recommenders? - posted by Daniel Quach <da...@cs.ucla.edu> on 2012/05/25 07:06:34 UTC, 1 replies.
- Error: Java heap space on mahout cvb command - posted by DAN HELM <da...@verizon.net> on 2012/05/26 01:07:37 UTC, 4 replies.
- Re: Creating vectors from a Lucene Index - posted by Hung Ta <xu...@gmail.com> on 2012/05/26 01:37:12 UTC, 0 replies.
- Running Naive Bayes with csv format input file - posted by "Nowal, Akshay" <Ak...@SYNTELINC.COM> on 2012/05/28 08:48:20 UTC, 1 replies.
- Bug in canopy/kmeans clustering - posted by Nabarun Sengupta <Na...@mindtree.com> on 2012/05/29 06:46:26 UTC, 1 replies.
- extracting output from mahout Naive bayes algo - posted by Venkat Tharun <ve...@gmail.com> on 2012/05/29 11:16:05 UTC, 0 replies.
- Help on how to use Mahout; call Mahout from Netbeans UI - posted by 2008226726 - hanis uitm <ha...@isiswa.uitm.edu.my> on 2012/05/29 12:04:11 UTC, 0 replies.
- extracting output from naive bayes classifier - posted by venkat_tharun <ve...@gmail.com> on 2012/05/29 12:10:50 UTC, 0 replies.
- Storage and Running Time issue running LDA on Cluster - posted by zarthon <mo...@gmail.com> on 2012/05/29 13:04:08 UTC, 1 replies.
- RecommenderJob Hadoop execution times - posted by Nikolaos Romanos Katsipoulakis <po...@gmail.com> on 2012/05/29 13:27:40 UTC, 6 replies.
- mahout FPGrowth problem - posted by "Ungerer, Jens" <je...@student.kit.edu> on 2012/05/29 15:49:51 UTC, 2 replies.
- RecommenderJob not working for boolean Data? - posted by "Oliver B. Fischer" <ma...@swe-blog.net> on 2012/05/29 23:58:04 UTC, 4 replies.
- Using LDA CVB results to match a new document to topics? - posted by "Runkel, Timothy J" <ti...@lmco.com> on 2012/05/30 02:03:53 UTC, 2 replies.
- TwitterAnalyzer for Clustering: How to add to classpath - posted by alexi <al...@gmail.com> on 2012/05/30 02:41:35 UTC, 0 replies.
- Training Data and Precision/Recall evaluation - posted by Daniel Quach <da...@cs.ucla.edu> on 2012/05/30 11:16:53 UTC, 1 replies.
- Server sizing Hadoop + Mahout - posted by jcuencaa <jo...@everis.com> on 2012/05/30 11:32:47 UTC, 1 replies.
- Clustering a large crawl - posted by Pat Ferrel <pa...@occamsmachete.com> on 2012/05/30 19:23:30 UTC, 9 replies.
- about vector from text - posted by Jiaan Zeng <l....@gmail.com> on 2012/05/30 21:00:48 UTC, 0 replies.
- RowSimilarityJob - posted by Pat Ferrel <pa...@occamsmachete.com> on 2012/05/31 04:22:19 UTC, 2 replies.