You are viewing a plain text version of this content. The canonical link for it is here.
- Re: Reading from Lucene, writing to IntWritable VectorWritable does not work - posted by Lance Norskog <go...@gmail.com> on 2011/06/01 03:01:27 UTC, 1 replies.
- Re: What does percentCorrect of CrossFloderLearner mean? - posted by Xiaobo Gu <gu...@gmail.com> on 2011/06/01 03:38:30 UTC, 1 replies.
- Re: Which exact algorithm is used in the Mahout SGD? - posted by Dmitriy Lyubimov <dl...@gmail.com> on 2011/06/01 04:19:42 UTC, 4 replies.
- Re: error when adding generic options - posted by Qiuyan Xu <qi...@mailbox.tu-berlin.de> on 2011/06/01 04:25:10 UTC, 1 replies.
- Why do userid & itemid have to be long? - posted by Mike Khristo <mi...@gmail.com> on 2011/06/01 04:50:40 UTC, 10 replies.
- Talk on dimension reduction from Google talks - Yoav Freund - posted by Lance Norskog <go...@gmail.com> on 2011/06/01 09:08:10 UTC, 0 replies.
- Measuring randomness - posted by Lance Norskog <go...@gmail.com> on 2011/06/01 09:31:30 UTC, 3 replies.
- Do we have to make a seperate hold-out data set for AdaptiveLogisticRegression to measure the performance? - posted by Xiaobo Gu <gu...@gmail.com> on 2011/06/01 11:22:18 UTC, 4 replies.
- Heap Size question. - posted by Ken Williams <zo...@hotmail.com> on 2011/06/01 13:35:17 UTC, 1 replies.
- Exploring the potential of a Mahout classification system - posted by "Baker, Tristan" <Tr...@intuit.com> on 2011/06/02 00:11:57 UTC, 3 replies.
- PearsonCorrelationSimilarity returning NaN for user similarity with perfect match - posted by Jason Smith <ja...@gmail.com> on 2011/06/02 05:00:53 UTC, 1 replies.
- Apache Mahout 0.5 released - posted by Sean Owen <sr...@apache.org> on 2011/06/02 09:25:05 UTC, 4 replies.
- NaiveBayes and Classification of non-documents - posted by "Lancaster, Robert (Orbitz)" <RO...@orbitz.com> on 2011/06/02 16:40:25 UTC, 5 replies.
- Intersection of 2 PreferenceArrays - posted by Chris Schilling <ch...@gmail.com> on 2011/06/02 20:07:43 UTC, 6 replies.
- Logistic Regression + non-CSV examples - posted by Svetlomir Kasabov <sk...@smail.inf.fh-brs.de> on 2011/06/02 21:55:26 UTC, 2 replies.
- LogisticModelParameters#saveTo(OutputStream) generates an empty file - posted by Svetlomir Kasabov <sk...@smail.inf.fh-brs.de> on 2011/06/03 02:01:23 UTC, 5 replies.
- Reg Randomn forest - posted by ex...@nokia.com on 2011/06/03 13:15:59 UTC, 1 replies.
- Computing SVD Of "Large Sparse Data" - posted by Eshwaran Vijaya Kumar <ev...@mozilla.com> on 2011/06/04 01:48:48 UTC, 10 replies.
- Re : Reg Randomn forest - posted by deneche abdelhakim <a_...@yahoo.fr> on 2011/06/04 16:30:03 UTC, 0 replies.
- How is the content of a confusion matrix print? - posted by XiaoboGu <gu...@gmail.com> on 2011/06/04 17:24:29 UTC, 2 replies.
- How to get the predicted target lable using CrossFolderLearner? - posted by XiaoboGu <gu...@gmail.com> on 2011/06/04 17:31:48 UTC, 7 replies.
- ItemSimilarityJob Cooccurrence Question - posted by djn <de...@gmail.com> on 2011/06/04 23:21:53 UTC, 1 replies.
- Logistic Regression + Time Series - posted by Svetlomir Dimitrov Kasabov <sv...@smail.inf.fh-bonn-rhein-sieg.de> on 2011/06/05 17:08:09 UTC, 7 replies.
- Problems running examples - posted by Mark <st...@gmail.com> on 2011/06/05 20:07:01 UTC, 24 replies.
- Need a little help with SVD / Dimensional Reduction - posted by Stefan Wienert <st...@wienert.cc> on 2011/06/05 22:16:41 UTC, 24 replies.
- What's the best practice to specify df for TPrior and alphaByLambda for ElasticBandPrior - posted by XiaoboGu <gu...@gmail.com> on 2011/06/06 10:06:39 UTC, 2 replies.
- Problems with genetic algorithms in Mahout - posted by Jose Fuentes De Frutos <in...@udc.es> on 2011/06/06 12:59:44 UTC, 0 replies.
- Taste-Web - posted by majin <ma...@x-playaz.de> on 2011/06/06 14:03:33 UTC, 1 replies.
- SequenceFilesFromDirectory - posted by Mark <st...@gmail.com> on 2011/06/06 18:04:18 UTC, 5 replies.
- Stacking Algorithms - posted by jeff thomas <mr...@yahoo.com> on 2011/06/06 22:27:22 UTC, 3 replies.
- item similarity based on multiple attributes - posted by mc tell <mc...@gmail.com> on 2011/06/07 14:23:37 UTC, 1 replies.
- [OT] 11 Days to Scale-A-Thon - posted by Grant Ingersoll <gs...@apache.org> on 2011/06/07 14:25:05 UTC, 0 replies.
- how to use bayse classifier to predict - posted by 刘逸哲 <zh...@alibaba-inc.com> on 2011/06/08 10:32:16 UTC, 1 replies.
- Vector truncation for visualization - posted by Lance Norskog <go...@gmail.com> on 2011/06/09 00:25:54 UTC, 4 replies.
- Problems running distributed seq2sparse - posted by Mark <st...@gmail.com> on 2011/06/09 02:47:05 UTC, 2 replies.
- Confused on binary vs source distributions - posted by Mark <st...@gmail.com> on 2011/06/09 03:18:25 UTC, 5 replies.
- Re: Hybrid RecSys — ways to do it - posted by Marko Ciric <ci...@gmail.com> on 2011/06/09 15:51:41 UTC, 3 replies.
- What are the most used machine learning algorithms? - posted by Danny Bickson <da...@gmail.com> on 2011/06/09 16:58:10 UTC, 4 replies.
- Re: Mahout in Action -- Ch 5 Dataset - posted by mail2abin <ma...@gmail.com> on 2011/06/09 18:09:00 UTC, 2 replies.
- plan for mahout 0.6 - posted by Daniel Xiaodan Zhou <da...@gmail.com> on 2011/06/09 21:44:15 UTC, 1 replies.
- Re: Automatically extracted Mahout FAQs - posted by Stefan Henß <st...@googlemail.com> on 2011/06/09 22:18:07 UTC, 1 replies.
- Re: T1 and T2 in Canopy - posted by Konstantin Shmakov <ks...@gmail.com> on 2011/06/09 23:41:09 UTC, 1 replies.
- MinHash implementation - posted by Jeff Hansen <ds...@gmail.com> on 2011/06/10 00:53:23 UTC, 0 replies.
- Yahoo's LDA code - posted by je...@lewi.us on 2011/06/10 02:12:17 UTC, 10 replies.
- Classification beginner questions - posted by Joscha Feth <jo...@feth.com> on 2011/06/10 09:54:35 UTC, 16 replies.
- Re: SGD and per-term annealing. - posted by Nabarun <se...@gmail.com> on 2011/06/10 12:39:44 UTC, 1 replies.
- Stochastic gradient algorithm related queries - posted by Nabarun <se...@gmail.com> on 2011/06/10 13:54:28 UTC, 1 replies.
- Installing mahout on a cluster of Linux VM's Examples not compiling - posted by "Walshe, Maurice (RBI-UK)" <Ma...@rbi.co.uk> on 2011/06/10 17:47:03 UTC, 0 replies.
- Re: Installing mahout on a cluster of Linux VM's Examples not compiling - posted by Sean Owen <sr...@gmail.com> on 2011/06/10 17:59:12 UTC, 1 replies.
- how do you run the kdd examples? - posted by jeff thomas <mr...@yahoo.com> on 2011/06/10 21:30:25 UTC, 4 replies.
- term collocation from lucene index - posted by Peter Andrews <pw...@gmail.com> on 2011/06/10 23:03:11 UTC, 1 replies.
- Uses of linear algebra in text search - posted by Lance Norskog <go...@gmail.com> on 2011/06/11 05:53:37 UTC, 0 replies.
- CsvRecordFactory usage recomendation - posted by Svetlomir Kasabov <sk...@smail.inf.fh-brs.de> on 2011/06/11 23:13:51 UTC, 1 replies.
- Calculating cosine similarity for vectors extracted from Lucene - posted by Andrew Clegg <an...@gmail.com> on 2011/06/12 02:29:37 UTC, 1 replies.
- RecommenderJob uses indirection for ItemIDs - posted by Lance Norskog <go...@gmail.com> on 2011/06/12 04:47:00 UTC, 3 replies.
- Another beginners question - posted by sharath jagannath <sh...@gmail.com> on 2011/06/14 06:52:43 UTC, 2 replies.
- Map reduce job for Recommender - posted by Prashant Sharma <pr...@imaginea.com> on 2011/06/14 07:39:11 UTC, 6 replies.
- tf-idf + svd + cosine similarity - posted by Stefan Wienert <st...@wienert.cc> on 2011/06/14 19:15:09 UTC, 27 replies.
- Comparing Mahout's Lanczos Solver Results With Matlab - posted by Eshwaran Vijaya Kumar <ev...@mozilla.com> on 2011/06/14 23:02:28 UTC, 4 replies.
- Mahout & Solr - posted by Mark <st...@gmail.com> on 2011/06/15 16:38:55 UTC, 4 replies.
- Probabilities in Bayesian classifier - posted by Steven Raemaekers <s....@sig.eu> on 2011/06/15 16:51:07 UTC, 5 replies.
- Cassandra support - posted by Patricio Echagüe <pa...@gmail.com> on 2011/06/15 21:08:32 UTC, 3 replies.
- a modified booleanrecommendation strategy with 'likes' - posted by aaron barnes <aa...@stasis.org> on 2011/06/15 21:27:44 UTC, 3 replies.
- Build error when running example: JesterRecommenderEvaluatorRunner - posted by Patricio Echagüe <pa...@gmail.com> on 2011/06/16 01:06:27 UTC, 1 replies.
- Help to build from sources - posted by Patricio Echagüe <pa...@gmail.com> on 2011/06/16 02:21:16 UTC, 4 replies.
- Request - Release 0.6 feature set listing - posted by Kumar Kandasami <ku...@gmail.com> on 2011/06/16 18:06:31 UTC, 3 replies.
- hadoop version - posted by Ian Upright <ia...@upright.net> on 2011/06/16 19:23:33 UTC, 2 replies.
- Clustering Suggestions - posted by Adam Estrada <es...@gmail.com> on 2011/06/16 22:24:56 UTC, 1 replies.
- mahouts' svd - posted by "stefan.bobocescu" <st...@cti.pub.ro> on 2011/06/17 00:16:55 UTC, 1 replies.
- Compute Covariance Matrix - posted by marco turchi <ma...@gmail.com> on 2011/06/17 11:50:57 UTC, 0 replies.
- the issues on "run the reuters extraction code from the examples directory" - posted by huaiyang gongzi <hu...@gmail.com> on 2011/06/17 18:46:32 UTC, 1 replies.
- LDA - posted by Ian Upright <ia...@upright.net> on 2011/06/18 01:59:47 UTC, 2 replies.
- Test - posted by Ken Williams <zo...@hotmail.com> on 2011/06/18 17:23:34 UTC, 1 replies.
- RE: OutOfMemoryError: GC overhead limit exceeded - posted by Ken Williams <zo...@hotmail.com> on 2011/06/18 18:39:09 UTC, 2 replies.
- Trending patterns - posted by Mark <st...@gmail.com> on 2011/06/18 19:52:33 UTC, 3 replies.
- Generated clusters... now what? - posted by Mark <st...@gmail.com> on 2011/06/18 22:30:12 UTC, 1 replies.
- This is to test whether my subscription is successful! Thanks. - posted by huaiyang gongzi <hu...@gmail.com> on 2011/06/19 02:29:47 UTC, 0 replies.
- lucene.vector SequenceFile formatted dictionary? - posted by Plastic Flat <pl...@gmail.com> on 2011/06/20 03:19:14 UTC, 0 replies.
- Mahout on Github - posted by Mark <st...@gmail.com> on 2011/06/20 17:05:38 UTC, 5 replies.
- Mahout 0.5 seq2sparse gives Error: LUCENE_31 - posted by Camilo Lopez <ca...@camilolopez.com> on 2011/06/20 19:32:34 UTC, 2 replies.
- Exception while testing reuters data - posted by sharath jagannath <sh...@gmail.com> on 2011/06/20 20:40:11 UTC, 6 replies.
- Running Iterative Recursive Least Squares - posted by Vincent Xue <xu...@gmail.com> on 2011/06/20 21:25:22 UTC, 3 replies.
- Re: Grouplens dataset Recommenderjob with Hadoop - posted by sangroya <sa...@gmail.com> on 2011/06/21 09:53:50 UTC, 1 replies.
- Re: how to do recommender incremental update for offline item-base similarity? - posted by Sean Owen <sr...@gmail.com> on 2011/06/21 16:31:05 UTC, 1 replies.
- More on incremental clustering - posted by Camilo Lopez <ca...@camilolopez.com> on 2011/06/21 17:23:37 UTC, 0 replies.
- Mahout for detecting fake profiles in social networks! - posted by Neville Agius <na...@gmail.com> on 2011/06/21 17:37:58 UTC, 5 replies.
- Re: Hints for Best Practices for Jobs with amazon EMR - posted by Mat Kelcey <ma...@gmail.com> on 2011/06/21 23:30:42 UTC, 0 replies.
- Which is more effective? - posted by Marko Ciric <ci...@gmail.com> on 2011/06/22 00:34:24 UTC, 8 replies.
- quickstart: MVN problems - posted by Daniel Weitzenfeld <dw...@gmail.com> on 2011/06/22 12:15:51 UTC, 1 replies.
- meanshift reduce task problem - posted by "Sengupta, Sohini IN BLR SISL" <so...@siemens.com> on 2011/06/22 13:45:09 UTC, 4 replies.
- Version of Lucene - posted by Adam Estrada <es...@gmail.com> on 2011/06/23 13:29:25 UTC, 3 replies.
- Mahout and Kolt - posted by Marko Ciric <ci...@gmail.com> on 2011/06/23 15:25:27 UTC, 4 replies.
- LanczosSVD and Eigenvalues - posted by tr...@cs.drexel.edu on 2011/06/23 18:07:16 UTC, 22 replies.
- java.io.IOException while running itemsimilarity - posted by Andrew Schein <an...@efrontier.com> on 2011/06/23 19:35:11 UTC, 3 replies.
- java.lang.IndexOutOfBoundsException - posted by Patricio Echagüe <pa...@gmail.com> on 2011/06/23 20:38:09 UTC, 15 replies.
- MAHOUT-708 - posted by Patricio Echagüe <pa...@gmail.com> on 2011/06/23 23:11:51 UTC, 2 replies.
- Can all the algorithms in Mahout be run locally without a Hadoop cluster. - posted by XiaoboGu <gu...@gmail.com> on 2011/06/24 10:47:59 UTC, 21 replies.
- Should threadcount and poolsize of AdaptiveLogisticRegression be the same? - posted by XiaoboGu <gu...@gmail.com> on 2011/06/24 12:04:27 UTC, 1 replies.
- Adding dimensions to an existing TF-IDF vector - posted by Mark <st...@gmail.com> on 2011/06/24 17:52:54 UTC, 5 replies.
- How to read/analyze the clustered result - posted by wine lover <wi...@gmail.com> on 2011/06/24 21:24:06 UTC, 1 replies.
- org.apache.mahout.math.hadoop.similarity.RowSimilarityJob - posted by "Paul, Seby" <Se...@searshc.com> on 2011/06/24 23:41:17 UTC, 0 replies.
- Using - posted by rmx <ru...@hotmail.com> on 2011/06/25 23:18:59 UTC, 0 replies.
- KMeans and Canopies - posted by Mark <st...@gmail.com> on 2011/06/26 21:29:05 UTC, 2 replies.
- An inmemory sparse matrix multiplier - posted by Vincent Xue <xu...@gmail.com> on 2011/06/26 22:47:21 UTC, 10 replies.
- Canopy Generation - posted by Mark <st...@gmail.com> on 2011/06/27 00:40:09 UTC, 3 replies.
- Incorrect calculation of pdf - posted by Vasil Vasilev <va...@gmail.com> on 2011/06/27 10:49:38 UTC, 7 replies.
- questions on using build-reuters.sh and the output of ExtractReuters - posted by wine lover <wi...@gmail.com> on 2011/06/27 17:55:05 UTC, 1 replies.
- Limiting lucene.vector by location - posted by Adam Estrada <es...@gmail.com> on 2011/06/27 18:42:26 UTC, 2 replies.
- parameter setting for using Seqdirectory and SequenceFile - posted by wine lover <wi...@gmail.com> on 2011/06/27 22:36:15 UTC, 2 replies.
- Fuzzy logic and Heuristics vs Classification - posted by Patrick Collins <pa...@ready2sign.com> on 2011/06/28 02:51:13 UTC, 3 replies.
- Re: Using with seq2spars org.apache.lucene.analysis.Analyzer - posted by rmx <ru...@hotmail.com> on 2011/06/28 13:02:19 UTC, 0 replies.
- Re: Using with seq2spars org.apache.lucene.analysis.Analyzer - posted by Dhruv Kumar <dk...@ecs.umass.edu> on 2011/06/28 17:05:24 UTC, 0 replies.
- Re: Creating vectors from a Lucene Index - posted by bin lin <do...@126.com> on 2011/06/29 03:15:33 UTC, 2 replies.
- L2 seems does not work - posted by Xiaobo Gu <gu...@gmail.com> on 2011/06/29 18:26:08 UTC, 4 replies.
- What does defaultLabel of ConfusionMatrix mean? - posted by Xiaobo Gu <gu...@gmail.com> on 2011/06/30 04:21:54 UTC, 1 replies.
- Markov Models and Recommenders - posted by Lance Norskog <go...@gmail.com> on 2011/06/30 05:01:07 UTC, 1 replies.
- questions on the results of running lda and ldatopics, thanks - posted by wine lover <wi...@gmail.com> on 2011/06/30 20:08:34 UTC, 2 replies.