You are viewing a plain text version of this content. The canonical link for it is here.
- Re: How to use multi thread in RDD map function ? - posted by myasuka <my...@live.com> on 2014/10/01 08:00:47 UTC, 0 replies.
- Re: parquet predicate / projection pushdown into unionAll - posted by DB Tsai <db...@dbtsai.com> on 2014/10/01 11:25:12 UTC, 0 replies.
- Re: jenkins downtime/system upgrade wednesday morning, 730am PDT - posted by shane knapp <sk...@berkeley.edu> on 2014/10/01 16:14:05 UTC, 1 replies.
- Re: amplab jenkins is down - posted by Nicholas Chammas <ni...@gmail.com> on 2014/10/01 21:53:03 UTC, 2 replies.
- Re: do MIMA checking before all test cases start? - posted by Nicholas Chammas <ni...@gmail.com> on 2014/10/01 21:56:18 UTC, 0 replies.
- Extending Scala style checks - posted by Nicholas Chammas <ni...@gmail.com> on 2014/10/01 23:01:06 UTC, 11 replies.
- HiveContext: cache table not supported for partitioned table? - posted by Du Li <li...@yahoo-inc.com.INVALID> on 2014/10/02 21:39:05 UTC, 2 replies.
- Re: EC2 clusters ready in launch time + 30 seconds - posted by Nicholas Chammas <ni...@gmail.com> on 2014/10/03 01:30:51 UTC, 9 replies.
- What is the best way to build my developing Spark for testing on EC2? - posted by Yu Ishikawa <yu...@gmail.com> on 2014/10/03 02:37:10 UTC, 2 replies.
- Breeze Library usage in Spark - posted by Priya Ch <le...@gmail.com> on 2014/10/03 13:22:44 UTC, 4 replies.
- emergency jenkins restart -- massive security patch released - posted by shane knapp <sk...@berkeley.edu> on 2014/10/03 19:51:13 UTC, 1 replies.
- Parquet schema migrations - posted by Cody Koeninger <co...@koeninger.org> on 2014/10/03 22:33:40 UTC, 6 replies.
- What versions of Hadoop Spark supports? - posted by tomo cocoa <co...@gmail.com> on 2014/10/04 21:10:43 UTC, 0 replies.
- Jython importing pyspark? - posted by Robert C Senkbeil <rc...@us.ibm.com> on 2014/10/05 19:16:25 UTC, 1 replies.
- Impact of input format on timing - posted by Tom Hubregtsen <th...@gmail.com> on 2014/10/05 22:58:16 UTC, 1 replies.
- Hyper Parameter Tuning Algorithms - posted by Lochana Menikarachchi <lo...@gmail.com> on 2014/10/06 04:28:18 UTC, 1 replies.
- Re: SPARK-3660 : Initial RDD for updateStateByKey transformation - posted by Soumitra Kumar <ku...@gmail.com> on 2014/10/06 05:40:48 UTC, 0 replies.
- Too big data Spark SQL on Hive table on version 1.0.2 has some strange output - posted by Trident <cw...@vip.qq.com> on 2014/10/06 08:00:29 UTC, 0 replies.
- Spark on Mesos 0.20 - posted by Fairiz Azizi <co...@gmail.com> on 2014/10/06 08:19:04 UTC, 14 replies.
- TorrentBroadcast slow performance - posted by Guillaume Pitel <gu...@exensa.com> on 2014/10/06 10:27:06 UTC, 5 replies.
- Pull Requests - posted by Bill Bejeck <bb...@gmail.com> on 2014/10/07 04:32:00 UTC, 1 replies.
- Local tests logging to log4j - posted by Debasish Das <de...@gmail.com> on 2014/10/07 20:42:35 UTC, 2 replies.
- Unneeded branches/tags - posted by Nicholas Chammas <ni...@gmail.com> on 2014/10/08 03:25:22 UTC, 3 replies.
- RE: Spark SQL question: why build hashtable for both sides in HashOuterJoin? - posted by Haopu Wang <HW...@qilinsoft.com> on 2014/10/08 08:04:55 UTC, 2 replies.
- Re: How to do broadcast join in SparkSQL - posted by Jianshi Huang <ji...@gmail.com> on 2014/10/08 08:18:37 UTC, 3 replies.
- Standardized Distance Functions in MLlib - posted by Yu Ishikawa <yu...@gmail.com> on 2014/10/08 13:19:47 UTC, 2 replies.
- will/when Spark/SparkSQL will support ORCFile format - posted by James Yu <jy...@gmail.com> on 2014/10/08 19:03:11 UTC, 7 replies.
- spark-ec2 can't initialize spark-standalone module - posted by Nicholas Chammas <ni...@gmail.com> on 2014/10/08 22:50:17 UTC, 1 replies.
- Fwd: Accumulator question - posted by Nathan Kronenfeld <nk...@oculusinfo.com> on 2014/10/08 23:53:34 UTC, 1 replies.
- new jenkins update + tentative release date - posted by shane knapp <sk...@berkeley.edu> on 2014/10/09 00:01:04 UTC, 10 replies.
- Re: [MLlib] LogisticRegressionWithSGD and LogisticRegressionWithLBFGS converge with different weights. - posted by DB Tsai <db...@dbtsai.com> on 2014/10/09 11:23:25 UTC, 0 replies.
- Trouble running tests - posted by Yana <ya...@gmail.com> on 2014/10/09 16:10:57 UTC, 3 replies.
- Introduction to Spark Blog - posted by "devl.development" <de...@gmail.com> on 2014/10/09 20:18:52 UTC, 0 replies.
- spark-prs and mesos/spark-ec2 - posted by Nicholas Chammas <ni...@gmail.com> on 2014/10/10 03:19:44 UTC, 1 replies.
- [Spark SQL] Strange NPE in Spark SQL with Hive - posted by Trident <cw...@vip.qq.com> on 2014/10/10 04:09:41 UTC, 0 replies.
- [Spark SQL Continue] Sorry, it is not only limited in SQL, may due to network - posted by Trident <cw...@vip.qq.com> on 2014/10/10 04:53:45 UTC, 0 replies.
- Breaking the previous large-scale sort record with Spark - posted by Matei Zaharia <ma...@gmail.com> on 2014/10/10 16:54:16 UTC, 12 replies.
- Decision forests don't work with non-trivial categorical features - posted by Sean Owen <so...@cloudera.com> on 2014/10/12 20:50:08 UTC, 7 replies.
- reading/writing parquet decimal type - posted by Michael Allman <mi...@videoamp.com> on 2014/10/12 22:51:21 UTC, 4 replies.
- Scalastyle improvements / large code reformatting - posted by Josh Rosen <ro...@gmail.com> on 2014/10/13 06:37:04 UTC, 8 replies.
- SPARK-3106 fixed? - posted by Jianshi Huang <ji...@gmail.com> on 2014/10/13 10:15:02 UTC, 3 replies.
- Re:Breaking the previous large-scale sort record with Spark - posted by "欧阳晋(欧阳晋)" <ji...@alibaba-inc.com> on 2014/10/13 15:10:30 UTC, 0 replies.
- Default spark.deploy.recoveryMode - posted by Priya Ch <le...@gmail.com> on 2014/10/14 13:33:46 UTC, 0 replies.
- avro-mapred hadoop2 not in assembly jar - posted by Joseph Beynon <jb...@gmail.com> on 2014/10/15 02:21:21 UTC, 0 replies.
- Unit testing Master-Worker Message Passing - posted by Matthew Cheah <ma...@gmail.com> on 2014/10/15 03:17:14 UTC, 9 replies.
- [mllib] Share the simple benchmark result about the cast cost from Spark vector to Breeze vector - posted by Yu Ishikawa <yu...@gmail.com> on 2014/10/15 16:05:33 UTC, 0 replies.
- short jenkins downtime -- trying to get to the bottom of the git fetch timeouts - posted by shane knapp <sk...@berkeley.edu> on 2014/10/15 22:52:20 UTC, 19 replies.
- Issues with ALS positive definite - posted by Debasish Das <de...@gmail.com> on 2014/10/16 01:57:04 UTC, 6 replies.
- accumulators - posted by Sean McNamara <Se...@Webtrends.com> on 2014/10/16 18:10:32 UTC, 1 replies.
- NNLS bug - posted by Debasish Das <de...@gmail.com> on 2014/10/17 08:25:50 UTC, 1 replies.
- Using Docker to Parallelize Tests - posted by Nicholas Chammas <ni...@gmail.com> on 2014/10/17 21:13:06 UTC, 0 replies.
- sampling broken in PySpark with recent NumPy - posted by Jeremy Freeman <fr...@gmail.com> on 2014/10/18 00:23:15 UTC, 0 replies.
- Raise Java dependency from 6 to 7 - posted by Andrew Ash <an...@andrewash.com> on 2014/10/18 02:00:40 UTC, 5 replies.
- Oryx + Spark mllib - posted by Debasish Das <de...@gmail.com> on 2014/10/18 17:46:48 UTC, 7 replies.
- Joining the spark dev community - posted by Saurabh Wadhawan <Sa...@guavus.com> on 2014/10/18 22:46:49 UTC, 1 replies.
- Submissions open for Spark Summit East 2015 - posted by Matei Zaharia <ma...@gmail.com> on 2014/10/19 06:52:13 UTC, 1 replies.
- Get attempt number in a closure - posted by Yin Huai <hu...@gmail.com> on 2014/10/20 16:17:04 UTC, 7 replies.
- something wrong with Jenkins or something untested merged? - posted by Nan Zhu <zh...@gmail.com> on 2014/10/21 00:51:44 UTC, 20 replies.
- Building and Running Spark on OS X - posted by Nicholas Chammas <ni...@gmail.com> on 2014/10/21 01:43:55 UTC, 9 replies.
- [MLlib] Contributing Algorithm for Outlier Detection - posted by Ashutosh <as...@iiitb.org> on 2014/10/21 11:23:00 UTC, 9 replies.
- Easy win: SBT plugin config expert to help on SPARK-3359? - posted by Sean Owen <so...@cloudera.com> on 2014/10/21 13:41:07 UTC, 1 replies.
- Which part of the code deals with communication? - posted by Theodore Si <sj...@gmail.com> on 2014/10/22 13:00:08 UTC, 1 replies.
- Graphx connectComponents API - posted by Manoj Awasthi <aw...@gmail.com> on 2014/10/22 15:26:14 UTC, 1 replies.
- SPARK-3299 jira task question - posted by Bill Bejeck <bb...@gmail.com> on 2014/10/22 18:52:19 UTC, 0 replies.
- Multitenancy in Spark - within/across spark context - posted by Ashwin Shankar <as...@gmail.com> on 2014/10/22 20:47:21 UTC, 7 replies.
- Fwd: Sharing spark context across multiple spark sql cli initializations - posted by Sadhan Sood <sa...@gmail.com> on 2014/10/22 21:18:26 UTC, 1 replies.
- Development testing code - posted by catchmonster <sk...@gmail.com> on 2014/10/23 01:13:21 UTC, 1 replies.
- Exception while running unit tests that makes use of local-cluster mode - posted by Varadharajan Mukundan <sr...@gmail.com> on 2014/10/23 08:34:43 UTC, 1 replies.
- PR for Hierarchical Clustering Needs Review - posted by RJ Nowling <rn...@gmail.com> on 2014/10/23 11:58:47 UTC, 2 replies.
- Memory - posted by Tom Hubregtsen <th...@gmail.com> on 2014/10/23 12:35:39 UTC, 0 replies.
- Receiver/DStream storage level - posted by Michael Allman <mi...@videoamp.com> on 2014/10/23 16:43:10 UTC, 0 replies.
- scalastyle annoys me a little bit - posted by Koert Kuipers <ko...@tresata.com> on 2014/10/23 20:03:24 UTC, 13 replies.
- Spark 1.2 feature freeze on November 1 - posted by Patrick Wendell <pw...@gmail.com> on 2014/10/23 21:06:29 UTC, 0 replies.
- label points with a given index - posted by Lochana Menikarachchi <lo...@gmail.com> on 2014/10/24 04:27:53 UTC, 1 replies.
- your weekly git timeout update! TL;DR: i'm now almost certain we're not hitting rate limits. - posted by shane knapp <sk...@berkeley.edu> on 2014/10/24 22:32:42 UTC, 1 replies.
- Moving PR Builder to mvn - posted by Hari Shreedharan <hs...@cloudera.com> on 2014/10/24 22:39:13 UTC, 7 replies.
- serialVersionUID incompatible error in class BlockManagerId - posted by Qiuzhuang Lian <qi...@gmail.com> on 2014/10/25 03:16:59 UTC, 4 replies.
- Matix operations in Scala \ Spark - posted by salexln <sa...@gmail.com> on 2014/10/25 15:03:23 UTC, 2 replies.
- Potential areas for working - posted by Vibhanshu Prasad <vi...@gmail.com> on 2014/10/26 10:43:31 UTC, 2 replies.
- best IDE for scala + spark development? - posted by ll <du...@gmail.com> on 2014/10/26 16:07:20 UTC, 14 replies.
- Build with Hive 0.13.1 doesn't have datanucleus and parquet dependencies. - posted by Jianshi Huang <ji...@gmail.com> on 2014/10/27 09:47:40 UTC, 2 replies.
- jenkins downtime tomorrow morning ~6am-8am PDT - posted by shane knapp <sk...@berkeley.edu> on 2014/10/27 18:46:32 UTC, 1 replies.
- jenkins emergency restart now, was Re: jenkins downtime tomorrow morning ~6am-8am PDT - posted by shane knapp <sk...@berkeley.edu> on 2014/10/27 21:24:25 UTC, 1 replies.
- HiveContext bug? - posted by Marcelo Vanzin <va...@cloudera.com> on 2014/10/27 23:25:27 UTC, 2 replies.
- Workaround for python's inability to unzip zip64 spark assembly jar - posted by Rahul Singhal <Ra...@guavus.com> on 2014/10/28 06:39:04 UTC, 0 replies.
- Re: Support Hive 0.13 .1 in Spark SQL - posted by Patrick Wendell <pw...@gmail.com> on 2014/10/28 07:51:57 UTC, 0 replies.
- How to run tests properly? - posted by Niklas Wilcke <1w...@informatik.uni-hamburg.de> on 2014/10/28 18:18:11 UTC, 7 replies.
- Breeze::DiffFunction not serializable - posted by Xuepeng Sun <xs...@yahoo.com> on 2014/10/28 20:53:45 UTC, 0 replies.
- HiveShim not found when building in Intellij - posted by Stephen Boesch <ja...@gmail.com> on 2014/10/29 03:42:48 UTC, 16 replies.
- matrix factorization cross validation - posted by Debasish Das <de...@gmail.com> on 2014/10/29 19:23:57 UTC, 11 replies.
- Registering custom metrics - posted by Gerard Maas <ge...@gmail.com> on 2014/10/30 21:53:35 UTC, 0 replies.
- Surprising Spark SQL benchmark - posted by Nicholas Chammas <ni...@gmail.com> on 2014/10/31 18:38:20 UTC, 5 replies.
- Spark consulting - posted by Alessandro Baretta <al...@gmail.com> on 2014/10/31 21:35:04 UTC, 3 replies.
- Parquet Migrations - posted by Gary Malouf <ma...@gmail.com> on 2014/10/31 21:49:56 UTC, 1 replies.