You are viewing a plain text version of this content. The canonical link for it is here.
- Downloading Hadoop from s3://spark-related-packages/ - posted by Nicholas Chammas <ni...@gmail.com> on 2015/11/01 04:17:52 UTC, 8 replies.
- Re: If you use Spark 1.5 and disabled Tungsten mode ... - posted by Reynold Xin <rx...@databricks.com> on 2015/11/01 08:22:39 UTC, 1 replies.
- unscribe - posted by Chenxi Li <sp...@gmail.com> on 2015/11/01 09:09:58 UTC, 1 replies.
- Re: Spark 1.6 Release Schedule - posted by Sean Owen <so...@cloudera.com> on 2015/11/01 13:16:49 UTC, 1 replies.
- Some spark apps fail with "All masters are unresponsive", while others pass normally - posted by Romi Kuntsman <ro...@totango.com> on 2015/11/01 17:08:23 UTC, 5 replies.
- Re: [Spark MLlib] about linear regression issue - posted by DB Tsai <db...@dbtsai.com> on 2015/11/02 04:12:14 UTC, 0 replies.
- Implementation of RNN/LSTM in Spark - posted by Disha Shrivastava <di...@gmail.com> on 2015/11/02 05:59:34 UTC, 5 replies.
- Re: Unable to run applications on spark in standalone cluster mode - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2015/11/02 08:48:59 UTC, 1 replies.
- Re: Getting Started - posted by Romi Kuntsman <ro...@totango.com> on 2015/11/02 09:12:40 UTC, 1 replies.
- Lead operator not working as aggregation operator - posted by Shagun Sodhani <ss...@gmail.com> on 2015/11/02 11:33:56 UTC, 3 replies.
- Re: Ability to offer initial coefficients in ml.LogisticRegression - posted by YiZhi Liu <ja...@gmail.com> on 2015/11/02 16:32:59 UTC, 4 replies.
- Re: test failed due to OOME - posted by Ted Yu <yu...@gmail.com> on 2015/11/02 16:42:52 UTC, 2 replies.
- [BUILD SYSTEM] quick jenkins downtime, november 5th 7am - posted by shane knapp <sk...@berkeley.edu> on 2015/11/02 18:55:11 UTC, 11 replies.
- Re: Guaranteed processing orders of each batch in Spark Streaming - posted by Renjie Liu <li...@gmail.com> on 2015/11/03 08:06:29 UTC, 0 replies.
- Anyone has perfect solution for spark source code compilation issue on intellij - posted by canan chen <cc...@gmail.com> on 2015/11/03 09:25:07 UTC, 1 replies.
- Running individual test classes - posted by Stefano Baghino <st...@radicalbit.io> on 2015/11/03 09:27:24 UTC, 7 replies.
- Unchecked contribution (JIRA and PR) - posted by Sergio Ramírez <sr...@ugr.es> on 2015/11/03 11:49:33 UTC, 6 replies.
- Extracting RDD of values per key from PairRDD - posted by Deepak Gopalakrishnan <dg...@gmail.com> on 2015/11/03 11:50:00 UTC, 1 replies.
- Master build fails ? - posted by Jeff Zhang <zj...@gmail.com> on 2015/11/03 11:50:50 UTC, 25 replies.
- Re: SparkLauncher#setJavaHome does not set JAVA_HOME in child process - posted by gus <gu...@gmail.com> on 2015/11/03 13:25:28 UTC, 1 replies.
- Re: Off-heap storage and dynamic allocation - posted by Ryan Williams <ry...@gmail.com> on 2015/11/03 16:57:41 UTC, 7 replies.
- Frozen exception while dynamically creating classes inside Spark using JavaAssist API - posted by Rachana Srivastava <Ra...@markmonitor.com> on 2015/11/03 19:52:29 UTC, 0 replies.
- Re: Pickle Spark DataFrame - posted by Justin Uang <ju...@gmail.com> on 2015/11/03 22:17:49 UTC, 0 replies.
- Info about Dataset - posted by Justin Uang <ju...@gmail.com> on 2015/11/03 22:41:05 UTC, 1 replies.
- [VOTE] Release Apache Spark 1.5.2 (RC2) - posted by Reynold Xin <rx...@databricks.com> on 2015/11/04 00:22:28 UTC, 22 replies.
- Please reply if you use Mesos fine grained mode - posted by Reynold Xin <rx...@databricks.com> on 2015/11/04 00:54:06 UTC, 9 replies.
- Getting new metrics into /api/v1 - posted by Charles Yeh <ch...@eactiv.com> on 2015/11/04 07:24:39 UTC, 1 replies.
- Codegen In Shuffle - posted by 牛兆捷 <nz...@gmail.com> on 2015/11/04 09:21:36 UTC, 2 replies.
- Build a specific module only - posted by gsvic <vi...@gmail.com> on 2015/11/04 12:27:49 UTC, 2 replies.
- Re: PMML version in MLLib - posted by Fazlan Nazeem <fa...@wso2.com> on 2015/11/04 12:42:05 UTC, 5 replies.
- Looking for the method executors uses to write to HDFS - posted by Tóth Zoltán <tz...@looper.hu> on 2015/11/04 14:11:43 UTC, 1 replies.
- Sort Merge Join from the filesystem - posted by Alex Nastetsky <al...@vervemobile.com> on 2015/11/04 15:37:53 UTC, 5 replies.
- How to force statistics calculation of Dataframe? - posted by Charmee Patel <ch...@gmail.com> on 2015/11/04 19:19:27 UTC, 3 replies.
- Why LibSVMRelation and CsvRelation don't extends HadoopFsRelation ? - posted by Jeff Zhang <zj...@gmail.com> on 2015/11/05 03:30:51 UTC, 6 replies.
- Fwd: dataframe slow down with tungsten turn on - posted by gen tang <ge...@gmail.com> on 2015/11/05 05:43:00 UTC, 4 replies.
- pyspark with pypy not work for spark-1.5.1 - posted by Chang Ya-Hsuan <su...@gmail.com> on 2015/11/05 08:56:30 UTC, 6 replies.
- A New Global Numerical Optimization Algo - posted by Shouheng Yi <sh...@gmail.com> on 2015/11/05 09:19:37 UTC, 0 replies.
- Recommended change to core-site.xml template - posted by Christian <en...@gmail.com> on 2015/11/05 17:25:17 UTC, 10 replies.
- Need advice on hooking into Sql query plan - posted by Yana Kadiyska <ya...@gmail.com> on 2015/11/05 23:34:01 UTC, 3 replies.
- State of the Build - posted by Jakob Odersky <jo...@gmail.com> on 2015/11/06 00:38:49 UTC, 16 replies.
- GraphX EdgePartition format - posted by Daniel Margo <dm...@eecs.harvard.edu> on 2015/11/06 12:35:01 UTC, 0 replies.
- Ready to talk about Spark 2.0? - posted by Sean Owen <so...@cloudera.com> on 2015/11/06 13:44:15 UTC, 7 replies.
- Build fails due to...multiple overloaded alternatives of constructor RDDInfo define default arguments? - posted by Jacek Laskowski <ja...@japila.pl> on 2015/11/07 13:41:04 UTC, 2 replies.
- Calling stop on StreamingContext locks up - posted by vonnagy <iv...@vadio.com> on 2015/11/07 21:17:58 UTC, 2 replies.
- [build system] emergency restart to temporarily patch a massive java security hole - posted by shane knapp <sk...@berkeley.edu> on 2015/11/08 23:53:07 UTC, 1 replies.
- 回复:[VOTE] Release Apache Spark 1.5.2 (RC2) - posted by 欧锐 <49...@qq.com> on 2015/11/09 04:28:57 UTC, 0 replies.
- OLAP query using spark dataframe with cassandra - posted by "fightfate@163.com" <fi...@163.com> on 2015/11/09 07:02:47 UTC, 8 replies.
- Wrap an RDD with a ShuffledRDD - posted by Muhammad Haseeb Javed <11...@seecs.edu.pk> on 2015/11/09 08:41:48 UTC, 0 replies.
- Re: sample or takeSample or ?? - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2015/11/09 14:31:21 UTC, 0 replies.
- Re: Guidance to get started - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2015/11/09 14:33:42 UTC, 0 replies.
- 回复: [VOTE] Release Apache Spark 1.5.2 (RC2) - posted by Ricky <49...@qq.com> on 2015/11/09 15:57:02 UTC, 1 replies.
- Re: Block Transfer Service encryption support - posted by turp1twin <tu...@gmail.com> on 2015/11/09 19:32:27 UTC, 2 replies.
- [build system] shane OOO until monday, nov 16 - posted by shane knapp <sk...@berkeley.edu> on 2015/11/09 23:36:28 UTC, 0 replies.
- ml.feature.Word2Vec.transform() very slow issue - posted by Yuming Wang <q7...@gmail.com> on 2015/11/10 06:08:30 UTC, 2 replies.
- Support for views/ virtual tables in SparkSQL - posted by Sudhir Menon <sm...@pivotal.io> on 2015/11/10 06:34:24 UTC, 3 replies.
- [ANNOUNCE] Announcing Spark 1.5.2 - posted by Reynold Xin <rx...@databricks.com> on 2015/11/10 17:49:31 UTC, 1 replies.
- SPARK-11638: Run Spark on Mesos, in Docker with Bridge networking - posted by Rad Gruchalski <ra...@gruchalski.com> on 2015/11/10 23:52:56 UTC, 0 replies.
- A proposal for Spark 2.0 - posted by Reynold Xin <rx...@databricks.com> on 2015/11/11 00:10:55 UTC, 55 replies.
- Why there's no api for SparkContext#textFiles to support multiple inputs ? - posted by Jeff Zhang <zj...@gmail.com> on 2015/11/11 10:20:48 UTC, 7 replies.
- Map Tasks - Disk I/O - posted by gsvic <vi...@gmail.com> on 2015/11/11 13:35:07 UTC, 0 replies.
- Support for local disk columnar storage for DataFrames - posted by Cristian O <cr...@googlemail.com> on 2015/11/11 13:59:35 UTC, 8 replies.
- Choreographing a Kryo update - posted by Steve Loughran <st...@hortonworks.com> on 2015/11/11 23:01:40 UTC, 1 replies.
- Proposal for SQL join optimization - posted by Zhan Zhang <zz...@hortonworks.com> on 2015/11/11 23:45:30 UTC, 2 replies.
- [build system] short jenkins downtime tomorrow morning, 11-13-2015 @ 7am PST - posted by shane knapp <sk...@berkeley.edu> on 2015/11/12 21:14:48 UTC, 3 replies.
- Seems jenkins is down (or very slow)? - posted by Yin Huai <yh...@databricks.com> on 2015/11/13 03:21:54 UTC, 5 replies.
- let spark streaming sample come to stop - posted by Renyi Xiong <re...@gmail.com> on 2015/11/13 19:52:39 UTC, 1 replies.
- SparkPullRequestBuilder coverage - posted by Ted Yu <yu...@gmail.com> on 2015/11/13 20:17:52 UTC, 3 replies.
- Spark 1.4.2 release and votes conversation? - posted by Andrew Lee <al...@hotmail.com> on 2015/11/13 22:00:51 UTC, 0 replies.
- Re: Spark 1.4.2 release and votes conversation? - posted by Reynold Xin <rx...@databricks.com> on 2015/11/13 22:30:24 UTC, 2 replies.
- Incubator Proposal for Spark-Kernel - posted by DavidFallside <da...@fallside.com> on 2015/11/13 22:58:21 UTC, 0 replies.
- Problem with Breadcast variable not deserialized - posted by Federico Bertola <fe...@gmail.com> on 2015/11/13 23:30:44 UTC, 0 replies.
- Re: spark 1.4 GC issue - posted by Ted Yu <yu...@gmail.com> on 2015/11/15 11:17:16 UTC, 0 replies.
- Are map tasks spilling data to disk? - posted by gsvic <vi...@gmail.com> on 2015/11/15 19:52:29 UTC, 2 replies.
- Map Tasks - Disk Spill (?) - posted by gsvic <vi...@gmail.com> on 2015/11/15 19:53:25 UTC, 0 replies.
- Hive Context incompatible with Sentry enabled Cluster - posted by Charmee Patel <ch...@gmail.com> on 2015/11/16 06:40:10 UTC, 0 replies.
- Hive on Spark Vs Spark SQL - posted by kiran lonikar <lo...@gmail.com> on 2015/11/16 07:37:17 UTC, 3 replies.
- releasing Spark 1.4.2 - posted by Niranda Perera <ni...@gmail.com> on 2015/11/16 07:53:47 UTC, 1 replies.
- Does anyone meet the issue that jars under lib_managed is never downloaded ? - posted by Jeff Zhang <zj...@gmail.com> on 2015/11/16 09:03:56 UTC, 10 replies.
- Streaming Receiverless Kafka API + Offset Management - posted by Nick Evans <me...@nicolasevans.org> on 2015/11/16 16:53:06 UTC, 3 replies.
- Persisting DStreams - posted by "Fernando O." <fo...@gmail.com> on 2015/11/16 19:38:15 UTC, 1 replies.
- Re: slightly more informative error message in MLUtils.loadLibSVMFile - posted by Joseph Bradley <jo...@databricks.com> on 2015/11/17 00:43:54 UTC, 1 replies.
- Re: Spark Implementation of XGBoost - posted by Joseph Bradley <jo...@databricks.com> on 2015/11/17 00:54:02 UTC, 0 replies.
- Mesos cluster dispatcher doesn't respect most args from the submit req - posted by Jo Voordeckers <jo...@gmail.com> on 2015/11/17 02:46:36 UTC, 4 replies.
- Add a function to support Google's Word2Vec - posted by yuming wang <wg...@gmail.com> on 2015/11/17 13:10:44 UTC, 1 replies.
- Fwd: zeppelin (or spark-shell) with HBase fails on executor level - posted by 임정택 <ka...@gmail.com> on 2015/11/18 05:26:15 UTC, 0 replies.
- How to Add builtin geometry type to SparkSQL? - posted by ddcd <ze...@gmail.com> on 2015/11/18 13:22:01 UTC, 1 replies.
- Re: orc read issue n spark - posted by Reynold Xin <rx...@databricks.com> on 2015/11/18 18:19:36 UTC, 0 replies.
- Spark Summit East 2016 CFP - Closing in 5 days - posted by Scott walent <sc...@gmail.com> on 2015/11/18 19:58:45 UTC, 0 replies.
- FW: SequenceFile and object reuse - posted by jeff saremi <je...@hotmail.com> on 2015/11/19 05:04:09 UTC, 3 replies.
- Hash Partitioning & Sort Merge Join - posted by gsvic <vi...@gmail.com> on 2015/11/19 08:02:39 UTC, 0 replies.
- Removing the Mesos fine-grained mode - posted by Iulian Dragoș <iu...@typesafe.com> on 2015/11/19 12:42:47 UTC, 13 replies.
- new datasource - posted by "james.green9@baesystems.com" <ja...@baesystems.com> on 2015/11/19 16:14:28 UTC, 3 replies.
- spark-submit is throwing NPE when trying to submit a random forest model - posted by Rachana Srivastava <Ra...@markmonitor.com> on 2015/11/19 22:21:35 UTC, 1 replies.
- Dropping support for earlier Hadoop versions in Spark 2.0? - posted by Reynold Xin <rx...@databricks.com> on 2015/11/19 23:14:44 UTC, 12 replies.
- 回复:Dropping support for earlier Hadoop versions in Spark 2.0? - posted by "张志强(旺轩)" <zz...@alibaba-inc.com> on 2015/11/20 02:16:36 UTC, 0 replies.
- Unhandled case in VectorAssembler - posted by BenFradet <be...@gmail.com> on 2015/11/20 23:39:33 UTC, 2 replies.
- Using spark MLlib without installing Spark - posted by bowen zhang <bo...@yahoo.com.INVALID> on 2015/11/22 00:38:30 UTC, 6 replies.
- [ANNOUNCE] Spark 1.6.0 Release Preview - posted by Michael Armbrust <mi...@databricks.com> on 2015/11/22 23:21:52 UTC, 3 replies.
- Re: Bringing up JDBC Tests to trunk - posted by Luciano Resende <lu...@gmail.com> on 2015/11/23 03:49:03 UTC, 1 replies.
- Spark-1.6.0-preview2 trackStateByKey exception restoring state - posted by jan <ja...@insidin.com> on 2015/11/23 16:22:18 UTC, 1 replies.
- question about combining small input splits - posted by Nezih <ny...@netflix.com> on 2015/11/23 22:24:01 UTC, 0 replies.
- why does shuffle in spark write shuffle data to disk by default? - posted by huan zhang <zh...@gmail.com> on 2015/11/24 02:36:58 UTC, 1 replies.
- Datasets on experimental dataframes? - posted by Jakob Odersky <jo...@gmail.com> on 2015/11/24 02:59:28 UTC, 1 replies.
- Fastest way to build Spark from scratch - posted by Nicholas Chammas <ni...@gmail.com> on 2015/11/24 05:18:05 UTC, 0 replies.
- Re: load multiple directory using dataframe load - posted by Fengdong Yu <fe...@everstring.com> on 2015/11/24 06:19:02 UTC, 0 replies.
- what should I know to implement twitter streaming for pyspark? - posted by Amir Rahnama <am...@gmail.com> on 2015/11/24 10:32:42 UTC, 1 replies.
- Streaming : stopping output transformations explicitly - posted by Yogesh Mahajan <ym...@snappydata.io> on 2015/11/24 13:09:04 UTC, 0 replies.
- sqlContext vs hivecontext - posted by Pranay Tonpay <pt...@gmail.com> on 2015/11/24 20:21:52 UTC, 0 replies.
- pyspark does not seem to start py4j callback server - posted by girishlg <gi...@gmail.com> on 2015/11/25 01:18:42 UTC, 0 replies.
- Spark checkpoint problem - posted by "wyphao.2007" <wy...@163.com> on 2015/11/25 12:06:56 UTC, 4 replies.
- [ANNOUNCE] CFP open for ApacheCon North America 2016 - posted by Rich Bowen <rb...@rcbowen.com> on 2015/11/25 18:32:10 UTC, 0 replies.
- VerifyError running Spark SQL code? - posted by Marcelo Vanzin <va...@cloudera.com> on 2015/11/26 01:51:33 UTC, 3 replies.
- How to add 1.5.2 support to ec2/spark_ec2.py ? - posted by Alexander Pivovarov <ap...@gmail.com> on 2015/11/26 06:19:37 UTC, 0 replies.
- Incremental Analysis with Spark - posted by Sachith Withana <sw...@gmail.com> on 2015/11/26 06:46:57 UTC, 1 replies.
- (Unknown) - posted by Dmitry Tolpeko <dm...@gmail.com> on 2015/11/26 13:46:48 UTC, 0 replies.
- question about combining small parquet files - posted by Nezih Yigitbasi <ny...@netflix.com.INVALID> on 2015/11/26 18:43:38 UTC, 1 replies.
- NettyRpcEnv adverisedPort - posted by Rad Gruchalski <ra...@gruchalski.com> on 2015/11/26 20:45:35 UTC, 2 replies.
- SparkR read.df Option type doesn't match - posted by liushiqi9 <sl...@phemi.com> on 2015/11/26 20:56:58 UTC, 3 replies.
- Grid search with Random Forest - posted by Ndjido Ardo Bar <nd...@gmail.com> on 2015/11/26 21:53:05 UTC, 0 replies.
- steamingContext stop gracefully failed in yarn-cluster mode - posted by "qinggangwang7@gmail.com" <qi...@gmail.com> on 2015/11/27 01:55:19 UTC, 0 replies.
- tests blocked at "don't call ssc.stop in listener" - posted by Nan Zhu <zh...@gmail.com> on 2015/11/27 03:22:17 UTC, 2 replies.
- Subtract implementation using broadcast - posted by Justin Uang <ju...@gmail.com> on 2015/11/27 16:19:03 UTC, 2 replies.
- Problem in running MLlib SVM - posted by Tarek Elgamal <ta...@gmail.com> on 2015/11/28 03:39:59 UTC, 4 replies.
- Export BLAS module on Spark MLlib - posted by Sasaki Kai <le...@me.com> on 2015/11/28 14:20:30 UTC, 2 replies.
- FuzzyCMeans Implementation - posted by salexln <sa...@gmail.com> on 2015/11/29 11:39:17 UTC, 0 replies.
- Need suggestions on monitor Spark progress - posted by Yuhao Yang <hh...@gmail.com> on 2015/11/29 15:12:31 UTC, 2 replies.