You are viewing a plain text version of this content. The canonical link for it is here.
- Issue with Spark on EC2 using spark-ec2 script - posted by Ryan Tabora <ra...@gmail.com> on 2014/08/01 00:04:41 UTC, 6 replies.
- makeLinkRDDs in MLlib ALS - posted by alwayforver <wa...@gmail.com> on 2014/08/01 00:39:29 UTC, 0 replies.
- Readin from Amazon S3 behaves inconsistently: return different number of lines... - posted by nit <ni...@gmail.com> on 2014/08/01 01:37:38 UTC, 3 replies.
- Re: Installing Spark 0.9.1 on EMR Cluster - posted by Rahul Bhojwani <ra...@gmail.com> on 2014/08/01 01:39:44 UTC, 2 replies.
- Spark job finishes then command shell is blocked/hangs? - posted by bumble123 <tc...@att.com> on 2014/08/01 01:51:00 UTC, 2 replies.
- Re: Example standalone app error! - posted by Tathagata Das <ta...@gmail.com> on 2014/08/01 02:36:40 UTC, 1 replies.
- Re: spark.scheduler.pool seems not working in spark streaming - posted by Tathagata Das <ta...@gmail.com> on 2014/08/01 02:37:29 UTC, 1 replies.
- sbt package failed: wrong libraryDependencies for spark-streaming? - posted by durin <ma...@simon-schaefer.net> on 2014/08/01 02:48:07 UTC, 5 replies.
- Accessing spark context from executors? - posted by Sung Hwan Chung <co...@cs.stanford.edu> on 2014/08/01 03:16:04 UTC, 2 replies.
- Re: spark.shuffle.consolidateFiles seems not working - posted by Aaron Davidson <il...@gmail.com> on 2014/08/01 03:58:12 UTC, 1 replies.
- Extracting an element from the feature vector in LabeledPoint - posted by SK <sk...@gmail.com> on 2014/08/01 04:01:41 UTC, 5 replies.
- Re: HiveContext is creating metastore warehouse locally instead of in hdfs - posted by chenjie <ch...@gmail.com> on 2014/08/01 04:09:06 UTC, 2 replies.
- spark-submit registers the driver twice - posted by salemi <al...@udo.edu> on 2014/08/01 04:16:06 UTC, 0 replies.
- Re: Fwd: pyspark crash on mesos - posted by daijia <ji...@intsig.com> on 2014/08/01 04:29:21 UTC, 1 replies.
- Re: SQLCtx cacheTable - posted by Michael Armbrust <mi...@databricks.com> on 2014/08/01 04:42:29 UTC, 5 replies.
- [GraphX] The best way to construct a graph - posted by Bin <wu...@126.com> on 2014/08/01 05:23:49 UTC, 5 replies.
- Re:Re: java.util.concurrent.TimeoutException: Futures timed out after [30 seconds] - posted by Bin <wu...@126.com> on 2014/08/01 05:28:31 UTC, 1 replies.
- Re: configuration needed to run twitter(25GB) dataset - posted by shijiaxin <sh...@gmail.com> on 2014/08/01 06:40:39 UTC, 3 replies.
- Issue using kryo serilization - posted by gpatcham <gp...@gmail.com> on 2014/08/01 07:23:58 UTC, 5 replies.
- Re: java.lang.OutOfMemoryError: Java heap space - posted by Haiyang Fu <ha...@gmail.com> on 2014/08/01 07:29:13 UTC, 1 replies.
- graphx and subgraph query - posted by dizzy5112 <da...@gmail.com> on 2014/08/01 08:42:42 UTC, 0 replies.
- Re: Ports required for running spark - posted by Andrew Ash <an...@andrewash.com> on 2014/08/01 08:46:27 UTC, 0 replies.
- Re: Hbase - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2014/08/01 08:47:44 UTC, 3 replies.
- RDD to DStream - posted by Aniket Bhatnagar <an...@gmail.com> on 2014/08/01 10:25:36 UTC, 4 replies.
- Iterator over RDD in PySpark - posted by Andrei <fa...@gmail.com> on 2014/08/01 10:38:05 UTC, 4 replies.
- Spark-sql with Tachyon cache - posted by Dariusz Kobylarz <da...@gmail.com> on 2014/08/01 10:42:58 UTC, 1 replies.
- Environment Variables question - posted by redocpot <ju...@gmail.com> on 2014/08/01 11:02:39 UTC, 0 replies.
- Reading from HDFS no faster than reading from S3 - how to tell if data locality respected? - posted by Martin Goodson <ma...@skimlinks.com> on 2014/08/01 11:44:00 UTC, 1 replies.
- Compiling Spark master (284771ef) with sbt/sbt assembly fails on EC2 - posted by Ankur Dave <an...@gmail.com> on 2014/08/01 12:42:17 UTC, 9 replies.
- Spark 0.9.2 sbt build issue - posted by Arun Kumar <to...@gmail.com> on 2014/08/01 13:16:22 UTC, 0 replies.
- Re: access hdfs file name in map() - posted by Roberto Torella <ro...@gmail.com> on 2014/08/01 13:38:30 UTC, 2 replies.
- Re: streaming window not behaving as advertised (v1.0.1) - posted by Venkat Subramanian <vs...@gmail.com> on 2014/08/01 16:12:47 UTC, 2 replies.
- Spark SQL, Parquet and Impala - posted by Patrick McGloin <mc...@gmail.com> on 2014/08/01 16:18:02 UTC, 4 replies.
- Tasks fail when ran in cluster but they work fine when submited using local local - posted by salemi <al...@udo.edu> on 2014/08/01 16:30:24 UTC, 1 replies.
- Accumulator and Accumulable vs classic MR - posted by Julien Naour <ju...@gmail.com> on 2014/08/01 16:38:20 UTC, 1 replies.
- Re: how to publish spark inhouse? - posted by Koert Kuipers <ko...@tresata.com> on 2014/08/01 17:38:46 UTC, 0 replies.
- Re: Number of partitions and Number of concurrent tasks - posted by Daniel Siegmann <da...@velos.io> on 2014/08/01 17:49:53 UTC, 2 replies.
- What should happen if we try to cache more data than the cluster can hold in memory? - posted by Nicholas Chammas <ni...@gmail.com> on 2014/08/01 18:24:22 UTC, 5 replies.
- RE: Data from Mysql using JdbcRDD - posted by srinivas <ku...@gmail.com> on 2014/08/01 18:25:30 UTC, 1 replies.
- Spark SQL Query Plan optimization - posted by "N.Venkata Naga Ravi" <nv...@hotmail.com> on 2014/08/01 19:13:41 UTC, 1 replies.
- Spark Streaming : Could not compute split, block not found - posted by Kanwaldeep <ka...@gmail.com> on 2014/08/01 19:55:24 UTC, 9 replies.
- persisting RDD in memory - posted by Sujee Maniyam <su...@sujee.net> on 2014/08/01 19:59:07 UTC, 1 replies.
- Re: Deploying spark applications from within Eclipse? - posted by nunarob <ro...@nunahealth.com> on 2014/08/01 20:42:32 UTC, 0 replies.
- Computing mean and standard deviation by key - posted by kriskalish <kr...@kalish.net> on 2014/08/01 20:55:09 UTC, 12 replies.
- correct upgrade process - posted by SK <sk...@gmail.com> on 2014/08/01 20:59:42 UTC, 2 replies.
- spark sql - posted by Madabhattula Rajesh Kumar <mr...@gmail.com> on 2014/08/01 22:32:11 UTC, 3 replies.
- creating a distributed index - posted by Philip Ogren <ph...@oracle.com> on 2014/08/01 22:50:22 UTC, 5 replies.
- How to read from OpenTSDB using PySpark (or Scala Spark)? - posted by bumble123 <tc...@att.com> on 2014/08/01 23:39:07 UTC, 6 replies.
- Re: Is there a way to write spark RDD to Avro files - posted by touchdown <yu...@gmail.com> on 2014/08/02 00:26:07 UTC, 4 replies.
- Spark ReduceByKey - Working in Java - posted by Anil Karamchandani <an...@gmail.com> on 2014/08/02 10:55:21 UTC, 1 replies.
- GraphX - posted by Deep Pradhan <pr...@gmail.com> on 2014/08/02 17:55:42 UTC, 5 replies.
- [GraphX] how to compute only a subset of vertices in the whole graph? - posted by Yifan LI <ia...@gmail.com> on 2014/08/02 19:04:22 UTC, 1 replies.
- Low Level Kafka Consumer for Spark - posted by Dibyendu Bhattacharya <di...@gmail.com> on 2014/08/02 19:09:42 UTC, 24 replies.
- spark-shell ip address issues - posted by Mohit Jaggi <mo...@gmail.com> on 2014/08/03 06:50:18 UTC, 0 replies.
- GraphX runs without Spark? - posted by Deep Pradhan <pr...@gmail.com> on 2014/08/03 09:44:52 UTC, 2 replies.
- Re: Starting with spark - posted by Mahebub Sayyed <ma...@gmail.com> on 2014/08/03 11:09:14 UTC, 1 replies.
- error while running kafka-spark-example - posted by Mahebub Sayyed <ma...@gmail.com> on 2014/08/03 13:47:11 UTC, 2 replies.
- pyspark script fails on EMR with an ERROR in configuring object. - posted by Rahul Bhojwani <ra...@gmail.com> on 2014/08/03 14:34:28 UTC, 3 replies.
- disable log4j for spark-shell - posted by Gil Vernik <GI...@il.ibm.com> on 2014/08/03 15:07:51 UTC, 4 replies.
- Writing to RabbitMQ - posted by jschindler <jo...@utexas.edu> on 2014/08/03 19:31:03 UTC, 8 replies.
- Cached RDD Block Size - Uneven Distribution - posted by iramaraju <ir...@gmail.com> on 2014/08/03 22:19:51 UTC, 2 replies.
- Re: How to share a NonSerializable variable among tasks in the same worker node? - posted by Fengyun RAO <ra...@gmail.com> on 2014/08/04 03:38:19 UTC, 9 replies.
- Kafka and Spark application after polling twice. - posted by salemi <al...@udo.edu> on 2014/08/04 05:26:54 UTC, 0 replies.
- MLLib: implementing ALS with distributed matrix - posted by Wei Tan <wt...@us.ibm.com> on 2014/08/04 05:39:02 UTC, 4 replies.
- Timing the codes in GraphX - posted by Deep Pradhan <pr...@gmail.com> on 2014/08/04 06:37:59 UTC, 1 replies.
- NoClassDefFoundError: org/codehaus/jackson/annotate/JsonClass with spark-submit - posted by Ryan Braley <ry...@traintracks.io> on 2014/08/04 07:28:51 UTC, 3 replies.
- Re: Spark SQL 1.0.1 error on reading fixed length byte array - posted by Pei-Lun Lee <pl...@appier.com> on 2014/08/04 08:53:30 UTC, 0 replies.
- Issues with HDP 2.4.0.2.1.3.0-563 - posted by Ron's Yahoo! <zl...@yahoo.com.INVALID> on 2014/08/04 09:13:35 UTC, 13 replies.
- Re: Compiling Spark master (6ba6c3eb) with sbt/sbt assembly - posted by Larry Xiao <xi...@sjtu.edu.cn> on 2014/08/04 10:44:14 UTC, 1 replies.
- Re: Bad Digest error while doing aws s3 put - posted by lmk <la...@gmail.com> on 2014/08/04 10:45:33 UTC, 4 replies.
- Spark on HDFS with replication - posted by Deep Pradhan <pr...@gmail.com> on 2014/08/04 11:51:45 UTC, 2 replies.
- Can we throttle the individual queries using SPARK - posted by Mahesh Govind <ma...@redknee.com> on 2014/08/04 13:24:43 UTC, 0 replies.
- Spark Training Course? - posted by Chris London <me...@chrislondon.co> on 2014/08/04 14:24:57 UTC, 3 replies.
- [GraphX] How spark parameters relate to Pregel implementation - posted by Bin <wu...@126.com> on 2014/08/04 14:52:26 UTC, 1 replies.
- [SparkStreaming]StackOverflow on restart - posted by Yana Kadiyska <ya...@gmail.com> on 2014/08/04 17:47:46 UTC, 1 replies.
- Re: Using countApproxDistinct in pyspark - posted by Diederik <dv...@gmail.com> on 2014/08/04 18:23:09 UTC, 0 replies.
- Spark SQL cache table using different storage level - posted by chutium <te...@gmail.com> on 2014/08/04 18:33:52 UTC, 0 replies.
- Spark app throwing java.lang.OutOfMemoryError: GC overhead limit exceeded - posted by buntu <bu...@gmail.com> on 2014/08/04 20:03:58 UTC, 1 replies.
- Re: Spark job tracker. - posted by abhiguruvayya <sh...@gmail.com> on 2014/08/04 20:04:39 UTC, 0 replies.
- Spark Streaming fails - where is the problem? - posted by durin <ma...@simon-schaefer.net> on 2014/08/04 20:08:23 UTC, 11 replies.
- Configuration setup and Connection refused - posted by "alamin.ishak" <al...@gmail.com> on 2014/08/04 20:10:00 UTC, 6 replies.
- Re: Can't see any thing one the storage panel of application UI - posted by "anthonyjschulte@gmail.com" <an...@gmail.com> on 2014/08/04 21:10:18 UTC, 5 replies.
- spark streaming kafka - posted by salemi <al...@udo.edu> on 2014/08/04 22:12:44 UTC, 1 replies.
- Spark SQL JDBC - posted by John Omernik <jo...@omernik.com> on 2014/08/04 22:54:05 UTC, 6 replies.
- Create a new object by given classtag - posted by Parthus <pe...@gmail.com> on 2014/08/04 22:59:59 UTC, 4 replies.
- Streaming + SQL : How to resgister a DStream content as a table and access it - posted by salemi <al...@udo.edu> on 2014/08/04 23:45:52 UTC, 1 replies.
- Substring in Spark SQL - posted by Tom <th...@gmail.com> on 2014/08/04 23:52:33 UTC, 2 replies.
- MovieLensALS - Scala Pattern Magic - posted by Steve Nunez <sn...@hortonworks.com> on 2014/08/05 00:17:32 UTC, 2 replies.
- Re: Memory & compute-intensive tasks - posted by rpandya <ra...@iecommerce.com> on 2014/08/05 00:20:33 UTC, 0 replies.
- Running Examples - posted by cetaylor <co...@hp.com> on 2014/08/05 00:37:30 UTC, 0 replies.
- java.lang.IllegalArgumentException: Unable to create serializer "com.esotericsoftware.kryo.serializers.FieldSerializer" - posted by Sameer Tilak <ss...@live.com> on 2014/08/05 01:40:49 UTC, 1 replies.
- Unit Test for Spark Streaming - posted by JiajiaJing <jj...@gmail.com> on 2014/08/05 03:30:50 UTC, 7 replies.
- Visualizing stage & task dependency graph - posted by rpandya <ra...@iecommerce.com> on 2014/08/05 05:55:03 UTC, 3 replies.
- about spark and using machine learning model - posted by Hoai-Thu Vuong <th...@gmail.com> on 2014/08/05 06:28:52 UTC, 2 replies.
- Re: java.lang.IllegalStateException: unread block data while running the sampe WordCount program from Eclipse - posted by nightwolf <ni...@gmail.com> on 2014/08/05 09:15:17 UTC, 0 replies.
- Re: Spark Deployment Patterns - Automated Deployment & Performance Testing - posted by nightwolf <ni...@gmail.com> on 2014/08/05 09:19:15 UTC, 0 replies.
- Spark stream data from kafka topics and output as parquet file on HDFS - posted by rafeeq s <ra...@gmail.com> on 2014/08/05 09:22:15 UTC, 6 replies.
- Running driver/SparkContent locally - posted by nightwolf <ni...@gmail.com> on 2014/08/05 09:22:54 UTC, 1 replies.
- Re: Spark streaming at-least once guarantee - posted by lalit1303 <la...@sigmoidanalytics.com> on 2014/08/05 09:45:15 UTC, 3 replies.
- java.lang.StackOverflowError - posted by Chengi Liu <ch...@gmail.com> on 2014/08/05 10:10:17 UTC, 2 replies.
- Running Hive UDF from spark-shell fails due to datatype issue - posted by visakh <vi...@gmail.com> on 2014/08/05 13:15:04 UTC, 0 replies.
- Understanding RDD.GroupBy OutOfMemory Exceptions - posted by Jens Kristian Geyti <sp...@jkg.dk> on 2014/08/05 15:13:12 UTC, 5 replies.
- Setting spark.executor.memory problem - posted by Grzegorz Białek <gr...@codilime.com> on 2014/08/05 15:43:07 UTC, 3 replies.
- Re: spark sql left join gives KryoException: Buffer overflow - posted by Dima Zhiyanov <Di...@hotmail.com> on 2014/08/05 16:38:10 UTC, 2 replies.
- master=local vs master=local[*] - posted by Grzegorz Białek <gr...@codilime.com> on 2014/08/05 16:43:22 UTC, 2 replies.
- Spark shell creating a local SparkContext instead of connecting to connecting to Spark Master - posted by Aniket Bhatnagar <an...@gmail.com> on 2014/08/05 17:21:24 UTC, 2 replies.
- Spark SQL Thrift Server - posted by John Omernik <jo...@omernik.com> on 2014/08/05 18:02:18 UTC, 2 replies.
- Re: Gradient Boosted Machines - posted by Manish Amde <ma...@gmail.com> on 2014/08/05 18:29:45 UTC, 0 replies.
- graph reduceByKey - posted by Omer Holzinger <om...@gmail.com> on 2014/08/05 19:18:44 UTC, 0 replies.
- Problem running Spark shell (1.0.0) on EMR - posted by Omer Holzinger <om...@gmail.com> on 2014/08/05 19:29:43 UTC, 0 replies.
- pyspark inferSchema - posted by Brad Miller <bm...@eecs.berkeley.edu> on 2014/08/05 19:31:39 UTC, 10 replies.
- Spark Memory Issues - posted by Sunny Khatri <su...@gmail.com> on 2014/08/05 20:08:12 UTC, 6 replies.
- trouble with jsonRDD and jsonFile in pyspark - posted by Brad Miller <bm...@eecs.berkeley.edu> on 2014/08/05 20:55:57 UTC, 13 replies.
- SELECT DISTINCT generates random results? - posted by Nan Zhu <zh...@gmail.com> on 2014/08/05 21:42:12 UTC, 1 replies.
- spark-submit symlink - posted by Koert Kuipers <ko...@tresata.com> on 2014/08/05 22:19:59 UTC, 0 replies.
- spark-ec2 script with VPC - posted by Erik Shilts <er...@opower.com> on 2014/08/05 22:30:46 UTC, 0 replies.
- [PySpark] [SQL] Going from RDD[dict] to SchemaRDD - posted by Nicholas Chammas <ni...@gmail.com> on 2014/08/05 22:30:51 UTC, 2 replies.
- issue with spark and bson input - posted by Dmitriy Selivanov <se...@gmail.com> on 2014/08/05 22:43:00 UTC, 1 replies.
- Re: Include permalinks in mail footer - posted by Nicholas Chammas <ni...@gmail.com> on 2014/08/05 23:03:03 UTC, 3 replies.
- [Streaming] Akka-based receiver with messages defined in uploaded jar - posted by Anton Brazhnyk <an...@genesys.com> on 2014/08/06 00:55:53 UTC, 6 replies.
- python dependencies loaded but not on PYTHONPATH - posted by Dominik Hübner <co...@dhuebner.com> on 2014/08/06 01:12:19 UTC, 0 replies.
- Re: Using sbt-pack with Spark 1.0.0 - posted by lbustelo <gi...@bustelos.com> on 2014/08/06 01:17:53 UTC, 0 replies.
- Can't zip RDDs with unequal numbers of partitions - posted by Bin <wu...@126.com> on 2014/08/06 05:27:30 UTC, 0 replies.
- Save an RDD to a SQL Database - posted by Vida Ha <vi...@gmail.com> on 2014/08/06 05:29:34 UTC, 14 replies.
- type issue: found RDD[T] expected RDD[A] - posted by Amit Kumar <ku...@gmail.com> on 2014/08/06 07:58:27 UTC, 4 replies.
- Problem reading from S3 in standalone application - posted by sparkuser2345 <hm...@gmail.com> on 2014/08/06 09:13:56 UTC, 4 replies.
- [GraphX] Can't zip RDDs with unequal numbers of partitions - posted by Bin <wu...@126.com> on 2014/08/06 10:54:39 UTC, 1 replies.
- fail to run LBFS in 5G KDD data in spark 1.0.1? - posted by "Lizhengbing (bing, BIPA)" <zh...@huawei.com> on 2014/08/06 10:58:27 UTC, 1 replies.
- Spark GraphX remembering old message - posted by Arun Kumar <to...@gmail.com> on 2014/08/06 10:58:55 UTC, 0 replies.
- can't submit my application on standalone spark cluster - posted by Andres Gomez Ferrer <ag...@redborder.net> on 2014/08/06 11:35:09 UTC, 2 replies.
- Spark Streaming multiple streams problem - posted by Laeeq Ahmed <la...@yahoo.com.INVALID> on 2014/08/06 12:08:57 UTC, 1 replies.
- Spark SQL (version 1.1.0-SNAPSHOT) should allow SELECT with duplicated columns - posted by Jianshi Huang <ji...@gmail.com> on 2014/08/06 12:45:42 UTC, 0 replies.
- SparkR : lapplyPartition transforms the data in vertical format - posted by Pranay Dave <pr...@gmail.com> on 2014/08/06 14:47:30 UTC, 5 replies.
- Spark Hbase job taking long time - posted by Amit Singh Hora <ho...@gmail.com> on 2014/08/06 14:54:35 UTC, 2 replies.
- how spark read hdfs data with kerberose - posted by Zhanfeng Huo <hu...@gmail.com> on 2014/08/06 14:57:20 UTC, 0 replies.
- [Streaming] updateStateByKey trouble - posted by Yana Kadiyska <ya...@gmail.com> on 2014/08/06 16:51:28 UTC, 1 replies.
- Runnning a Spark Shell locally against EC2 - posted by Gary Malouf <ma...@gmail.com> on 2014/08/06 17:29:23 UTC, 2 replies.
- PySpark, numpy arrays and binary data - posted by Rok Roskar <ro...@gmail.com> on 2014/08/06 17:41:27 UTC, 3 replies.
- Regarding tooling/performance vs RedShift - posted by Gary Malouf <ma...@gmail.com> on 2014/08/06 18:06:38 UTC, 7 replies.
- Re: Submitting to a cluster behind a VPN, configuring different IP address - posted by nunarob <ro...@nunahealth.com> on 2014/08/06 20:34:13 UTC, 0 replies.
- GraphX Pagerank application - posted by AlexanderRiggers <al...@gmail.com> on 2014/08/06 20:37:38 UTC, 1 replies.
- heterogeneous cluster hardware - posted by "anthonyjschulte@gmail.com" <an...@gmail.com> on 2014/08/06 21:11:26 UTC, 8 replies.
- Spark memory management - posted by Gary Malouf <ma...@gmail.com> on 2014/08/06 21:23:27 UTC, 1 replies.
- Spark build error - posted by Priya Ch <le...@gmail.com> on 2014/08/06 21:34:32 UTC, 0 replies.
- Help with debugging a performance issue - posted by Sameer Tilak <ss...@live.com> on 2014/08/06 22:22:31 UTC, 0 replies.
- UpdateStateByKey - How to improve performance? - posted by Venkat Subramanian <vs...@gmail.com> on 2014/08/06 22:29:55 UTC, 2 replies.
- Re: Stopping StreamingContext does not kill receiver - posted by lbustelo <gi...@bustelos.com> on 2014/08/06 23:41:09 UTC, 4 replies.
- Spark Streaming + reduceByWindow(reduceFunc, invReduceFunc, windowDuration, slideDuration - posted by salemi <al...@udo.edu> on 2014/08/07 00:43:04 UTC, 9 replies.
- Naive Bayes parameters - posted by SK <sk...@gmail.com> on 2014/08/07 00:45:09 UTC, 4 replies.
- Trying to make sense of the actual executed code - posted by Tom <th...@gmail.com> on 2014/08/07 00:55:48 UTC, 2 replies.
- Using Python IDE for Spark Application Development - posted by Sathish Kumaran Vairavelu <vs...@gmail.com> on 2014/08/07 01:16:10 UTC, 5 replies.
- Hive 11 / CDH 4.6/ Spark 0.9.1 dilemmna - posted by Anurag Tangri <at...@groupon.com> on 2014/08/07 01:46:39 UTC, 1 replies.
- Regularization parameters - posted by SK <sk...@gmail.com> on 2014/08/07 03:18:43 UTC, 7 replies.
- PySpark + executor lost - posted by Avishek Saha <av...@gmail.com> on 2014/08/07 04:25:37 UTC, 6 replies.
- memory issue on standalone master - posted by BQ <bq...@gmail.com> on 2014/08/07 04:51:35 UTC, 2 replies.
- Column width limits? - posted by "Daniel, Ronald (ELS-SDG)" <R....@elsevier.com> on 2014/08/07 05:11:42 UTC, 1 replies.
- 答复: fail to run LBFS in 5G KDD data in spark 1.0.1? - posted by "Lizhengbing (bing, BIPA)" <zh...@huawei.com> on 2014/08/07 05:41:25 UTC, 0 replies.
- spark-cassandra-connector issue - posted by Gary Zhao <ga...@gmail.com> on 2014/08/07 07:56:32 UTC, 0 replies.
- Spark SQL - posted by "vdiwakar.malladi" <vd...@gmail.com> on 2014/08/07 08:33:52 UTC, 2 replies.
- Spark with HBase - posted by Deepa Jayaveer <de...@tcs.com> on 2014/08/07 10:18:08 UTC, 2 replies.
- Re: Spark Streaming- Input from Kafka, output to HBase - posted by Khanderao Kand <kh...@gmail.com> on 2014/08/07 10:41:46 UTC, 2 replies.
- Where do my partitions go? - posted by losmi83 <mi...@gmail.com> on 2014/08/07 11:20:20 UTC, 3 replies.
- Re: spark streaming actor receiver doesn't play well with kryoserializer - posted by Rohit Rai <ro...@tuplejump.com> on 2014/08/07 11:30:36 UTC, 1 replies.
- Got error “"java.lang.IllegalAccessError" when using HiveContext in Spark shell on AWS - posted by Zhun Shen <sh...@gmail.com> on 2014/08/07 12:18:56 UTC, 1 replies.
- Re: How to read a multipart s3 file? - posted by sparkuser2345 <hm...@gmail.com> on 2014/08/07 13:57:04 UTC, 5 replies.
- reduceByKey to get all associated values - posted by Konstantin Kudryavtsev <ku...@gmail.com> on 2014/08/07 15:18:03 UTC, 4 replies.
- How can I implement eigenvalue decomposition in Spark? - posted by yaochunnan <ya...@gmail.com> on 2014/08/07 15:21:33 UTC, 13 replies.
- Low Performance of Shark over Spark. - posted by vi...@socialinfra.net on 2014/08/07 15:51:48 UTC, 5 replies.
- KMeans Input Format - posted by AlexanderRiggers <al...@gmail.com> on 2014/08/07 16:37:40 UTC, 8 replies.
- [Compile error] Spark 1.0.2 against cloudera 2.0.0-cdh4.6.0 error - posted by linkpatrickliu <li...@live.com> on 2014/08/07 17:11:08 UTC, 4 replies.
- spark streaming multiple file output paths - posted by Chen Song <ch...@gmail.com> on 2014/08/07 17:39:08 UTC, 1 replies.
- JVM Error while building spark - posted by Rasika Pohankar <ra...@gmail.com> on 2014/08/07 17:45:51 UTC, 2 replies.
- Spark 1.0.1 NotSerialized exception (a bit of a head scratcher) - posted by Padmanabhan, "Mahesh  (contractor)" <ma...@twc-contractor.com> on 2014/08/07 18:03:59 UTC, 11 replies.
- Initial job has not accepted any resources - posted by arnaudbriche <br...@gmail.com> on 2014/08/07 18:11:11 UTC, 3 replies.
- Spark Streaming Workflow Validation - posted by "Dan H." <dc...@gmail.com> on 2014/08/07 19:18:07 UTC, 2 replies.
- How to Start a JOB programatically from an EC2 machine? - posted by SankarS <sm...@yahoo.com> on 2014/08/07 19:35:29 UTC, 0 replies.
- trouble with saveAsParquetFile - posted by Brad Miller <bm...@eecs.berkeley.edu> on 2014/08/07 19:55:18 UTC, 4 replies.
- questions about MLLib recommendation models - posted by Jay Hutfles <ja...@gmail.com> on 2014/08/07 22:06:33 UTC, 4 replies.
- Lost executors - posted by rpandya <ra...@iecommerce.com> on 2014/08/07 23:56:26 UTC, 8 replies.
- Missing SparkSQLCLIDriver and Beeline drivers in Spark - posted by ajatix <aj...@sigmoidanalytics.com> on 2014/08/08 00:12:20 UTC, 2 replies.
- Re: All of the tasks have been completed but the Stage is still shown as "Active"? - posted by "anthonyjschulte@gmail.com" <an...@gmail.com> on 2014/08/08 00:21:36 UTC, 0 replies.
- Use SparkStreaming to find the max of a dataset? - posted by bumble123 <tc...@att.com> on 2014/08/08 02:31:08 UTC, 5 replies.
- [MLLib]:choosing the Loss function - posted by SK <sk...@gmail.com> on 2014/08/08 03:31:14 UTC, 5 replies.
- Spark: Could not load native gpl library - posted by Jikai Lei <ha...@gmail.com> on 2014/08/08 03:59:23 UTC, 5 replies.
- Re: Got error“"java.lang.IllegalAccessError" when using HiveContext in Spark shell on AWS - posted by Zhun Shen <sh...@gmail.com> on 2014/08/08 05:04:02 UTC, 0 replies.
- Internal Error: Missing Template ERR_DNS_FAIL - posted by hansen <ha...@neusoft.com> on 2014/08/08 05:56:24 UTC, 0 replies.
- Java API for GraphX - posted by "vdiwakar.malladi" <vd...@gmail.com> on 2014/08/08 06:17:24 UTC, 0 replies.
- Spark Streaming on Yarn Input from Flume - posted by XiaoQinyu <xi...@outlook.com> on 2014/08/08 07:02:18 UTC, 1 replies.
- How to use spark-cassandra-connector in spark-shell? - posted by Gary Zhao <ga...@gmail.com> on 2014/08/08 07:18:20 UTC, 5 replies.
- Shared variable in Spark Streaming - posted by Soumitra Kumar <ku...@gmail.com> on 2014/08/08 08:16:28 UTC, 3 replies.
- Spark hang with mesos after many task failures - posted by Xu Zhongxing <xu...@163.com> on 2014/08/08 09:53:35 UTC, 0 replies.
- Unable to access worker web UI or application UI (EC2) - posted by sparkuser2345 <hm...@gmail.com> on 2014/08/08 10:07:08 UTC, 1 replies.
- [GraphX] Is it normal to shuffle write 15GB while the data is only 30MB? - posted by Bin <wu...@126.com> on 2014/08/08 10:36:53 UTC, 0 replies.
- Time series in Spark / Spark Streaming - posted by PiR <pi...@cnrs.fr> on 2014/08/08 10:38:38 UTC, 0 replies.
- How to detect mesos slave down in Spark programs - posted by Xu Zhongxing <xu...@163.com> on 2014/08/08 10:58:41 UTC, 0 replies.
- Job ACL's on SPark - posted by Manoj kumar <ma...@gmail.com> on 2014/08/08 15:56:58 UTC, 1 replies.
- Minimum Split of Hadoop RDD - posted by Deep Pradhan <pr...@gmail.com> on 2014/08/08 17:56:33 UTC, 0 replies.
- error with pyspark - posted by Baoqiang Cao <bq...@gmail.com> on 2014/08/08 18:12:23 UTC, 3 replies.
- Re: scopt.OptionParser - posted by SK <sk...@gmail.com> on 2014/08/08 19:05:01 UTC, 1 replies.
- Custom Transformations in Spark - posted by Jeevak Kasarkod <je...@gmail.com> on 2014/08/08 20:10:09 UTC, 2 replies.
- Spark Streaming worker underutilized? - posted by maddenpj <ma...@gmail.com> on 2014/08/08 21:42:59 UTC, 0 replies.
- Spark sql failed in yarn-cluster mode when connecting to non-default hive database - posted by Jenny Zhao <li...@gmail.com> on 2014/08/08 21:56:37 UTC, 8 replies.
- RDD partitioner or repartition examples? - posted by buntu <bu...@gmail.com> on 2014/08/08 22:14:14 UTC, 0 replies.
- Does Spark 1.0.1 stil collect results in serial??? - posted by makevnin <ma...@lanl.gov> on 2014/08/08 22:44:20 UTC, 1 replies.
- Executors for Spark shell take much longer to be ready - posted by durin <ma...@simon-schaefer.net> on 2014/08/09 01:07:18 UTC, 0 replies.
- Spark SQL dialect - posted by Sathish Kumaran Vairavelu <vs...@gmail.com> on 2014/08/09 02:11:17 UTC, 1 replies.
- increase parallelism of reading from hdfs - posted by Chen Song <ch...@gmail.com> on 2014/08/09 05:13:50 UTC, 2 replies.
- OOM writing out sorted RDD - posted by Bharath Ravi Kumar <re...@gmail.com> on 2014/08/09 07:11:33 UTC, 1 replies.
- No space left on device - posted by kmatzen <km...@gmail.com> on 2014/08/09 08:02:21 UTC, 2 replies.
- set SPARK_LOCAL_DIRS issue - posted by Baoqiang Cao <bq...@gmail.com> on 2014/08/09 16:21:04 UTC, 1 replies.
- How to read zip files from HDFS into spark-shell using scala - posted by Alton Alexander <al...@gmail.com> on 2014/08/09 20:44:25 UTC, 0 replies.
- feature space search - posted by filipus <fl...@gmail.com> on 2014/08/09 22:54:08 UTC, 0 replies.
- Overriding dstream window definition - posted by Ruchir Jha <ru...@gmail.com> on 2014/08/10 01:44:31 UTC, 0 replies.
- Spark SQL JSON dataset query nested datastructures - posted by Sathish Kumaran Vairavelu <vs...@gmail.com> on 2014/08/10 03:43:30 UTC, 1 replies.
- Sharing memory across applications - posted by Tushar Khairnar <tk...@pivotal.io> on 2014/08/10 10:55:39 UTC, 0 replies.
- Re: saveAsTextFile - posted by durin <ma...@simon-schaefer.net> on 2014/08/10 11:12:54 UTC, 0 replies.
- how to use SPARK_PUBLIC_DNS - posted by 诺铁 <no...@gmail.com> on 2014/08/10 14:27:42 UTC, 1 replies.
- Partitioning a libsvm format file - posted by ayandas84 <ay...@gmail.com> on 2014/08/10 18:35:02 UTC, 1 replies.
- CDH5, HiveContext, Parquet - posted by Eric Friedman <er...@gmail.com> on 2014/08/10 18:36:11 UTC, 9 replies.
- SparkStreaming 0.9.0 / Java / Twitter issue - posted by Jörn Franke <jo...@gmail.com> on 2014/08/10 23:25:14 UTC, 1 replies.
- Second Attempt: Custom transformations in Spark - posted by Jeevak Kasarkod <je...@gmail.com> on 2014/08/11 04:08:07 UTC, 0 replies.
- Explain throws exception in SparkSQL - posted by Yu Gavin <ga...@gmail.com> on 2014/08/11 06:28:50 UTC, 0 replies.
- Does MLlib in spark 1.0.2 only work for tall-and-skinny matrix? - posted by Andy Zhao <an...@gmail.com> on 2014/08/11 06:35:01 UTC, 1 replies.
- Exception when call couchbase sdk in Spark Job - posted by sunchen <su...@gmail.com> on 2014/08/11 07:44:29 UTC, 0 replies.
- collect failed for unknow reason when deploy use standalone mode - posted by "wangyi@testbird.com" <wa...@testbird.com> on 2014/08/11 09:14:01 UTC, 1 replies.
- Spark RuntimeException due to Unsupported datatype NullType - posted by rafeeq s <ra...@gmail.com> on 2014/08/11 09:53:54 UTC, 1 replies.
- spark sql (can it call impala udf) - posted by marspoc <so...@gmail.com> on 2014/08/11 11:17:09 UTC, 0 replies.
- [spark-streaming] kafka source and flow control - posted by gpasquiers <gw...@ericsson.com> on 2014/08/11 11:19:41 UTC, 5 replies.
- Re: How to direct insert vaules into SparkSQL tables? - posted by chutium <te...@gmail.com> on 2014/08/11 12:13:05 UTC, 2 replies.
- how to split RDD by key and save to different path - posted by 诺铁 <no...@gmail.com> on 2014/08/11 14:42:04 UTC, 2 replies.
- ERROR UserGroupInformation: Can't find user in Subject: - posted by Dan Foisy <da...@gmail.com> on 2014/08/11 15:00:11 UTC, 0 replies.
- looking for a definitive RDD.Pipe() example? - posted by pjv0580 <pv...@agoragames.com> on 2014/08/11 15:30:55 UTC, 0 replies.
- Spark app slowing down and I'm unable to kill it - posted by Grzegorz Białek <gr...@codilime.com> on 2014/08/11 16:14:05 UTC, 1 replies.
- Re: saveAsTextFiles file not found exception - posted by Chen Song <ch...@gmail.com> on 2014/08/11 16:29:39 UTC, 7 replies.
- Compile spark code with idea succesful but run SparkPi error with "java.lang.SecurityException" - posted by Zhanfeng Huo <hu...@gmail.com> on 2014/08/11 16:36:41 UTC, 2 replies.
- share/reuse off-heap persisted (tachyon) RDD in SparkContext or saveAsParquetFile on tachyon in SQLContext - posted by chutium <te...@gmail.com> on 2014/08/11 17:08:29 UTC, 3 replies.
- Parallelizing a task makes it freeze - posted by sparkuser2345 <hm...@gmail.com> on 2014/08/11 17:31:11 UTC, 1 replies.
- Re: Can I share the RDD between multiprocess - posted by coolfrood <aa...@quantcast.com> on 2014/08/11 17:48:36 UTC, 1 replies.
- Re: ClassNotFound for user class in uber-jar - posted by lbustelo <gi...@bustelos.com> on 2014/08/11 17:50:38 UTC, 0 replies.
- ClassNotFound exception on class in uber.jar - posted by lbustelo <gi...@bustelos.com> on 2014/08/11 18:06:00 UTC, 1 replies.
- Re: Running a task once on each executor - posted by RodrigoB <ro...@aspect.com> on 2014/08/11 19:46:34 UTC, 0 replies.
- Random Forest implementation in MLib - posted by Sameer Tilak <ss...@live.com> on 2014/08/11 19:52:12 UTC, 1 replies.
- Spark "completed application" not showing example job - posted by "Mozumder, Monir" <Mo...@amd.com> on 2014/08/11 20:21:19 UTC, 2 replies.
- mllib style - posted by Koert Kuipers <ko...@tresata.com> on 2014/08/11 21:07:49 UTC, 1 replies.
- Failed jobs show up as succeeded in YARN? - posted by Shay Rojansky <ro...@roji.org> on 2014/08/11 21:17:04 UTC, 1 replies.
- spark.files.userClassPathFirst=true Not Working Correctly - posted by DNoteboom <da...@wibidata.com> on 2014/08/11 21:24:58 UTC, 5 replies.
- RE: Spark on an HPC setup - posted by Sidharth Kashyap <si...@outlook.com> on 2014/08/11 22:41:54 UTC, 0 replies.
- Gathering Information about Standalone Cluster - posted by Wonha Ryu <wo...@gmail.com> on 2014/08/11 22:58:51 UTC, 0 replies.
- Spark streaming error - Task not serializable - posted by Xuri Nagarin <se...@gmail.com> on 2014/08/12 03:00:29 UTC, 0 replies.
- Using very large files for KMeans training -- cluster centers size? - posted by durin <ma...@simon-schaefer.net> on 2014/08/12 03:26:31 UTC, 1 replies.
- Re: java.lang.StackOverflowError when calling count() - posted by randylu <ra...@gmail.com> on 2014/08/12 04:14:37 UTC, 2 replies.
- Benchmark on physical Spark cluster - posted by "Mozumder, Monir" <Mo...@amd.com> on 2014/08/12 04:17:56 UTC, 1 replies.
- KMeans - java.lang.IllegalArgumentException: requirement failed - posted by "Ge, Yao (Y.)" <yg...@ford.com> on 2014/08/12 05:44:11 UTC, 2 replies.
- Is there any way to control the parallelism in LogisticRegression - posted by "ZHENG, Xu-dong" <do...@gmail.com> on 2014/08/12 05:46:08 UTC, 6 replies.
- Transform RDD[List] - posted by Kevin Jung <it...@samsung.com> on 2014/08/12 06:42:24 UTC, 5 replies.
- Support for ORC Table in Shark/Spark - posted by vi...@socialinfra.net on 2014/08/12 07:23:33 UTC, 5 replies.
- How to save mllib model to hdfs and reload it - posted by XiaoQinyu <xi...@outlook.com> on 2014/08/12 07:27:45 UTC, 13 replies.
- Re: Mllib : Save SVM model to disk - posted by XiaoQinyu <xi...@outlook.com> on 2014/08/12 07:36:48 UTC, 1 replies.
- Serialization with com.twitter.chill.MeatLocker - posted by jerryye <je...@gmail.com> on 2014/08/12 08:54:22 UTC, 0 replies.
- Killing spark app problem - posted by Grzegorz Białek <gr...@codilime.com> on 2014/08/12 09:33:20 UTC, 1 replies.
- LDA in MLBase - posted by Aslan Bekirov <as...@gmail.com> on 2014/08/12 14:43:39 UTC, 0 replies.
- Re: anaconda and spark integration - posted by Oleg Ruchovets <or...@gmail.com> on 2014/08/12 17:41:07 UTC, 1 replies.
- Reference External Variables in Map Function (Inner class) - posted by Sunny Khatri <su...@gmail.com> on 2014/08/12 19:02:09 UTC, 4 replies.
- DistCP - Spark-based - posted by Gary Malouf <ma...@gmail.com> on 2014/08/12 20:03:56 UTC, 1 replies.
- Jobs get stuck at reduceByKey stage with spark 1.0.1 - posted by Shivani Rao <ra...@gmail.com> on 2014/08/12 20:28:38 UTC, 0 replies.
- about how to use trees with apache-spark and map-reduce tasks. - posted by Alonso Isidoro Roman <al...@gmail.com> on 2014/08/12 20:47:33 UTC, 0 replies.
- Spark Streaming example on your mesos cluster - posted by Zia Syed <xi...@gmail.com> on 2014/08/12 21:24:54 UTC, 1 replies.
- Task closures and synchronization - posted by Tom Vacek <mi...@gmail.com> on 2014/08/12 22:35:18 UTC, 1 replies.
- how to access workers from spark context - posted by "S. Zhou" <my...@yahoo.com.INVALID> on 2014/08/13 01:39:34 UTC, 3 replies.
- Running time bottleneck on a few worker - posted by Bin <wu...@126.com> on 2014/08/13 04:24:48 UTC, 1 replies.
- training recsys model - posted by Hoai-Thu Vuong <th...@gmail.com> on 2014/08/13 05:01:58 UTC, 3 replies.
- OutOfMemor​yError when spark streaming receive flume events - posted by jason chen <py...@gmail.com> on 2014/08/13 05:10:59 UTC, 0 replies.
- Viewing web UI after fact - posted by grzegorz-bialek <gr...@codilime.com> on 2014/08/13 11:02:41 UTC, 4 replies.
- Running GraphX through Java - posted by Sonal Goyal <so...@gmail.com> on 2014/08/13 12:23:45 UTC, 1 replies.
- Is there any interest in handling XML within Spark ? - posted by Darin McBeath <dd...@yahoo.com.INVALID> on 2014/08/13 15:30:16 UTC, 1 replies.
- SparkSQL Hive partitioning support - posted by Silvio Fiorito <si...@granturing.com> on 2014/08/13 17:22:28 UTC, 3 replies.
- Re: Contribution to Spark MLLib - posted by Debasish Das <de...@gmail.com> on 2014/08/13 17:40:43 UTC, 0 replies.
- Getting percentile from Spark Streaming? - posted by bumble123 <tc...@att.com> on 2014/08/13 18:43:03 UTC, 0 replies.
- groupByKey() completes 99% on Spark + EC2 + S3 but then throws java.net.SocketException: Connection reset - posted by Arpan Ghosh <ar...@automatic.com> on 2014/08/13 21:21:13 UTC, 9 replies.
- Re: Kafka - streaming from multiple topics - posted by maddenpj <ma...@gmail.com> on 2014/08/13 21:37:26 UTC, 0 replies.
- Open source project: Deploy Spark to a cluster with Puppet and Fabric. - posted by bdamos <am...@adobe.com> on 2014/08/13 22:44:56 UTC, 1 replies.
- Open source project: Example Spark project using Parquet as a columnar store with Thrift objects. - posted by bdamos <am...@adobe.com> on 2014/08/13 22:48:24 UTC, 0 replies.
- spark streaming : what is the best way to make a driver highly available - posted by salemi <al...@udo.edu> on 2014/08/13 22:49:16 UTC, 3 replies.
- Using Hadoop InputFormat in Python - posted by Tassilo Klein <TJ...@gmail.com> on 2014/08/13 23:59:33 UTC, 6 replies.
- Spark Akka/actor failures. - posted by ldmtwo <ld...@gmail.com> on 2014/08/14 00:56:17 UTC, 2 replies.
- How to access the individual elements of RDD[Iterable[Float]] to do sum(),stdev() ? - posted by KRaman <kr...@gmail.com> on 2014/08/14 01:18:53 UTC, 0 replies.
- java.lang.UnknownError: no bin was found for continuous variable. - posted by Sameer Tilak <ss...@live.com> on 2014/08/14 01:43:54 UTC, 2 replies.
- SPARK_LOCAL_DIRS option - posted by Debasish Das <de...@gmail.com> on 2014/08/14 03:47:53 UTC, 1 replies.
- How to debug: Runs locally but not on cluster - posted by jerryye <je...@gmail.com> on 2014/08/14 04:29:15 UTC, 1 replies.
- Re: Python + Spark unable to connect to S3 bucket .... "Invalid hostname in URI" - posted by jerryye <je...@gmail.com> on 2014/08/14 04:37:59 UTC, 2 replies.
- Ways to partition the RDD - posted by bdev <bu...@gmail.com> on 2014/08/14 04:45:52 UTC, 6 replies.
- Spark SQL Stackoverflow error - posted by Vishal Vibhandik <vi...@gmail.com> on 2014/08/14 05:16:35 UTC, 1 replies.
- Script to deploy spark to Google compute engine - posted by Soumya Simanta <so...@gmail.com> on 2014/08/14 05:17:43 UTC, 2 replies.
- Re: Job aborted due to stage failure: TID x failed for unknown reasons - posted by jerryye <je...@gmail.com> on 2014/08/14 09:53:12 UTC, 0 replies.
- how to use the method saveAsTextFile of a RDD like javaRDD - posted by Gefei Li <ge...@gmail.com> on 2014/08/14 10:23:10 UTC, 4 replies.
- read performance issue - posted by Gurvinder Singh <gu...@uninett.no> on 2014/08/14 10:27:25 UTC, 0 replies.
- Should the memory of worker nodes be constrained to the size of the master node? - posted by Darin McBeath <dd...@yahoo.com.INVALID> on 2014/08/14 14:32:41 UTC, 1 replies.
- Re: Down-scaling Spark on EC2 cluster - posted by Shubhabrata <ma...@gmail.com> on 2014/08/14 16:34:52 UTC, 0 replies.
- Using Spark Streaming to listen to HDFS directory and handle different files by file name - posted by ZhangYi <yi...@thoughtworks.com> on 2014/08/14 16:50:31 UTC, 0 replies.
- SPARK_DRIVER_MEMORY - posted by Brad Miller <bm...@eecs.berkeley.edu> on 2014/08/14 18:54:46 UTC, 0 replies.
- Mlib model: viewing and saving - posted by Sameer Tilak <ss...@live.com> on 2014/08/14 19:21:54 UTC, 0 replies.
- SPARK_LOCAL_DIRS - posted by Brad Miller <bm...@eecs.berkeley.edu> on 2014/08/14 19:27:40 UTC, 1 replies.
- Re: Subscribing to news releases - posted by Nicholas Chammas <ni...@gmail.com> on 2014/08/14 20:33:52 UTC, 0 replies.
- How to transform large local files into Parquet format and write into HDFS? - posted by Parthus <pe...@gmail.com> on 2014/08/14 20:44:04 UTC, 0 replies.
- Documentation to start with - posted by Abhilash K Challa <ab...@gmail.com> on 2014/08/14 21:33:31 UTC, 0 replies.
- Spark on HDP - posted by Padmanabh <re...@gmail.com> on 2014/08/14 22:50:43 UTC, 0 replies.
- Seattle Spark Meetup: Spark at eBay - Troubleshooting the everyday issues Slides - posted by Denny Lee <de...@gmail.com> on 2014/08/14 23:14:33 UTC, 1 replies.
- spark streaming - lamda architecture - posted by salemi <al...@udo.edu> on 2014/08/14 23:27:22 UTC, 6 replies.
- Re: Spark webUI - application details page - posted by SK <sk...@gmail.com> on 2014/08/14 23:48:59 UTC, 15 replies.
- Performance hit for using sc.setCheckPointDir - posted by Debasish Das <de...@gmail.com> on 2014/08/15 00:00:58 UTC, 0 replies.
- Dealing with Idle shells - posted by Gary Malouf <ma...@gmail.com> on 2014/08/15 00:03:08 UTC, 0 replies.
- Compiling SNAPTSHOT - posted by Jim Blomo <ji...@gmail.com> on 2014/08/15 00:25:00 UTC, 1 replies.
- SparkR: split, apply, combine strategy for dataframes? - posted by "Carlos J. Gil Bellosta " <gi...@gmail.com> on 2014/08/15 00:53:19 UTC, 2 replies.
- Getting hadoop distcp to work on ephemeral-hsfs in spark-ec2 cluster - posted by Arpan Ghosh <ar...@automatic.com> on 2014/08/15 02:57:01 UTC, 0 replies.
- Spark working directories - posted by Yana Kadiyska <ya...@gmail.com> on 2014/08/15 03:24:21 UTC, 1 replies.
- None in RDD - posted by guoxu1231 <gu...@gmail.com> on 2014/08/15 07:30:23 UTC, 0 replies.
- spark on yarn cluster can't launch - posted by centerqi hu <ce...@gmail.com> on 2014/08/15 09:23:36 UTC, 4 replies.
- Re: Debugging "Task not serializable" - posted by Juan Rodríguez Hortalá <ju...@gmail.com> on 2014/08/15 10:01:26 UTC, 0 replies.
- Re: How to implement multinomial logistic regression(softmax regression) in Spark? - posted by Cui xp <li...@gmail.com> on 2014/08/15 11:24:45 UTC, 4 replies.
- Re: spark won't build with maven - posted by visakh <vi...@gmail.com> on 2014/08/15 11:32:08 UTC, 0 replies.
- Issues with S3 client library and Apache Spark - posted by Darin McBeath <dd...@yahoo.com.INVALID> on 2014/08/15 16:21:04 UTC, 0 replies.
- Running Spark shell on YARN - posted by Soumya Simanta <so...@gmail.com> on 2014/08/15 19:45:50 UTC, 6 replies.
- [Spar Streaming] How can we use consecutive data points as the features ? - posted by Yan Fang <ya...@gmail.com> on 2014/08/15 20:29:30 UTC, 1 replies.
- Hardware Context on Spark Worker Hosts - posted by Chris Brown <cb...@infoblox.com> on 2014/08/15 20:48:42 UTC, 0 replies.
- closure issue - works in scalatest but not in spark-shell - posted by Mohit Jaggi <mo...@gmail.com> on 2014/08/15 20:50:55 UTC, 0 replies.
- ALS checkpoint performance - posted by Debasish Das <de...@gmail.com> on 2014/08/15 21:30:57 UTC, 1 replies.
- spark streaming - saving kafka DStream into hadoop throws exception - posted by salemi <al...@udo.edu> on 2014/08/15 22:37:26 UTC, 3 replies.
- Open sourcing Spindle by Adobe Research, a web analytics processing engine in Scala, Spark, and Parquet. - posted by Brandon Amos <am...@adobe.com> on 2014/08/15 23:06:37 UTC, 5 replies.
- mlib model viewing and saving - posted by Sameer Tilak <ss...@live.com> on 2014/08/16 01:28:11 UTC, 0 replies.
- Does HiveContext support Parquet? - posted by lyc <ya...@huawei.com> on 2014/08/16 01:29:50 UTC, 13 replies.
- Re: Scala Spark Distinct on a case class doesn't work - posted by clarkroberts <cl...@curalate.com> on 2014/08/16 01:46:28 UTC, 0 replies.
- Updating exising JSON files - posted by ejb11235 <er...@ericjbell.com> on 2014/08/16 01:53:52 UTC, 1 replies.
- Question regarding spark data partition and coalesce. Need info on my use case. - posted by abhiguruvayya <sh...@gmail.com> on 2014/08/16 03:34:06 UTC, 2 replies.
- Error in sbt/sbt package - posted by Deep Pradhan <pr...@gmail.com> on 2014/08/16 07:09:12 UTC, 2 replies.
- kryo out of buffer exception - posted by Mohit Jaggi <mo...@gmail.com> on 2014/08/16 20:43:33 UTC, 0 replies.
- Does anyone have a stand alone spark instance running on Windows - posted by Steve Lewis <lo...@gmail.com> on 2014/08/16 21:14:00 UTC, 4 replies.
- iterating with index in psypark - posted by Chengi Liu <ch...@gmail.com> on 2014/08/16 23:22:27 UTC, 1 replies.
- s3:// sequence file startup time - posted by kmatzen <km...@gmail.com> on 2014/08/17 03:46:20 UTC, 2 replies.
- Program without doing assembly - posted by Deep Pradhan <pr...@gmail.com> on 2014/08/17 07:56:25 UTC, 1 replies.
- akka error : play framework (2.3.3) and spark (1.0.2) - posted by Sujee Maniyam <su...@sujee.net> on 2014/08/17 08:04:55 UTC, 2 replies.
- Finding Data Gaps or missing data count for a time series data - posted by kushagrathakur <ku...@gmail.com> on 2014/08/17 13:54:07 UTC, 0 replies.
- application as a service - posted by Zhanfeng Huo <hu...@gmail.com> on 2014/08/17 17:27:44 UTC, 5 replies.
- Vast network traffic during RDD creation - posted by Acco <gi...@hotmail.com> on 2014/08/17 17:39:37 UTC, 0 replies.
- Spark: why need a masterLock when sending heartbeat to master - posted by Victor Sheng <vi...@gmail.com> on 2014/08/17 17:58:56 UTC, 2 replies.
- Question on mappartitionwithsplit - posted by Chengi Liu <ch...@gmail.com> on 2014/08/17 19:25:28 UTC, 5 replies.
- OutOfMemory Error - posted by Ghousia Taj <gh...@gmail.com> on 2014/08/18 07:10:57 UTC, 7 replies.
- Segmented fold count - posted by fil <fi...@pobox.com> on 2014/08/18 07:34:38 UTC, 7 replies.
- NullPointerException when connecting from Spark to a Hive table backed by HBase - posted by Cesar Arevalo <ce...@zephyrhealthinc.com> on 2014/08/18 08:39:16 UTC, 8 replies.
- a noob question for how to implement setup and cleanup in Spark map - posted by Henry Hung <YT...@winbond.com> on 2014/08/18 08:42:21 UTC, 8 replies.
- Merging complicated small matrices to one big matrix - posted by Chengi Liu <ch...@gmail.com> on 2014/08/18 11:42:10 UTC, 1 replies.
- [GraphX] how to set memory configurations to avoid OutOfMemoryError "GC overhead limit exceeded" - posted by Yifan LI <ia...@gmail.com> on 2014/08/18 15:29:12 UTC, 1 replies.
- Working with many RDDs in parallel? - posted by David Tinker <da...@gmail.com> on 2014/08/18 15:31:07 UTC, 2 replies.
- spark kryo serilizable exception - posted by adu <du...@hzduozhun.com> on 2014/08/18 15:50:02 UTC, 1 replies.
- Spark Streaming Data Sharing - posted by Levi Bowman <Le...@markit.com> on 2014/08/18 16:30:31 UTC, 1 replies.
- Spark Streaming - saving DStream into hadoop throws execption if checkpoint is enabled. - posted by salemi <al...@udo.edu> on 2014/08/18 19:20:45 UTC, 0 replies.
- Bug or feature? Overwrite broadcasted variables. - posted by Peng Cheng <pc...@uow.edu.au> on 2014/08/18 20:26:03 UTC, 3 replies.
- java.nio.channels.CancelledKeyException in Graphx Connected Components - posted by Jeffrey Picard <jp...@columbia.edu> on 2014/08/18 22:02:33 UTC, 0 replies.
- Extracting unique elements of an ArrayBuffer - posted by SK <sk...@gmail.com> on 2014/08/18 22:09:32 UTC, 1 replies.
- spark - reading hfds files every 5 minutes - posted by salemi <al...@udo.edu> on 2014/08/18 23:23:12 UTC, 3 replies.
- setCallSite for API backtraces not showing up in logs? - posted by John Salvatier <js...@gmail.com> on 2014/08/18 23:33:42 UTC, 0 replies.
- How to use Spark Streaming from an HTTP api? - posted by bumble123 <tc...@att.com> on 2014/08/19 00:20:32 UTC, 1 replies.
- spark-submit with HA YARN - posted by Matt Narrell <ma...@gmail.com> on 2014/08/19 01:07:08 UTC, 7 replies.
- Processing multiple files in parallel - posted by SK <sk...@gmail.com> on 2014/08/19 03:14:32 UTC, 2 replies.
- Data loss - Spark streaming and network receiver - posted by Wei Liu <we...@stellarloyalty.com> on 2014/08/19 03:18:40 UTC, 5 replies.
- sqlContext.parquetFile(path) fails if path is a file but succeeds if a directory - posted by Fengyun RAO <ra...@gmail.com> on 2014/08/19 04:59:50 UTC, 1 replies.
- spark - Identifying and skipping processed data in hdfs - posted by salemi <al...@udo.edu> on 2014/08/19 05:30:37 UTC, 2 replies.
- Cannot run program "Rscript" using SparkR - posted by Stuti Awasthi <st...@hcl.com> on 2014/08/19 07:05:47 UTC, 2 replies.
- Re: spark streaming updataStateByKey clear old data - posted by darwen <hu...@gmail.com> on 2014/08/19 07:40:56 UTC, 0 replies.
- Problem in running a job on more than one workers - posted by Rasika Pohankar <ra...@gmail.com> on 2014/08/19 08:33:10 UTC, 1 replies.
- Naive Bayes - posted by Phuoc Do <ph...@vida.io> on 2014/08/19 09:07:38 UTC, 4 replies.
- Performance problem on collect - posted by Emmanuel Castanier <em...@gmail.com> on 2014/08/19 09:43:28 UTC, 4 replies.
- Partitioning under Spark 1.0.x - posted by losmi83 <mi...@gmail.com> on 2014/08/19 12:53:04 UTC, 0 replies.
- spark error when distinct on more than one cloume - posted by "wangyi@testbird.com" <wa...@testbird.com> on 2014/08/19 13:57:10 UTC, 1 replies.
- Executor Memory, Task hangs - posted by "Laird, Benjamin" <Be...@capitalone.com> on 2014/08/19 15:05:57 UTC, 3 replies.
- Running time is significantly unbalanced - posted by Bin <wu...@126.com> on 2014/08/19 16:30:40 UTC, 1 replies.
- [Streaming]Executor OOM - posted by Yana Kadiyska <ya...@gmail.com> on 2014/08/19 17:07:58 UTC, 0 replies.
- How to configure SPARK_EXECUTOR_URI to access files from maprfs - posted by "Lee Strawther (lstrawth)" <ls...@cisco.com> on 2014/08/19 17:14:05 UTC, 0 replies.
- spark on disk executions - posted by Oleg Ruchovets <or...@gmail.com> on 2014/08/19 18:38:55 UTC, 1 replies.
- Updating shared data structure between executors - posted by Tim Smith <se...@gmail.com> on 2014/08/19 19:29:58 UTC, 0 replies.
- Python script runs fine in local mode, errors in other modes - posted by Aaron <aa...@target.com> on 2014/08/19 19:47:09 UTC, 5 replies.
- EC2 instances missing SSD drives randomly? - posted by Andras Barjak <an...@lynxanalytics.com> on 2014/08/19 20:54:49 UTC, 1 replies.
- noob: how to extract different members of a VertexRDD - posted by spr <sp...@yarcdata.com> on 2014/08/19 21:05:15 UTC, 3 replies.
- pyspark/yarn and inconsistent number of executors - posted by Calvin <ip...@gmail.com> on 2014/08/19 21:51:18 UTC, 1 replies.
- RDD Grouping - posted by TJ Klein <TJ...@gmail.com> on 2014/08/19 22:02:01 UTC, 2 replies.
- Only master is really busy at KMeans training - posted by durin <ma...@simon-schaefer.net> on 2014/08/19 22:41:42 UTC, 4 replies.
- saveAsTextFile hangs with hdfs - posted by David <da...@gmail.com> on 2014/08/19 22:44:18 UTC, 3 replies.
- Re: OpenCV + Spark : Where to put System.loadLibrary ? - posted by kmatzen <km...@gmail.com> on 2014/08/19 22:50:16 UTC, 1 replies.
- spark streaming - how to prevent that empty dstream to get written out to hdfs - posted by salemi <al...@udo.edu> on 2014/08/19 23:15:21 UTC, 0 replies.
- spark-submit with Yarn - posted by Arun Ahuja <aa...@gmail.com> on 2014/08/19 23:34:18 UTC, 3 replies.
- Multiple column families vs Multiple tables - posted by Wei Liu <we...@stellarloyalty.com> on 2014/08/20 00:06:38 UTC, 3 replies.
- Re: Task's "Scheduler Delay" in web ui - posted by Chris Fregly <ch...@fregly.com> on 2014/08/20 00:54:26 UTC, 0 replies.
- First Bay Area Tachyon meetup: August 25th, hosted by Yahoo! (Limited Space) - posted by Haoyuan Li <ha...@gmail.com> on 2014/08/20 01:08:46 UTC, 1 replies.
- Re: How to incorporate the new data in the MLlib-NaiveBayes model along with predicting? - posted by Chris Fregly <ch...@fregly.com> on 2014/08/20 01:22:11 UTC, 1 replies.
- Decision tree: categorical variables - posted by Sameer Tilak <ss...@live.com> on 2014/08/20 01:24:05 UTC, 3 replies.
- High-Level Implementation Documentation - posted by Kenny Ballou <kb...@devnulllabs.io> on 2014/08/20 01:30:36 UTC, 0 replies.
- Re: slower worker node in the cluster - posted by Chris Fregly <ch...@fregly.com> on 2014/08/20 01:41:06 UTC, 0 replies.
- Re: Is hive UDF are supported in HiveContext - posted by chutium <te...@gmail.com> on 2014/08/20 01:57:51 UTC, 0 replies.
- Spark SQL: Caching nested structures extremely slow - posted by Evan Chan <ve...@gmail.com> on 2014/08/20 03:01:14 UTC, 2 replies.
- What about implementing various hypothesis test for Logistic Regression in MLlib - posted by guxiaobo1982 <gu...@qq.com> on 2014/08/20 06:50:40 UTC, 1 replies.
- JVM heap and native allocation questions - posted by kmatzen <km...@gmail.com> on 2014/08/20 07:29:54 UTC, 0 replies.
- Got NotSerializableException when access broadcast variable - posted by 田毅 <ti...@asiainfo.com> on 2014/08/20 08:27:00 UTC, 5 replies.
- RDD Row Index - posted by TJ Klein <TJ...@gmail.com> on 2014/08/20 09:35:20 UTC, 2 replies.
- [Spark SQL] How to select first row in each GROUP BY group? - posted by Fengyun RAO <ra...@gmail.com> on 2014/08/20 09:52:28 UTC, 4 replies.
- Accessing to elements in JavaDStream - posted by cuongpham92 <cu...@gmail.com> on 2014/08/20 09:55:16 UTC, 2 replies.
- Re: NullPointerException from '.count.foreachRDD' - posted by anoldbrain <an...@gmail.com> on 2014/08/20 10:12:43 UTC, 2 replies.
- Broadcast vs simple variable - posted by Julien Naour <ju...@gmail.com> on 2014/08/20 10:18:00 UTC, 4 replies.
- Difference between amplab docker and spark docker? - posted by Josh J <jo...@gmail.com> on 2014/08/20 10:28:58 UTC, 0 replies.
- Re: hdfs read performance issue - posted by Gurvinder Singh <gu...@uninett.no> on 2014/08/20 11:01:27 UTC, 0 replies.
- Hi - posted by rapelly kartheek <ka...@gmail.com> on 2014/08/20 11:29:11 UTC, 1 replies.
- (Unknown) - posted by Cường Phạm <cu...@gmail.com> on 2014/08/20 12:47:26 UTC, 0 replies.
- Web UI doesn't show some stages - posted by Grzegorz Białek <gr...@codilime.com> on 2014/08/20 13:22:46 UTC, 2 replies.
- Potential Thrift Server Bug on Spark SQL,perhaps with cache table? - posted by John Omernik <jo...@omernik.com> on 2014/08/20 15:11:48 UTC, 1 replies.
- Advantage of using cache() - posted by Grzegorz Białek <gr...@codilime.com> on 2014/08/20 16:48:58 UTC, 4 replies.
- How to pass env variables from master to executors within spark-shell - posted by Darin McBeath <dd...@yahoo.com.INVALID> on 2014/08/20 18:19:18 UTC, 2 replies.
- Spark exception while reading different inputs - posted by durga <du...@gmail.com> on 2014/08/20 18:23:21 UTC, 0 replies.
- Stage failure in BlockManager due to FileNotFoundException on long-running streaming job - posted by Silvio Fiorito <si...@granturing.com> on 2014/08/20 18:28:38 UTC, 2 replies.
- Is Spark SQL Thrift Server part of the 1.0.2 release - posted by "Tam, Ken K" <ke...@verizon.com> on 2014/08/20 18:35:53 UTC, 0 replies.
- Re: Is Spark SQL Thrift Server part of the 1.0.2 release - posted by Michael Armbrust <mi...@databricks.com> on 2014/08/20 18:37:51 UTC, 2 replies.
- GraphX question about graph traversal - posted by Cesar Arevalo <ce...@zephyrhealthinc.com> on 2014/08/20 19:34:50 UTC, 4 replies.
- MLlib: issue with increasing maximum depth of the decision tree - posted by Sameer Tilak <ss...@live.com> on 2014/08/20 19:37:13 UTC, 2 replies.
- Spark-job error on writing result into hadoop w/ switch_user=false - posted by Jongyoul Lee <jo...@gmail.com> on 2014/08/20 19:46:20 UTC, 1 replies.
- Personalized Page rank in graphx - posted by Mohit Singh <mo...@gmail.com> on 2014/08/20 19:57:57 UTC, 2 replies.
- How to set KryoRegistrator class in spark-shell - posted by Benyi Wang <be...@gmail.com> on 2014/08/20 21:25:14 UTC, 2 replies.
- Small input split sizes - posted by David Rosenstrauch <da...@darose.net> on 2014/08/20 22:33:18 UTC, 0 replies.
- java.io.NotSerializableException: org.scalatest.Assertions$AssertionsHelper - posted by Chris Jones <cv...@yahoo.com.INVALID> on 2014/08/21 01:21:14 UTC, 2 replies.
- Spark QL and protobuf schema - posted by Dmitriy Lyubimov <dl...@gmail.com> on 2014/08/21 02:57:27 UTC, 5 replies.
- Spark memory settings on yarn - posted by centerqi hu <ce...@gmail.com> on 2014/08/21 03:56:56 UTC, 2 replies.
- DStream cannot write to text file - posted by cuongpham92 <cu...@gmail.com> on 2014/08/21 04:59:57 UTC, 7 replies.
- Trying to run SparkSQL over Spark Streaming - posted by praveshjain1991 <pr...@gmail.com> on 2014/08/21 07:19:27 UTC, 11 replies.
- Merging two Spark SQL tables? - posted by Evan Chan <ve...@gmail.com> on 2014/08/21 08:30:24 UTC, 4 replies.
- Mapping with extra arguments - posted by TJ Klein <TJ...@gmail.com> on 2014/08/21 08:33:39 UTC, 4 replies.
- Matrix multiplication in spark - posted by phoenix bai <mi...@gmail.com> on 2014/08/21 11:07:22 UTC, 4 replies.
- ClassCastException, when calling saveToCassandra() - posted by keiffster <ke...@first-utility.com> on 2014/08/21 11:34:47 UTC, 0 replies.
- out of memory errors -- per core memory limits? - posted by Rok Roskar <ro...@gmail.com> on 2014/08/21 11:39:34 UTC, 0 replies.
- Spark Streaming checkpoint recovery causes IO re-execution - posted by RodrigoB <ro...@aspect.com> on 2014/08/21 12:16:43 UTC, 5 replies.
- More than one worker freezes (some) applications - posted by Alexander Matz <al...@ziti.uni-heidelberg.de> on 2014/08/21 12:40:21 UTC, 0 replies.
- Job aborted due to stage failure: Master removed our application: FAILED - posted by Kristoffer Sjögren <st...@gmail.com> on 2014/08/21 13:36:43 UTC, 1 replies.
- Launching history server problem - posted by Grzegorz Białek <gr...@codilime.com> on 2014/08/21 14:05:09 UTC, 0 replies.
- [Spark Streaming] kafka consumer announce - posted by Evgeniy Shishkin <it...@gmail.com> on 2014/08/21 15:17:47 UTC, 2 replies.
- Re: Spark Streaming with Flume event - posted by Spidy <yo...@gmail.com> on 2014/08/21 16:30:16 UTC, 1 replies.
- multiple windows from the same DStream ? - posted by Josh J <jo...@gmail.com> on 2014/08/21 16:58:06 UTC, 1 replies.
- DStream start a separate DStream - posted by Josh J <jo...@gmail.com> on 2014/08/21 17:08:26 UTC, 1 replies.
- Tracking memory usage - posted by Grzegorz Białek <gr...@codilime.com> on 2014/08/21 17:18:36 UTC, 0 replies.
- I am struggling to run Spark Examples on my local machine - posted by Steve Lewis <lo...@gmail.com> on 2014/08/21 18:37:52 UTC, 1 replies.
- Debugging cluster stability, configuration issues - posted by Shay Seng <sh...@urbanengines.com> on 2014/08/21 19:23:09 UTC, 2 replies.
- Kafka Spark Streaming job has an issue when the worker reading from Kafka is killed - posted by Bharat Venkat <bv...@gmail.com> on 2014/08/21 21:01:20 UTC, 0 replies.
- Writeup on Spark SQL with GDELT - posted by Evan Chan <ve...@gmail.com> on 2014/08/21 21:04:54 UTC, 1 replies.
- [Tachyon] Error reading from Parquet files in HDFS - posted by Evan Chan <ve...@gmail.com> on 2014/08/21 21:22:29 UTC, 2 replies.
- Spark Streaming Twitter Example Error - posted by danilopds <da...@gmail.com> on 2014/08/21 22:09:59 UTC, 1 replies.
- OOM Java heap space error on saveAsTextFile - posted by Daniil Osipov <da...@shazam.com> on 2014/08/21 22:36:02 UTC, 1 replies.
- Re: Spark on Yarn: Connecting to Existing Instance - posted by Chris Fregly <ch...@fregly.com> on 2014/08/21 22:45:20 UTC, 0 replies.
- Development environment issues - posted by pierred <pi...@demartines.com> on 2014/08/22 00:21:25 UTC, 1 replies.
- AppMaster OOME on YARN - posted by Vipul Pandey <vi...@gmail.com> on 2014/08/22 00:30:54 UTC, 2 replies.
- saveAsTextFile makes no progress without caching RDD - posted by jerryye <je...@gmail.com> on 2014/08/22 00:36:05 UTC, 0 replies.
- Configuration for big worker nodes - posted by soroka21 <so...@gmail.com> on 2014/08/22 00:42:09 UTC, 1 replies.
- Re: Hive From Spark - posted by Du Li <li...@yahoo-inc.com.INVALID> on 2014/08/22 01:23:04 UTC, 7 replies.
- Spark-JobServer moving to a new location - posted by Evan Chan <ve...@gmail.com> on 2014/08/22 01:43:24 UTC, 0 replies.
- Finding previous and next element in a sorted RDD - posted by cjwang <cj...@cjwang.us> on 2014/08/22 02:42:41 UTC, 4 replies.
- Re: Spark Streaming - What does Spark Streaming checkpoint? - posted by Chris Fregly <ch...@fregly.com> on 2014/08/22 03:02:28 UTC, 0 replies.
- The running time of spark - posted by Denis RP <qq...@gmail.com> on 2014/08/22 03:49:29 UTC, 5 replies.
- Spark SQL Parser error - posted by S Malligarjunan <sm...@yahoo.com.INVALID> on 2014/08/22 05:12:16 UTC, 8 replies.
- LDA example? - posted by Denny Lee <de...@gmail.com> on 2014/08/22 07:10:35 UTC, 2 replies.
- Spark on Mesos cause mesos-master OOM - posted by Chengwei Yang <ch...@gmail.com> on 2014/08/22 07:57:23 UTC, 0 replies.
- iterator cause NotSerializableException - posted by Kevin Jung <it...@samsung.com> on 2014/08/22 10:51:25 UTC, 0 replies.
- countByWindow save the count ? - posted by Josh J <jo...@gmail.com> on 2014/08/22 10:58:28 UTC, 2 replies.
- Installation On Windows machine - posted by "Mishra, Abhishek" <Ab...@xerox.com> on 2014/08/22 12:01:43 UTC, 3 replies.
- On Spark Standalone mode, Where the driver program will run? - posted by "taoistwar@gmail.com" <ta...@gmail.com> on 2014/08/22 12:04:22 UTC, 0 replies.
- [PySpark][Python 2.7.8][Spark 1.0.2] count() with TypeError: an integer is required - posted by Earthson <Ea...@gmail.com> on 2014/08/22 12:14:12 UTC, 3 replies.
- "Block input-* already exists on this machine; not re-adding it" warnings - posted by Aniket Bhatnagar <an...@gmail.com> on 2014/08/22 12:24:07 UTC, 1 replies.
- Re: Finding Rank in Spark - posted by athiradas <at...@flutura.com> on 2014/08/22 13:14:34 UTC, 1 replies.
- Understanding how to create custom DStreams in Spark streaming - posted by Aniket Bhatnagar <an...@gmail.com> on 2014/08/22 13:36:39 UTC, 0 replies.
- Losing Executors on cluster with RDDs of 100GB - posted by Yadid Ayzenberg <ya...@media.mit.edu> on 2014/08/22 13:47:53 UTC, 1 replies.
- Do we have to install the snappy when running the shuffle jobs - posted by carlmartin <ca...@qq.com> on 2014/08/22 16:06:07 UTC, 0 replies.
- Manipulating/Analyzing CSV files in Spark on local machine - posted by "Hingorani, Vineet" <vi...@sap.com> on 2014/08/22 16:58:20 UTC, 0 replies.
- why classTag not typeTag? - posted by Mohit Jaggi <mo...@gmail.com> on 2014/08/22 18:15:14 UTC, 1 replies.
- importing scala libraries from python? - posted by Jonathan Haddad <jo...@jonhaddad.com> on 2014/08/22 20:29:54 UTC, 0 replies.
- How to start master and workers on Windows - posted by Steve Lewis <lo...@gmail.com> on 2014/08/22 20:57:28 UTC, 0 replies.
- FetchFailed when collect at YARN cluster - posted by Jiayu Zhou <de...@gmail.com> on 2014/08/22 21:00:05 UTC, 3 replies.
- [PySpark] order of values in GroupByKey() - posted by Arpan Ghosh <ar...@automatic.com> on 2014/08/22 22:32:34 UTC, 3 replies.
- spark streaming - realtime reports - storing current state of resources - posted by salemi <al...@udo.edu> on 2014/08/22 22:43:04 UTC, 0 replies.
- cache table with JDBC - posted by ken <ke...@verizon.com> on 2014/08/23 01:30:48 UTC, 0 replies.
- Re: wholeTextFiles not working with HDFS - posted by pierred <pi...@demartines.com> on 2014/08/23 02:14:38 UTC, 1 replies.
- ODBC and HiveThriftServer2 - posted by prnicolas <pn...@yahoo.com> on 2014/08/23 04:06:09 UTC, 0 replies.
- Spark: Why Standalone mode can not set Executor Number. - posted by Victor Sheng <vi...@gmail.com> on 2014/08/23 05:37:42 UTC, 0 replies.
- Re: What about implementing various hypothesis test for LogisticRegression in MLlib - posted by guxiaobo1982 <gu...@qq.com> on 2014/08/23 05:44:48 UTC, 1 replies.
- FetchFailedException from Block Manager - posted by Victor Tso-Guillen <vt...@paxata.com> on 2014/08/23 06:24:55 UTC, 0 replies.
- Spark Cluster Benchmarking Frameworks - posted by Jonathan Hodges <ho...@gmail.com> on 2014/08/23 18:57:20 UTC, 0 replies.
- Printing the RDDs in SparkPageRank - posted by Deep Pradhan <pr...@gmail.com> on 2014/08/24 08:28:35 UTC, 4 replies.
- How to make Spark Streaming write its output so that Impala can read it? - posted by rafeeq s <ra...@gmail.com> on 2014/08/24 10:19:44 UTC, 2 replies.
- Spark Streaming API and Performance Clarifications - posted by didi <di...@gmail.com> on 2014/08/24 18:29:31 UTC, 0 replies.
- amp lab spark streaming twitter example - posted by Forest D <de...@gmail.com> on 2014/08/24 19:21:20 UTC, 3 replies.
- Return multiple [K,V] pairs from a Java Function - posted by Tom <th...@gmail.com> on 2014/08/24 22:15:21 UTC, 1 replies.
- pipe raw binary data - posted by "Emeric, Viel" <em...@jp.fujitsu.com> on 2014/08/25 02:49:32 UTC, 0 replies.
- Spark Stream + HDFS Append - posted by Dean Chen <de...@gmail.com> on 2014/08/25 02:56:07 UTC, 1 replies.
- How to join two PairRDD together? - posted by Gefei Li <ge...@gmail.com> on 2014/08/25 08:37:19 UTC, 3 replies.
- many fetch failure in "BlockManager" - posted by 余根茂 <hu...@gmail.com> on 2014/08/25 09:03:51 UTC, 0 replies.
- spark and matlab - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/08/25 09:40:59 UTC, 3 replies.
- apply at Option.scala:120 - posted by "Wang, Jensen" <je...@sap.com> on 2014/08/25 10:32:34 UTC, 1 replies.
- StorageLevel error. - posted by rapelly kartheek <ka...@gmail.com> on 2014/08/25 11:52:07 UTC, 1 replies.
- Spark - GraphX pregel like with global variables (accumulator / broadcast) - posted by BertrandR <be...@gmail.com> on 2014/08/25 15:41:36 UTC, 4 replies.
- Request for help in writing to Textfile - posted by yh18190 <yh...@gmail.com> on 2014/08/25 15:56:57 UTC, 1 replies.
- Manipulating columns in CSV file or Transpose of Array[Array[String]] RDD - posted by "Hingorani, Vineet" <vi...@sap.com> on 2014/08/25 16:09:26 UTC, 3 replies.
- SPARK Hive Context UDF Class Not Found Exception, - posted by S Malligarjunan <sm...@yahoo.com.INVALID> on 2014/08/25 18:57:04 UTC, 2 replies.
- How do you hit breakpoints using IntelliJ In functions used by an RDD - posted by Steve Lewis <lo...@gmail.com> on 2014/08/25 19:32:11 UTC, 4 replies.
- GraphX usecases - posted by Sunita Arvind <su...@gmail.com> on 2014/08/25 20:23:37 UTC, 2 replies.
- HiveContext ouput log file - posted by S Malligarjunan <sm...@yahoo.com.INVALID> on 2014/08/25 20:46:35 UTC, 2 replies.
- Read timeout while running a Job on data in S3 - posted by Arpan Ghosh <ar...@automatic.com> on 2014/08/25 21:01:50 UTC, 0 replies.
- Request for Help - posted by yh18190 <yh...@gmail.com> on 2014/08/25 21:55:00 UTC, 1 replies.
- Re: Storage Handlers in Spark SQL - posted by Michael Armbrust <mi...@databricks.com> on 2014/08/25 22:17:07 UTC, 1 replies.
- Spark Screencast doesn't show in Chrome on OS X - posted by Nick Chammas <ni...@gmail.com> on 2014/08/25 22:55:49 UTC, 7 replies.
- unable to instantiate HiveMetaStoreClient on LocalHiveContext - posted by Du Li <li...@yahoo-inc.com.INVALID> on 2014/08/26 00:33:39 UTC, 2 replies.
- Re: error from DecisonTree Training: - posted by Joseph Bradley <jo...@databricks.com> on 2014/08/26 01:27:31 UTC, 0 replies.
- Does Spark Streaming count the number of windows processed? - posted by jchen <jc...@pivotal.io> on 2014/08/26 01:50:56 UTC, 0 replies.
- Pair RDD - posted by Deep Pradhan <pr...@gmail.com> on 2014/08/26 06:55:40 UTC, 1 replies.
- creating a subgraph with an edge predicate - posted by dizzy5112 <da...@gmail.com> on 2014/08/26 06:56:51 UTC, 0 replies.
- My Post Related Query - posted by Sandeep Vaid <ca...@gmail.com> on 2014/08/26 09:15:22 UTC, 0 replies.
- Re: Running Wordcount on large file stucks and throws OOM exception - posted by motte1988 <wi...@studserv.uni-leipzig.de> on 2014/08/26 09:35:44 UTC, 0 replies.
- Spark SQL insertInto - posted by praveshjain1991 <pr...@gmail.com> on 2014/08/26 09:44:31 UTC, 0 replies.
- Key-Value in PairRDD - posted by Deep Pradhan <pr...@gmail.com> on 2014/08/26 11:54:12 UTC, 1 replies.
- spark.default.parallelism bug? - posted by Grzegorz Białek <gr...@codilime.com> on 2014/08/26 13:51:57 UTC, 1 replies.
- CoGroupedDStream similar to CoGroupedRDD - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2014/08/26 14:08:14 UTC, 0 replies.
- Re: Spark Streaming Output to DB - posted by Ravi Sharma <ra...@gmail.com> on 2014/08/26 14:42:15 UTC, 4 replies.
- Prevent too many partitions - posted by Grzegorz Białek <gr...@codilime.com> on 2014/08/26 15:16:07 UTC, 0 replies.
- What is a Block Manager? - posted by Victor Tso-Guillen <vt...@paxata.com> on 2014/08/26 17:42:24 UTC, 4 replies.
- Submit to the "Powered By Spark" Page! - posted by Patrick Wendell <pw...@gmail.com> on 2014/08/26 18:13:43 UTC, 0 replies.
- Spark Streaming - Small file in HDFS - posted by Ravi Sharma <ra...@gmail.com> on 2014/08/26 18:14:26 UTC, 0 replies.
- Kinesis receiver & spark streaming partition - posted by Wei Liu <we...@stellarloyalty.com> on 2014/08/26 20:37:50 UTC, 1 replies.
- OutofMemoryError when generating output - posted by SK <sk...@gmail.com> on 2014/08/26 21:38:00 UTC, 3 replies.
- Spark 1.1. doesn't work with hive context - posted by S Malligarjunan <sm...@yahoo.com.INVALID> on 2014/08/26 21:53:18 UTC, 1 replies.
- Re: Out of memory on large RDDs - posted by Andrew Ash <an...@andrewash.com> on 2014/08/26 23:15:08 UTC, 1 replies.
- Re: Parsing Json object definition spanning multiple lines - posted by Chris Fregly <ch...@fregly.com> on 2014/08/27 01:01:16 UTC, 1 replies.
- Re: Spark-Streaming collect/take functionality. - posted by Chris Fregly <ch...@fregly.com> on 2014/08/27 02:26:42 UTC, 0 replies.
- Specifying classpath - posted by Ashish Jain <as...@gmail.com> on 2014/08/27 02:42:28 UTC, 1 replies.
- Upgrading 1.0.0 to 1.0.2 - posted by Victor Tso-Guillen <vt...@paxata.com> on 2014/08/27 03:10:07 UTC, 5 replies.
- CUDA in spark, especially in MLlib? - posted by Wei Tan <wt...@us.ibm.com> on 2014/08/27 03:57:35 UTC, 8 replies.
- Ask for help, how to integrate Sparkstreaming and IBM MQ - posted by "35597813@qq.com" <35...@qq.com> on 2014/08/27 04:16:15 UTC, 0 replies.
- Execute HiveFormSpark ERROR. - posted by CharlieLin <ch...@gmail.com> on 2014/08/27 05:29:06 UTC, 2 replies.
- Spark on Hadoop with Java 8 - posted by jatinpreet <ja...@gmail.com> on 2014/08/27 08:06:29 UTC, 1 replies.
- Is there a way to insert data into existing parquet file using spark ? - posted by rafeeq s <ra...@gmail.com> on 2014/08/27 09:28:20 UTC, 0 replies.
- Developing a spark streaming application - posted by Filip Andrei <an...@gmail.com> on 2014/08/27 10:28:52 UTC, 0 replies.
- hive on spark &yarn - posted by centerqi hu <ce...@gmail.com> on 2014/08/27 10:44:06 UTC, 1 replies.
- Example File not running - posted by "Hingorani, Vineet" <vi...@sap.com> on 2014/08/27 12:22:00 UTC, 6 replies.
- Replicate RDDs - posted by rapelly kartheek <ka...@gmail.com> on 2014/08/27 12:38:52 UTC, 0 replies.
- External dependencies management with spark - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/08/27 13:23:29 UTC, 0 replies.
- NotSerializableException while doing rdd.saveToCassandra - posted by lmk <la...@gmail.com> on 2014/08/27 13:56:24 UTC, 0 replies.
- Example file not running - posted by "Hingorani, Vineet" <vi...@sap.com> on 2014/08/27 14:02:55 UTC, 0 replies.
- How to get prerelease thriftserver working? - posted by Matt Chu <mc...@kabam.com> on 2014/08/27 15:02:33 UTC, 3 replies.
- Execution time increasing with increase of cluster size - posted by Sarath Chandra <sa...@algofusiontech.com> on 2014/08/27 16:59:38 UTC, 1 replies.
- Saddle structure in Spark - posted by LPG <lu...@gmail.com> on 2014/08/27 17:22:02 UTC, 0 replies.
- Reference Accounts & Large Node Deployments - posted by Steve Nunez <sn...@hortonworks.com> on 2014/08/27 18:08:48 UTC, 1 replies.
- Re: Issue Connecting to HBase in spark shell - posted by kpeng1 <kp...@gmail.com> on 2014/08/27 19:58:00 UTC, 0 replies.
- Amplab: big-data-benchmark - posted by Sameer Tilak <ss...@live.com> on 2014/08/27 20:42:28 UTC, 2 replies.
- Spark N.C. - posted by am <am...@gmail.com> on 2014/08/27 21:38:12 UTC, 0 replies.
- MLBase status - posted by Sameer Tilak <ss...@live.com> on 2014/08/27 21:52:02 UTC, 1 replies.
- [Streaming] Cannot get executors to stay alive - posted by Yana Kadiyska <ya...@gmail.com> on 2014/08/27 22:24:37 UTC, 0 replies.
- Historic data and clocks - posted by Frank van Lankvelt <f....@onehippo.com> on 2014/08/27 22:53:53 UTC, 0 replies.
- SchemaRDD - posted by Koert Kuipers <ko...@tresata.com> on 2014/08/27 23:31:15 UTC, 1 replies.
- Spark Streaming: DStream - zipWithIndex - posted by Soumitra Kumar <ku...@gmail.com> on 2014/08/27 23:37:27 UTC, 9 replies.
- minPartitions ignored for bz2? - posted by jerryye <je...@gmail.com> on 2014/08/27 23:49:55 UTC, 1 replies.
- FileNotFoundException (No space left on device) writing to S3 - posted by Daniil Osipov <da...@shazam.com> on 2014/08/28 01:12:56 UTC, 1 replies.
- SparkSQL returns ArrayBuffer for fields of type Array - posted by Du Li <li...@yahoo-inc.com.INVALID> on 2014/08/28 01:27:44 UTC, 3 replies.
- Re: Apache Spark- Cassandra - NotSerializable Exception while saving to cassandra - posted by Yana <ya...@gmail.com> on 2014/08/28 01:43:45 UTC, 1 replies.
- Kafka stream receiver stops input - posted by Tim Smith <se...@gmail.com> on 2014/08/28 02:56:10 UTC, 1 replies.
- Compilaon Error: Spark 1.0.2 with HBase 0.98 - posted by "Arthur.hk.chan@gmail.com" <ar...@gmail.com> on 2014/08/28 03:54:33 UTC, 0 replies.
- Compilation Error: Spark 1.0.2 with HBase 0.98 - posted by "Arthur.hk.chan@gmail.com" <ar...@gmail.com> on 2014/08/28 03:57:19 UTC, 0 replies.
- how to correctly run scala script using spark-shell through stdin (spark v1.0.0) - posted by Henry Hung <YT...@winbond.com> on 2014/08/28 04:01:13 UTC, 2 replies.
- Re: Compilation Error: Spark 1.0.2 with HBase 0.98 - posted by Ted Yu <yu...@gmail.com> on 2014/08/28 04:13:23 UTC, 11 replies.
- Compilation FAILURE : Spark 1.0.2 / Project Hive (0.13.1) - posted by "Arthur.hk.chan@gmail.com" <ar...@gmail.com> on 2014/08/28 05:54:05 UTC, 1 replies.
- Update on Pig on Spark initiative - posted by Mayur Rustagi <ma...@gmail.com> on 2014/08/28 07:03:25 UTC, 2 replies.
- Submitting multiple files pyspark - posted by Chengi Liu <ch...@gmail.com> on 2014/08/28 07:58:52 UTC, 1 replies.
- java.lang.OutOfMemoryError: Requested array size exceeds VM limit - posted by durin <ma...@simon-schaefer.net> on 2014/08/28 08:45:34 UTC, 0 replies.
- Using unshaded akka in Spark driver - posted by Aniket Bhatnagar <an...@gmail.com> on 2014/08/28 08:59:15 UTC, 0 replies.
- Key-Value Operations - posted by Deep Pradhan <pr...@gmail.com> on 2014/08/28 10:34:52 UTC, 1 replies.
- The concurrent model of spark job/stage/task - posted by 李华 <35...@qq.com> on 2014/08/28 10:38:48 UTC, 0 replies.
- sbt package assembly run spark examples - posted by filipus <fl...@gmail.com> on 2014/08/28 11:29:52 UTC, 1 replies.
- Spark SQL : how to find element where a field is in a given set - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/08/28 12:09:23 UTC, 7 replies.
- Spark-submit not running - posted by "Hingorani, Vineet" <vi...@sap.com> on 2014/08/28 12:37:09 UTC, 8 replies.
- how to filter value in spark - posted by marylucy <qa...@hotmail.com> on 2014/08/28 13:20:13 UTC, 2 replies.
- how to specify columns in groupby - posted by MEETHU MATHEW <me...@yahoo.co.in> on 2014/08/28 14:04:48 UTC, 2 replies.
- Re: Compilaon Error: Spark 1.0.2 with HBase 0.98 - posted by Sean Owen <so...@cloudera.com> on 2014/08/28 14:22:12 UTC, 0 replies.
- SPARK on YARN, containers fails - posted by Control <fi...@credit-suisse.com> on 2014/08/28 15:22:36 UTC, 0 replies.
- repartitioning an RDD yielding imbalance - posted by Rok Roskar <ro...@gmail.com> on 2014/08/28 16:00:39 UTC, 1 replies.
- Re: Graphx: undirected graph support - posted by FokkoDriesprong <fo...@driesprongen.nl> on 2014/08/28 16:07:12 UTC, 0 replies.
- Change delimiter when collecting SchemaRDD - posted by yadid ayzenberg <ya...@media.mit.edu> on 2014/08/28 16:58:07 UTC, 2 replies.
- Print to spark log - posted by jamborta <ja...@gmail.com> on 2014/08/28 17:06:41 UTC, 2 replies.
- SPARK-1297 patch error (spark-1297-v4.txt ) - posted by "Arthur.hk.chan@gmail.com" <ar...@gmail.com> on 2014/08/28 18:50:56 UTC, 4 replies.
- New SparkR mailing list, JIRA - posted by Shivaram Venkataraman <sh...@eecs.berkeley.edu> on 2014/08/28 19:27:16 UTC, 0 replies.
- Converting a DStream's RDDs to SchemaRDDs - posted by "Verma, Rishi (398J)" <Ri...@jpl.nasa.gov> on 2014/08/28 19:28:54 UTC, 2 replies.
- org.apache.hadoop.io.compress.SnappyCodec not found - posted by "Arthur.hk.chan@gmail.com" <ar...@gmail.com> on 2014/08/28 20:39:38 UTC, 4 replies.
- org.apache.spark.examples.xxx - posted by filipus <fl...@gmail.com> on 2014/08/28 21:09:30 UTC, 6 replies.
- What happens if I have a function like a PairFunction but which might return multiple values - posted by Steve Lewis <lo...@gmail.com> on 2014/08/28 21:17:54 UTC, 1 replies.
- Q on downloading spark for standalone cluster - posted by Sanjeev Sagar <sa...@mypointscorp.com> on 2014/08/28 22:07:37 UTC, 3 replies.
- Where to save intermediate results? - posted by huylv <hu...@insight-centre.org> on 2014/08/28 22:30:07 UTC, 2 replies.
- DStream repartitioning, performance tuning processing - posted by Tim Smith <se...@gmail.com> on 2014/08/29 00:16:31 UTC, 5 replies.
- Failed to run runJob at ReceiverTracker.scala - posted by Tim Smith <se...@gmail.com> on 2014/08/29 00:54:44 UTC, 4 replies.
- transforming a Map object to RDD - posted by SK <sk...@gmail.com> on 2014/08/29 00:56:17 UTC, 1 replies.
- problem connection to hdfs on localhost from spark-shell - posted by Bharath Bhushan <ma...@outlook.com> on 2014/08/29 01:18:29 UTC, 1 replies.
- Memory statistics in the Application detail UI - posted by SK <sk...@gmail.com> on 2014/08/29 03:32:32 UTC, 4 replies.
- The concurrent model of spark job/stage/task - posted by "35597813@qq.com" <35...@qq.com> on 2014/08/29 03:34:43 UTC, 3 replies.
- Problem using accessing HiveContext - posted by "Zitser, Igor " <ig...@citi.com> on 2014/08/29 03:40:50 UTC, 0 replies.
- Odd saveAsSequenceFile bug - posted by Shay Seng <sh...@urbanengines.com> on 2014/08/29 04:06:37 UTC, 0 replies.
- RE: Sorting Reduced/Groupd Values without Explicit Sorting - posted by fluke777 <sv...@gmail.com> on 2014/08/29 05:16:23 UTC, 0 replies.
- Spark / Thrift / ODBC connectivity - posted by Denny Lee <de...@gmail.com> on 2014/08/29 06:42:26 UTC, 1 replies.
- How to debug this error? - posted by Gary Zhao <ga...@gmail.com> on 2014/08/29 06:43:26 UTC, 1 replies.
- Spark Hive max key length is 767 bytes - posted by "Arthur.hk.chan@gmail.com" <ar...@gmail.com> on 2014/08/29 06:47:11 UTC, 5 replies.
- u'' notation with pyspark output data - posted by Oleg Ruchovets <or...@gmail.com> on 2014/08/29 08:22:08 UTC, 1 replies.
- how can I get the number of cores - posted by Kevin Jung <it...@samsung.com> on 2014/08/29 09:39:44 UTC, 1 replies.
- Ensuring object in spark streaming runs on specific node - posted by Filip Andrei <an...@gmail.com> on 2014/08/29 10:26:12 UTC, 0 replies.
- Running Spark On Yarn without Spark-Submit - posted by Archit Thakur <ar...@gmail.com> on 2014/08/29 10:33:36 UTC, 2 replies.
- Spark Streaming reset state - posted by Eko Susilo <ek...@gmail.com> on 2014/08/29 17:30:32 UTC, 3 replies.
- /tmp/spark-events permissions problem - posted by Brad Miller <bm...@eecs.berkeley.edu> on 2014/08/29 17:43:40 UTC, 0 replies.
- Announce: Smoke - a web frontend to Spark - posted by "Horacio G. de Oro" <hg...@gmail.com> on 2014/08/29 19:34:32 UTC, 0 replies.
- Problem Accessing Hive Table from hiveContext - posted by "Zitser, Igor " <ig...@citi.com> on 2014/08/29 19:50:17 UTC, 0 replies.
- Re: Too many open files - posted by SK <sk...@gmail.com> on 2014/08/29 20:06:51 UTC, 1 replies.
- Possible to make one executor be able to work on multiple tasks simultaneously? - posted by Victor Tso-Guillen <vt...@paxata.com> on 2014/08/29 20:23:42 UTC, 3 replies.
- SparkSql is slow over yarn - posted by Chirag Aggarwal <Ch...@guavus.com> on 2014/08/29 20:33:20 UTC, 1 replies.
- Spark Streaming with Kafka, building project with 'sbt assembly' is extremely slow - posted by Aris <ar...@gmail.com> on 2014/08/29 22:30:49 UTC, 0 replies.
- Re: Anyone know hot to submit spark job to yarn in java code? - posted by Archit Thakur <ar...@gmail.com> on 2014/08/29 23:09:08 UTC, 0 replies.
- [PySpark] large # of partitions causes OOM - posted by Nick Chammas <ni...@gmail.com> on 2014/08/30 00:05:32 UTC, 0 replies.
- What is the better data structure in an RDD - posted by cjwang <cj...@cjwang.us> on 2014/08/30 01:00:23 UTC, 1 replies.
- What does "appMasterRpcPort: -1" indicate ? - posted by Tao Xiao <xi...@gmail.com> on 2014/08/30 04:44:31 UTC, 3 replies.
- SparkSQL HiveContext No Suitable Driver / Cannot Find Driver - posted by Denny Lee <de...@gmail.com> on 2014/08/30 07:55:46 UTC, 1 replies.
- spark-ec2 [Errno 110] Connection time out - posted by David Matheson <da...@gmail.com> on 2014/08/30 09:53:40 UTC, 0 replies.
- spark on yarn with hive - posted by centerqi hu <ce...@gmail.com> on 2014/08/30 12:08:33 UTC, 0 replies.
- Mapping Hadoop Reduce to Spark - posted by Steve Lewis <lo...@gmail.com> on 2014/08/30 18:04:10 UTC, 3 replies.
- Spark Master/Slave and HA - posted by "Arthur.hk.chan@gmail.com" <ar...@gmail.com> on 2014/08/31 01:53:49 UTC, 1 replies.
- Spark and Shark Node: RAM Allocation - posted by "Arthur.hk.chan@gmail.com" <ar...@gmail.com> on 2014/08/31 02:06:41 UTC, 0 replies.
- Powered By Spark - posted by Yi Tian <ti...@gmail.com> on 2014/08/31 04:24:53 UTC, 0 replies.
- Re: saveAsSequenceFile for DStream - posted by Chris Fregly <ch...@fregly.com> on 2014/08/31 05:34:20 UTC, 0 replies.
- How can a "deserialized Java object" be stored on disk? - posted by Tao Xiao <xi...@gmail.com> on 2014/08/31 06:02:35 UTC, 1 replies.
- Re: data locality - posted by Chris Fregly <ch...@fregly.com> on 2014/08/31 06:13:19 UTC, 0 replies.
- jdbcRDD from JAVA - posted by Ahmad Osama <ao...@gmail.com> on 2014/08/31 16:07:21 UTC, 1 replies.
- This always tries to connect to HDFS: user$ export MASTER=local[NN]; pyspark --master local[NN] ... - posted by didata <su...@didata.us> on 2014/08/31 23:17:41 UTC, 1 replies.