You are viewing a plain text version of this content. The canonical link for it is here.
- hadoopRDD stalls reading entire directory - posted by Russell Jurney <ru...@gmail.com> on 2014/06/01 02:16:16 UTC, 30 replies.
- can not access app details on ec2 - posted by wxhsdp <wx...@gmail.com> on 2014/06/01 02:19:07 UTC, 0 replies.
- spark 1.0.0 on yarn - posted by "Xu (Simon) Chen" <xc...@gmail.com> on 2014/06/01 02:51:51 UTC, 7 replies.
- Re: possible typos in spark 1.0 documentation - posted by Yadid Ayzenberg <ya...@media.mit.edu> on 2014/06/01 04:39:26 UTC, 0 replies.
- Spark on EC2 - posted by superback <an...@gmail.com> on 2014/06/01 06:21:25 UTC, 3 replies.
- Re: Yay for 1.0.0! EC2 Still has problems. - posted by Jeremy Lee <un...@gmail.com> on 2014/06/01 07:53:18 UTC, 12 replies.
- Re: Using Spark on Data size larger than Memory size - posted by Aaron Davidson <il...@gmail.com> on 2014/06/01 08:10:19 UTC, 9 replies.
- SparkSQL Table schema in Java - posted by Kuldeep Bora <ku...@gmail.com> on 2014/06/01 09:38:45 UTC, 0 replies.
- Using sbt-pack with Spark 1.0.0 - posted by Pierre B <pi...@realimpactanalytics.com> on 2014/06/01 11:34:09 UTC, 3 replies.
- sc.textFileGroupByPath("*/*.txt") - posted by Oleg Proudnikov <ol...@gmail.com> on 2014/06/01 14:37:42 UTC, 6 replies.
- Re: Akka disassociation on Java SE Embedded - posted by Chanwit Kaewkasi <ch...@gmail.com> on 2014/06/01 18:48:09 UTC, 1 replies.
- [Spark Streaming] Distribute custom receivers evenly across excecutors - posted by Guang Gao <bi...@gmail.com> on 2014/06/02 00:06:44 UTC, 1 replies.
- Re: Trouble with EC2 - posted by PJ$ <p...@chickenandwaffl.es> on 2014/06/02 00:11:39 UTC, 3 replies.
- Re: Create/shutdown objects before/after RDD use (or: Non-serializable classes) - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/06/02 03:13:22 UTC, 0 replies.
- Please put me into the mail list, thanks. - posted by Yunmeng Ban <ba...@gmail.com> on 2014/06/02 04:11:19 UTC, 0 replies.
- Re: Is uberjar a recommended way of running Spark/Scala applications? - posted by Ngoc Dao <ng...@gmail.com> on 2014/06/02 04:12:31 UTC, 2 replies.
- Can anyone help me set memory for standalone cluster? - posted by Yunmeng Ban <ba...@gmail.com> on 2014/06/02 04:36:46 UTC, 1 replies.
- Re: apache whirr for spark - posted by chirag lakhani <ch...@gmail.com> on 2014/06/02 04:51:15 UTC, 0 replies.
- Is there a step-by-step instruction on how to build Spark App with IntelliJ IDEA? - posted by Wei Da <xw...@gmail.com> on 2014/06/02 06:15:33 UTC, 1 replies.
- Re: Unable to execute saveAsTextFile on multi node mesos - posted by prabeesh k <pr...@gmail.com> on 2014/06/02 07:02:13 UTC, 0 replies.
- Spark Streaming not processing file with particular number of entries - posted by praveshjain1991 <pr...@gmail.com> on 2014/06/02 09:16:19 UTC, 7 replies.
- Re: Using String Dataset for Logistic Regression - posted by praveshjain1991 <pr...@gmail.com> on 2014/06/02 09:19:21 UTC, 3 replies.
- How can I make Spark 1.0 saveAsTextFile to overwrite existing file - posted by Kexin Xie <ke...@bigcommerce.com> on 2014/06/02 10:08:05 UTC, 31 replies.
- pyspark problems on yarn (job not parallelized, and Py4JJavaError) - posted by "Xu (Simon) Chen" <xc...@gmail.com> on 2014/06/02 17:24:00 UTC, 9 replies.
- Re: Failed to remove RDD error - posted by Michael Chang <mi...@tellapart.com> on 2014/06/02 18:42:54 UTC, 4 replies.
- Is Hadoop MR now comparable with Spark? - posted by Ian Ferreira <ia...@hotmail.com> on 2014/06/02 19:33:59 UTC, 0 replies.
- How to create RDDs from another RDD? - posted by Gerard Maas <ge...@gmail.com> on 2014/06/02 22:13:16 UTC, 3 replies.
- EC2 Simple Cluster - posted by Gianluca Privitera <gi...@studio.unibo.it> on 2014/06/02 22:29:33 UTC, 1 replies.
- Interactive modification of DStreams - posted by lbustelo <gi...@bustelos.com> on 2014/06/02 23:46:15 UTC, 3 replies.
- NoSuchElementException: key not found - posted by Michael Chang <mi...@tellapart.com> on 2014/06/03 00:27:49 UTC, 6 replies.
- using Log4j to log INFO level messages on workers - posted by Shivani Rao <ra...@gmail.com> on 2014/06/03 01:18:20 UTC, 2 replies.
- Fwd: SecurityException when running tests with Spark 1.0.0 - posted by Mohit Nayak <wi...@gmail.com> on 2014/06/03 01:21:15 UTC, 5 replies.
- Processing audio/video/images - posted by jamal sasha <ja...@gmail.com> on 2014/06/03 02:02:05 UTC, 7 replies.
- Window slide duration - posted by Vadim Chekan <ko...@gmail.com> on 2014/06/03 02:22:37 UTC, 6 replies.
- how to construct a ClassTag object as a method parameter in Java - posted by bluejoe2008 <bl...@gmail.com> on 2014/06/03 02:59:21 UTC, 4 replies.
- A single build.sbt file to start Spark REPL? - posted by Alexy Khrabrov <al...@scalable.pro> on 2014/06/03 04:56:17 UTC, 1 replies.
- WebUI's Application count doesn't get updated - posted by "MrAsanjar ." <af...@gmail.com> on 2014/06/03 09:30:46 UTC, 5 replies.
- Need equallyWeightedPartitioner Algorithm - posted by Joe L <se...@yahoo.com> on 2014/06/03 09:33:04 UTC, 0 replies.
- Upgradation to Spark 1.0.0 - posted by MEETHU MATHEW <me...@yahoo.co.in> on 2014/06/03 09:38:34 UTC, 1 replies.
- Re: Having spark-ec2 join new slaves to existing cluster - posted by sirisha_devineni <si...@persistent.co.in> on 2014/06/03 12:52:09 UTC, 1 replies.
- Reg: Add/Remove slave nodes spark-ec2 - posted by Sirisha Devineni <Si...@persistent.co.in> on 2014/06/03 13:00:40 UTC, 1 replies.
- Spark block manager registration extreme slow - posted by Denes <te...@outlook.com> on 2014/06/03 13:55:35 UTC, 0 replies.
- Kyro deserialisation error - posted by Denes <te...@outlook.com> on 2014/06/03 14:02:13 UTC, 0 replies.
- Error related to serialisation in spark streaming - posted by nilmish <ni...@gmail.com> on 2014/06/03 15:23:39 UTC, 9 replies.
- Reconnect to an application/RDD - posted by Oleg Proudnikov <ol...@gmail.com> on 2014/06/03 15:45:24 UTC, 2 replies.
- Prepare spark executor - posted by yaoxin <ya...@gmail.com> on 2014/06/03 15:52:16 UTC, 0 replies.
- Spark not working with mesos - posted by praveshjain1991 <pr...@gmail.com> on 2014/06/03 16:43:00 UTC, 8 replies.
- spark 1.0 not using properties file from SPARK_CONF_DIR - posted by Eugen Cepoi <ce...@gmail.com> on 2014/06/03 16:47:33 UTC, 2 replies.
- ---cores option in spark-shell - posted by Marek Wiewiorka <ma...@gmail.com> on 2014/06/03 17:15:01 UTC, 3 replies.
- Re: Using MLLib in Scala - posted by Xiangrui Meng <me...@gmail.com> on 2014/06/03 18:07:22 UTC, 0 replies.
- Spark 1.0.0 fails if mesos.coarse set to true - posted by Marek Wiewiorka <ma...@gmail.com> on 2014/06/03 18:17:05 UTC, 5 replies.
- wholeTextFiles() : java.lang.IncompatibleClassChangeError: Found class org.apache.hadoop.mapreduce.TaskAttemptContext, but interface was expected - posted by toivoa <to...@gmail.com> on 2014/06/03 18:23:54 UTC, 0 replies.
- Re: wholeTextFiles() : java.lang.IncompatibleClassChangeError: Found class org.apache.hadoop.mapreduce.TaskAttemptContext, but interface was expected - posted by Sean Owen <so...@cloudera.com> on 2014/06/03 18:26:35 UTC, 4 replies.
- mounting SSD devices of EC2 r3.8xlarge instances - posted by Andras Barjak <an...@lynxanalytics.com> on 2014/06/03 19:05:07 UTC, 2 replies.
- Problems with connecting Spark to Hive - posted by Lars Selsaas <la...@thinkbiganalytics.com> on 2014/06/03 19:44:02 UTC, 2 replies.
- Strange problem with saveAsTextFile after upgrade Spark 0.9.0->1.0.0 - posted by Marek Wiewiorka <ma...@gmail.com> on 2014/06/03 20:46:29 UTC, 13 replies.
- SchemaRDD's saveAsParquetFile() throws java.lang.IncompatibleClassChangeError - posted by "k.tham" <ke...@gmail.com> on 2014/06/03 21:25:01 UTC, 5 replies.
- RDD with a Map - posted by Amit Kumar <ku...@gmail.com> on 2014/06/03 23:56:29 UTC, 6 replies.
- Invalid Class Exception - posted by Suman Somasundar <su...@oracle.com> on 2014/06/04 02:18:46 UTC, 4 replies.
- Better line number hints for logging? - posted by John Salvatier <js...@gmail.com> on 2014/06/04 02:22:20 UTC, 5 replies.
- spark is dead and pid file exists - posted by Sophia <sl...@163.com> on 2014/06/04 03:26:21 UTC, 1 replies.
- Re: access hdfs file name in map() - posted by "Xu (Simon) Chen" <xc...@gmail.com> on 2014/06/04 06:22:20 UTC, 1 replies.
- How to stop a running SparkContext in the proper way? - posted by MEETHU MATHEW <me...@yahoo.co.in> on 2014/06/04 06:54:57 UTC, 2 replies.
- KMeans.train() throws NotSerializableException - posted by bluejoe2008 <bl...@gmail.com> on 2014/06/04 07:11:13 UTC, 1 replies.
- ZeroMQ Stream -> stack guard problem and no data - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/06/04 08:10:11 UTC, 3 replies.
- SocketException when reading from S3 (s3n format) - posted by yuzeh <de...@gmail.com> on 2014/06/04 10:02:53 UTC, 3 replies.
- IllegalArgumentException on calling KMeans.train() - posted by bluejoe2008 <bl...@gmail.com> on 2014/06/04 10:43:20 UTC, 2 replies.
- Problem understanding log message in SparkStreaming - posted by nilmish <ni...@gmail.com> on 2014/06/04 10:45:18 UTC, 0 replies.
- How to change default storage levels - posted by Salih Kardan <ka...@gmail.com> on 2014/06/04 10:52:45 UTC, 1 replies.
- executor idle during task schedule - posted by wxhsdp <wx...@gmail.com> on 2014/06/04 11:08:30 UTC, 0 replies.
- compile spark 1.0.0 error - posted by ch huang <ju...@gmail.com> on 2014/06/04 11:13:55 UTC, 2 replies.
- Can this be done in map-reduce technique (in parallel) - posted by lmk <la...@gmail.com> on 2014/06/04 12:49:12 UTC, 9 replies.
- Facing MetricsSystem error on Running Spark applications - posted by Vibhor Banga <vi...@gmail.com> on 2014/06/04 14:31:43 UTC, 1 replies.
- Join : Giving incorrect result - posted by Ajay Srivastava <a_...@yahoo.com> on 2014/06/04 14:32:32 UTC, 7 replies.
- Can't seem to link "external/twitter" classes from my own app - posted by Jeremy Lee <un...@gmail.com> on 2014/06/04 14:49:18 UTC, 11 replies.
- is there any easier way to define a custom RDD in Java - posted by bluejoe2008 <bl...@gmail.com> on 2014/06/04 15:30:48 UTC, 3 replies.
- Re: spark on yarn fail with IOException - posted by sam <sa...@gmail.com> on 2014/06/04 15:37:16 UTC, 0 replies.
- Spark Usecase - posted by Shahab Yunus <sh...@gmail.com> on 2014/06/04 15:57:02 UTC, 1 replies.
- error with cdh 5 spark installation - posted by chirag lakhani <ch...@gmail.com> on 2014/06/04 16:19:33 UTC, 2 replies.
- Java IO Stream Corrupted - Invalid Type AC? - posted by Matt Kielo <mk...@oculusinfo.com> on 2014/06/04 16:33:58 UTC, 5 replies.
- Trouble launching EC2 Cluster with Spark - posted by Sam Taylor Steyer <ss...@stanford.edu> on 2014/06/04 16:45:14 UTC, 7 replies.
- pyspark join crash - posted by Brad Miller <bm...@eecs.berkeley.edu> on 2014/06/04 17:23:12 UTC, 3 replies.
- Re: Using mongo with PySpark - posted by Samarth Mailinglist <ma...@gmail.com> on 2014/06/04 18:57:06 UTC, 1 replies.
- Re: Spark streaming on load run - How to increase single node capacity? - posted by Wayne Adams <wm...@comcast.net> on 2014/06/04 19:20:47 UTC, 1 replies.
- RDD[(K,V)] for a Map File on HDFS - posted by Amit Kumar <ku...@gmail.com> on 2014/06/04 21:15:00 UTC, 0 replies.
- Re: custom receiver in java - posted by lbustelo <gi...@bustelos.com> on 2014/06/04 21:15:02 UTC, 1 replies.
- reuse hadoop code in Spark - posted by Wei Tan <wt...@us.ibm.com> on 2014/06/04 22:08:19 UTC, 3 replies.
- Re: SQLContext and HiveContext Query Performance - posted by ssb61 <sa...@gmail.com> on 2014/06/04 23:16:04 UTC, 4 replies.
- Re: How can I dispose an Accumulator? - posted by Daniel Siegmann <da...@velos.io> on 2014/06/04 23:22:55 UTC, 1 replies.
- Cassandra examples don't work for me - posted by Tim Kellogg <ti...@2lemetry.com> on 2014/06/05 00:01:42 UTC, 1 replies.
- Re: Running a spark-submit compatible app in spark-shell - posted by Roger Hoover <ro...@gmail.com> on 2014/06/05 00:03:21 UTC, 0 replies.
- Re: error loading large files in PySpark 0.9.0 - posted by Jeremy Freeman <fr...@gmail.com> on 2014/06/05 00:28:10 UTC, 3 replies.
- Spark assembly error. - posted by Sung Hwan Chung <co...@cs.stanford.edu> on 2014/06/05 01:16:49 UTC, 1 replies.
- Re: Why Scala? - posted by John Omernik <jo...@omernik.com> on 2014/06/05 02:02:32 UTC, 4 replies.
- Logistic Regression MLLib Slow - posted by Srikrishna S <sr...@gmail.com> on 2014/06/05 02:47:24 UTC, 9 replies.
- Using log4j.xml - posted by Michael Chang <mi...@tellapart.com> on 2014/06/05 03:09:47 UTC, 0 replies.
- mismatched hdfs protocol - posted by bluejoe2008 <bl...@gmail.com> on 2014/06/05 04:25:45 UTC, 5 replies.
- spark1.0 spark sql saveAsParquetFile Error - posted by victor sheng <vi...@gmail.com> on 2014/06/05 04:31:19 UTC, 2 replies.
- Re: job offering - posted by hassan <He...@gmail.com> on 2014/06/05 06:35:14 UTC, 0 replies.
- Re: Unable to run a Standalone job - posted by Shrikar archak <sh...@gmail.com> on 2014/06/05 07:40:15 UTC, 2 replies.
- Re: ClassCastException when using saveAsTextFile - posted by Kanwaldeep <ka...@gmail.com> on 2014/06/05 07:53:55 UTC, 1 replies.
- Native library can not be loaded when using Mllib PCA - posted by yangliuyu <ya...@163.com> on 2014/06/05 11:36:08 UTC, 4 replies.
- How to shut down Spark Streaming with Kafka properly? - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/06/05 11:40:32 UTC, 4 replies.
- Spark Kafka streaming - ClassNotFoundException: org.apache.spark.streaming.kafka.KafkaReceiver - posted by Gaurav Dasgupta <ga...@gmail.com> on 2014/06/05 11:45:29 UTC, 2 replies.
- Serialization problem in Spark - posted by Vibhor Banga <vi...@gmail.com> on 2014/06/05 12:11:09 UTC, 6 replies.
- Problem with serialization and deserialization - posted by "ANEESH .V.V" <an...@gmail.com> on 2014/06/05 13:47:35 UTC, 1 replies.
- spark worker and yarn memory - posted by "Xu (Simon) Chen" <xc...@gmail.com> on 2014/06/05 15:44:47 UTC, 2 replies.
- Loading Python libraries into Spark - posted by mrm <ma...@skimlinks.com> on 2014/06/05 16:29:35 UTC, 3 replies.
- compress in-memory cache? - posted by "Xu (Simon) Chen" <xc...@gmail.com> on 2014/06/05 16:41:34 UTC, 2 replies.
- Scala By the Bay Developer Conference and Training Registration - posted by Alexy Khrabrov <al...@scalable.pro> on 2014/06/05 16:55:30 UTC, 0 replies.
- Re: Unable to run a Standalone job([NOT FOUND ] org.eclipse.jetty.orbit#javax.mail.glassfish;1.4.1.v201005082020) - posted by Shrikar archak <sh...@gmail.com> on 2014/06/05 17:05:50 UTC, 1 replies.
- implicit ALS dataSet - posted by redocpot <ju...@gmail.com> on 2014/06/05 17:46:47 UTC, 8 replies.
- Seattle Spark Meetup: Machine Learning Streams with Spark 1.0 - posted by Denny Lee <de...@gmail.com> on 2014/06/05 21:05:46 UTC, 0 replies.
- creating new ami image for spark ec2 commands - posted by Matt Work Coarr <ma...@gmail.com> on 2014/06/05 22:44:23 UTC, 7 replies.
- Examples - posted by Tim Kellogg <ti...@2lemetry.com> on 2014/06/05 23:07:13 UTC, 0 replies.
- Setting executor memory when using spark-shell - posted by Oleg Proudnikov <ol...@gmail.com> on 2014/06/05 23:15:05 UTC, 8 replies.
- Spark Streaming, download a s3 file to run a script shell on it - posted by Gianluca Privitera <gi...@studio.unibo.it> on 2014/06/05 23:30:37 UTC, 4 replies.
- When does Spark switch from PROCESS_LOCAL to NODE_LOCAL or RACK_LOCAL? - posted by Sung Hwan Chung <co...@cs.stanford.edu> on 2014/06/06 00:13:38 UTC, 5 replies.
- Twitter feed options? - posted by Jeremy Lee <un...@gmail.com> on 2014/06/06 05:07:44 UTC, 1 replies.
- KyroException: Unable to find class - posted by Justin Yip <yi...@gmail.com> on 2014/06/06 06:21:53 UTC, 0 replies.
- Spark Streaming NeteorkReceiver problems - posted by zzzzzqf12345 <zz...@gmail.com> on 2014/06/06 07:21:53 UTC, 0 replies.
- Identify #iterations KMeans executing - posted by Stuti Awasthi <st...@hcl.com> on 2014/06/06 11:08:16 UTC, 1 replies.
- RE: range partitioner with updateStateByKey - posted by RodrigoB <ro...@aspect.com> on 2014/06/06 12:36:45 UTC, 0 replies.
- ERROR Worker: All masters are unresponsive! Giving up. - posted by Ivy <yt...@gmail.com> on 2014/06/06 13:38:00 UTC, 0 replies.
- Is Spark-1.0.0 not backward compatible with Shark-0.9.1 ? - posted by bijoy deb <bi...@gmail.com> on 2014/06/06 15:02:16 UTC, 2 replies.
- Using Java functions in Spark - posted by Oleg Proudnikov <ol...@gmail.com> on 2014/06/06 17:24:01 UTC, 2 replies.
- Bayes Net with Graphx? - posted by Greg <gr...@zooniverse.org> on 2014/06/06 17:41:33 UTC, 1 replies.
- Spark 1.0 & embedded Hive libraries - posted by Silvio Fiorito <si...@granturing.com> on 2014/06/06 21:08:28 UTC, 2 replies.
- Spark Streaming window functions bug 1.0.0 - posted by Gianluca Privitera <gi...@studio.unibo.it> on 2014/06/06 23:44:29 UTC, 0 replies.
- Showing key cluster stats in the Web UI - posted by Nick Chammas <ni...@gmail.com> on 2014/06/07 03:09:50 UTC, 0 replies.
- best practice: write and debug Spark application in scala-ide and maven - posted by Wei Tan <wt...@us.ibm.com> on 2014/06/07 03:10:26 UTC, 4 replies.
- stage kill link is awfully close to the stage name - posted by Nick Chammas <ni...@gmail.com> on 2014/06/07 03:12:31 UTC, 2 replies.
- unit test - posted by b0c1 <bo...@gmail.com> on 2014/06/07 03:32:02 UTC, 0 replies.
- New user streaming question - posted by Michael Campbell <mi...@gmail.com> on 2014/06/07 03:50:48 UTC, 3 replies.
- cache spark sql parquet file in memory? - posted by "Xu (Simon) Chen" <xc...@gmail.com> on 2014/06/07 05:13:54 UTC, 3 replies.
- Best practise for 'Streaming' dumps? - posted by Jeremy Lee <un...@gmail.com> on 2014/06/07 06:49:07 UTC, 4 replies.
- Scheduling code for Spark - posted by rapelly kartheek <ka...@gmail.com> on 2014/06/07 10:24:23 UTC, 1 replies.
- Spark with Spark Streaming - posted by b0c1 <bo...@gmail.com> on 2014/06/07 11:18:16 UTC, 0 replies.
- Gradient Descent with MLBase - posted by Aslan Bekirov <as...@gmail.com> on 2014/06/07 15:24:26 UTC, 2 replies.
- ec2 deployment regions supported - posted by Joe Mathai <jo...@gmail.com> on 2014/06/07 15:28:12 UTC, 0 replies.
- How to process multiple classification with SVM in MLlib - posted by littlebird <cx...@163.com> on 2014/06/07 16:15:21 UTC, 6 replies.
- serialization a model - posted by filipus <fl...@gmail.com> on 2014/06/07 17:46:14 UTC, 0 replies.
- [graphx] PageRank with Edge weights - posted by Lee Becker <le...@hapara.com> on 2014/06/07 18:14:08 UTC, 0 replies.
- Dumping Metics on HDFS - posted by Rahul Singhal <Ra...@guavus.com> on 2014/06/08 01:08:09 UTC, 0 replies.
- Spark Worker Core Allocation - posted by Subacini B <su...@gmail.com> on 2014/06/08 07:54:48 UTC, 3 replies.
- How to get the help or explanation for the functions in Spark shell? - posted by Carter <gy...@hotmail.com> on 2014/06/08 13:00:12 UTC, 3 replies.
- How to compile a Spark project in Scala IDE for Eclipse? - posted by Carter <gy...@hotmail.com> on 2014/06/08 17:06:01 UTC, 4 replies.
- Are "scala.MatchError" messages a problem? - posted by Jeremy Lee <un...@gmail.com> on 2014/06/08 18:44:52 UTC, 5 replies.
- Spark Streaming union expected behaviour? - posted by Shrikar archak <sh...@gmail.com> on 2014/06/08 20:00:59 UTC, 0 replies.
- Classpath errors with Breeze - posted by dlaw <di...@gmail.com> on 2014/06/09 06:52:36 UTC, 5 replies.
- How to achieve a reasonable performance on Spark Streaming - posted by onpoq <on...@gmail.com> on 2014/06/09 07:02:43 UTC, 0 replies.
- Re: Comprehensive Port Configuration reference? - posted by Andrew Ash <an...@andrewash.com> on 2014/06/09 08:26:25 UTC, 0 replies.
- how to improve sharkserver2's parallelism performance? - posted by qingyang li <li...@gmail.com> on 2014/06/09 09:24:06 UTC, 0 replies.
- mllib, python and SVD - posted by Håvard Wahl Kongsgård <ha...@gmail.com> on 2014/06/09 11:32:54 UTC, 1 replies.
- Spark-Streaming window processing - posted by Yingjun Wu <wu...@gmail.com> on 2014/06/09 11:39:52 UTC, 3 replies.
- ArrayIndexOutOfBoundsException when reading bzip2 files - posted by MEETHU MATHEW <me...@yahoo.co.in> on 2014/06/09 13:54:08 UTC, 5 replies.
- Spark 1.0.0 Maven dependencies problems. - posted by toivoa <to...@gmail.com> on 2014/06/09 15:34:50 UTC, 6 replies.
- Re: How to enable fault-tolerance? - posted by Peng Cheng <pc...@uow.edu.au> on 2014/06/09 17:59:49 UTC, 5 replies.
- Re: implementing the VectorAccumulatorParam - posted by Sean Owen <so...@cloudera.com> on 2014/06/09 18:20:49 UTC, 1 replies.
- How to achieve reasonable performance on Spark Streaming? - posted by onpoq l <on...@gmail.com> on 2014/06/09 18:48:33 UTC, 3 replies.
- Re: Occasional failed tasks - posted by Peng Cheng <pc...@uow.edu.au> on 2014/06/09 19:27:14 UTC, 0 replies.
- Re: Spark SQL JDBC Connectivity and more - posted by Venkat Subramanian <vs...@gmail.com> on 2014/06/09 20:25:01 UTC, 1 replies.
- Spark 0.9.1 - saveAsTextFile() exception: _temporary doesn't exist! - posted by Oleg Proudnikov <ol...@gmail.com> on 2014/06/09 22:19:45 UTC, 0 replies.
- Spark Streaming application not working on EC2 Cluster - posted by Gianluca Privitera <gi...@studio.unibo.it> on 2014/06/09 23:59:15 UTC, 0 replies.
- Optimizing reduce for 'huge' aggregated outputs. - posted by Sung Hwan Chung <co...@cs.stanford.edu> on 2014/06/10 00:19:52 UTC, 3 replies.
- Spilled shuffle files not being cleared - posted by Michael Chang <mi...@tellapart.com> on 2014/06/10 00:22:31 UTC, 3 replies.
- Setting spark memory limit - posted by Henggang Cui <cu...@gmail.com> on 2014/06/10 00:24:27 UTC, 1 replies.
- Is spark context in local mode thread-safe? - posted by DB Tsai <db...@stanford.edu> on 2014/06/10 00:50:46 UTC, 6 replies.
- Errors when building Spark with sbt - posted by SK <sk...@gmail.com> on 2014/06/10 00:58:07 UTC, 1 replies.
- SaveAsTextfile per day instead of window? - posted by Shrikar archak <sh...@gmail.com> on 2014/06/10 02:05:24 UTC, 0 replies.
- SPARK on HPC Podcast - posted by Brock Palen <br...@umich.edu> on 2014/06/10 02:46:48 UTC, 0 replies.
- Merging all Spark Streaming RDDs to one RDD - posted by Henggang Cui <cu...@gmail.com> on 2014/06/10 03:00:00 UTC, 1 replies.
- FileNotFoundException when using persist(DISK_ONLY) - posted by Surendranauth Hiraman <su...@velos.io> on 2014/06/10 04:05:59 UTC, 3 replies.
- performance difference between spark-shell and spark-submit - posted by "Xu (Simon) Chen" <xc...@gmail.com> on 2014/06/10 05:19:21 UTC, 0 replies.
- Spark SQL standalone application compile error - posted by shlee0605 <sh...@gmail.com> on 2014/06/10 06:25:52 UTC, 1 replies.
- Writing data to HBase using Spark - posted by Vibhor Banga <vi...@gmail.com> on 2014/06/10 08:33:44 UTC, 6 replies.
- NullPointerException on reading checkpoint files - posted by Kanwaldeep <ka...@gmail.com> on 2014/06/10 08:58:46 UTC, 2 replies.
- Shark over Spark-Streaming - posted by praveshjain1991 <pr...@gmail.com> on 2014/06/10 09:15:46 UTC, 0 replies.
- Problem in Spark Streaming - posted by nilmish <ni...@gmail.com> on 2014/06/10 11:56:57 UTC, 9 replies.
- pmml with augustus - posted by filipus <fl...@gmail.com> on 2014/06/10 13:52:32 UTC, 8 replies.
- abnormal latency when running Spark Streaming - posted by Yingjun Wu <wu...@gmail.com> on 2014/06/10 14:01:16 UTC, 1 replies.
- Performance of Akka or TCP Socket input sources vs HDFS: Data locality in Spark Streaming - posted by Nilesh Chakraborty <ni...@nileshc.com> on 2014/06/10 14:05:52 UTC, 2 replies.
- Calling JavaPairRDD.first after calling JavaPairRDD.groupByKey results in NullPointerException - posted by Gaurav Jain <ja...@student.ethz.ch> on 2014/06/10 14:06:40 UTC, 0 replies.
- Can't find pyspark when using PySpark on YARN - posted by 李奇平 <qi...@alibaba-inc.com> on 2014/06/10 15:35:17 UTC, 1 replies.
- Spark Streaming socketTextStream - posted by fredwolfinger <fw...@cyberpointllc.com> on 2014/06/10 15:41:37 UTC, 2 replies.
- HDFS Server/Client IPC version mismatch while trying to access HDFS files using Spark-0.9.1 - posted by bijoy deb <bi...@gmail.com> on 2014/06/10 20:16:41 UTC, 3 replies.
- Hanging Spark jobs - posted by "Hurwitz, Daniel" <dh...@ebay.com> on 2014/06/10 20:22:25 UTC, 2 replies.
- Spark Logging - posted by Robert James <sr...@gmail.com> on 2014/06/10 20:39:21 UTC, 3 replies.
- getting started with mllib.recommendation.ALS - posted by Sandeep Parikh <sa...@clusterbeep.org> on 2014/06/10 22:59:52 UTC, 2 replies.
- Re: NoSuchMethodError in KafkaReciever - posted by mpieck <mp...@gazeta.pl> on 2014/06/10 23:15:13 UTC, 3 replies.
- How to specify executor memory in EC2 ? - posted by Aliaksei Litouka <al...@gmail.com> on 2014/06/10 23:38:10 UTC, 6 replies.
- Information on Spark UI - posted by Shuo Xiang <sh...@gmail.com> on 2014/06/11 02:23:56 UTC, 6 replies.
- spark streaming, kafka, SPARK_CLASSPATH - posted by lannyripple <la...@gmail.com> on 2014/06/11 02:48:58 UTC, 8 replies.
- groupBy question - posted by SK <sk...@gmail.com> on 2014/06/11 03:10:31 UTC, 2 replies.
- Monitoring spark dis-associated workers - posted by Allen Chang <al...@yahoo.com> on 2014/06/11 03:15:59 UTC, 0 replies.
- problem starting the history server on EC2 - posted by zhen <z....@latrobe.edu.au> on 2014/06/11 03:29:28 UTC, 7 replies.
- output tuples in CSV format - posted by SK <sk...@gmail.com> on 2014/06/11 03:34:08 UTC, 2 replies.
- Question about RDD cache, unpersist, materialization - posted by innowireless TaeYun Kim <ta...@innowireless.co.kr> on 2014/06/11 04:40:23 UTC, 7 replies.
- Re: little confused about SPARK_JAVA_OPTS alternatives - posted by elyast <lu...@gmail.com> on 2014/06/11 08:40:25 UTC, 6 replies.
- Number of Spark streams in Yarn cluster - posted by tnegi <tn...@gmail.com> on 2014/06/11 09:59:05 UTC, 0 replies.
- Spark SQL incorrect result on GROUP BY query - posted by Pei-Lun Lee <pl...@appier.com> on 2014/06/11 12:01:07 UTC, 4 replies.
- Error During ReceivingConnection - posted by Surendranauth Hiraman <su...@velos.io> on 2014/06/11 14:38:47 UTC, 1 replies.
- Normalizations in MLBase - posted by Aslan Bekirov <as...@gmail.com> on 2014/06/11 15:25:30 UTC, 5 replies.
- MLLib : Decision Tree not getting built for 5 or more levels(maxDepth=5) and the one built for 3 levels is performing poorly - posted by SURAJ SHETH <sh...@gmail.com> on 2014/06/11 17:44:50 UTC, 5 replies.
- Adding external jar to spark-shell classpath in spark 1.0 - posted by "Ulanov, Alexander" <al...@hp.com> on 2014/06/11 19:25:11 UTC, 7 replies.
- Having trouble with streaming (updateStateByKey) - posted by Michael Campbell <mi...@gmail.com> on 2014/06/11 19:47:20 UTC, 1 replies.
- Powered by Spark addition - posted by Derek Mansen <de...@vistarmedia.com> on 2014/06/11 22:28:34 UTC, 6 replies.
- Kafka client - specify offsets? - posted by Michael Campbell <mi...@gmail.com> on 2014/06/11 22:53:07 UTC, 2 replies.
- Compression with DISK_ONLY persistence - posted by Surendranauth Hiraman <su...@velos.io> on 2014/06/11 23:56:23 UTC, 1 replies.
- When to use CombineByKey vs reduceByKey? - posted by Diana Hu <si...@gmail.com> on 2014/06/12 00:21:06 UTC, 2 replies.
- Not fully cached when there is enough memory - posted by Shuo Xiang <sh...@gmail.com> on 2014/06/12 00:24:01 UTC, 3 replies.
- json parsing with json4s - posted by SK <sk...@gmail.com> on 2014/06/12 00:26:41 UTC, 2 replies.
- Using Spark to crack passwords - posted by Nick Chammas <ni...@gmail.com> on 2014/06/12 02:24:40 UTC, 8 replies.
- Hive classes for Catalyst - posted by Stephen Boesch <ja...@gmail.com> on 2014/06/12 02:37:07 UTC, 3 replies.
- History Server renered page not suitable for load balancing - posted by elyast <lu...@gmail.com> on 2014/06/12 04:53:56 UTC, 1 replies.
- shuffling using netty in spark streaming - posted by onpoq l <on...@gmail.com> on 2014/06/12 08:35:10 UTC, 1 replies.
- Re: running Spark Streaming just once and stop it - posted by Ravi Hemnani <ra...@gmail.com> on 2014/06/12 10:06:34 UTC, 1 replies.
- initial basic question from new user - posted by Toby Douglass <to...@avocet.io> on 2014/06/12 11:24:46 UTC, 9 replies.
- How to read a snappy-compressed text file? - posted by innowireless TaeYun Kim <ta...@innowireless.co.kr> on 2014/06/12 12:20:29 UTC, 0 replies.
- Re: how to set spark.executor.memory and heap size - posted by Laurent T <la...@ldmobile.net> on 2014/06/12 12:29:20 UTC, 1 replies.
- How to use SequenceFileRDDFunctions.saveAsSequenceFile() in Java? - posted by innowireless TaeYun Kim <ta...@innowireless.co.kr> on 2014/06/12 14:16:51 UTC, 0 replies.
- Spark 1.0.0 Standalone AppClient cannot connect Master - posted by Hao Wang <wh...@gmail.com> on 2014/06/12 15:24:17 UTC, 2 replies.
- HELP!? Re: Streaming trouble (reduceByKey causes all printing to stop) - posted by Michael Campbell <mi...@gmail.com> on 2014/06/12 16:18:25 UTC, 1 replies.
- Access DStream content? - posted by "Wolfinger, Fred" <fw...@cyberpointllc.com> on 2014/06/12 17:06:33 UTC, 1 replies.
- wholeTextFiles not working with HDFS - posted by Sguj <tp...@yahoo.com> on 2014/06/12 18:05:48 UTC, 6 replies.
- Re: use spark-shell in the source - posted by Andrew Or <an...@databricks.com> on 2014/06/12 18:47:18 UTC, 1 replies.
- Announcing MesosCon schedule, incl Spark presentation - posted by Dave Lester <da...@gmail.com> on 2014/06/12 19:29:22 UTC, 0 replies.
- overwriting output directory - posted by SK <sk...@gmail.com> on 2014/06/12 19:57:32 UTC, 1 replies.
- spark EC2 bring-up problems - posted by Toby Douglass <to...@avocet.io> on 2014/06/12 21:38:19 UTC, 4 replies.
- Increase storage.MemoryStore size - posted by ericjohnston1989 <er...@gmail.com> on 2014/06/12 22:16:06 UTC, 1 replies.
- An attempt to implement dbscan algorithm on top of Spark - posted by Aliaksei Litouka <al...@gmail.com> on 2014/06/12 23:31:15 UTC, 2 replies.
- NullPointerExceptions when using val or broadcast on a standalone cluster. - posted by bdamos <am...@adobe.com> on 2014/06/13 00:32:50 UTC, 2 replies.
- Java Custom Receiver onStart method never called - posted by jsabin <je...@gmail.com> on 2014/06/13 00:41:47 UTC, 1 replies.
- specifying fields for join() - posted by SK <sk...@gmail.com> on 2014/06/13 01:25:09 UTC, 3 replies.
- Re: Doubts about MLlib.linalg in python - posted by ericjohnston1989 <er...@gmail.com> on 2014/06/13 01:47:05 UTC, 0 replies.
- Spark SQL - input command in web ui/event log - posted by shlee0605 <sh...@gmail.com> on 2014/06/13 02:49:33 UTC, 1 replies.
- Fwd: ApacheCon CFP closes June 25 - posted by Matei Zaharia <ma...@gmail.com> on 2014/06/13 07:10:50 UTC, 0 replies.
- multiple passes in mapPartitions - posted by zhen <z....@latrobe.edu.au> on 2014/06/13 07:30:14 UTC, 2 replies.
- Command exited with code 137 - posted by libl <27...@qq.com> on 2014/06/13 08:45:01 UTC, 1 replies.
- openstack swift integration with Spark - posted by Reynold Xin <rx...@databricks.com> on 2014/06/13 09:15:53 UTC, 0 replies.
- Spark 1.0.0 on yarn cluster problem - posted by Sophia <sl...@163.com> on 2014/06/13 09:46:18 UTC, 1 replies.
- Convert text into tfidf vectors for Classification - posted by Stuti Awasthi <st...@hcl.com> on 2014/06/13 10:21:36 UTC, 1 replies.
- list of persisted rdds - posted by mrm <ma...@skimlinks.com> on 2014/06/13 13:02:10 UTC, 7 replies.
- Master not seeing recovered nodes("Got heartbeat from unregistered worker ....") - posted by Yana Kadiyska <ya...@gmail.com> on 2014/06/13 14:57:47 UTC, 3 replies.
- BUG? Why does MASTER have to be set to spark://hostname:port? - posted by Hao Wang <wh...@gmail.com> on 2014/06/13 16:27:26 UTC, 0 replies.
- Re: Transform pair to a new pair - posted by lalit1303 <la...@sigmoidanalytics.com> on 2014/06/13 16:44:23 UTC, 0 replies.
- process local vs node local subtlety question/issue - posted by Albert Chu <ch...@llnl.gov> on 2014/06/13 19:55:03 UTC, 1 replies.
- Re: MLlib-Missing Regularization Parameter and Intercept for Logistic Regression - posted by Congrui Yi <fi...@us.bosch.com> on 2014/06/13 20:32:32 UTC, 7 replies.
- MLlib-a problem of example code for L-BFGS - posted by Congrui Yi <fi...@us.bosch.com> on 2014/06/13 20:50:31 UTC, 4 replies.
- spark-submit fails to get jar from http source - posted by lbustelo <gi...@bustelos.com> on 2014/06/13 21:41:35 UTC, 0 replies.
- Odd pyspark state - posted by Jim Blomo <ji...@gmail.com> on 2014/06/13 22:11:09 UTC, 0 replies.
- How Spark Choose Worker Nodes for respective HDFS block - posted by "anishsneh@yahoo.co.in" <an...@yahoo.co.in> on 2014/06/13 23:17:50 UTC, 1 replies.
- guidance on simple unit testing with Spark - posted by SK <sk...@gmail.com> on 2014/06/13 23:42:57 UTC, 2 replies.
- spark.eventLog.enabled not working on spark on AWS EC2 - posted by zhen <z....@latrobe.edu.au> on 2014/06/14 00:47:27 UTC, 0 replies.
- convert List to RDD - posted by SK <sk...@gmail.com> on 2014/06/14 03:08:57 UTC, 3 replies.
- spark master UI does not keep detailed application history - posted by zhen <z....@latrobe.edu.au> on 2014/06/14 03:21:48 UTC, 2 replies.
- printing in unit test - posted by SK <sk...@gmail.com> on 2014/06/14 03:38:28 UTC, 0 replies.
- Multi-dimensional Uniques over large dataset - posted by Krishna Sankar <ks...@gmail.com> on 2014/06/14 05:52:36 UTC, 2 replies.
- MLLib : Decision Tree with minimum points per node - posted by Justin Yip <yi...@gmail.com> on 2014/06/14 05:55:03 UTC, 2 replies.
- SparkSQL registerAsTable - No TypeTag available Error - posted by premdass <pr...@yahoo.co.in> on 2014/06/14 14:03:48 UTC, 2 replies.
- Accumulable with huge accumulated value? - posted by Nilesh Chakraborty <ni...@nileshc.com> on 2014/06/14 15:30:31 UTC, 0 replies.
- GroupByKey results in OOM - Any other alternative - posted by Vivek YS <vi...@gmail.com> on 2014/06/14 19:58:58 UTC, 6 replies.
- DStream are not processed after upgrade to Spark 1.0 - posted by Chang Lim <ch...@gmail.com> on 2014/06/14 20:37:45 UTC, 0 replies.
- Is shuffle "stable"? - posted by Daniel Darabos <da...@lynxanalytics.com> on 2014/06/14 21:14:43 UTC, 2 replies.
- Re: guidance on simple unit testing with Sprk - posted by Gerard Maas <ge...@gmail.com> on 2014/06/14 22:32:05 UTC, 0 replies.
- Failing to run standalone streaming app: IOException; classNotFoundException; and more - posted by pns <pe...@gmail.com> on 2014/06/14 23:29:28 UTC, 0 replies.
- long GC pause during file.cache() - posted by Wei Tan <wt...@us.ibm.com> on 2014/06/15 04:24:11 UTC, 0 replies.
- Using custom class as a key for groupByKey() or reduceByKey() - posted by Gaurav Jain <ja...@student.ethz.ch> on 2014/06/15 17:45:51 UTC, 2 replies.
- Re: long GC pause during file.cache() - posted by Hao Wang <wh...@gmail.com> on 2014/06/15 18:13:01 UTC, 6 replies.
- Akka listens to hostname while user may spark-submit with master in IP url - posted by Hao Wang <wh...@gmail.com> on 2014/06/15 23:06:48 UTC, 0 replies.
- pyspark serializer can't handle functions? - posted by madeleine <ma...@gmail.com> on 2014/06/16 01:49:09 UTC, 3 replies.
- Is There Any Benchmarks Comparing C++ MPI with Spark - posted by Wei Da <xw...@gmail.com> on 2014/06/16 09:17:57 UTC, 5 replies.
- Spark streaming with Redis? Working with large number of model objects at spark compute nodes. - posted by tnegi <tn...@gmail.com> on 2014/06/16 10:47:57 UTC, 0 replies.
- What is the best way to handle transformations or actions that takes forever? - posted by Peng Cheng <pc...@uow.edu.au> on 2014/06/16 10:52:18 UTC, 3 replies.
- Need help. Spark + Accumulo => Error: java.lang.NoSuchMethodError: org.apache.commons.codec.binary.Base64.encodeBase64String - posted by Jianshi Huang <ji...@gmail.com> on 2014/06/16 13:37:12 UTC, 8 replies.
- pyspark regression results way off - posted by jamborta <ja...@gmail.com> on 2014/06/16 17:13:06 UTC, 6 replies.
- Memory footprint of Calliope: Spark -> Cassandra writes - posted by Gerard Maas <ge...@gmail.com> on 2014/06/16 18:27:45 UTC, 4 replies.
- Re: Java updateStateByKey - posted by Gaurav Jain <ja...@student.ethz.ch> on 2014/06/16 18:47:56 UTC, 0 replies.
- Need some Streaming help - posted by Yana Kadiyska <ya...@gmail.com> on 2014/06/16 18:57:26 UTC, 0 replies.
- Fwd: spark streaming questions - posted by Chen Song <ch...@gmail.com> on 2014/06/16 19:12:36 UTC, 3 replies.
- Worker dies while submitting a job - posted by Luis Ángel Vicente Sánchez <la...@gmail.com> on 2014/06/16 19:25:20 UTC, 3 replies.
- Accessing the per-key state maintained by updateStateByKey for transformation of JavaPairDStream - posted by Gaurav Jain <ja...@student.ethz.ch> on 2014/06/16 20:01:27 UTC, 0 replies.
- pyspark-Failed to run first - posted by Congrui Yi <fi...@us.bosch.com> on 2014/06/16 20:54:41 UTC, 4 replies.
- No Intercept for Python - posted by Naftali Harris <na...@affirm.com> on 2014/06/16 21:19:16 UTC, 2 replies.
- Can't get Master Kerberos principal for use as renewer - posted by "Finamore A." <al...@polito.it> on 2014/06/16 23:03:03 UTC, 1 replies.
- Set comparison - posted by SK <sk...@gmail.com> on 2014/06/16 23:09:33 UTC, 3 replies.
- spark with docker: errors with akka, NAT? - posted by Mohit Jaggi <mo...@gmail.com> on 2014/06/17 00:36:19 UTC, 8 replies.
- Spark sql unable to connect to db2 hive metastore - posted by Jenny Zhao <li...@gmail.com> on 2014/06/17 01:29:21 UTC, 3 replies.
- akka.FrameSize - posted by Chen Jin <ka...@gmail.com> on 2014/06/17 06:59:50 UTC, 0 replies.
- How to add jar with SparkSQL HiveContext? - posted by Earthson <Ea...@gmail.com> on 2014/06/17 08:41:09 UTC, 1 replies.
- DoNotRetryIOException: IllegalAccessError - posted by wanbo <ge...@163.com> on 2014/06/17 09:04:23 UTC, 0 replies.
- Contribution to Spark MLLib - posted by Jayati <ti...@gmail.com> on 2014/06/17 09:22:58 UTC, 5 replies.
- Re: Spark 0.9.1 core dumps on Mesos 0.18.0 - posted by qingyang li <li...@gmail.com> on 2014/06/17 09:50:29 UTC, 1 replies.
- Yarn-client mode and standalone-client mode hang during job start - posted by Jianshi Huang <ji...@gmail.com> on 2014/06/17 11:41:52 UTC, 3 replies.
- news20-binary classification with LogisticRegressionWithSGD - posted by Makoto Yui <yu...@gmail.com> on 2014/06/17 14:32:02 UTC, 15 replies.
- join operation is taking too much time - posted by MEETHU MATHEW <me...@yahoo.co.in> on 2014/06/17 14:59:36 UTC, 3 replies.
- Execution stalls in LogisticRegressionWithSGD - posted by Bharath Ravi Kumar <re...@gmail.com> on 2014/06/17 15:48:12 UTC, 5 replies.
- Spark 1.0.0 java.lang.outOfMemoryError: Java Heap Space - posted by Sguj <tp...@yahoo.com> on 2014/06/17 17:06:51 UTC, 4 replies.
- Executors not utilized properly. - posted by abhiguruvayya <sh...@gmail.com> on 2014/06/17 18:36:59 UTC, 8 replies.
- Problems running Spark job on mesos in fine-grained mode - posted by Sébastien Rainville <se...@gmail.com> on 2014/06/17 20:57:55 UTC, 4 replies.
- Spark streaming RDDs to Parquet records - posted by maheshtwc <ma...@twc-contractor.com> on 2014/06/17 21:52:32 UTC, 7 replies.
- Spark SQL: No function to evaluate expression - posted by Zuhair Khayyat <zu...@gmail.com> on 2014/06/17 22:02:41 UTC, 2 replies.
- Unit test failure: Address already in use - posted by SK <sk...@gmail.com> on 2014/06/17 23:01:37 UTC, 3 replies.
- Best practices for removing lineage of a RDD or Graph object? - posted by dash <bs...@nd.edu> on 2014/06/18 00:29:58 UTC, 2 replies.
- Why MLLib classes are so badly organized? - posted by frol <fr...@gmail.com> on 2014/06/18 00:35:43 UTC, 0 replies.
- Enormous EC2 price jump makes "r3.large" patch more important - posted by Jeremy Lee <un...@gmail.com> on 2014/06/18 01:17:18 UTC, 6 replies.
- Issue while trying to aggregate with a sliding window - posted by Hatch M <ha...@gmail.com> on 2014/06/18 02:19:54 UTC, 4 replies.
- Spark Streaming Example with CDH5 - posted by manas Kar <Ma...@exactearth.com> on 2014/06/18 04:14:57 UTC, 1 replies.
- rdd.cache() is not faster? - posted by Wei Tan <wt...@us.ibm.com> on 2014/06/18 05:36:28 UTC, 3 replies.
- Wildcard support in input path - posted by Jianshi Huang <ji...@gmail.com> on 2014/06/18 05:58:37 UTC, 8 replies.
- question about setting SPARK_CLASSPATH IN spark_env.sh - posted by santhoma <sa...@yahoo.com> on 2014/06/18 06:26:11 UTC, 3 replies.
- Un-serializable 3rd-party classes (Spark, Java) - posted by Daedalus <tu...@gmail.com> on 2014/06/18 07:11:13 UTC, 2 replies.
- SparkR Installation - posted by Stuti Awasthi <st...@hcl.com> on 2014/06/18 07:59:48 UTC, 1 replies.
- get schema from SchemaRDD - posted by Kevin Jung <it...@samsung.com> on 2014/06/18 10:36:26 UTC, 1 replies.
- Shark Tasks in parallel execution - posted by majian <ma...@nq.com> on 2014/06/18 11:36:47 UTC, 0 replies.
- Re: Cannot print a derived DStream after reduceByKey - posted by haopu <hw...@qilinsoft.com> on 2014/06/18 13:15:17 UTC, 1 replies.
- BSP realization on Spark - posted by Ghousia <gh...@gmail.com> on 2014/06/18 14:11:52 UTC, 2 replies.
- HDFS folder .sparkStaging not deleted and filled up HDFS in yarn mode - posted by Andrew Lee <al...@hotmail.com> on 2014/06/18 20:05:12 UTC, 2 replies.
- Spark is now available via Homebrew - posted by Nick Chammas <ni...@gmail.com> on 2014/06/18 22:37:46 UTC, 6 replies.
- Spark 0.9.1 java.lang.outOfMemoryError: Java Heap Space - posted by Shivani Rao <ra...@gmail.com> on 2014/06/18 23:17:50 UTC, 9 replies.
- java.lang.OutOfMemoryError with saveAsTextFile - posted by "Muttineni, Vinay" <vm...@ebay.com> on 2014/06/19 00:06:45 UTC, 0 replies.
- Spark streaming and rate limit - posted by Flavio Pompermaier <po...@okkam.it> on 2014/06/19 00:13:18 UTC, 8 replies.
- create SparkContext dynamically - posted by jamborta <ja...@gmail.com> on 2014/06/19 01:10:29 UTC, 0 replies.
- Trailing Tasks Saving to HDFS - posted by Surendranauth Hiraman <su...@velos.io> on 2014/06/19 01:16:30 UTC, 3 replies.
- Patterns for making multiple aggregations in one pass - posted by Nick Chammas <ni...@gmail.com> on 2014/06/19 01:28:43 UTC, 7 replies.
- options set in spark-env.sh is not reflecting on actual execution - posted by MEETHU MATHEW <me...@yahoo.co.in> on 2014/06/19 07:21:13 UTC, 1 replies.
- python worker crash in spark 1.0 - posted by "Schein, Sagi" <sa...@hp.com> on 2014/06/19 08:07:53 UTC, 0 replies.
- MLLib inside Storm : silly or not ? - posted by Eustache DIEMERT <eu...@diemert.fr> on 2014/06/19 09:26:57 UTC, 6 replies.
- write event logs with YARN - posted by Christophe Préaud <ch...@kelkoo.com> on 2014/06/19 11:18:51 UTC, 0 replies.
- DStream join with a RDD (v1.0.0) - posted by haopu <hw...@qilinsoft.com> on 2014/06/19 11:51:18 UTC, 0 replies.
- Query on Merge Message (Graph: pregel operator) - posted by Ghousia Taj <gh...@gmail.com> on 2014/06/19 14:26:36 UTC, 1 replies.
- Getting started : Spark on YARN issue - posted by Praveen Seluka <ps...@qubole.com> on 2014/06/19 15:04:03 UTC, 3 replies.
- Spark and RDF - posted by Flavio Pompermaier <po...@okkam.it> on 2014/06/19 16:49:44 UTC, 5 replies.
- Long running Spark Streaming Job increasing executing time per batch - posted by "Skogberg, Fredrik" <Fr...@paddypower.com> on 2014/06/19 17:45:27 UTC, 3 replies.
- Getting different answers running same line of code - posted by mrm <ma...@skimlinks.com> on 2014/06/19 17:54:35 UTC, 1 replies.
- trying to understand yarn-client mode - posted by Koert Kuipers <ko...@tresata.com> on 2014/06/19 19:22:59 UTC, 7 replies.
- 1.0.1 release plan - posted by Mingyu Kim <mk...@palantir.com> on 2014/06/19 19:47:56 UTC, 3 replies.
- Possible approaches for adding extra metadata (Spark Streaming) - posted by Shrikar archak <sh...@gmail.com> on 2014/06/19 20:19:03 UTC, 0 replies.
- Parallel LogisticRegression? - posted by Kyle Ellrott <ke...@soe.ucsc.edu> on 2014/06/19 20:21:24 UTC, 2 replies.
- How do you run your spark app? - posted by ldmtwo <ld...@gmail.com> on 2014/06/19 20:36:08 UTC, 9 replies.
- increasing concurrency of saveAsNewAPIHadoopFile? - posted by Sandeep Parikh <sa...@clusterbeep.org> on 2014/06/19 21:38:50 UTC, 1 replies.
- Submiting multiple jobs via different threads - posted by Zhang Haoming <ha...@outlook.com> on 2014/06/19 22:40:19 UTC, 0 replies.
- pyspark bug with unittest and scikit-learn - posted by Brad Miller <bm...@eecs.berkeley.edu> on 2014/06/19 22:58:53 UTC, 0 replies.
- How to store JavaRDD as a sequence file using spark java API? - posted by abhiguruvayya <sh...@gmail.com> on 2014/06/20 02:54:43 UTC, 6 replies.
- Repeated Broadcasts - posted by Daedalus <tu...@gmail.com> on 2014/06/20 07:54:03 UTC, 1 replies.
- problem about cluster mode of spark 1.0.0 - posted by randylu <ra...@gmail.com> on 2014/06/20 09:30:07 UTC, 4 replies.
- broadcast in spark streaming - posted by Hahn Jiang <ha...@gmail.com> on 2014/06/20 10:39:31 UTC, 2 replies.
- How could I set the number of executor? - posted by Earthson <Ea...@gmail.com> on 2014/06/20 10:52:07 UTC, 1 replies.
- Anything like grid search available for mlbase? - posted by Charles Earl <ch...@gmail.com> on 2014/06/20 15:46:46 UTC, 1 replies.
- parallel Reduce within a key - posted by ansriniv <an...@gmail.com> on 2014/06/20 15:57:59 UTC, 2 replies.
- java.net.SocketTimeoutException: Read timed out and java.io.IOException: Filesystem closed on Spark 1.0 - posted by Arun Ahuja <aa...@gmail.com> on 2014/06/20 16:55:46 UTC, 0 replies.
- Performance problems on SQL JOIN - posted by mathias <ma...@socialsignificance.co.uk> on 2014/06/20 17:16:23 UTC, 4 replies.
- Better way to use a large data set? - posted by "Muttineni, Vinay" <vm...@ebay.com> on 2014/06/20 17:29:04 UTC, 0 replies.
- broadcast not working in yarn-cluster mode - posted by Christophe Préaud <ch...@kelkoo.com> on 2014/06/20 18:13:44 UTC, 1 replies.
- spark on yarn is trying to use file:// instead of hdfs:// - posted by Koert Kuipers <ko...@tresata.com> on 2014/06/20 18:40:31 UTC, 6 replies.
- Can not checkpoint Graph object's vertices but could checkpoint edges - posted by dash <bs...@nd.edu> on 2014/06/20 19:27:59 UTC, 0 replies.
- Possible approaches for adding extra metadata (Spark Streaming)? - posted by Shrikar archak <sh...@gmail.com> on 2014/06/20 20:16:54 UTC, 3 replies.
- Running Spark alongside Hadoop - posted by Sameer Tilak <ss...@live.com> on 2014/06/20 21:41:00 UTC, 3 replies.
- Set the number/memory of workers under mesos - posted by Shuo Xiang <sh...@gmail.com> on 2014/06/20 22:30:52 UTC, 3 replies.
- kibana like frontend for spark - posted by Mohit Jaggi <mo...@gmail.com> on 2014/06/20 23:18:01 UTC, 0 replies.
- Fwd: Using Spark - posted by Ricky Thomas <ri...@truedash.io> on 2014/06/20 23:52:26 UTC, 2 replies.
- sc.textFile can't recognize '\004' - posted by anny9699 <an...@gmail.com> on 2014/06/21 02:08:11 UTC, 2 replies.
- How to terminate job from the task code? - posted by Piotr Kołaczkowski <pk...@datastax.com> on 2014/06/21 07:08:45 UTC, 1 replies.
- Spark throws NoSuchFieldError when testing on cluster mode - posted by Peng Cheng <pc...@uow.edu.au> on 2014/06/21 10:30:10 UTC, 7 replies.
- zip in pyspark truncates RDD to number of processors - posted by madeleine <ma...@gmail.com> on 2014/06/21 18:37:50 UTC, 1 replies.
- Spark Processing Large Data Stuck - posted by yxzhao <yx...@ualr.edu> on 2014/06/21 21:55:52 UTC, 3 replies.
- InputStreamsSuite test failed - posted by crazymb <cr...@163.com> on 2014/06/22 11:30:45 UTC, 0 replies.
- Shark vs Impala - posted by Flavio Pompermaier <po...@okkam.it> on 2014/06/22 13:32:00 UTC, 7 replies.
- MLLib sample data format - posted by Justin Yip <yi...@gmail.com> on 2014/06/22 23:35:22 UTC, 6 replies.
- hi - posted by rapelly kartheek <ka...@gmail.com> on 2014/06/23 07:26:18 UTC, 4 replies.
- Persistent Local Node variables - posted by Daedalus <tu...@gmail.com> on 2014/06/23 07:34:57 UTC, 2 replies.
- Kafka Streaming - Error Could not compute split - posted by Kanwaldeep <ka...@gmail.com> on 2014/06/23 08:39:38 UTC, 1 replies.
- Multiclass classification evaluation measures - posted by "Ulanov, Alexander" <al...@hp.com> on 2014/06/23 12:02:29 UTC, 0 replies.
- Help with object access from mapper (simple question) - posted by Yana Kadiyska <ya...@gmail.com> on 2014/06/23 16:44:03 UTC, 2 replies.
- Basic Scala and Spark questions - posted by Sameer Tilak <ss...@live.com> on 2014/06/23 19:38:04 UTC, 3 replies.
- about a JavaWordCount example with spark-core_2.10-1.0.0.jar - posted by Alonso Isidoro Roman <al...@gmail.com> on 2014/06/23 20:15:40 UTC, 6 replies.
- Error in run spark.ContextCleaner under Spark 1.0.0 - posted by Haoming Zhang <ha...@outlook.com> on 2014/06/23 21:13:07 UTC, 1 replies.
- How to use K-fold validation in spark-1.0? - posted by holdingonrobin <ro...@gmail.com> on 2014/06/23 23:21:38 UTC, 4 replies.
- Re: how to make saveAsTextFile NOT split output into multiple file? - posted by holdingonrobin <ro...@gmail.com> on 2014/06/23 23:26:47 UTC, 1 replies.
- Efficiently doing an analysis with Cartesian product (pyspark) - posted by Aaron Dossett <aa...@target.com> on 2014/06/23 23:29:21 UTC, 3 replies.
- Run Spark on Mesos? Add yourself to the #PoweredByMesos list - posted by Dave Lester <da...@gmail.com> on 2014/06/24 00:15:37 UTC, 0 replies.
- balancing RDDs - posted by Sean McNamara <Se...@Webtrends.com> on 2014/06/24 00:40:52 UTC, 2 replies.
- Error when running unit tests - posted by SK <sk...@gmail.com> on 2014/06/24 01:30:25 UTC, 0 replies.
- Bug in Spark REPL - posted by Shivani Rao <ra...@gmail.com> on 2014/06/24 01:38:18 UTC, 1 replies.
- DAGScheduler: Failed to run foreach - posted by Sameer Tilak <ss...@live.com> on 2014/06/24 02:05:03 UTC, 5 replies.
- apache spark 1.0.0 sha1 & md5 checksum fails - posted by "MrAsanjar ." <af...@gmail.com> on 2014/06/24 03:37:36 UTC, 1 replies.
- which function can generate a ShuffleMapTask - posted by lihu <li...@gmail.com> on 2014/06/24 05:23:59 UTC, 0 replies.
- How to Reload Spark Configuration Files - posted by Sirisha Devineni <Si...@persistent.co.in> on 2014/06/24 08:05:02 UTC, 2 replies.
- How data is distributed while processing in spark cluster? - posted by srujana <sr...@persistent.co.in> on 2014/06/24 08:21:07 UTC, 2 replies.
- Questions regarding different spark pre-built packages - posted by Sourav Chandra <so...@livestream.com> on 2014/06/24 08:46:14 UTC, 1 replies.
- Using Spark as web app backend - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/06/24 09:12:30 UTC, 5 replies.
- Prediction using Classification with text attributes in Apache Spark MLLib - posted by lmk <la...@gmail.com> on 2014/06/24 13:17:08 UTC, 9 replies.
- Streaming aggregation - posted by john levingston <jo...@gmail.com> on 2014/06/24 17:06:43 UTC, 0 replies.
- Integrate Spark Editor with Hue for source compiled installation of spark/spark-jobServer - posted by Sunita Arvind <su...@gmail.com> on 2014/06/24 18:04:57 UTC, 1 replies.
- Setting user permissions for Spark and Shark - posted by ajatix <aj...@sigmoidanalytics.com> on 2014/06/24 19:12:45 UTC, 0 replies.
- Centralized Spark Logging solution - posted by Robert James <sr...@gmail.com> on 2014/06/24 20:58:07 UTC, 0 replies.
- LinearRegression giving different weights in Spark 1.0 and Spark 0.9 - posted by fintis <fi...@gmail.com> on 2014/06/24 21:15:26 UTC, 0 replies.
- partitions, coalesce() and parallelism - posted by Alex Boisvert <al...@gmail.com> on 2014/06/24 21:50:58 UTC, 10 replies.
- Spark switch to debug loglevel - posted by Philip Limbeck <ph...@gmail.com> on 2014/06/24 22:27:46 UTC, 0 replies.
- Graphx SubGraph - posted by aymanshalaby <aa...@marketwired.com> on 2014/06/24 23:12:52 UTC, 1 replies.
- JavaRDD.mapToPair throws NPE - posted by Mingyu Kim <mk...@palantir.com> on 2014/06/24 23:44:53 UTC, 1 replies.
- Spark slave fail to start with wierd error information - posted by Peng Cheng <pc...@uow.edu.au> on 2014/06/25 00:28:28 UTC, 3 replies.
- Upgrading to Spark 1.0.0 causes NoSuchMethodError - posted by Robert James <sr...@gmail.com> on 2014/06/25 00:39:41 UTC, 7 replies.
- ElasticSearch enrich - posted by boci <bo...@gmail.com> on 2014/06/25 00:42:33 UTC, 23 replies.
- Does PUBLIC_DNS environment parameter really works? - posted by Peng Cheng <pc...@uow.edu.au> on 2014/06/25 06:01:16 UTC, 1 replies.
- Changing log level of spark - posted by Philip Limbeck <ph...@gmail.com> on 2014/06/25 07:25:40 UTC, 2 replies.
- Need help to make spark sql works in stand alone application - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/06/25 09:27:35 UTC, 0 replies.
- TaskSchedulerImpl: Initial job has not accepted any resources; check your cluster UI to ensure that workers are registered and have sufficient memory - posted by Peng Cheng <pc...@uow.edu.au> on 2014/06/25 10:54:05 UTC, 3 replies.
- Is there anyone who can explain why the function of ALS.train give different shuffle results when execute the same transformation flatMap - posted by "Lizhengbing (bing, BIPA)" <zh...@huawei.com> on 2014/06/25 11:39:34 UTC, 0 replies.
- Re: Is there anyone who can explain why the function of ALS.train give different shuffle results when execute the same transformation flatMap - posted by Nick Pentreath <ni...@gmail.com> on 2014/06/25 11:48:17 UTC, 0 replies.
- Cassandra and Spark checkpoints - posted by toivoa <to...@gmail.com> on 2014/06/25 15:02:39 UTC, 0 replies.
- Spark and Cassandra - NotSerializableException - posted by shaiw75 <sh...@intentiq.com> on 2014/06/25 16:08:20 UTC, 0 replies.
- Spark's Hadooop Dependency - posted by Robert James <sr...@gmail.com> on 2014/06/25 17:26:16 UTC, 1 replies.
- graphx Joining two VertexPartitions with different indexes is slow. - posted by Koert Kuipers <ko...@tresata.com> on 2014/06/25 18:09:38 UTC, 0 replies.
- jsonFile function in SQLContext does not work - posted by durin <ma...@simon-schaefer.net> on 2014/06/25 19:57:27 UTC, 6 replies.
- Spark's Maven dependency on Hadoop 1 - posted by Robert James <sr...@gmail.com> on 2014/06/25 20:11:08 UTC, 0 replies.
- Using CQLSSTableWriter to batch load data from Spark to Cassandra. - posted by Gerard Maas <ge...@gmail.com> on 2014/06/25 20:44:38 UTC, 6 replies.
- wholeTextFiles and gzip - posted by Nick Chammas <ni...@gmail.com> on 2014/06/25 21:17:13 UTC, 0 replies.
- semi join spark streaming - posted by Chen Song <ch...@gmail.com> on 2014/06/25 21:23:03 UTC, 0 replies.
- Worker nodes: Error messages - posted by Sameer Tilak <ss...@live.com> on 2014/06/25 22:56:54 UTC, 1 replies.
- Hadoop interface vs class - posted by Robert James <sr...@gmail.com> on 2014/06/25 23:41:13 UTC, 5 replies.
- wholeTextFiles like for binary files ? - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/06/26 00:01:51 UTC, 1 replies.
- trouble: Launching spark on hadoop + yarn. - posted by sdeb <sa...@gmail.com> on 2014/06/26 00:09:22 UTC, 0 replies.
- Number of executors smaller than requested in YARN. - posted by Sung Hwan Chung <co...@cs.stanford.edu> on 2014/06/26 02:13:03 UTC, 0 replies.
- Does Spark restart cached workers even without failures? - posted by Sung Hwan Chung <co...@cs.stanford.edu> on 2014/06/26 02:19:54 UTC, 0 replies.
- 答复: Is there anyone who can explain why the function of ALS.train give different shuffle results when execute the same transformation flatMap - posted by "Lizhengbing (bing, BIPA)" <zh...@huawei.com> on 2014/06/26 02:51:48 UTC, 0 replies.
- Spark standalone network configuration problems - posted by Shannon Quinn <sq...@gatech.edu> on 2014/06/26 03:07:34 UTC, 17 replies.
- Spark vs Google cloud dataflow - posted by Aureliano Buendia <bu...@gmail.com> on 2014/06/26 03:23:41 UTC, 13 replies.
- Spark executor error - posted by Sung Hwan Chung <co...@cs.stanford.edu> on 2014/06/26 05:08:38 UTC, 1 replies.
- Where Can I find the full documentation for Spark SQL? - posted by guxiaobo1982 <gu...@qq.com> on 2014/06/26 06:36:07 UTC, 4 replies.
- Spark Streaming Window without slideDuration parameter - posted by haopu <hw...@qilinsoft.com> on 2014/06/26 10:53:10 UTC, 0 replies.
- What's the best practice to deploy spark on Big SMP servers? - posted by guxiaobo1982 <gu...@qq.com> on 2014/06/26 12:29:21 UTC, 0 replies.
- LiveListenerBus throws exception and weird web UI bug - posted by Pei-Lun Lee <pl...@appier.com> on 2014/06/26 12:41:43 UTC, 2 replies.
- running multiple applications at the same time - posted by jamborta <ja...@gmail.com> on 2014/06/26 14:07:31 UTC, 3 replies.
- About StorageLevel - posted by "tomsheep.cn@gmail.com" <to...@gmail.com> on 2014/06/26 14:36:25 UTC, 4 replies.
- [ANNOUNCE] Apache MRQL 0.9.2-incubating released - posted by Leonidas Fegaras <fe...@apache.org> on 2014/06/26 14:45:57 UTC, 0 replies.
- Fine-grained mesos execution hangs on Debian 7.4 - posted by Fedechicco <fe...@gmail.com> on 2014/06/26 18:11:21 UTC, 2 replies.
- Serialization of objects - posted by Sameer Tilak <ss...@live.com> on 2014/06/26 18:30:31 UTC, 1 replies.
- Spark Streaming RDD transformation - posted by Bill Jay <bi...@gmail.com> on 2014/06/26 20:40:50 UTC, 2 replies.
- Improving Spark multithreaded performance? - posted by Kyle Ellrott <ke...@soe.ucsc.edu> on 2014/06/26 21:06:19 UTC, 6 replies.
- Running new code on a Spark Cluster - posted by Pat Ferrel <pa...@occamsmachete.com> on 2014/06/26 21:13:22 UTC, 2 replies.
- Spark job tracker. - posted by abhiguruvayya <sh...@gmail.com> on 2014/06/26 22:55:14 UTC, 4 replies.
- Spark-submit failing on cluster - posted by ajatix <aj...@sigmoidanalytics.com> on 2014/06/26 23:01:41 UTC, 1 replies.
- SparkSQL- saveAsParquetFile - posted by "anthonyjschulte@gmail.com" <an...@gmail.com> on 2014/06/26 23:55:00 UTC, 0 replies.
- SparkSQL- Nested CaseClass Parquet failure - posted by "anthonyjschulte@gmail.com" <an...@gmail.com> on 2014/06/27 00:03:30 UTC, 2 replies.
- Task progress in ipython? - posted by "Xu (Simon) Chen" <xc...@gmail.com> on 2014/06/27 01:09:19 UTC, 0 replies.
- Google Cloud Engine adds out of the box Spark/Shark support - posted by Mayur Rustagi <ma...@gmail.com> on 2014/06/27 01:42:33 UTC, 0 replies.
- numpy + pyspark - posted by Avishek Saha <av...@gmail.com> on 2014/06/27 02:45:00 UTC, 7 replies.
- Spark Streaming to capture packets from interface - posted by swezzz <sw...@gmail.com> on 2014/06/27 08:29:33 UTC, 0 replies.
- org.jboss.netty.channel.ChannelException: Failed to bind to: master/1xx.xx..xx:0 - posted by MEETHU MATHEW <me...@yahoo.co.in> on 2014/06/27 08:30:57 UTC, 4 replies.
- Map with filter on JavaRdd - posted by ajay garg <aj...@mobileum.com> on 2014/06/27 08:42:48 UTC, 3 replies.
- Issue in using classes with constructor as vertex attribute in graphx - posted by harsh2005_7 <ha...@yahoo.com> on 2014/06/27 13:11:02 UTC, 0 replies.
- Re: [GraphX] Cast error when comparing a vertex attribute after its type has changed - posted by Pierre-Alexandre Fonta <pi...@gmail.com> on 2014/06/27 13:21:36 UTC, 0 replies.
- How to use .newAPIHadoopRDD() from Java (w/ Cassandra) - posted by Martin Gammelsæter <ma...@gmail.com> on 2014/06/27 15:06:10 UTC, 0 replies.
- Spark RDD member of class loses it's value when the class being used as graph attribute - posted by harsh2005_7 <ha...@yahoo.com> on 2014/06/27 16:29:44 UTC, 2 replies.
- problem when start spark streaming in cluster mode - posted by Siyuan he <hs...@gmail.com> on 2014/06/27 16:52:54 UTC, 3 replies.
- scopt.OptionParser - posted by SK <sk...@gmail.com> on 2014/06/27 20:44:11 UTC, 0 replies.
- jackson-core-asl jar (1.8.8 vs 1.9.x) conflict with the spark-sql (version 1.x) - posted by M Singh <ma...@yahoo.com> on 2014/06/27 21:58:30 UTC, 3 replies.
- Integrate spark-shell into officially supported web ui/api plug-in? What do you think? - posted by Peng Cheng <pc...@uow.edu.au> on 2014/06/27 22:24:33 UTC, 2 replies.
- Could not compute split, block not found - posted by Bill Jay <bi...@gmail.com> on 2014/06/27 22:59:14 UTC, 4 replies.
- hadoop + yarn + spark - posted by sdeb <sa...@gmail.com> on 2014/06/28 02:00:32 UTC, 1 replies.
- Interconnect benchmarking - posted by Ryan Compton <co...@gmail.com> on 2014/06/28 02:07:26 UTC, 3 replies.
- Anybody changed their mind about going to the Spark Summit 2014 - posted by Cesar Arevalo <ce...@zephyrhealthinc.com> on 2014/06/28 02:14:36 UTC, 0 replies.
- HBase 0.96+ with Spark 1.0+ - posted by Stephen Boesch <ja...@gmail.com> on 2014/06/28 05:21:07 UTC, 4 replies.
- Distribute data from Kafka evenly on cluster - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/06/28 06:49:04 UTC, 1 replies.
- collect on partitions get very slow near the last few partitions. - posted by Sung Hwan Chung <co...@cs.stanford.edu> on 2014/06/28 08:35:20 UTC, 2 replies.
- Alternative to checkpointing and materialization for truncating lineage in high iteration jobs - posted by Nilesh Chakraborty <ni...@nileshc.com> on 2014/06/28 18:36:58 UTC, 1 replies.
- high minimum query latency - posted by Toby Douglass <to...@avocet.io> on 2014/06/29 11:29:03 UTC, 1 replies.
- Spark with HBase - posted by "N.Venkata Naga Ravi" <nv...@hotmail.com> on 2014/06/29 11:58:43 UTC, 1 replies.
- Kafka/ES question - posted by boci <bo...@gmail.com> on 2014/06/29 14:40:50 UTC, 0 replies.
- Spark Streaming with HBase - posted by "N.Venkata Naga Ravi" <nv...@hotmail.com> on 2014/06/29 18:46:32 UTC, 1 replies.
- Memory/Network Intensive Workload - posted by danilopds <da...@gmail.com> on 2014/06/29 20:55:08 UTC, 1 replies.
- Is it possible to use Spark, Maven, and Hadoop 2? - posted by Robert James <sr...@gmail.com> on 2014/06/29 21:20:51 UTC, 4 replies.
- Issues starting up Spark on mesos - akka.version - posted by _soumya_ <so...@gmail.com> on 2014/06/29 22:10:00 UTC, 0 replies.
- Sorting Reduced/Groupd Values without Explicit Sorting - posted by "Parsian, Mahmoud" <mp...@illumina.com> on 2014/06/30 02:59:59 UTC, 4 replies.
- Re: Selecting first ten values in a RDD/partition - posted by Chris Fregly <ch...@fregly.com> on 2014/06/30 03:38:15 UTC, 0 replies.
- questions about shuffle time and parallel degree - posted by wxhsdp <wx...@gmail.com> on 2014/06/30 04:45:27 UTC, 0 replies.
- How to control a spark application(executor) using memory amount per node? - posted by hansen <ha...@neusoft.com> on 2014/06/30 09:58:18 UTC, 1 replies.
- Configuration properties for Spark - posted by M Singh <ma...@yahoo.com> on 2014/06/30 13:02:25 UTC, 0 replies.
- Callbacks on freeing up of RDDs - posted by Jaideep Dhok <ja...@inmobi.com> on 2014/06/30 13:18:34 UTC, 0 replies.
- TaskNotSerializable when invoking KMeans.run - posted by Daniel Micol <dm...@gmail.com> on 2014/06/30 16:03:23 UTC, 1 replies.
- Serializer or Out-of-Memory issues? - posted by Sguj <tp...@yahoo.com> on 2014/06/30 16:17:43 UTC, 0 replies.
- Help alleviating OOM errors - posted by Yana Kadiyska <ya...@gmail.com> on 2014/06/30 16:39:50 UTC, 0 replies.
- Spark 1.0 docs out of sync? - posted by Diana Carroll <dc...@cloudera.com> on 2014/06/30 17:03:10 UTC, 0 replies.
- spark streaming counter metrics - posted by Chen Song <ch...@gmail.com> on 2014/06/30 18:28:55 UTC, 0 replies.
- odd caching behavior or accounting - posted by Brad Miller <bm...@eecs.berkeley.edu> on 2014/06/30 19:20:12 UTC, 1 replies.
- Help understanding spark.task.maxFailures - posted by Yana Kadiyska <ya...@gmail.com> on 2014/06/30 21:10:33 UTC, 0 replies.
- Re: persistence and fault tolerance in Spark Streaming - posted by Chris Fregly <ch...@fregly.com> on 2014/06/30 21:27:02 UTC, 0 replies.
- Spark 1.0: Reading JSON LZH Compressed File - posted by "Uddin, Nasir M." <nu...@dtcc.com> on 2014/06/30 22:09:50 UTC, 0 replies.
- Spark 1.0 and Logistic Regression Python Example - posted by Sam Jacobs <sa...@us.abb.com> on 2014/06/30 23:04:37 UTC, 0 replies.