You are viewing a plain text version of this content. The canonical link for it is here.
- Re: RDDs join problem: incorrect result - posted by Harihar Nahak <hn...@wynyardgroup.com> on 2014/12/01 00:31:42 UTC, 0 replies.
- Re: GraphX:java.lang.NoSuchMethodError:org.apache.spark.graphx.Graph$.apply - posted by Harihar Nahak <hn...@wynyardgroup.com> on 2014/12/01 00:37:43 UTC, 0 replies.
- How can a function access Executor ID, Function ID and other parameters - posted by Steve Lewis <lo...@gmail.com> on 2014/12/01 01:05:18 UTC, 0 replies.
- Re: reduceByKey and empty output files - posted by Rishi Yadav <ri...@infoobjects.com> on 2014/12/01 01:19:51 UTC, 0 replies.
- Re: Edge List File in GraphX - posted by Harihar Nahak <hn...@wynyardgroup.com> on 2014/12/01 02:16:21 UTC, 0 replies.
- Re: kafka pipeline exactly once semantics - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/12/01 04:44:44 UTC, 0 replies.
- RE: Unable to compile spark 1.1.0 on windows 8.1 - posted by Judy Nash <ju...@exchange.microsoft.com> on 2014/12/01 05:12:28 UTC, 2 replies.
- Re: spark.akka.frameSize setting problem - posted by Ke Wang <jk...@gmail.com> on 2014/12/01 06:14:18 UTC, 3 replies.
- RE: latest Spark 1.2 thrift server fail with NoClassDefFoundError on Guava - posted by Judy Nash <ju...@exchange.microsoft.com> on 2014/12/01 07:53:36 UTC, 4 replies.
- merge elements in a Spark RDD under custom condition - posted by Pengcheng YIN <pc...@gmail.com> on 2014/12/01 09:05:58 UTC, 0 replies.
- Re: java.io.InvalidClassException: org.apache.spark.api.java.JavaUtils$SerializableMapWrapper; no valid constructor - posted by lokeshkumar <lo...@dataken.net> on 2014/12/01 09:17:26 UTC, 1 replies.
- akka.remote.transport.Transport$InvalidAssociationException: The remote system terminated the association because it is shutting down - posted by Alexey Romanchuk <al...@gmail.com> on 2014/12/01 09:37:27 UTC, 2 replies.
- Re: Spark SQL 1.0.0 - RDD from snappy compress avro file - posted by cjdc <cr...@cern.ch> on 2014/12/01 09:55:59 UTC, 2 replies.
- Re: Creating a SchemaRDD from an existing API - posted by Niranda Perera <ni...@wso2.com> on 2014/12/01 10:34:55 UTC, 1 replies.
- Kryo exception for CassandraSQLRow - posted by shahab <sh...@gmail.com> on 2014/12/01 10:48:25 UTC, 1 replies.
- Spark 1.1.0: weird spark-shell behavior - posted by Reinis Vicups <sp...@orbit-x.de> on 2014/12/01 10:50:19 UTC, 0 replies.
- Kafka+Spark-streaming issue: Stream 0 received 0 blocks - posted by m....@accenture.com on 2014/12/01 11:12:40 UTC, 0 replies.
- Re: Kafka+Spark-streaming issue: Stream 0 received 0 blocks - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2014/12/01 11:25:43 UTC, 8 replies.
- Re: Setting network variables in spark-shell - posted by Shixiong Zhu <zs...@gmail.com> on 2014/12/01 13:26:45 UTC, 0 replies.
- Is Spark the right tool for me? - posted by "Stadin, Benjamin" <Be...@heidelberg-mobil.com> on 2014/12/01 14:48:12 UTC, 10 replies.
- ensuring RDD indices remain immutable - posted by rok <ro...@gmail.com> on 2014/12/01 15:36:38 UTC, 2 replies.
- How take top N of top M from RDD as RDD - posted by Xuefeng Wu <be...@gmail.com> on 2014/12/01 15:44:00 UTC, 3 replies.
- Time based aggregation in Real time Spark Streaming - posted by pankaj <pa...@gmail.com> on 2014/12/01 16:31:40 UTC, 2 replies.
- Problem creating EC2 cluster using spark-ec2 - posted by Dave Challis <da...@aistemos.com> on 2014/12/01 16:34:50 UTC, 5 replies.
- Re: Spark Job submit - posted by Matt Narrell <ma...@gmail.com> on 2014/12/01 17:38:12 UTC, 0 replies.
- Re: Mllib native netlib-java/OpenBLAS - posted by agg212 <al...@brown.edu> on 2014/12/01 17:49:44 UTC, 3 replies.
- packaging from source gives protobuf compatibility issues. - posted by akhandeshi <am...@gmail.com> on 2014/12/01 18:18:34 UTC, 0 replies.
- Spark Summit East CFP - 5 days until deadline - posted by Scott walent <sc...@gmail.com> on 2014/12/01 18:52:48 UTC, 0 replies.
- How to Integrate openNLP with Spark - posted by Nikhil <ni...@yahoo.com> on 2014/12/01 19:31:47 UTC, 1 replies.
- Minimum cluster size for empirical testing - posted by "Valdes, Pablo" <pv...@comscore.com> on 2014/12/01 20:54:25 UTC, 0 replies.
- Re: How to use FlumeInputDStream in spark cluster? - posted by Ping Tang <pt...@aerohive.com> on 2014/12/01 21:02:05 UTC, 0 replies.
- Remove added jar from spark context - posted by ankits <an...@gmail.com> on 2014/12/01 21:06:28 UTC, 0 replies.
- RE: Inaccurate Estimate of weights model from StreamingLinearRegressionWithSGD - posted by "Bui, Tri" <Tr...@VerizonWireless.com.INVALID> on 2014/12/01 22:36:10 UTC, 0 replies.
- StreamingLinearRegressionWithSGD - posted by Joanne Contact <jo...@gmail.com> on 2014/12/01 23:00:29 UTC, 1 replies.
- Spark SQL table Join, one task is taking long - posted by Venkat Subramanian <vs...@gmail.com> on 2014/12/01 23:35:14 UTC, 4 replies.
- hdfs streaming context - posted by Benjamin Cuthbert <cu...@gmail.com> on 2014/12/01 23:41:02 UTC, 7 replies.
- Passing Java Options to Spark AM launching - posted by Mohammad Islam <mi...@yahoo.com.INVALID> on 2014/12/02 01:49:06 UTC, 3 replies.
- Loading RDDs in a streaming fashion - posted by Keith Simmons <ke...@pulse.io> on 2014/12/02 01:50:12 UTC, 6 replies.
- numpy arrays and spark sql - posted by Joseph Winston <jo...@me.com> on 2014/12/02 03:33:45 UTC, 1 replies.
- Re: ALS failure with size > Integer.MAX_VALUE - posted by Bharath Ravi Kumar <re...@gmail.com> on 2014/12/02 03:40:51 UTC, 5 replies.
- Re: Calling spark from a java web application. - posted by ryaminal <ta...@gmail.com> on 2014/12/02 04:16:46 UTC, 2 replies.
- java.io.IOException: Filesystem closed - posted by rapelly kartheek <ka...@gmail.com> on 2014/12/02 07:29:39 UTC, 6 replies.
- Is it possible to just change the value of the items in RDD without making a full copy? - posted by Xuelin Cao <xu...@yahoo.com.INVALID> on 2014/12/02 12:18:55 UTC, 3 replies.
- Spark Streaming on Mesos: How is the nr of coarse-grained executors calculated? - posted by Gerard Maas <ge...@gmail.com> on 2014/12/02 13:55:12 UTC, 0 replies.
- Parallelize independent tasks - posted by Anselme Vignon <an...@flaminem.com> on 2014/12/02 14:23:23 UTC, 1 replies.
- Does filter on an RDD scan every data item ? - posted by nsareen <ns...@gmail.com> on 2014/12/02 15:29:46 UTC, 11 replies.
- pySpark saveAsSequenceFile append overwrite - posted by Csaba Ragany <ra...@gmail.com> on 2014/12/02 15:40:17 UTC, 1 replies.
- IP to geo information in spark streaming application - posted by Noam Kfir <No...@perion.com> on 2014/12/02 16:14:12 UTC, 2 replies.
- Re: MLlib Naive Bayes classifier confidence - posted by MariusFS <ma...@sien.com> on 2014/12/02 17:44:01 UTC, 2 replies.
- sort algorithm using sortBy - posted by bchazalet <bc...@companywatch.net> on 2014/12/02 18:19:59 UTC, 0 replies.
- Re: Low Level Kafka Consumer for Spark - posted by RodrigoB <ro...@aspect.com> on 2014/12/02 18:59:38 UTC, 4 replies.
- Re: Spark setup on local windows machine - posted by Sunita Arvind <su...@gmail.com> on 2014/12/02 19:13:33 UTC, 0 replies.
- Re: Negative Accumulators - posted by Peter Thai <th...@gmail.com> on 2014/12/02 19:37:20 UTC, 1 replies.
- Scala Dependency Injection - posted by Venkat Subramanian <vs...@gmail.com> on 2014/12/02 19:41:09 UTC, 0 replies.
- Help understanding - Not enough space to cache rdd - posted by akhandeshi <am...@gmail.com> on 2014/12/02 19:53:08 UTC, 3 replies.
- SchemaRDD + SQL , loading projection columns - posted by Vishnusaran Ramaswamy <vi...@gmail.com> on 2014/12/02 21:43:53 UTC, 2 replies.
- Unresolved attributes - posted by Eric Tanner <er...@justenough.com> on 2014/12/02 22:00:46 UTC, 1 replies.
- WordCount fails in .textFile() method - posted by Rahul Swaminathan <ra...@duke.edu> on 2014/12/02 22:17:43 UTC, 1 replies.
- SaveAsTextFile brings down data nodes with IO Exceptions - posted by "Ganelin, Ilya" <Il...@capitalone.com> on 2014/12/02 22:20:07 UTC, 0 replies.
- Using SparkSQL to query Hbase entity takes very long time - posted by bonnahu <bo...@gmail.com> on 2014/12/02 22:26:04 UTC, 0 replies.
- Announcing Spark 1.1.1! - posted by Andrew Or <an...@databricks.com> on 2014/12/02 22:36:47 UTC, 4 replies.
- Standard SQL tool access to SchemaRDD - posted by Jim Carroll <ji...@gmail.com> on 2014/12/02 23:34:33 UTC, 2 replies.
- executor logging management from python - posted by freedafeng <fr...@yahoo.com> on 2014/12/02 23:37:18 UTC, 1 replies.
- object xxx is not a member of package com - posted by flyson <m_...@msn.com> on 2014/12/02 23:59:31 UTC, 1 replies.
- Any ideas why a few tasks would stall - posted by Steve Lewis <lo...@gmail.com> on 2014/12/03 00:03:58 UTC, 7 replies.
- Re: Kryo NPE with Array - posted by Simone Franzini <ca...@gmail.com> on 2014/12/03 00:55:50 UTC, 0 replies.
- Reading from Kerberos Secured HDFS in Spark? - posted by Matt Cheah <mc...@palantir.com> on 2014/12/03 01:09:48 UTC, 0 replies.
- Re: Viewing web UI after fact - posted by lihu <li...@gmail.com> on 2014/12/03 05:06:33 UTC, 1 replies.
- Monitoring Spark - posted by Isca Harmatz <po...@gmail.com> on 2014/12/03 05:57:27 UTC, 5 replies.
- Filter using the Vertex Ids - posted by Deep Pradhan <pr...@gmail.com> on 2014/12/03 07:01:20 UTC, 6 replies.
- Spark with HBase - posted by Jai <ja...@gmail.com> on 2014/12/03 07:21:25 UTC, 3 replies.
- Spark SQL UDF returning a list? - posted by Jerry Raj <je...@gmail.com> on 2014/12/03 08:31:40 UTC, 2 replies.
- getting firs N messages froma Kafka topic using Spark Streaming - posted by Hafiz Mujadid <ha...@gmail.com> on 2014/12/03 08:35:08 UTC, 2 replies.
- Does count() evaluate all mapped functions? - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/12/03 08:44:39 UTC, 0 replies.
- Using sparkSQL to convert a collection of python dictionary of dictionaries to schma RDD - posted by sahanbull <sa...@skimlinks.com> on 2014/12/03 09:11:08 UTC, 4 replies.
- textFileStream() issue? - posted by Bahubali Jain <ba...@gmail.com> on 2014/12/03 09:31:31 UTC, 1 replies.
- Re: WordCount fails in .textFile() method - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2014/12/03 09:55:44 UTC, 3 replies.
- How does 2.3.4-spark differ from typesafe 2.3.4 akka? - posted by dresnick <ab...@gmail.com> on 2014/12/03 10:28:11 UTC, 0 replies.
- How to enforce RDD to be cached? - posted by shahab <sh...@gmail.com> on 2014/12/03 10:52:00 UTC, 4 replies.
- converting DStream[String] into RDD[String] in spark streaming - posted by Hafiz Mujadid <ha...@gmail.com> on 2014/12/03 14:45:58 UTC, 2 replies.
- collecting fails - requirements for collecting (clone, hashCode etc?) - posted by Ron Ayoub <ro...@live.com> on 2014/12/03 14:48:53 UTC, 1 replies.
- Re: Logging problem in Spark when using Flume Log4jAppender - posted by QiaoanChen <ka...@gmail.com> on 2014/12/03 14:50:37 UTC, 0 replies.
- Failed fetch: "Could not get block(s)" - posted by Al M <al...@gmail.com> on 2014/12/03 15:55:51 UTC, 0 replies.
- Spark MOOC by Berkeley and Databricks - posted by Marco Didonna <m....@gmail.com> on 2014/12/03 16:27:28 UTC, 0 replies.
- what is the best way to implement mini batches? - posted by ll <du...@gmail.com> on 2014/12/03 16:28:44 UTC, 9 replies.
- Providing query dsl to Elasticsearch for Spark (2.1.0.Beta3) - posted by Ian Wilkinson <ia...@me.com> on 2014/12/03 16:31:24 UTC, 1 replies.
- Serializing with Kryo NullPointerException - Java - posted by Robin Keunen <ro...@lampiris.be> on 2014/12/03 18:15:40 UTC, 1 replies.
- [SQL] Wildcards in SQLContext.parquetFile? - posted by Yana Kadiyska <ya...@gmail.com> on 2014/12/03 18:25:13 UTC, 1 replies.
- GraphX Pregel halting condition - posted by Jay Hutfles <ja...@gmail.com> on 2014/12/03 18:37:01 UTC, 2 replies.
- Re: heterogeneous cluster setup - posted by Victor Tso-Guillen <vt...@paxata.com> on 2014/12/03 18:41:51 UTC, 2 replies.
- dockerized spark executor on mesos? - posted by Dick Davies <di...@hellooperator.net> on 2014/12/03 18:46:48 UTC, 4 replies.
- Best way to have some singleton per worker - posted by Ashic Mahtab <as...@live.com> on 2014/12/03 18:59:38 UTC, 1 replies.
- MLLib: loading saved model - posted by Sameer Tilak <ss...@live.com> on 2014/12/03 19:12:25 UTC, 1 replies.
- Re: How can I read an avro file in HDFS in Java? - posted by Prannoy <pr...@sigmoidanalytics.com> on 2014/12/03 20:04:16 UTC, 0 replies.
- Re: Insert new data into specific partition of an RDD - posted by dsiegel <de...@gmail.com> on 2014/12/03 22:01:51 UTC, 0 replies.
- Alternatives to groupByKey - posted by ameyc <am...@gmail.com> on 2014/12/03 22:26:22 UTC, 4 replies.
- How to create a new SchemaRDD which is not based on original SparkPlan? - posted by Tim Chou <ti...@gmail.com> on 2014/12/04 00:17:55 UTC, 0 replies.
- Spark executor lost - posted by "S. Zhou" <my...@yahoo.com.INVALID> on 2014/12/04 00:29:22 UTC, 4 replies.
- single key-value pair fitting in memory - posted by dsiegel <de...@gmail.com> on 2014/12/04 01:16:59 UTC, 0 replies.
- SQL query in scala API - posted by Arun Luthra <ar...@gmail.com> on 2014/12/04 01:47:03 UTC, 5 replies.
- How can a function running on a slave access the Executor - posted by Steve Lewis <lo...@gmail.com> on 2014/12/04 02:05:46 UTC, 0 replies.
- wordcount accross several files - posted by BC <bq...@gmail.com> on 2014/12/04 03:43:40 UTC, 0 replies.
- reading dynamoDB with spark - posted by Tyson <ty...@gmail.com> on 2014/12/04 04:11:56 UTC, 0 replies.
- Spark SQL with a sorted file - posted by Jerry Raj <je...@gmail.com> on 2014/12/04 04:34:18 UTC, 4 replies.
- Re: Having problem with Spark streaming with Kinesis - posted by "A.K.M. Ashrafuzzaman" <as...@gmail.com> on 2014/12/04 05:05:03 UTC, 3 replies.
- spark-submit on YARN is slow - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/12/04 05:10:39 UTC, 15 replies.
- cannot submit python files on EC2 cluster - posted by chocjy <ji...@gmail.com> on 2014/12/04 05:17:38 UTC, 1 replies.
- Issue in executing Spark Application from Eclipse - posted by Stuti Awasthi <st...@hcl.com> on 2014/12/04 07:05:10 UTC, 3 replies.
- MLLIB model export: PMML vs MLLIB serialization - posted by sourabh <ch...@gmail.com> on 2014/12/04 07:11:19 UTC, 4 replies.
- Re: netty on classpath when using spark-submit - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/12/04 07:13:12 UTC, 0 replies.
- Spark Streaming empty RDD issue - posted by Hafiz Mujadid <ha...@gmail.com> on 2014/12/04 07:47:02 UTC, 2 replies.
- serialization issue in case of case class is more than 1 - posted by Rahul Bindlish <ra...@nectechnologies.in> on 2014/12/04 08:54:15 UTC, 0 replies.
- Necessity for rdd replication. - posted by rapelly kartheek <ka...@gmail.com> on 2014/12/04 08:54:58 UTC, 1 replies.
- (Unknown) - posted by Subong Kim <su...@gmail.com> on 2014/12/04 10:13:48 UTC, 0 replies.
- map function - posted by Yifan LI <ia...@gmail.com> on 2014/12/04 10:26:26 UTC, 1 replies.
- Re: [question]Where can I get the log file - posted by Prannoy <pr...@sigmoidanalytics.com> on 2014/12/04 10:30:41 UTC, 0 replies.
- SchemaRDD partition on specific column values? - posted by nitin <ni...@gmail.com> on 2014/12/04 11:00:39 UTC, 5 replies.
- Determination of number of RDDs - posted by Deep Pradhan <pr...@gmail.com> on 2014/12/04 11:08:45 UTC, 2 replies.
- Re: map function - posted by Mark Hamstra <ma...@clearstorydata.com> on 2014/12/04 11:13:51 UTC, 1 replies.
- Example usage of StreamingListener - posted by Hafiz Mujadid <ha...@gmail.com> on 2014/12/04 12:00:10 UTC, 2 replies.
- Re: running Spark Streaming just once and stop it - posted by Hafiz Mujadid <ha...@gmail.com> on 2014/12/04 12:08:05 UTC, 0 replies.
- Spark-Streaming: output to cassandra - posted by m....@accenture.com on 2014/12/04 15:51:01 UTC, 9 replies.
- Efficient way to get top K values per key in (key, value) RDD? - posted by Theodore Vasiloudis <th...@gmail.com> on 2014/12/04 15:53:12 UTC, 1 replies.
- Market Basket Analysis - posted by Rohit Pujari <rp...@hortonworks.com> on 2014/12/04 15:58:31 UTC, 7 replies.
- Failed to read chunk exception - posted by Steve Lewis <lo...@gmail.com> on 2014/12/04 17:48:26 UTC, 1 replies.
- How can a function get a TaskContext - posted by Steve Lewis <lo...@gmail.com> on 2014/12/04 18:51:51 UTC, 0 replies.
- Stateful mapPartitions - posted by Akshat Aranya <aa...@gmail.com> on 2014/12/04 19:56:23 UTC, 4 replies.
- how do you turn off info logging when running in local mode - posted by Ron Ayoub <ro...@live.com> on 2014/12/04 20:18:26 UTC, 1 replies.
- Re: Can not see any spark metrics on ganglia-web - posted by danilopds <da...@gmail.com> on 2014/12/04 20:24:37 UTC, 0 replies.
- Re: Spark metrics for ganglia - posted by danilopds <da...@gmail.com> on 2014/12/04 20:35:52 UTC, 2 replies.
- spark-ec2 Web UI Problem - posted by Xingwei Yang <ha...@gmail.com> on 2014/12/04 22:41:57 UTC, 1 replies.
- How to make symbol for one column in Spark SQL. - posted by Tim Chou <ti...@gmail.com> on 2014/12/04 23:26:28 UTC, 2 replies.
- Unable to run applications on clusters on EC2 - posted by Xingwei Yang <ha...@gmail.com> on 2014/12/04 23:26:57 UTC, 1 replies.
- How to extend an one-to-one RDD of Spark that can be persisted? - posted by Peng Cheng <pc...@uow.edu.au> on 2014/12/04 23:35:11 UTC, 0 replies.
- Loading a large Hbase table into SPARK RDD takes quite long time - posted by bonnahu <bo...@gmail.com> on 2014/12/04 23:56:26 UTC, 4 replies.
- Re: Spark 1.1.0 Can not read snappy compressed sequence file - posted by Stéphane Verlet <ka...@gmail.com> on 2014/12/05 00:21:03 UTC, 0 replies.
- printing mllib.linalg.vector - posted by debbie <de...@hotmail.com> on 2014/12/05 00:32:42 UTC, 1 replies.
- Getting all the results from MySQL table - posted by gargp <pr...@gmail.com> on 2014/12/05 00:58:02 UTC, 0 replies.
- representing RDF literals as vertex properties - posted by spr <sp...@yarcdata.com> on 2014/12/05 01:26:50 UTC, 3 replies.
- Re: scopt.OptionParser - posted by Caron <ca...@gmail.com> on 2014/12/05 02:45:01 UTC, 0 replies.
- spark assembly jar caused "changed on src filesystem" error - posted by "Hu, Leo" <le...@sap.com> on 2014/12/05 03:23:24 UTC, 1 replies.
- Exception adding resource files in latest Spark - posted by Jianshi Huang <ji...@gmail.com> on 2014/12/05 04:37:50 UTC, 6 replies.
- SPARK LIMITATION - more than one case class is not allowed !! - posted by Rahul Bindlish <ra...@nectechnologies.in> on 2014/12/05 04:53:11 UTC, 10 replies.
- Window function by Spark SQL - posted by "Dai, Kevin" <yu...@ebay.com> on 2014/12/05 05:22:20 UTC, 1 replies.
- SparkContext.textfile() cannot load file using UNC path on windows - posted by Ningjun Wang <ni...@gmail.com> on 2014/12/05 05:28:33 UTC, 0 replies.
- Re: How to incrementally compile spark examples using mvn - posted by MEETHU MATHEW <me...@yahoo.co.in> on 2014/12/05 07:23:35 UTC, 6 replies.
- RDD.aggregate? - posted by ll <du...@gmail.com> on 2014/12/05 07:36:48 UTC, 3 replies.
- Clarifications on Spark - posted by Ajay <aj...@gmail.com> on 2014/12/05 08:25:03 UTC, 1 replies.
- Re: Auto BroadcastJoin optimization failed in latest Spark - posted by Jianshi Huang <ji...@gmail.com> on 2014/12/05 08:30:08 UTC, 2 replies.
- drop table if exists throws exception - posted by Jianshi Huang <ji...@gmail.com> on 2014/12/05 08:31:56 UTC, 3 replies.
- Profiling GraphX codes. - posted by Deep Pradhan <pr...@gmail.com> on 2014/12/05 09:33:25 UTC, 0 replies.
- Spark streaming for v1.1.1 - unable to start application - posted by Sourav Chandra <so...@livestream.com> on 2014/12/05 09:36:28 UTC, 2 replies.
- Issue on [SPARK-3877][YARN]: Return code of the spark-submit in yarn-cluster mode - posted by LinQili <li...@outlook.com> on 2014/12/05 09:55:37 UTC, 4 replies.
- Re: NullPointerException When Reading Avro Sequence Files - posted by cjdc <cr...@cern.ch> on 2014/12/05 10:52:34 UTC, 3 replies.
- Increasing the number of retry in case of job failure - posted by shahab <sh...@gmail.com> on 2014/12/05 11:02:10 UTC, 2 replies.
- [Graphx] which way is better to access faraway neighbors? - posted by Yifan LI <ia...@gmail.com> on 2014/12/05 11:26:52 UTC, 1 replies.
- scala.MatchError on SparkSQL when creating ArrayType of StructType - posted by Hao Ren <in...@gmail.com> on 2014/12/05 11:36:37 UTC, 1 replies.
- How can I compile only the core and streaming (so that I can get test utilities of streaming)? - posted by Emre Sevinc <em...@gmail.com> on 2014/12/05 12:52:54 UTC, 3 replies.
- cartesian on pyspark not paralleised - posted by Antony Mayi <an...@yahoo.com.INVALID> on 2014/12/05 14:07:13 UTC, 1 replies.
- spark streaming kafa best practices ? - posted by david <da...@free.fr> on 2014/12/05 14:26:10 UTC, 4 replies.
- subscribe me to the list - posted by "Wang, Ningjun (LNG-NPV)" <ni...@lexisnexis.com> on 2014/12/05 15:36:25 UTC, 1 replies.
- Why my default partition size is set to 52 ? - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/12/05 15:51:40 UTC, 2 replies.
- Why KMeans with mllib is so slow ? - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/12/05 16:50:07 UTC, 8 replies.
- pyspark exception catch - posted by Igor Mazor <ig...@rocket-internet.de> on 2014/12/05 17:12:57 UTC, 2 replies.
- Spark Streaming Reusing JDBC Connections - posted by Asim Jalis <as...@gmail.com> on 2014/12/05 18:32:24 UTC, 1 replies.
- Adding Spark Cassandra dependency breaks Spark Streaming? - posted by Ashic Mahtab <as...@live.com> on 2014/12/05 18:35:59 UTC, 5 replies.
- I am having problems reading files in the 4GB range - posted by Steve Lewis <lo...@gmail.com> on 2014/12/05 19:51:07 UTC, 0 replies.
- Optimized spark configuration - posted by "vdiwakar.malladi" <vd...@gmail.com> on 2014/12/05 19:51:29 UTC, 1 replies.
- Java RDD Union - posted by Ron Ayoub <ro...@live.com> on 2014/12/05 20:27:03 UTC, 6 replies.
- Including data nucleus tools - posted by sp...@seznam.cz on 2014/12/05 21:25:04 UTC, 8 replies.
- Re: Using data in RDD to specify HDFS directory to write to - posted by Nathan Murthy <na...@gmail.com> on 2014/12/05 22:22:02 UTC, 0 replies.
- Cannot PredictOnValues or PredictOn base on the model build with StreamingLinearRegressionWithSGD - posted by "Bui, Tri" <Tr...@VerizonWireless.com.INVALID> on 2014/12/05 23:01:14 UTC, 0 replies.
- Running two different Spark jobs vs multi-threading RDDs - posted by Corey Nolet <cj...@gmail.com> on 2014/12/05 23:19:26 UTC, 2 replies.
- Transfer from RDD to JavaRDD - posted by Xingwei Yang <ha...@gmail.com> on 2014/12/06 00:51:09 UTC, 1 replies.
- Hive Problem in Pig generated Parquet file schema in CREATE EXTERNAL TABLE (e.g. bag::col1) - posted by Jianshi Huang <ji...@gmail.com> on 2014/12/06 01:41:42 UTC, 8 replies.
- Problems creating and reading a large test file - posted by Steve Lewis <lo...@gmail.com> on 2014/12/06 02:21:23 UTC, 0 replies.
- Re: rdd.saveAsTextFile problem - posted by dylanhogg <dy...@gmail.com> on 2014/12/06 03:26:20 UTC, 0 replies.
- Trying to understand a basic difference between these two configurations - posted by Soumya Simanta <so...@gmail.com> on 2014/12/06 04:31:36 UTC, 2 replies.
- Fair scheduling accross applications in stand-alone mode - posted by Mohammed Guller <mo...@glassbeam.com> on 2014/12/06 08:25:32 UTC, 1 replies.
- Modifying an RDD in forEach - posted by Ron Ayoub <ro...@live.com> on 2014/12/06 13:42:00 UTC, 4 replies.
- PySpark Loading Json Following by groupByKey seems broken in spark 1.1.1 - posted by Brad Willard <in...@bradwillard.com> on 2014/12/06 15:02:29 UTC, 0 replies.
- Where can you get nightly builds? - posted by Simone Franzini <ca...@gmail.com> on 2014/12/06 15:41:11 UTC, 2 replies.
- Is there a way to force spark to use specific ips? - posted by Ashic Mahtab <as...@live.com> on 2014/12/06 16:37:07 UTC, 2 replies.
- Spark on YARN memory utilization - posted by Denny Lee <de...@gmail.com> on 2014/12/06 22:27:49 UTC, 4 replies.
- run JavaAPISuite with mavem - posted by Koert Kuipers <ko...@tresata.com> on 2014/12/07 02:43:31 UTC, 11 replies.
- java.lang.ExceptionInInitializerError/Unable to load YARN support - posted by maven <ni...@gmail.com> on 2014/12/07 02:55:37 UTC, 4 replies.
- "vcores used" in cluster metrics(yarn resource manager ui) when running spark on yarn - posted by yuemeng1 <yu...@huawei.com> on 2014/12/07 04:29:22 UTC, 1 replies.
- Recovered executor num in yarn-client mode - posted by yuemeng1 <yu...@huawei.com> on 2014/12/07 04:29:30 UTC, 0 replies.
- Convert RDD[Map[String, Any]] to SchemaRDD - posted by Jianshi Huang <ji...@gmail.com> on 2014/12/07 07:32:30 UTC, 4 replies.
- MLlib(Logistic Regression) + Spark Streaming. - posted by Nasir Khan <na...@gmail.com> on 2014/12/07 11:30:12 UTC, 2 replies.
- RE: Bulk-load to HBase - posted by fralken <fm...@tiscali.it> on 2014/12/07 18:17:52 UTC, 0 replies.
- NoClassDefFoundError - posted by Julius K <fo...@gmail.com> on 2014/12/07 18:35:45 UTC, 2 replies.
- saveAsParquetFile and DirectFileOutputCommitter Class not found Error - posted by "Addanki, Santosh Kumar" <sa...@sap.com> on 2014/12/07 20:28:25 UTC, 0 replies.
- Spark SQL: How to get the hierarchical element with SQL? - posted by Xuelin Cao <xu...@yahoo.com.INVALID> on 2014/12/08 06:08:19 UTC, 5 replies.
- Print Node info. of Decision Tree - posted by jake Lim <it...@gmail.com> on 2014/12/08 06:17:34 UTC, 1 replies.
- spark Exception while performing saveAsTextFiles - posted by Hafiz Mujadid <ha...@gmail.com> on 2014/12/08 08:19:46 UTC, 1 replies.
- monitoring for spark standalone - posted by Judy Nash <ju...@exchange.microsoft.com> on 2014/12/08 08:35:06 UTC, 2 replies.
- Is there a way to get column names using hiveContext ? - posted by abhishek <re...@gmail.com> on 2014/12/08 08:36:11 UTC, 1 replies.
- Count-based windows - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/12/08 09:56:46 UTC, 0 replies.
- How can I make Spark Streaming count the words in a file in a unit test? - posted by Emre Sevinc <em...@gmail.com> on 2014/12/08 11:36:41 UTC, 1 replies.
- Conference in Paris next Thursday - posted by Alexis Seigneurin <al...@gmail.com> on 2014/12/08 12:03:00 UTC, 0 replies.
- column level encryption/decryption with key management - posted by Chirag Aggarwal <Ch...@guavus.com> on 2014/12/08 12:05:08 UTC, 0 replies.
- Efficient self-joins - posted by Theodore Vasiloudis <th...@gmail.com> on 2014/12/08 14:50:47 UTC, 8 replies.
- Programmatically running spark jobs using yarn-client - posted by Aniket Bhatnagar <an...@gmail.com> on 2014/12/08 14:53:23 UTC, 2 replies.
- Error when mapping a schema RDD when converting lists - posted by sahanbull <sa...@skimlinks.com> on 2014/12/08 15:38:13 UTC, 2 replies.
- Locking for shared RDDs - posted by "aditya.athalye" <ad...@gmail.com> on 2014/12/08 15:43:30 UTC, 2 replies.
- Understanding reported times on the Spark UI [+ Streaming] - posted by Gerard Maas <ge...@gmail.com> on 2014/12/08 16:38:59 UTC, 0 replies.
- Install Apache Spark on a Cluster - posted by riginos <sa...@gmail.com> on 2014/12/08 18:31:04 UTC, 1 replies.
- Error: Spark-streaming to Cassandra - posted by m....@accenture.com on 2014/12/08 18:47:31 UTC, 2 replies.
- Spark-SQL JDBC driver - posted by Anas Mosaad <an...@incorta.com> on 2014/12/08 20:01:04 UTC, 11 replies.
- How can I create an RDD with millions of entries created programmatically - posted by Steve Lewis <lo...@gmail.com> on 2014/12/08 20:05:38 UTC, 3 replies.
- SOLVED -- Re: scopt.OptionParser - posted by Caron <ca...@gmail.com> on 2014/12/08 20:36:53 UTC, 0 replies.
- MLLIb: Linear regression: Loss was due to java.lang.ArrayIndexOutOfBoundsException - posted by Sameer Tilak <ss...@live.com> on 2014/12/08 21:54:37 UTC, 0 replies.
- Transform SchemaRDDs into new SchemaRDDs - posted by Sunita Arvind <su...@gmail.com> on 2014/12/08 22:43:54 UTC, 0 replies.
- Learning rate or stepsize automation - posted by "Bui, Tri" <Tr...@VerizonWireless.com.INVALID> on 2014/12/09 00:04:52 UTC, 2 replies.
- Error outputing to CSV file - posted by DataNut <ro...@fmr.com> on 2014/12/09 00:49:59 UTC, 2 replies.
- MLLib /ALS : java.lang.OutOfMemoryError: Java heap space - posted by jaykatukuri <jk...@apple.com> on 2014/12/09 01:19:27 UTC, 5 replies.
- In Java how can I create an RDD with a large number of elements - posted by Steve Lewis <lo...@gmail.com> on 2014/12/09 03:17:33 UTC, 1 replies.
- Is there an efficient way to append new data to a registered Spark SQL Table? - posted by Xuelin Cao <xu...@yahoo.com.INVALID> on 2014/12/09 03:29:19 UTC, 2 replies.
- Can not write out data as snappy-compressed files - posted by Tao Xiao <xi...@gmail.com> on 2014/12/09 03:30:52 UTC, 0 replies.
- query classification using Apache spark Mlib - posted by "Huang,Jin" <hu...@gmail.com> on 2014/12/09 04:09:11 UTC, 0 replies.
- How to convert RDD to JSON? - posted by YaoPau <jo...@gmail.com> on 2014/12/09 05:52:54 UTC, 2 replies.
- Saving Data only if Dstream is not empty - posted by Hafiz Mujadid <ha...@gmail.com> on 2014/12/09 06:53:38 UTC, 3 replies.
- Writing and reading file faster than memory option - posted by Malte <ma...@gmail.com> on 2014/12/09 07:30:03 UTC, 0 replies.
- spark broadcast unavailable - posted by 十六夜涙 <cr...@qq.com> on 2014/12/09 07:42:20 UTC, 2 replies.
- PhysicalRDD problem? - posted by nitin <ni...@gmail.com> on 2014/12/09 08:22:24 UTC, 4 replies.
- Re: spark sql - save to Parquet file - Unsupported datatype TimestampType - posted by "ZHENG, Xu-dong" <do...@gmail.com> on 2014/12/09 08:27:40 UTC, 1 replies.
- Issues on schemaRDD's function got stuck - posted by LinQili <li...@outlook.com> on 2014/12/09 08:54:14 UTC, 2 replies.
- Re: do not assemble the spark example jar - posted by lihu <li...@gmail.com> on 2014/12/09 09:27:18 UTC, 0 replies.
- KafkaUtils explicit acks - posted by Mukesh Jha <me...@gmail.com> on 2014/12/09 10:23:13 UTC, 8 replies.
- spark 1.1.1 Maven dependency - posted by sivarani <wh...@gmail.com> on 2014/12/09 10:46:01 UTC, 1 replies.
- NoSuchMethodError: writing spark-streaming data to cassandra - posted by m....@accenture.com on 2014/12/09 11:16:41 UTC, 2 replies.
- Using S3 block file system - posted by Paul Colomiets <pa...@colomiets.name> on 2014/12/09 11:44:13 UTC, 0 replies.
- Stack overflow Error while executing spark SQL - posted by ji...@wipro.com on 2014/12/09 12:34:55 UTC, 1 replies.
- reg JDBCRDD code - posted by Deepa Jayaveer <de...@tcs.com> on 2014/12/09 14:09:13 UTC, 6 replies.
- PySpark elasticsearch question - posted by Mohamed Lrhazi <Mo...@georgetown.edu> on 2014/12/09 14:15:55 UTC, 2 replies.
- registerTempTable: Table not found - posted by Hao Ren <in...@gmail.com> on 2014/12/09 14:30:49 UTC, 2 replies.
- Help realted with spark streaming usage error - posted by Saurabh Pateriya <sa...@happiestminds.com> on 2014/12/09 14:44:48 UTC, 0 replies.
- Specifying number of executors in Mesos - posted by Gerard Maas <ge...@gmail.com> on 2014/12/09 15:20:02 UTC, 2 replies.
- Unable to start Spark 1.3 after building:java.lang. NoClassDefFoundError: org/codehaus/jackson/map/deser/std/StdDeserializer - posted by Daniel Haviv <da...@gmail.com> on 2014/12/09 15:58:29 UTC, 7 replies.
- Submit application to spark on mesos cluster - posted by Han JU <ju...@gmail.com> on 2014/12/09 16:11:48 UTC, 0 replies.
- pyspark sc.textFile uses only 4 out of 32 threads per node - posted by Gautham <ga...@gmail.com> on 2014/12/09 19:59:41 UTC, 3 replies.
- PySprak and UnsupportedOperationException - posted by Mohamed Lrhazi <Mo...@georgetown.edu> on 2014/12/09 20:32:22 UTC, 3 replies.
- yarn log on EMR - posted by Tyson <ty...@gmail.com> on 2014/12/09 21:50:40 UTC, 0 replies.
- Caching RDDs with shared memory - bug or feature? - posted by insperatum <in...@gmail.com> on 2014/12/09 22:42:12 UTC, 0 replies.
- spark shell and hive context problem - posted by minajagi <ch...@jpmorgan.com> on 2014/12/09 23:07:21 UTC, 1 replies.
- spark shell session crashes when trying to obtain hive context - posted by minajagi <ch...@jpmorgan.com> on 2014/12/09 23:10:04 UTC, 0 replies.
- implement query to sparse vector representation in spark - posted by "Huang,Jin" <hu...@gmail.com> on 2014/12/09 23:12:35 UTC, 0 replies.
- equivalent to sql in - posted by dizzy5112 <da...@gmail.com> on 2014/12/09 23:15:35 UTC, 3 replies.
- Re: query classification using spark Mlib - posted by sparkEbay <jx...@hotmail.com> on 2014/12/09 23:24:30 UTC, 0 replies.
- Fwd: Please add us to the Spark users page - posted by Abhik Majumdar <ab...@vidora.com> on 2014/12/10 00:34:22 UTC, 0 replies.
- Cluster getting a null pointer error - posted by Eric Tanner <er...@justenough.com> on 2014/12/10 00:58:43 UTC, 1 replies.
- Reading Yarn log on EMR - posted by Nagy István <ty...@gmail.com> on 2014/12/10 01:48:53 UTC, 1 replies.
- Workers keep dying on EC2 Spark cluster: PriviledgedActionException - posted by Jeff Schecter <je...@levelmoney.com> on 2014/12/10 02:35:49 UTC, 0 replies.
- Can HiveContext be used without using Hive? - posted by Manoj Samel <ma...@gmail.com> on 2014/12/10 02:51:19 UTC, 3 replies.
- Mllib error - posted by amin mohebbi <am...@yahoo.com.INVALID> on 2014/12/10 06:11:39 UTC, 1 replies.
- Stack overflow Error while executing spark SQL - posted by Jishnu Prathap <ji...@wipro.com> on 2014/12/10 08:07:40 UTC, 1 replies.
- Re: Spark SQL Stackoverflow error - posted by Jishnu Prathap <ji...@wipro.com> on 2014/12/10 08:52:51 UTC, 0 replies.
- 回复: spark broadcast unavailable - posted by 十六夜涙 <cr...@qq.com> on 2014/12/10 09:30:55 UTC, 0 replies.
- Actor System Corrupted! - posted by "Stephen Samuel (Sam)" <sa...@sksamuel.com> on 2014/12/10 11:18:38 UTC, 0 replies.
- MLLib in Production - posted by Klausen Schaefersinho <kl...@gmail.com> on 2014/12/10 11:25:51 UTC, 4 replies.
- Maven profile in MLLib netlib-lgpl not working (1.1.1) - posted by Guillaume Pitel <gu...@exensa.com> on 2014/12/10 11:42:36 UTC, 0 replies.
- flatMap and spilling of output to disk - posted by Johannes Simon <jo...@mail.de> on 2014/12/10 13:13:04 UTC, 4 replies.
- DIMSUM and ColumnSimilarity use case ? - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/12/10 15:53:21 UTC, 3 replies.
- KryoException: Buffer overflow for very small input - posted by JoeWass <jo...@afandian.com> on 2014/12/10 16:49:34 UTC, 0 replies.
- how to run spark function in a tomcat servlet - posted by bai阿蒙 <sm...@hotmail.com> on 2014/12/10 17:19:32 UTC, 1 replies.
- Spark 1.1.0 does not spawn more than 6 executors in yarn-client mode and ignores --num-executors - posted by Aniket Bhatnagar <an...@gmail.com> on 2014/12/10 19:24:02 UTC, 1 replies.
- Issue when upgrading from Spark 1.1.0 to 1.1.1: Exception of java.lang.NoClassDefFoundError: io/netty/util/TimerTask - posted by "S. Zhou" <my...@yahoo.com.INVALID> on 2014/12/10 19:45:04 UTC, 1 replies.
- Spark 1.1.1 SQLContext.jsonFile dumps trace if JSON has newlines ... - posted by Manoj Samel <ma...@gmail.com> on 2014/12/10 20:48:13 UTC, 1 replies.
- Spark 1.0.0 Standalone mode config - posted by 9000revs <90...@gmail.com> on 2014/12/10 20:59:40 UTC, 1 replies.
- MLlib: Libsvm: Loss was due to java.lang.ArrayIndexOutOfBoundsException - posted by Sameer Tilak <ss...@live.com> on 2014/12/10 21:34:25 UTC, 0 replies.
- Key not valid / already cancelled using Spark Streaming - posted by Flávio Santos <ba...@chaordicsystems.com> on 2014/12/10 21:44:25 UTC, 5 replies.
- Trouble with cache() and parquet - posted by Yana Kadiyska <ya...@gmail.com> on 2014/12/10 21:46:01 UTC, 3 replies.
- Filtering nested data using Spark SQL - posted by Jerry Lam <ch...@gmail.com> on 2014/12/11 00:10:41 UTC, 0 replies.
- Unread block issue w/ spark 1.1.0 on CDH5 - posted by Anson Abraham <an...@gmail.com> on 2014/12/11 00:44:48 UTC, 0 replies.
- Regarding Classification of Big Data - posted by Chintan Bhatt <ch...@charusat.ac.in> on 2014/12/11 01:34:48 UTC, 0 replies.
- Using Intellij for pyspark - posted by Stephen Boesch <ja...@gmail.com> on 2014/12/11 01:46:18 UTC, 0 replies.
- RDDs being cleaned too fast - posted by ankits <an...@gmail.com> on 2014/12/11 02:34:52 UTC, 3 replies.
- Partitioner in sortBy - posted by Kevin Jung <it...@samsung.com> on 2014/12/11 02:42:54 UTC, 0 replies.
- Decision Tree with libsvmtools datasets - posted by "Ge, Yao (Y.)" <yg...@ford.com> on 2014/12/11 04:40:55 UTC, 1 replies.
- Decision Tree with Categorical Features - posted by "Ge, Yao (Y.)" <yg...@ford.com> on 2014/12/11 04:51:16 UTC, 0 replies.
- Compare performance of sqlContext.jsonFile and sqlContext.jsonRDD - posted by Rakesh Nair <ra...@gmail.com> on 2014/12/11 05:50:08 UTC, 1 replies.
- parquet file not loading (spark v 1.1.0) - posted by Rahul Bindlish <ra...@nectechnologies.in> on 2014/12/11 07:06:15 UTC, 1 replies.
- Error on JavaSparkContext.stop() - posted by Taeyun Kim <ta...@innowireless.com> on 2014/12/11 08:37:09 UTC, 3 replies.
- Spark steaming : work with collect() but not without collect() - posted by david <da...@free.fr> on 2014/12/11 10:24:13 UTC, 2 replies.
- spark logging issue - posted by Sourav Chandra <so...@livestream.com> on 2014/12/11 12:32:06 UTC, 0 replies.
- Can spark job have sideeffects (write files to FileSystem) - posted by Paweł Szulc <pa...@gmail.com> on 2014/12/11 12:50:36 UTC, 3 replies.
- Re: Standalone spark cluster. Can't submit job programmatically -> java.io.InvalidClassException - posted by sivarani <wh...@gmail.com> on 2014/12/11 13:19:41 UTC, 0 replies.
- "Session" for connections? - posted by Ashic Mahtab <as...@live.com> on 2014/12/11 13:28:35 UTC, 8 replies.
- ERROR YarnClientClusterScheduler: Lost executor Akka client disassociated - posted by Muhammad Ahsan <mu...@gmail.com> on 2014/12/11 13:42:09 UTC, 2 replies.
- Spark streaming: missing classes when kafka consumer classes - posted by Mario Pastorelli <ma...@teralytics.ch> on 2014/12/11 13:52:43 UTC, 7 replies.
- Spark SQL Vs CQL performance on Cassandra - posted by Ajay <aj...@gmail.com> on 2014/12/11 14:05:09 UTC, 0 replies.
- Standalone app: IOException due to broadcast.destroy() - posted by Alberto Garcia <ag...@inf.uc3m.es> on 2014/12/11 14:38:31 UTC, 0 replies.
- Newbie Question - posted by "Fernando O." <fo...@gmail.com> on 2014/12/11 16:50:06 UTC, 0 replies.
- Exception using amazonaws library - posted by Albert Manyà <al...@eml.cc> on 2014/12/11 17:43:38 UTC, 2 replies.
- Different Vertex Ids in Graph and Edges - posted by th0rsten <th...@online.de> on 2014/12/11 17:57:00 UTC, 0 replies.
- broadcast: OutOfMemoryError - posted by ll <du...@gmail.com> on 2014/12/11 19:14:05 UTC, 1 replies.
- why is spark + scala code so slow, compared to python? - posted by ll <du...@gmail.com> on 2014/12/11 19:23:41 UTC, 6 replies.
- custom spark app name in yarn-cluster mode - posted by Tomer Benyamini <to...@gmail.com> on 2014/12/11 19:27:40 UTC, 3 replies.
- Proper way to check SparkContext's status within code - posted by Edwin <al...@yahoo.com> on 2014/12/11 20:40:09 UTC, 0 replies.
- Native library error when trying to use Spark with Snappy files - posted by Rich Haase <rh...@pandora.com> on 2014/12/11 22:53:02 UTC, 0 replies.
- Running spark-submit from a remote machine using a YARN application - posted by ryaminal <ta...@gmail.com> on 2014/12/11 23:01:51 UTC, 1 replies.
- Job status from Python - posted by Michael Nazario <mn...@palantir.com> on 2014/12/11 23:41:42 UTC, 0 replies.
- Spark Streaming in Production - posted by twizansk <tw...@gmail.com> on 2014/12/12 01:03:28 UTC, 5 replies.
- Spark Server - How to implement - posted by Manoj Samel <ma...@gmail.com> on 2014/12/12 02:33:48 UTC, 4 replies.
- Re: KryoRegistrator exception and Kryo class not found while compiling - posted by bonnahu <bo...@gmail.com> on 2014/12/12 02:51:12 UTC, 0 replies.
- Re: KryoSerializer exception in Spark Streaming JAVA - posted by bonnahu <bo...@gmail.com> on 2014/12/12 02:55:24 UTC, 1 replies.
- Job doesn't start when external shuffle service enabled - posted by Tsuyoshi OZAWA <oz...@gmail.com> on 2014/12/12 04:25:27 UTC, 0 replies.
- GroupBy and nested Top on - posted by sparkuser2014 <Co...@crowdstrike.com> on 2014/12/12 06:23:51 UTC, 0 replies.
- Using Spark at the U.S.Treasury - posted by Max Funk <ma...@systemaccounting.org> on 2014/12/12 07:04:25 UTC, 0 replies.
- Adding a column to a SchemaRDD - posted by Nathan Kronenfeld <nk...@oculusinfo.com> on 2014/12/12 07:11:43 UTC, 5 replies.
- Mllib Error - posted by amin mohebbi <am...@yahoo.com.INVALID> on 2014/12/12 07:52:01 UTC, 1 replies.
- Re: Remote jar file - posted by rahulkumar-aws <ra...@gmail.com> on 2014/12/12 08:45:33 UTC, 0 replies.
- Re: Access to s3 from spark - posted by rahulkumar-aws <ra...@gmail.com> on 2014/12/12 09:16:41 UTC, 0 replies.
- Serialization issue when using HBase with Spark - posted by yangliuyu <ya...@163.com> on 2014/12/12 09:35:57 UTC, 5 replies.
- [Graphx] the communication cost of leftJoin - posted by Yifan LI <ia...@gmail.com> on 2014/12/12 12:40:05 UTC, 0 replies.
- Spark CDH5 packages - posted by Jing Dong <ji...@qubitdigital.com> on 2014/12/12 13:26:35 UTC, 2 replies.
- Read data from SparkStreaming from Java socket. - posted by Guillermo Ortiz <ko...@gmail.com> on 2014/12/12 13:55:37 UTC, 8 replies.
- ...FileNotFoundException: Path is not a file: - error on accessing HDFS with sc.wholeTextFiles - posted by Karen Murphy <k....@qub.ac.uk> on 2014/12/12 14:04:21 UTC, 2 replies.
- Unit testing and Spark Streaming - posted by Eric Loots <er...@gmail.com> on 2014/12/12 14:17:37 UTC, 2 replies.
- Re: Spark 1.1.1, Hadoop 2.6 - Protobuf conflict - posted by kmurph <k....@qub.ac.uk> on 2014/12/12 15:08:16 UTC, 1 replies.
- Cannot pickle DecisionTreeModel in the pyspark - posted by Gen <ge...@gmail.com> on 2014/12/12 16:41:37 UTC, 0 replies.
- Do I need to applied feature scaling via StandardScaler for LBFGS for Linear Regression? - posted by "Bui, Tri" <Tr...@VerizonWireless.com.INVALID> on 2014/12/12 17:49:22 UTC, 6 replies.
- RDD lineage and broadcast variables - posted by Ron Ayoub <ro...@live.com> on 2014/12/12 17:52:09 UTC, 0 replies.
- Passing Spark Configuration from Driver (Master) to all of the Slave nodes - posted by Demi Ben-Ari <de...@gmail.com> on 2014/12/12 18:38:12 UTC, 2 replies.
- How to get driver id? - posted by Xingwei Yang <ha...@gmail.com> on 2014/12/12 19:10:39 UTC, 0 replies.
- resource allocation spark on yarn - posted by gpatcham <gp...@gmail.com> on 2014/12/12 19:52:25 UTC, 3 replies.
- GraphX for large scale PageRank (~4 billion nodes, ~128 billion edges) - posted by Stephen Merity <st...@commoncrawl.org> on 2014/12/12 20:18:44 UTC, 1 replies.
- how to convert an rdd to a single output file - posted by Steve Lewis <lo...@gmail.com> on 2014/12/12 20:19:45 UTC, 4 replies.
- Spark 1.2 + Avro file does not work in HDP2.2 - posted by Manas Kar <ma...@gmail.com> on 2014/12/12 20:38:04 UTC, 1 replies.
- Spark 1.2 + Avro does not work in HDP2.2 - posted by manasdebashiskar <ma...@gmail.com> on 2014/12/12 20:48:45 UTC, 1 replies.
- java.lang.IllegalStateException: unread block data - posted by Morbious <kn...@gmail.com> on 2014/12/12 21:37:18 UTC, 5 replies.
- IBM open-sources Spark Kernel - posted by Robert C Senkbeil <rc...@us.ibm.com> on 2014/12/12 23:16:45 UTC, 2 replies.
- Re: Submiting multiple jobs via different threads - posted by Michael Quinlan <mq...@gmail.com> on 2014/12/12 23:22:26 UTC, 0 replies.
- Spark SQL API Doc & IsCached as SQL command - posted by Judy Nash <ju...@exchange.microsoft.com> on 2014/12/13 00:14:21 UTC, 3 replies.
- SVMWithSGD.run source code - posted by Caron <ca...@gmail.com> on 2014/12/13 01:52:03 UTC, 1 replies.
- sbt assembly with hive - posted by Stephen Boesch <ja...@gmail.com> on 2014/12/13 03:40:48 UTC, 1 replies.
- clean up of state in State Dstream - posted by Sunil Yarram <yv...@gmail.com> on 2014/12/13 03:44:03 UTC, 1 replies.
- Spark SQL Roadmap? - posted by Xiaoyong Zhu <xi...@microsoft.com> on 2014/12/13 13:59:55 UTC, 3 replies.
- unread block data when reading from NFS - posted by gtinside <gt...@gmail.com> on 2014/12/13 15:23:46 UTC, 1 replies.
- JSON Input files - posted by Madabhattula Rajesh Kumar <mr...@gmail.com> on 2014/12/13 17:18:54 UTC, 9 replies.
- Building Desktop application for ALS-MlLib/ Training ALS - posted by Saurabh Agrawal <sa...@markit.com> on 2014/12/13 20:05:33 UTC, 3 replies.
- Nabble mailing list mirror errors: "This post has NOT been accepted by the mailing list yet" - posted by Josh Rosen <ro...@gmail.com> on 2014/12/13 23:05:56 UTC, 4 replies.
- Calling ALS-MlLib from desktop application/ Training ALS - posted by Saurabh Agrawal <sa...@markit.com> on 2014/12/14 04:17:07 UTC, 1 replies.
- pyspark is crashing in this case. why? - posted by genesis fatum <ge...@gmail.com> on 2014/12/14 15:03:39 UTC, 2 replies.
- MLlib vs Madlib - posted by "Venkat, Ankam" <An...@centurylink.com> on 2014/12/14 15:26:09 UTC, 3 replies.
- Limit the # of columns in Spark Scala - posted by Denny Lee <de...@gmail.com> on 2014/12/14 17:15:42 UTC, 6 replies.
- DStream demultiplexer based on a key - posted by Jean-Pascal Billaud <jp...@tellapart.com> on 2014/12/14 19:50:45 UTC, 3 replies.
- HTTP 500 Error for SparkUI in YARN Cluster mode - posted by Benyi Wang <be...@gmail.com> on 2014/12/14 20:26:42 UTC, 0 replies.
- spark kafka batch integration - posted by Koert Kuipers <ko...@tresata.com> on 2014/12/14 21:41:58 UTC, 2 replies.
- Run Spark job on Playframework + Spark Master/Worker in one Mac - posted by Tomoya Igarashi <to...@gmail.com> on 2014/12/15 03:12:17 UTC, 5 replies.
- Spark Streaming Python APIs? - posted by Xiaoyong Zhu <xi...@microsoft.com> on 2014/12/15 03:52:30 UTC, 5 replies.
- Q about Spark MLlib- Decision tree - scala.MatchError: 2.0 (of class java.lang.Double) - posted by jake Lim <it...@gmail.com> on 2014/12/15 06:46:12 UTC, 0 replies.
- Spark inserting into parquet files with different schema - posted by AdamPD <ad...@pharmadata.net.au> on 2014/12/15 09:10:47 UTC, 1 replies.
- RDD vs Broadcast - posted by elitejyo <el...@yahoo.co.in> on 2014/12/15 09:31:28 UTC, 0 replies.
- Why my SQL UDF cannot be registered? - posted by Xuelin Cao <xu...@yahoo.com.INVALID> on 2014/12/15 10:27:40 UTC, 1 replies.
- HiveQL support in Cassandra-Spark connector - posted by shahab <sh...@gmail.com> on 2014/12/15 11:11:56 UTC, 0 replies.
- Migrating Parquet inputs - posted by Marius Soutier <mp...@gmail.com> on 2014/12/15 12:44:25 UTC, 0 replies.
- Re: stage failure: java.lang.IllegalStateException: unread block data - posted by Akhil <ak...@sigmoidanalytics.com> on 2014/12/15 14:42:01 UTC, 0 replies.
- is there a way to interact with Spark clusters remotely? - posted by Xiaoyong Zhu <xi...@microsoft.com> on 2014/12/15 15:17:00 UTC, 3 replies.
- Re: Pagerank implementation - posted by kmurph <k....@qub.ac.uk> on 2014/12/15 16:55:25 UTC, 0 replies.
- integrating long-running Spark jobs with Thriftserver - posted by Tim Schweichler <Ti...@healthination.com> on 2014/12/15 16:56:20 UTC, 2 replies.
- Serialize mllib's MatrixFactorizationModel - posted by Albert Manyà <al...@eml.cc> on 2014/12/15 17:33:36 UTC, 4 replies.
- Intermittent test failures - posted by Marius Soutier <mp...@gmail.com> on 2014/12/15 17:48:33 UTC, 5 replies.
- Custom UDTF with Lateral View throws ClassNotFound exception in Spark SQL CLI - posted by shenghua <wa...@gmail.com> on 2014/12/15 19:54:05 UTC, 2 replies.
- Accessing rows of a row in Spark - posted by Jerry Lam <ch...@gmail.com> on 2014/12/15 20:04:45 UTC, 3 replies.
- Re: MLLIb: Linear regression: Loss was due to java.lang.ArrayIndexOutOfBoundsException - posted by Xiangrui Meng <me...@gmail.com> on 2014/12/15 20:52:05 UTC, 0 replies.
- Stop streaming context gracefully when SIGTERM is passed - posted by "Budde, Adam" <bu...@amazon.com> on 2014/12/15 22:32:54 UTC, 1 replies.
- NumberFormatException - posted by yu <yu...@iastate.edu> on 2014/12/15 22:49:02 UTC, 4 replies.
- MLLib: Saving and loading a model - posted by Sameer Tilak <ss...@live.com> on 2014/12/15 23:21:47 UTC, 1 replies.
- Executor memory - posted by Pala M Muthaia <mc...@rocketfuelinc.com> on 2014/12/16 01:53:21 UTC, 3 replies.
- Re: NotSerializableException in Spark Streaming - posted by Nicholas Chammas <ni...@gmail.com> on 2014/12/16 03:24:09 UTC, 0 replies.
- Fetch Failed caused job failed. - posted by Mars Max <ma...@baidu.com> on 2014/12/16 04:00:38 UTC, 1 replies.
- Can I set max execution time for any task in a job? - posted by Mohamed Lrhazi <Mo...@georgetown.edu> on 2014/12/16 05:29:17 UTC, 1 replies.
- Accessing Apache Spark from Java - posted by Jai <ja...@gmail.com> on 2014/12/16 08:55:03 UTC, 1 replies.
- Multiple Filter Effiency - posted by zkidkid <zk...@gmail.com> on 2014/12/16 09:06:28 UTC, 1 replies.
- GC problem while filtering large data - posted by Joe L <se...@yahoo.com> on 2014/12/16 09:09:17 UTC, 0 replies.
- GC problem while filtering - posted by Batselem <se...@gmail.com> on 2014/12/16 10:40:56 UTC, 0 replies.
- 答复: Fetch Failed caused job failed. - posted by "Ma,Xi" <ma...@baidu.com> on 2014/12/16 10:46:58 UTC, 1 replies.
- Locality level and Kryo - posted by aecc <al...@gmail.com> on 2014/12/16 11:24:35 UTC, 1 replies.
- Re: protobuf error running spark on hadoop 2.4 - posted by RodrigoB <ro...@aspect.com> on 2014/12/16 11:42:23 UTC, 0 replies.
- RDD "toarray","first" behavior - posted by buring <qy...@gmail.com> on 2014/12/16 11:57:24 UTC, 0 replies.
- Could not find the main class: org.apache.spark.deploy.SparkSubmit - posted by Daniel Haviv <da...@gmail.com> on 2014/12/16 12:34:15 UTC, 9 replies.
- Re: Data Loss - Spark streaming - posted by Gerard Maas <ge...@gmail.com> on 2014/12/16 13:12:43 UTC, 1 replies.
- Why so many tasks? - posted by bethesda <sw...@mac.com> on 2014/12/16 13:51:53 UTC, 6 replies.
- Pyspark 1.1.1 error with large number of records - serializer.dump_stream(func(split_index, iterator), outfile) - posted by mj <jo...@gmail.com> on 2014/12/16 14:04:20 UTC, 1 replies.
- Error when Applying schema to a dictionary with a Tuple as key - posted by sahanbull <sa...@skimlinks.com> on 2014/12/16 14:49:31 UTC, 2 replies.
- Streaming | Partition count mismatch exception while saving data in RDD - posted by Aniket Bhatnagar <an...@gmail.com> on 2014/12/16 15:38:12 UTC, 1 replies.
- Appending an incrental value to each RDD record - posted by bethesda <sw...@mac.com> on 2014/12/16 16:12:51 UTC, 3 replies.
- NullPointerException on cluster mode when using foreachPartition - posted by richiesgr <ri...@gmail.com> on 2014/12/16 16:21:40 UTC, 1 replies.
- Re: Spark 1.2 + Avro does not work in HDP2.2 - posted by Sean Owen <so...@cloudera.com> on 2014/12/16 17:49:00 UTC, 0 replies.
- Kafka Receiver not recreated after executor died - posted by Luis Ángel Vicente Sánchez <la...@gmail.com> on 2014/12/16 17:53:33 UTC, 1 replies.
- No disk single pass RDD aggregation - posted by Jim Carroll <ji...@gmail.com> on 2014/12/16 18:50:38 UTC, 4 replies.
- Spark handling of a file://xxxx.gz Uri - posted by Jim Carroll <ji...@gmail.com> on 2014/12/16 19:22:41 UTC, 2 replies.
- Understanding disk usage with Accumulators - posted by "Ganelin, Ilya" <Il...@capitalone.com> on 2014/12/16 19:23:05 UTC, 1 replies.
- Running Spark Job on Yarn from Java Code - posted by Rahul Swaminathan <ra...@duke.edu> on 2014/12/16 19:45:25 UTC, 1 replies.
- Spark Sql on Yarn using python - posted by Sam Flint <sa...@magnetic.com> on 2014/12/16 20:01:30 UTC, 0 replies.
- Re: streaming linear regression is not building the model - posted by tsu-wt <wa...@yahoo.com> on 2014/12/16 20:11:34 UTC, 0 replies.
- Cannot parse ListBuffer - StreamingLinearRegression - posted by tsu-wt <wa...@yahoo.com> on 2014/12/16 20:29:14 UTC, 0 replies.
- Control default partition when load a RDD from HDFS - posted by Shuai Zheng <sz...@gmail.com> on 2014/12/16 21:15:42 UTC, 4 replies.
- Spark eating exceptions in multi-threaded local mode - posted by Corey Nolet <cj...@gmail.com> on 2014/12/16 21:56:10 UTC, 0 replies.
- S3 globbing - posted by durga <du...@gmail.com> on 2014/12/17 00:35:30 UTC, 3 replies.
- How do I stop the automatic partitioning of my RDD? - posted by Jim Carroll <ji...@gmail.com> on 2014/12/17 00:43:26 UTC, 1 replies.
- "toArray","first" get the different result from one element RDD - posted by buring <qy...@gmail.com> on 2014/12/17 01:55:56 UTC, 1 replies.
- when will the spark 1.3.0 be released? - posted by 张建轶 <zh...@youku.com> on 2014/12/17 04:43:27 UTC, 4 replies.
- 答复: 答复: Fetch Failed caused job failed. - posted by "Ma,Xi" <ma...@baidu.com> on 2014/12/17 05:33:13 UTC, 0 replies.
- Spark SQL DSL for joins? - posted by Jerry Raj <je...@gmail.com> on 2014/12/17 06:43:01 UTC, 4 replies.
- Rolling upgrade Spark cluster - posted by Kenichi Maehashi <we...@kenichimaehashi.com> on 2014/12/17 07:40:55 UTC, 1 replies.
- Locality Level Kryo - posted by aecc <al...@gmail.com> on 2014/12/17 08:56:57 UTC, 0 replies.
- wordcount job slow while input from NFS mount - posted by Larry Liu <la...@gmail.com> on 2014/12/17 09:47:34 UTC, 6 replies.
- When will Spark SQL support building DB index natively? - posted by Xuelin Cao <xu...@yahoo.com.INVALID> on 2014/12/17 11:25:14 UTC, 3 replies.
- SchemaRDD.sample problem - posted by Hao Ren <in...@gmail.com> on 2014/12/17 11:28:50 UTC, 4 replies.
- Hadoop and spark together - posted by Morbious <kn...@gmail.com> on 2014/12/17 12:19:33 UTC, 0 replies.
- weird bytecode incompatability issue between spark-core jar from mvn repo and official spark prebuilt binary - posted by "Sun, Rui" <ru...@intel.com> on 2014/12/17 13:07:09 UTC, 9 replies.
- Are lazy values created once per node or once per partition? - posted by Ashic Mahtab <as...@live.com> on 2014/12/17 14:01:32 UTC, 1 replies.
- Apache Spark 1.1.1 with Hbase 0.98.8-hadoop2 and hadoop 2.3.0 - posted by Amit Singh Hora <ho...@gmail.com> on 2014/12/17 14:43:04 UTC, 1 replies.
- spark-ec2 starts hdfs1, tachyon but not spark - posted by Al Thompson <at...@gmail.com> on 2014/12/17 14:58:40 UTC, 0 replies.
- Get the value of DStream[(String, Iterable[String])] - posted by Guillermo Ortiz <ko...@gmail.com> on 2014/12/17 17:11:03 UTC, 3 replies.
- Spark SQL 1.1.1 reading LZO compressed json files - posted by Jerry Lam <ch...@gmail.com> on 2014/12/17 17:21:40 UTC, 7 replies.
- Who is using Spark and related technologies for bioinformatics applications? - posted by Steve Lewis <lo...@gmail.com> on 2014/12/17 17:27:11 UTC, 0 replies.
- Implementing a spark version of Haskell's partition - posted by Juan Rodríguez Hortalá <ju...@gmail.com> on 2014/12/17 17:56:42 UTC, 4 replies.
- SparkSQL 1.2.1-snapshot Left Join problem - posted by Hao Ren <in...@gmail.com> on 2014/12/17 19:32:03 UTC, 1 replies.
- Help with updateStateByKey - posted by Pierce Lamb <ri...@gmail.com> on 2014/12/17 23:07:10 UTC, 6 replies.
- spark-sql with join terribly slow. - posted by harirajaram <ha...@gmail.com> on 2014/12/17 23:15:22 UTC, 3 replies.
- building with Hadoop 1.0.4 / where is hadoop-yarn-common:1.0.4 ? - posted by Tim Harsch <th...@cray.com> on 2014/12/18 01:35:53 UTC, 1 replies.
- Re: java.io.NotSerializableException: org.apache.avro.mapred.AvroKey using spark with avro - posted by touchdown <yu...@gmail.com> on 2014/12/18 02:14:10 UTC, 3 replies.
- SPARK-2243 Support multiple SparkContexts in the same JVM - posted by Anton Brazhnyk <an...@genesys.com> on 2014/12/18 03:23:56 UTC, 3 replies.
- Spark Shell slowness on Google Cloud - posted by Alessandro Baretta <al...@gmail.com> on 2014/12/18 07:08:29 UTC, 6 replies.
- Getting OutOfMemoryError and Worker.run caught exception - posted by "A.K.M. Ashrafuzzaman" <as...@gmail.com> on 2014/12/18 08:02:55 UTC, 1 replies.
- How to specify the driver on a specific machine on yarn cluster mode? - posted by LinQili <li...@outlook.com> on 2014/12/18 08:14:55 UTC, 0 replies.
- Can Spark 1.0.2 run on CDH-4.3.0 with yarn? And Will Spark 1.2.0 support CDH5.1.2 with yarn? - posted by Canoe <ca...@gmail.com> on 2014/12/18 10:18:48 UTC, 2 replies.
- Semantics of foreachPartition() - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/12/18 10:43:03 UTC, 1 replies.
- Incorrect results when calling collect() ? - posted by Tristan Blakers <tr...@blackfrog.org> on 2014/12/18 11:06:00 UTC, 4 replies.
- Can we specify driver running on a specific machine of the cluster on yarn-cluster mode? - posted by LinQili <li...@outlook.com> on 2014/12/18 11:14:58 UTC, 2 replies.
- create table in yarn-cluster mode vs yarn-client mode - posted by Chirag Aggarwal <Ch...@guavus.com> on 2014/12/18 12:07:45 UTC, 0 replies.
- pyspark 1.1.1 on windows saveAsTextFile - NullPointerException - posted by mj <jo...@gmail.com> on 2014/12/18 14:40:40 UTC, 1 replies.
- Downloads from S3 exceedingly slow when running on spark-ec2 - posted by Jon Chase <jo...@gmail.com> on 2014/12/18 14:56:04 UTC, 3 replies.
- Spark 1.2 Release Date - posted by Al M <al...@gmail.com> on 2014/12/18 15:09:44 UTC, 3 replies.
- EC2 VPC script - posted by Eduardo Cusa <ed...@usmediaconsulting.com> on 2014/12/18 15:42:21 UTC, 2 replies.
- undefined - posted by Eduardo Cusa <ed...@usmediaconsulting.com> on 2014/12/18 16:22:10 UTC, 0 replies.
- Effects problems in logistic regression - posted by Franco Barrientos <fr...@exalitica.com> on 2014/12/18 17:34:57 UTC, 6 replies.
- Standalone Spark program - posted by Akshat Aranya <aa...@gmail.com> on 2014/12/18 17:36:39 UTC, 2 replies.
- does spark sql support columnar compression with encoding when caching tables - posted by Sadhan Sood <sa...@gmail.com> on 2014/12/18 21:07:03 UTC, 6 replies.
- UNION two RDDs - posted by Jerry Lam <ch...@gmail.com> on 2014/12/18 21:52:01 UTC, 3 replies.
- Spark GraphX question. - posted by Tae-Hyuk Ahn <ah...@gmail.com> on 2014/12/18 23:11:33 UTC, 2 replies.
- Creating a smaller, derivative RDD from an RDD - posted by bethesda <sw...@mac.com> on 2014/12/18 23:18:20 UTC, 1 replies.
- Re: hello - posted by Harihar Nahak <hn...@wynyardgroup.com> on 2014/12/18 23:31:42 UTC, 0 replies.
- How to increase parallelism in Yarn - posted by Suman Somasundar <su...@oracle.com> on 2014/12/18 23:37:52 UTC, 1 replies.
- Sharing sqlContext between Akka router and "routee" actors ... - posted by Manoj Samel <ma...@gmail.com> on 2014/12/19 03:45:00 UTC, 2 replies.
- When will spark 1.2 released? - posted by "vboylin1987@gmail.com" <vb...@gmail.com> on 2014/12/19 06:47:53 UTC, 7 replies.
- [SPARK-SQL]how to run cache command with Running the Thrift JDBC/ODBC server - posted by jeanlyn92 <je...@gmail.com> on 2014/12/19 09:08:15 UTC, 1 replies.
- Who manage the log4j appender while running spark on yarn? - posted by WangTaoTheTonic <ba...@aliyun.com> on 2014/12/19 09:37:05 UTC, 2 replies.
- Announcing Spark 1.2! - posted by Patrick Wendell <pw...@gmail.com> on 2014/12/19 09:52:19 UTC, 3 replies.
- How to run an action and get output? - posted by Ashic Mahtab <as...@live.com> on 2014/12/19 10:53:09 UTC, 1 replies.
- spark streaming python + kafka - posted by Oleg Ruchovets <or...@gmail.com> on 2014/12/19 11:15:08 UTC, 2 replies.
- How to run an action and get output?‏ - posted by ashic <as...@live.com> on 2014/12/19 11:57:18 UTC, 2 replies.
- Does Spark 1.2.0 support Scala 2.11? - posted by Jonathan Chayat <jo...@supersonic.com> on 2014/12/19 12:00:44 UTC, 2 replies.
- reading files recursively using spark - posted by Hafiz Mujadid <ha...@gmail.com> on 2014/12/19 12:13:38 UTC, 4 replies.
- Scala Lazy values and partitions - posted by Ashic Mahtab <as...@live.com> on 2014/12/19 12:21:42 UTC, 3 replies.
- Re: too many small files and task - posted by bethesda <sw...@mac.com> on 2014/12/19 12:31:33 UTC, 0 replies.
- 300% "Fraction Cached"? - posted by Yifan LI <ia...@gmail.com> on 2014/12/19 12:55:13 UTC, 1 replies.
- Batch timestamp in spark streaming - posted by nelson <ne...@ysance.com> on 2014/12/19 12:59:21 UTC, 2 replies.
- "Fetch Failure" - posted by bethesda <sw...@mac.com> on 2014/12/19 13:46:33 UTC, 9 replies.
- Querying Temp table using JDBC - posted by shahab <sh...@gmail.com> on 2014/12/19 15:04:14 UTC, 1 replies.
- Can Spark 1.1.0 save checkpoint to HDFS 2.5.1? - posted by Haopu Wang <HW...@qilinsoft.com> on 2014/12/19 15:11:14 UTC, 6 replies.
- Querying registered RDD (AsTable) using JDBC - posted by shahab <sh...@gmail.com> on 2014/12/19 17:32:49 UTC, 3 replies.
- Spark Streaming Threading Model - posted by Asim Jalis <as...@gmail.com> on 2014/12/19 19:16:18 UTC, 2 replies.
- Is there a way (in Java) to turn Java Iterable into a JavaRDD? - posted by Steve Lewis <lo...@gmail.com> on 2014/12/19 19:23:53 UTC, 0 replies.
- run spark on mesos localy - posted by Nagy Istvan <sp...@gmail.com> on 2014/12/19 19:37:31 UTC, 0 replies.
- spark/yarn ignoring num-executors (python, Amazon EMR, spark-submit, yarn-client) - posted by Tim Schweichler <Ti...@healthination.com> on 2014/12/19 19:52:18 UTC, 0 replies.
- spark-shell bug with RDD distinct? - posted by Jay Hutfles <ja...@gmail.com> on 2014/12/19 19:55:50 UTC, 0 replies.
- spark-shell bug with RDDs and case classes? - posted by Jay Hutfles <ja...@gmail.com> on 2014/12/19 20:21:13 UTC, 1 replies.
- DAGScheduler StackOverflowError - posted by David McWhorter <mc...@ccri.com> on 2014/12/19 20:35:10 UTC, 0 replies.
- Any potentiail issue if I create a SparkContext in executor - posted by Shuai Zheng <sz...@gmail.com> on 2014/12/19 21:05:34 UTC, 0 replies.
- Hadoop 2.6 compatibility? - posted by sa <as...@gmail.com> on 2014/12/19 21:51:21 UTC, 5 replies.
- Yarn not running as many executors as I'd like - posted by Jon Chase <jo...@gmail.com> on 2014/12/19 21:57:01 UTC, 1 replies.
- Using Customized Hadoop InputFormat class with Spark Streaming - posted by soroka21 <so...@gmail.com> on 2014/12/19 23:51:05 UTC, 1 replies.
- java.sql.SQLException: No suitable driver found - posted by durga <du...@gmail.com> on 2014/12/20 00:47:28 UTC, 4 replies.
- SchemaRDD to Hbase - posted by Subacini B <su...@gmail.com> on 2014/12/20 07:47:25 UTC, 2 replies.
- Interpreting MLLib's linear regression o/p - posted by Sameer Tilak <ss...@live.com> on 2014/12/20 17:13:04 UTC, 3 replies.
- Are failures normal / to be expected on an AWS cluster? - posted by Joe Wass <jw...@crossref.org> on 2014/12/20 18:28:30 UTC, 0 replies.
- Using "SparkSubmit.main()" to submit SparkContext in web application - posted by Corey Nolet <cj...@gmail.com> on 2014/12/20 18:52:15 UTC, 0 replies.
- v1.2.0 (re?)introduces Wrong FS behavior in thriftserver - posted by Matt Mead <ma...@matthewcmead.com> on 2014/12/20 21:44:30 UTC, 3 replies.
- How to deploy my java code which invokes Spark in Tomcat? - posted by Tao Lu <ta...@gmail.com> on 2014/12/21 05:31:06 UTC, 1 replies.
- spark-repl_1.2.0 was not uploaded to central maven repository. - posted by Peng Cheng <pc...@uow.edu.au> on 2014/12/21 06:50:50 UTC, 4 replies.
- Re: Spark on Tachyon - posted by Peng Cheng <pc...@uow.edu.au> on 2014/12/21 07:45:50 UTC, 0 replies.
- Network file input cannot be recognized? - posted by Shuai Zheng <sz...@gmail.com> on 2014/12/21 13:04:53 UTC, 1 replies.
- need help with simple http request mapper - posted by kmatzen <km...@gmail.com> on 2014/12/21 20:06:01 UTC, 0 replies.
- Find the file info of when load the data into RDD - posted by Shuai Zheng <sz...@gmail.com> on 2014/12/21 22:43:18 UTC, 2 replies.
- Issue with Parquet on Spark 1.2 and Amazon EMR - posted by Adam Gilmore <dr...@gmail.com> on 2014/12/22 02:37:47 UTC, 0 replies.
- locality sensitive hashing for spark - posted by morr0723 <mi...@gmail.com> on 2014/12/22 03:10:37 UTC, 2 replies.
- Parquet schema changes - posted by Adam Gilmore <dr...@gmail.com> on 2014/12/22 06:11:14 UTC, 2 replies.
- S3 files , Spark job hungsup - posted by durga <du...@gmail.com> on 2014/12/22 07:13:37 UTC, 6 replies.
- Python:Streaming Question - posted by Samarth Mailinglist <ma...@gmail.com> on 2014/12/22 07:57:20 UTC, 1 replies.
- Question about TTL with TorrentBroadcastFactory in Spark-1.2.0 - posted by 顾亮亮 <gu...@qiyi.com> on 2014/12/22 08:14:40 UTC, 0 replies.
- Spark SQL 1.2 with CDH 4, Hive UDF is not working. - posted by Ji ZHANG <zh...@gmail.com> on 2014/12/22 09:15:22 UTC, 1 replies.
- what is the default log4j configuration passed to yarn container - posted by Venkata ramana gollamudi <ra...@huawei.com> on 2014/12/22 09:25:16 UTC, 0 replies.
- Graceful shutdown in spark streaming - posted by Jesper Lundgren <ko...@gmail.com> on 2014/12/22 10:22:43 UTC, 0 replies.
- Re: Is Spark? or GraphX runs fast? a performance comparison on Page Rank - posted by pradhandeep <pr...@gmail.com> on 2014/12/22 11:04:31 UTC, 1 replies.
- Re: How to get list of edges between two Vertex ? - posted by pradhandeep <pr...@gmail.com> on 2014/12/22 11:14:51 UTC, 0 replies.
- Possible problems in packaging mlllib - posted by shkesar <sh...@live.com> on 2014/12/22 11:16:09 UTC, 1 replies.
- Using more cores on machines - posted by Ashic Mahtab <as...@live.com> on 2014/12/22 11:39:04 UTC, 5 replies.
- Can Spark SQL thrift server UI provide JOB kill operate or any REST API? - posted by Xiaoyu Wang <wa...@gmail.com> on 2014/12/22 14:09:33 UTC, 1 replies.
- Spark exception when sending message to akka actor - posted by Priya Ch <le...@gmail.com> on 2014/12/22 14:45:28 UTC, 0 replies.
- MLlib, classification label problem - posted by Hao Ren <in...@gmail.com> on 2014/12/22 17:02:02 UTC, 1 replies.
- Mesos resource allocation - posted by Josh Devins <jo...@soundcloud.com> on 2014/12/22 17:23:57 UTC, 1 replies.
- Tuning Spark Streaming jobs - posted by Gerard Maas <ge...@gmail.com> on 2014/12/22 17:33:17 UTC, 4 replies.
- custom python converter from HBase Result to tuple - posted by Antony Mayi <an...@yahoo.com.INVALID> on 2014/12/22 20:02:39 UTC, 3 replies.
- Announcing Spark Packages - posted by Xiangrui Meng <me...@gmail.com> on 2014/12/22 21:37:43 UTC, 7 replies.
- MLLib beginner question - posted by boci <bo...@gmail.com> on 2014/12/22 22:47:41 UTC, 3 replies.
- Long-running job cleanup - posted by "Ganelin, Ilya" <Il...@capitalone.com> on 2014/12/22 23:36:10 UTC, 5 replies.
- Re: Spark in Standalone mode - posted by durga <du...@gmail.com> on 2014/12/22 23:55:45 UTC, 0 replies.
- SparkSQL Array type support - Unregonized Thrift TTypeId value: ARRAY_TYPE - posted by David Allan <da...@yahoo.com> on 2014/12/23 00:45:00 UTC, 1 replies.
- broadcasting object issue - posted by Henry Hung <YT...@winbond.com> on 2014/12/23 04:42:57 UTC, 1 replies.
- Joins in Spark - posted by Deep Pradhan <pr...@gmail.com> on 2014/12/23 05:16:02 UTC, 4 replies.
- Consistent hashing of RDD row - posted by lev <ka...@gmail.com> on 2014/12/23 06:48:50 UTC, 1 replies.
- Re: Spark Job hangs up on multi-node cluster but passes on a single node - posted by shaiw75 <sh...@intentiq.com> on 2014/12/23 08:57:25 UTC, 1 replies.
- RDD for Storm Streaming in Spark - posted by Ajay <aj...@gmail.com> on 2014/12/23 09:03:50 UTC, 4 replies.
- How to export data from hive into hdfs in spark program? - posted by LinQili <li...@outlook.com> on 2014/12/23 09:09:41 UTC, 2 replies.
- Spark SQL job block when use hive udf from_unixtime - posted by "Dai, Kevin" <yu...@ebay.com> on 2014/12/23 09:42:40 UTC, 1 replies.
- JavaRDD (Data Aggregation) based on key - posted by sachin Singh <sa...@gmail.com> on 2014/12/23 10:47:41 UTC, 1 replies.
- ReceiverInputDStream#saveAsTextFiles with a S3 URL results in double forward slash key names in S3 - posted by Enno Shioji <es...@gmail.com> on 2014/12/23 13:06:36 UTC, 4 replies.
- Spark Installation Maven PermGen OutOfMemoryException - posted by Vladimir Protsenko <pr...@gmail.com> on 2014/12/23 15:57:41 UTC, 9 replies.
- Spark UI port issue when deploying Spark driver on YARN in yarn-cluster mode on EMR - posted by Roberto Coluccio <ro...@gmail.com> on 2014/12/23 17:04:25 UTC, 1 replies.
- removing first record from RDD[String] - posted by Hafiz Mujadid <ha...@gmail.com> on 2014/12/23 17:35:50 UTC, 6 replies.
- retry in combineByKey at BinaryClassificationMetrics.scala - posted by Thomas Kwan <th...@manage.com> on 2014/12/23 18:00:24 UTC, 2 replies.
- Single worker locked at 100% CPU - posted by Phil Wills <ot...@gmail.com> on 2014/12/23 18:40:58 UTC, 2 replies.
- MLlib + Streaming - posted by Gianmarco De Francisci Morales <gd...@apache.org> on 2014/12/23 19:01:36 UTC, 3 replies.
- Spark Port Configuration - posted by "Dan H." <dc...@gmail.com> on 2014/12/23 20:40:33 UTC, 0 replies.
- serialize protobuf messages - posted by Chen Song <ch...@gmail.com> on 2014/12/23 21:08:56 UTC, 1 replies.
- SparkSQL: CREATE EXTERNAL TABLE with a SchemaRDD - posted by Jerry Lam <ch...@gmail.com> on 2014/12/23 21:26:02 UTC, 2 replies.
- Updating RDD used in DStream transform - posted by Sean McKibben <gr...@graphex.com> on 2014/12/23 23:12:12 UTC, 0 replies.
- Escape commas in file names - posted by Daniel Siegmann <da...@velos.io> on 2014/12/24 00:33:05 UTC, 3 replies.
- weights not changed with different reg param - posted by Thomas Kwan <th...@manage.com> on 2014/12/24 00:36:11 UTC, 1 replies.
- Exception after changing RDDs - posted by "kai.lu" <qi...@falkonry.com> on 2014/12/24 01:13:28 UTC, 0 replies.
- Debugging a Spark application using Eclipse throws SecurityException - posted by ey-chih chow <ey...@hotmail.com> on 2014/12/24 02:31:20 UTC, 1 replies.
- How to build Spark against the latest - posted by guxiaobo1982 <gu...@qq.com> on 2014/12/24 05:00:33 UTC, 8 replies.
- Not Serializable exception when integrating SQL and Spark Streaming - posted by bigdata4u <bi...@live.com> on 2014/12/24 06:32:07 UTC, 5 replies.
- SchemaRDD to RDD[String] - posted by Hafiz Mujadid <ha...@gmail.com> on 2014/12/24 07:18:01 UTC, 3 replies.
- Best to execute SQL in Streaming data - posted by bigdata4u <bi...@live.com> on 2014/12/24 08:18:26 UTC, 1 replies.
- got ”org.apache.thrift.protocol.TProtocolException: Expected protocol id ffffff82 but got ffffff80“ from hive metastroe service when I use show tables command in spark-sql shell - posted by Roc Chu <ch...@gmail.com> on 2014/12/24 11:31:21 UTC, 1 replies.
- Need help for Spark-JobServer setup on Maven (for Java programming) - posted by Sasi <sa...@gmail.com> on 2014/12/24 12:21:48 UTC, 7 replies.
- Why does consuming a RESTful web service (using javax.ws.rs.* and Jsersey) work in unit test but not when submitted to Spark? - posted by Emre Sevinc <em...@gmail.com> on 2014/12/24 13:02:26 UTC, 5 replies.
- saveAsNewAPIHadoopDataset against hbase hanging in pyspark 1.2.0 - posted by Antony Mayi <an...@yahoo.com.INVALID> on 2014/12/24 13:49:00 UTC, 7 replies.
- SVDPlusPlus Recommender in MLLib - posted by Prafulla Wani <pr...@gmail.com> on 2014/12/24 15:46:28 UTC, 1 replies.
- null Error in ALS model predict - posted by Franco Barrientos <fr...@exalitica.com> on 2014/12/24 16:44:24 UTC, 1 replies.
- How to identify erroneous input record ? - posted by Sanjay Subramanian <sa...@yahoo.com.INVALID> on 2014/12/24 17:28:01 UTC, 3 replies.
- hiveContext.jsonFile fails with "Unexpected close marker " - posted by elliott cordo <el...@gmail.com> on 2014/12/24 18:34:12 UTC, 1 replies.
- Discourse: A proposed alternative to the Spark User list - posted by Nick Chammas <ni...@gmail.com> on 2014/12/24 21:50:18 UTC, 5 replies.
- Question on saveAsTextFile with overwrite option - posted by "Shao, Saisai" <sa...@intel.com> on 2014/12/25 07:52:41 UTC, 4 replies.
- Re: Debian package for spark? - posted by varun sharma <va...@gmail.com> on 2014/12/25 11:38:16 UTC, 0 replies.
- Do you know any Spark modeling tool? - posted by Haopu Wang <HW...@qilinsoft.com> on 2014/12/25 11:42:01 UTC, 0 replies.
- Corrupted Exception while deserialize task - posted by WangTaoTheTonic <ba...@aliyun.com> on 2014/12/25 15:27:30 UTC, 0 replies.
- serialization issue with mapPartitions - posted by ey-chih chow <ey...@hotmail.com> on 2014/12/25 17:32:08 UTC, 5 replies.
- action progress in ipython notebook? - posted by Eric Friedman <er...@gmail.com> on 2014/12/25 19:01:34 UTC, 10 replies.
- ReliableDeliverySupervisor: Association with remote system - posted by SamyaMaiti <sa...@gmail.com> on 2014/12/25 20:13:02 UTC, 2 replies.
- unable to do group by with 1st column - posted by Amit Behera <am...@gmail.com> on 2014/12/25 21:22:52 UTC, 8 replies.
- fail to run spark PortfolioDemo with dse Cassandra - posted by Zhang Jiaqiang <zh...@gmail.com> on 2014/12/26 02:53:18 UTC, 1 replies.
- An exception about broadcast in concurrent environment - posted by 净`L <lj...@foxmail.com> on 2014/12/26 05:26:30 UTC, 1 replies.
- 回复:An exception about broadcast in concurrent environment - posted by 净`L <lj...@foxmail.com> on 2014/12/26 05:29:51 UTC, 0 replies.
- Profiling a spark application. - posted by rapelly kartheek <ka...@gmail.com> on 2014/12/26 06:04:36 UTC, 0 replies.
- how to do incremental model updates using spark streaming and mllib - posted by vishnu <jo...@gmail.com> on 2014/12/26 07:36:41 UTC, 1 replies.
- Storage Locations of an rdd - posted by rapelly kartheek <ka...@gmail.com> on 2014/12/26 08:58:54 UTC, 1 replies.
- Spark Streaming and Windows, it always counts the logs during all the windows. Why? - posted by Guillermo Ortiz <ko...@gmail.com> on 2014/12/26 10:56:24 UTC, 2 replies.
- Serious issues with class not found exceptions of classes in uber jar - posted by critikaled <is...@gmail.com> on 2014/12/26 10:56:27 UTC, 4 replies.
- Re: Using the DataStax Cassandra Connector from PySpark - posted by Stephen Boesch <ja...@gmail.com> on 2014/12/26 21:50:23 UTC, 0 replies.
- Can't submit the SparkPi example to local Yarn 2.6.0 installed by ambari 1.7.0 - posted by guxiaobo1982 <gu...@qq.com> on 2014/12/27 06:53:51 UTC, 0 replies.
- Compile error from Spark 1.2.0 - posted by Zigen Zigen <db...@gmail.com> on 2014/12/27 08:13:51 UTC, 2 replies.
- Dynamic Allocation in Spark 1.2.0 - posted by Anders Arpteg <ar...@spotify.com> on 2014/12/27 15:06:49 UTC, 4 replies.
- Playing along at home: recommendations as to system requirements? - posted by Amy Brown <te...@gmail.com> on 2014/12/27 15:27:56 UTC, 2 replies.
- Re: Can't submit the SparkPi example to local Yarn 2.6.0 installed by ambari 1.7.0 - posted by Sean Owen <so...@cloudera.com> on 2014/12/27 20:08:03 UTC, 0 replies.
- init / shutdown for complex map job? - posted by Kevin Burton <bu...@spinn3r.com> on 2014/12/27 21:23:05 UTC, 5 replies.
- unable to check whether an item is present in RDD - posted by Amit Behera <am...@gmail.com> on 2014/12/27 21:24:42 UTC, 7 replies.
- Problem with StreamingContext - getting SPARK-2243 - posted by tfrisk <tf...@gmail.com> on 2014/12/27 23:24:45 UTC, 3 replies.
- Re: Using YARN on a cluster created with spark-ec2 - posted by firemonk9 <dh...@gmail.com> on 2014/12/28 02:17:31 UTC, 0 replies.
- Compile error since Spark 1.2.0 - posted by zigen <db...@gmail.com> on 2014/12/28 04:20:57 UTC, 1 replies.
- Spark core maven error - posted by lalitagarw <la...@gmail.com> on 2014/12/28 05:15:46 UTC, 3 replies.
- Strange results of running Spark GenSort.scala - posted by Sam Liu <li...@sina.com> on 2014/12/28 13:57:31 UTC, 0 replies.
- Anaconda iPython notebook working with CDH Spark - posted by Bin Wang <bi...@gmail.com> on 2014/12/28 19:57:31 UTC, 1 replies.
- sample is not a member of org.apache.spark.streaming.dstream.DStream - posted by Josh J <jo...@gmail.com> on 2014/12/28 23:44:50 UTC, 1 replies.
- Re: Using TF-IDF from MLlib - posted by Yao <yg...@ford.com> on 2014/12/29 04:37:01 UTC, 3 replies.
- Re: TF-IDF in Spark 1.1.0 - posted by Yao <yg...@ford.com> on 2014/12/29 04:42:30 UTC, 0 replies.
- recent join/iterator fix - posted by Stephen Haberman <st...@gmail.com> on 2014/12/29 05:28:00 UTC, 5 replies.
- Setting up Simple Kafka Consumer via Spark Java app - posted by suhshekar52 <su...@gmail.com> on 2014/12/29 07:56:24 UTC, 16 replies.
- Spark 1.2.0 Yarn not published - posted by Aniket Bhatnagar <an...@gmail.com> on 2014/12/29 08:13:43 UTC, 2 replies.
- word count aggregation - posted by Hoai-Thu Vuong <th...@gmail.com> on 2014/12/29 09:00:50 UTC, 2 replies.
- Mapping directory structure to columns in SparkSQL - posted by Mickalas <Mi...@gmail.com> on 2014/12/29 11:19:33 UTC, 2 replies.
- How to set up spark sql on ec2 - posted by critikaled <is...@gmail.com> on 2014/12/29 11:34:19 UTC, 1 replies.
- What are all the Hadoop Major Versions in spark-ec2 script? - posted by critikaled <is...@gmail.com> on 2014/12/29 11:39:36 UTC, 0 replies.
- Clustering text data with MLlib - posted by jatinpreet <ja...@gmail.com> on 2014/12/29 11:55:09 UTC, 4 replies.
- Spark Configurations - posted by Chirag Aggarwal <Ch...@guavus.com> on 2014/12/29 13:10:02 UTC, 2 replies.
- DecisionTree Algorithm used in Spark MLLib - posted by Anoop Shiralige <an...@gmail.com> on 2014/12/29 13:27:31 UTC, 0 replies.
- SPARK-streaming app running 10x slower on YARN vs STANDALONE cluster - posted by Mukesh Jha <me...@gmail.com> on 2014/12/29 13:36:23 UTC, 10 replies.
- python: module pyspark.daemon not found - posted by Naveen Kumar Pokala <np...@spcapitaliq.com> on 2014/12/29 14:01:43 UTC, 3 replies.
- Re: Can't submit the SparkPi example to local Yarn 2.6.0 installed byambari 1.7.0 - posted by guxiaobo1982 <gu...@qq.com> on 2014/12/29 14:41:03 UTC, 0 replies.
- Spark profiler - posted by rapelly kartheek <ka...@gmail.com> on 2014/12/29 16:24:51 UTC, 1 replies.
- Can we say 1 RDD is generated every batch interval? - posted by SamyaMaiti <sa...@gmail.com> on 2014/12/29 17:49:51 UTC, 3 replies.
- How to pass options to KeyConverter using PySpark - posted by Brett Meyer <Br...@crowdstrike.com> on 2014/12/29 19:14:56 UTC, 0 replies.
- Re: trying to understand yarn-client mode - posted by Fernando Otero <fo...@gmail.com> on 2014/12/29 20:47:34 UTC, 1 replies.
- Clean up app folders in worker nodes - posted by hutashan <hu...@gmail.com> on 2014/12/29 21:01:24 UTC, 1 replies.
- Submit spark jobs inside web application - posted by Corey Nolet <cj...@gmail.com> on 2014/12/29 21:38:49 UTC, 0 replies.
- Building Spark 1.2 jmx and jmxtools issue? - posted by John Omernik <jo...@omernik.com> on 2014/12/29 23:20:45 UTC, 1 replies.
- How to tell if RDD no longer has any children - posted by Corey Nolet <cj...@gmail.com> on 2014/12/30 00:43:54 UTC, 0 replies.
- RE: Spark sql failed in yarn-cluster mode when connecting to non-default hive database - posted by Andrew Lee <al...@hotmail.com> on 2014/12/30 01:01:26 UTC, 1 replies.
- Spark Streaming: HiveContext within Custom Actor - posted by sranga <sr...@gmail.com> on 2014/12/30 03:45:50 UTC, 2 replies.
- Number of cores is 0 in WebUI when use bin/spark-submit - posted by Wei Da <xw...@qq.com> on 2014/12/30 03:55:11 UTC, 0 replies.
- Shuffle write increases in spark 1.2 - posted by Kevin Jung <it...@samsung.com> on 2014/12/30 04:08:43 UTC, 0 replies.
- Cached RDD - posted by Corey Nolet <cj...@gmail.com> on 2014/12/30 04:53:42 UTC, 1 replies.
- Re: A question about using insert into in rdd foreach in spark 1.2 - posted by Michael Armbrust <mi...@databricks.com> on 2014/12/30 05:42:22 UTC, 0 replies.
- Spark SQL implementation error - posted by sachin Singh <sa...@gmail.com> on 2014/12/30 10:43:42 UTC, 1 replies.
- Spark SQL insert overwrite table failed. - posted by Mars Max <ma...@baidu.com> on 2014/12/30 11:19:25 UTC, 0 replies.
- Writing and reading sequence file results in trailing extra data - posted by Enno Shioji <es...@gmail.com> on 2014/12/30 11:58:30 UTC, 0 replies.
- [SOLVED] Re: Writing and reading sequence file results in trailing extra data - posted by Enno Shioji <es...@gmail.com> on 2014/12/30 12:26:24 UTC, 0 replies.
- Re: building spark1.2 meet error - posted by xhudik <xh...@gmail.com> on 2014/12/30 12:45:45 UTC, 5 replies.
- How to collect() each partition in scala ? - posted by "DEVAN M.S." <ms...@gmail.com> on 2014/12/30 12:54:21 UTC, 2 replies.
- SparkContext with error from PySpark - posted by Jaggu <ja...@gmail.com> on 2014/12/30 13:45:57 UTC, 3 replies.
- Is it possible to store graph directly into HDFS? - posted by Jason Hong <be...@gmail.com> on 2014/12/30 14:27:03 UTC, 2 replies.
- Spark Standalone Cluster not correctly configured - posted by frodo777 <ro...@bitmonlab.com> on 2014/12/30 14:59:53 UTC, 0 replies.
- Host Error on EC2 while accessing hdfs from stadalone - posted by Laeeq Ahmed <la...@yahoo.com.INVALID> on 2014/12/30 17:01:25 UTC, 1 replies.
- Spark 1.2 and Mesos 0.21.0 spark.executor.uri issue? - posted by Denny Lee <de...@gmail.com> on 2014/12/30 17:25:07 UTC, 0 replies.
- Shuffle Problems in 1.2.0 - posted by Sven Krasser <kr...@gmail.com> on 2014/12/30 18:49:29 UTC, 2 replies.
- Trying to make spark-jobserver work with yarn - posted by "Fernando O." <fo...@gmail.com> on 2014/12/30 19:23:52 UTC, 1 replies.
- Spark Accumulators exposed as Metrics to Graphite - posted by Łukasz Stefaniak <lu...@gmail.com> on 2014/12/30 20:24:23 UTC, 0 replies.
- Location of logs in local mode - posted by Brett Meyer <Br...@crowdstrike.com> on 2014/12/30 22:30:45 UTC, 0 replies.
- Gradual slow down of the Streaming job (getCallSite at DStream.scala:294) - posted by RK <pr...@yahoo.com.INVALID> on 2014/12/30 22:41:10 UTC, 3 replies.
- Kafka + Spark streaming - posted by SamyaMaiti <sa...@gmail.com> on 2014/12/30 23:19:39 UTC, 2 replies.
- Trouble using MultipleTextOutputFormat with Spark - posted by Arpan Ghosh <ar...@automatic.com> on 2014/12/31 01:00:38 UTC, 0 replies.
- JetS3T settings spark - posted by durga <du...@gmail.com> on 2014/12/31 01:21:22 UTC, 2 replies.
- Spark app performance - posted by Raghavendra Pandey <ra...@gmail.com> on 2014/12/31 04:15:25 UTC, 0 replies.
- How to set local property in beeline connect to the spark thrift server - posted by Xiaoyu Wang <wa...@gmail.com> on 2014/12/31 08:55:49 UTC, 2 replies.
- FlatMapValues - posted by Sanjay Subramanian <sa...@yahoo.com.INVALID> on 2014/12/31 09:12:25 UTC, 7 replies.
- pyspark.daemon not found - posted by Naveen Kumar Pokala <np...@spcapitaliq.com> on 2014/12/31 09:58:15 UTC, 1 replies.
- spark stream + cassandra (execution on event) - posted by Oleg Ruchovets <or...@gmail.com> on 2014/12/31 10:44:23 UTC, 0 replies.
- Exception in thread "main" org.apache.hadoop.ipc.RemoteException: Server IPC version 9 cannot communicate with client version 4 - posted by Hafiz Mujadid <ha...@gmail.com> on 2014/12/31 11:47:04 UTC, 1 replies.
- NoSuchMethodError: com.typesafe.config.Config.getDuration with akka-http/akka-stream - posted by Christophe Billiard <ch...@gmail.com> on 2014/12/31 12:08:17 UTC, 0 replies.
- Fwd: Sample Spark Program Error - posted by Naveen Madhire <vm...@umail.iu.edu> on 2014/12/31 15:38:41 UTC, 3 replies.
- Re: Big performance difference between "client" and "cluster" deployment mode; is this expected? - posted by Sean Owen <so...@cloudera.com> on 2014/12/31 19:21:40 UTC, 2 replies.
- Re: Why the major.minor version of the new hive-exec is 51.0? - posted by Michael Armbrust <mi...@databricks.com> on 2014/12/31 21:57:03 UTC, 1 replies.
- Re: UpdateStateByKey persist to Tachyon - posted by amkcom <am...@gmail.com> on 2014/12/31 23:34:45 UTC, 0 replies.