You are viewing a plain text version of this content. The canonical link for it is here.
- Re: real time Query engine Spark-SQL on Hbase - posted by Corey Nolet <cj...@gmail.com> on 2015/05/01 00:05:32 UTC, 3 replies.
- casting timestamp into long fail in Spark 1.3.1 - posted by Justin Yip <yi...@prediction.io> on 2015/05/01 00:41:38 UTC, 2 replies.
- Spark Streaming Kafka Avro NPE on deserialization of payload - posted by Todd Nist <ts...@gmail.com> on 2015/05/01 00:53:15 UTC, 2 replies.
- Re: DataFrame filter referencing error - posted by ayan guha <gu...@gmail.com> on 2015/05/01 00:54:45 UTC, 1 replies.
- how to pass configuration properties from driver to executor? - posted by Tian Zhang <tz...@yahoo.com> on 2015/05/01 01:26:52 UTC, 2 replies.
- Re: [SPAM] Customized Aggregation Query on Spark SQL - posted by Zhan Zhang <zz...@hortonworks.com> on 2015/05/01 01:55:49 UTC, 1 replies.
- RE: Expert advise needed. (POC is at crossroads) - posted by java8964 <ja...@hotmail.com> on 2015/05/01 02:03:43 UTC, 2 replies.
- Re: How to install spark in spark on yarn mode - posted by Shixiong Zhu <zs...@gmail.com> on 2015/05/01 02:16:51 UTC, 0 replies.
- Re: Enabling Event Log - posted by Shixiong Zhu <zs...@gmail.com> on 2015/05/01 02:25:13 UTC, 2 replies.
- How to add a column to a spark RDD with many columns? - posted by Carter <gy...@hotmail.com> on 2015/05/01 06:55:20 UTC, 3 replies.
- Error when saving as parquet to S3 - posted by Cosmin Cătălin Sanda <co...@gmail.com> on 2015/05/01 07:36:55 UTC, 0 replies.
- Spark SQL ThriftServer Impersonation Support - posted by Night Wolf <ni...@gmail.com> on 2015/05/01 07:56:30 UTC, 1 replies.
- Re: How to group multiple row data ? - posted by Bipin Nag <bi...@gmail.com> on 2015/05/01 08:23:59 UTC, 0 replies.
- Help with publishing to Kafka from Spark Streaming? - posted by Pavan Sudheendra <pa...@gmail.com> on 2015/05/01 08:38:40 UTC, 1 replies.
- Re: Spark on Mesos - posted by Tim Chen <ti...@mesosphere.io> on 2015/05/01 09:18:30 UTC, 4 replies.
- Fwd: Event generator for SPARK-Streaming from csv - posted by anshu shukla <an...@gmail.com> on 2015/05/01 09:30:25 UTC, 2 replies.
- Spark worker error on standalone cluster - posted by "Michael Ryabtsev (Totango)" <mi...@totango.com> on 2015/05/01 11:13:26 UTC, 2 replies.
- NullPointerException with Avro + Spark. - posted by ๏̯͡๏ <ÐΞ€ρ@Ҝ>, de...@gmail.com on 2015/05/01 13:42:05 UTC, 0 replies.
- Driver memory default setting stops background jobs - posted by Andreas Marfurt <an...@gmail.com> on 2015/05/01 14:39:11 UTC, 0 replies.
- Exiting "driver" main() method... - posted by James Carman <ja...@carmanconsulting.com> on 2015/05/01 14:52:52 UTC, 3 replies.
- ClassNotFoundException for Kryo serialization - posted by Akshat Aranya <aa...@gmail.com> on 2015/05/01 17:05:40 UTC, 4 replies.
- spark.logConf with log4j.rootCategory=WARN - posted by roy <rp...@njit.edu> on 2015/05/01 17:41:35 UTC, 1 replies.
- Re: Spark - Hive Metastore MySQL driver - posted by Ted Yu <yu...@gmail.com> on 2015/05/01 20:05:31 UTC, 0 replies.
- [PSA] Use Stack Overflow! - posted by Nick Chammas <ni...@gmail.com> on 2015/05/01 20:29:05 UTC, 0 replies.
- Remoting warning when submitting to cluster - posted by javidelgadillo <jd...@esri.com> on 2015/05/01 20:34:51 UTC, 3 replies.
- Generating version agnostic jar path value for --jars clause - posted by nitinkak001 <ni...@gmail.com> on 2015/05/01 22:06:27 UTC, 0 replies.
- Selecting download for 'hadoop 2.4 and later" - posted by Stephen Boesch <ja...@gmail.com> on 2015/05/01 22:30:52 UTC, 1 replies.
- empty jdbc RDD in spark - posted by Hafiz Mujadid <ha...@gmail.com> on 2015/05/01 22:56:23 UTC, 2 replies.
- Re: Drop a column from the DataFrame. - posted by dsgriffin <ds...@gmail.com> on 2015/05/01 22:57:17 UTC, 3 replies.
- sparkR equivalent to SparkContext.newAPIHadoopRDD? - posted by David Holiday <da...@annaisystems.com> on 2015/05/02 06:35:27 UTC, 0 replies.
- Submit & Kill Spark Application program programmatically from another application - posted by Yijie Shen <he...@gmail.com> on 2015/05/02 07:50:05 UTC, 0 replies.
- to split an RDD to multiple ones? - posted by Yifan LI <ia...@gmail.com> on 2015/05/02 11:00:19 UTC, 2 replies.
- Re: Number of input partitions in SparkContext.sequenceFile - posted by Archit Thakur <ar...@gmail.com> on 2015/05/02 12:31:54 UTC, 0 replies.
- Problem in Standalone Mode - posted by drarse <dr...@gmail.com> on 2015/05/02 13:06:08 UTC, 1 replies.
- Re: Spark - Timeout Issues - OutOfMemoryError - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2015/05/02 16:45:28 UTC, 18 replies.
- com.esotericsoftware.kryo.KryoException: java.lang.IndexOutOfBoundsException: Index: - posted by shahab <sh...@gmail.com> on 2015/05/02 17:57:35 UTC, 6 replies.
- spark filestream problem - posted by Evo Eftimov <ev...@isecc.com> on 2015/05/02 18:06:54 UTC, 3 replies.
- spark filestrea problem - posted by Evo Eftimov <ev...@isecc.com> on 2015/05/02 18:07:41 UTC, 1 replies.
- not getting any mail - posted by Jeetendra Gangele <ga...@gmail.com> on 2015/05/02 18:14:46 UTC, 1 replies.
- Can I group elements in RDD into different groups and let each group share some elements?‏ - posted by Franz Chien <fr...@gmail.com> on 2015/05/02 21:45:23 UTC, 0 replies.
- Re: Can I group elements in RDD into different groups and let each group share some elements? - posted by Olivier Girardot <ss...@gmail.com> on 2015/05/02 23:25:10 UTC, 0 replies.
- Re: directory loader in windows - posted by ayan guha <gu...@gmail.com> on 2015/05/03 04:44:46 UTC, 0 replies.
- Hardware requirements - posted by sherine ahmed <sh...@hotmail.com> on 2015/05/03 09:19:59 UTC, 3 replies.
- Spark distributed SQL: JSON Data set on all worker node - posted by Jai <ja...@gmail.com> on 2015/05/03 11:02:19 UTC, 3 replies.
- PriviledgedActionException- Executor error - posted by podioss <gr...@hotmail.com> on 2015/05/03 11:25:17 UTC, 1 replies.
- Questions about Accumulators - posted by xiazhuchang <hk...@163.com> on 2015/05/03 14:08:44 UTC, 4 replies.
- Re: PySpark: slicing issue with dataframes - posted by Ali Bajwa <al...@gmail.com> on 2015/05/03 19:36:50 UTC, 1 replies.
- How to skip corrupted avro files - posted by Shing Hing Man <ma...@yahoo.com.INVALID> on 2015/05/03 19:57:13 UTC, 2 replies.
- Long GC pauses with Spark SQL 1.3.0 and billion row tables - posted by Nick Travers <n....@gmail.com> on 2015/05/04 07:36:01 UTC, 4 replies.
- spark log analyzer sample - posted by anshu shukla <an...@gmail.com> on 2015/05/04 08:49:52 UTC, 0 replies.
- Spark job concurrency problem - posted by Xi Shen <da...@gmail.com> on 2015/05/04 09:07:48 UTC, 1 replies.
- Spark Mongodb connection - posted by Yasemin Kaya <go...@gmail.com> on 2015/05/04 09:27:51 UTC, 2 replies.
- how to make sure data is partitioned across all workers? - posted by shahab <sh...@gmail.com> on 2015/05/04 09:36:21 UTC, 0 replies.
- Re: Map-Side Join in Spark - posted by ๏̯͡๏ <ÐΞ€ρ@Ҝ>, de...@gmail.com on 2015/05/04 11:41:23 UTC, 0 replies.
- sparksql running slow while joining 2 tables. - posted by lu...@sina.com on 2015/05/04 12:02:07 UTC, 3 replies.
- Re: Unusual filter behaviour on RDD - posted by fawadalam <fa...@gmail.com> on 2015/05/04 14:14:27 UTC, 0 replies.
- mapping JavaRDD to jdbc DataFrame - posted by Lior Chaga <li...@taboola.com> on 2015/05/04 14:16:44 UTC, 1 replies.
- Re: Custom Partitioning Spark - posted by ๏̯͡๏ <ÐΞ€ρ@Ҝ>, de...@gmail.com on 2015/05/04 15:01:57 UTC, 0 replies.
- 回复:Re: sparksql running slow while joining 2 tables. - posted by lu...@sina.com on 2015/05/04 15:07:56 UTC, 2 replies.
- Re: Support for skewed joins in Spark - posted by ๏̯͡๏ <ÐΞ€ρ@Ҝ>, de...@gmail.com on 2015/05/04 15:09:53 UTC, 0 replies.
- spark 1.3.1 - posted by Saurabh Gupta <sa...@semusi.com> on 2015/05/04 15:17:05 UTC, 3 replies.
- Difference ? - posted by ๏̯͡๏ <ÐΞ€ρ@Ҝ>, de...@gmail.com on 2015/05/04 15:20:05 UTC, 0 replies.
- MLLib SVM probability - posted by Robert Musters <ro...@openindex.io> on 2015/05/04 15:22:21 UTC, 2 replies.
- How to deal with code that runs before foreach block in Apache Spark? - posted by Emre Sevinc <em...@gmail.com> on 2015/05/04 15:34:41 UTC, 3 replies.
- Troubling Logging w/Simple Example (spark-1.2.2-bin-hadoop2.4)... - posted by James Carman <ja...@carmanconsulting.com> on 2015/05/04 15:56:44 UTC, 1 replies.
- "java.io.IOException: No space left on device" while doing repartitioning in Spark - posted by shahab <sh...@gmail.com> on 2015/05/04 15:57:03 UTC, 2 replies.
- Re: SparkStream saveAsTextFiles() - posted by anavidad <an...@gmail.com> on 2015/05/04 16:42:18 UTC, 1 replies.
- SparkSQL Nested structure - posted by Giovanni Paolo Gibilisco <gi...@gmail.com> on 2015/05/04 16:49:45 UTC, 1 replies.
- Is LIMIT n in Spark SQL useful? - posted by Yi Zhang <zh...@yahoo.com.INVALID> on 2015/05/04 16:52:48 UTC, 5 replies.
- Python Custom Partitioner - posted by ayan guha <gu...@gmail.com> on 2015/05/04 17:08:42 UTC, 2 replies.
- Parallelize foreach in PySpark with Spark Standalone - posted by kdunn <kd...@gmail.com> on 2015/05/04 17:38:36 UTC, 0 replies.
- AJAX with Apache Spark - posted by Sergio Jiménez Barrio <dr...@gmail.com> on 2015/05/04 21:07:36 UTC, 1 replies.
- "com.datastax.spark" % "spark-streaming_2.10" % "1.1.0" in my build.sbt ?? - posted by Eric Ho <er...@intel.com> on 2015/05/04 21:12:09 UTC, 1 replies.
- No logs from my cluster / worker ... (running DSE 4.6.1) - posted by Eric Ho <er...@intel.com> on 2015/05/04 21:22:42 UTC, 2 replies.
- Re: Spark partitioning question - posted by Imran Rashid <ir...@cloudera.com> on 2015/05/04 21:44:51 UTC, 2 replies.
- Re: ReduceByKey and sorting within partitions - posted by Imran Rashid <ir...@cloudera.com> on 2015/05/04 21:56:41 UTC, 2 replies.
- Re: Extra stage that executes before triggering computation with an action - posted by Imran Rashid <ir...@cloudera.com> on 2015/05/04 22:03:53 UTC, 0 replies.
- Building DAG from log - posted by Giovanni Paolo Gibilisco <gi...@gmail.com> on 2015/05/04 23:08:26 UTC, 0 replies.
- Spark JVM default memory - posted by Vijayasarathy Kannan <kv...@vt.edu> on 2015/05/04 23:24:40 UTC, 4 replies.
- Re: Kryo serialization of classes in additional jars - posted by Imran Rashid <ir...@cloudera.com> on 2015/05/04 23:47:18 UTC, 2 replies.
- Re: spark kryo serialization question - posted by Imran Rashid <ir...@cloudera.com> on 2015/05/04 23:49:12 UTC, 0 replies.
- OOM error with GMMs on 4GB dataset - posted by Vinay Muttineni <vm...@ebay.com> on 2015/05/05 02:16:47 UTC, 1 replies.
- spark Unable to instantiate org.apache.hadoop.hive.metastore.HiveMetaStoreClient - posted by 鹰 <98...@qq.com> on 2015/05/05 02:49:48 UTC, 1 replies.
- 回复:RE: 回复:Re: sparksql running slow while joining_2_tables. - posted by lu...@sina.com on 2015/05/05 03:51:55 UTC, 3 replies.
- 回复:spark Unable to instantiate org.apache.hadoop.hive.metastore.HiveMetaStoreClient - posted by lu...@sina.com on 2015/05/05 03:56:33 UTC, 2 replies.
- sparksql support hive view - posted by lu...@sina.com on 2015/05/05 04:12:12 UTC, 1 replies.
- Nightly builds/releases? - posted by Ankur Chauhan <ac...@brightcove.com> on 2015/05/05 04:25:01 UTC, 3 replies.
- Help with Spark SQL Hash Distribution - posted by Mani <ma...@vt.edu> on 2015/05/05 06:05:05 UTC, 1 replies.
- Re: sparksql running slow while joining_2_tables. - posted by "Cheng, Hao" <ha...@intel.com> on 2015/05/05 07:18:05 UTC, 0 replies.
- Unable to join table across data sources using sparkSQL - posted by Ishwardeep Singh <is...@impetus.co.in> on 2015/05/05 09:20:12 UTC, 5 replies.
- 回复:RE: 回复:spark Unable to instantiate org.apache.hadoop.hive.metastore.HiveMetaStoreClient - posted by 鹰 <98...@qq.com> on 2015/05/05 09:32:39 UTC, 0 replies.
- 回复: spark Unable to instantiate org.apache.hadoop.hive.metastore.HiveMetaStoreClient - posted by 鹰 <98...@qq.com> on 2015/05/05 10:12:40 UTC, 0 replies.
- setting spark configuration properties problem - posted by Hafiz Mujadid <ha...@gmail.com> on 2015/05/05 11:46:44 UTC, 0 replies.
- JAVA for SPARK certification - posted by Gourav Sengupta <go...@gmail.com> on 2015/05/05 11:56:34 UTC, 9 replies.
- example code for current date in spark sql - posted by kiran mavatoor <ki...@yahoo.com.INVALID> on 2015/05/05 11:56:47 UTC, 0 replies.
- Two DataFrames with different schema, unionAll issue. - posted by Wilhelm <ni...@gmail.com> on 2015/05/05 12:24:17 UTC, 2 replies.
- spark sql, creating literal columns in java. - posted by Jan-Paul Bultmann <ja...@me.com> on 2015/05/05 13:15:43 UTC, 1 replies.
- Re: RDD coalesce or repartition by #records or #bytes? - posted by Du Li <li...@yahoo-inc.com.INVALID> on 2015/05/05 13:26:13 UTC, 0 replies.
- 回复:Re: sparksql running slow while joining_2_tables. - posted by lu...@sina.com on 2015/05/05 13:50:22 UTC, 1 replies.
- Spark + Kakfa with directStream - posted by Guillermo Ortiz <ko...@gmail.com> on 2015/05/05 14:46:04 UTC, 1 replies.
- multiple hdfs folder & files input to PySpark - posted by Oleg Ruchovets <or...@gmail.com> on 2015/05/05 14:59:14 UTC, 2 replies.
- How to separate messages of different topics. - posted by Guillermo Ortiz <ko...@gmail.com> on 2015/05/05 15:29:50 UTC, 1 replies.
- Parquet number of partitions - posted by Eric Eijkelenboom <er...@gmail.com> on 2015/05/05 15:56:57 UTC, 3 replies.
- Where does Spark persist RDDs on disk? - posted by Haoliang Quan <ha...@gmail.com> on 2015/05/05 17:07:47 UTC, 1 replies.
- Escaping user input for Hive queries - posted by Yana Kadiyska <ya...@gmail.com> on 2015/05/05 17:15:08 UTC, 0 replies.
- Possible to disable Spark HTTP server ? - posted by roy <rp...@njit.edu> on 2015/05/05 17:41:50 UTC, 1 replies.
- Inserting Nulls - posted by Masf <ma...@gmail.com> on 2015/05/05 18:00:45 UTC, 1 replies.
- Spark applications Web UI at 4040 doesn't exist - posted by "marco.doncel" <ma...@gmail.com> on 2015/05/05 18:19:36 UTC, 0 replies.
- Number of files to load - posted by Rendy Bambang Junior <re...@gmail.com> on 2015/05/05 18:45:43 UTC, 3 replies.
- saveAsTextFile() to save output of Spark program to HDFS - posted by Sudarshan <nj...@gmail.com> on 2015/05/05 21:03:38 UTC, 5 replies.
- Parquet Partition Strategy - how to partition data correctly - posted by Todd Nist <ts...@gmail.com> on 2015/05/05 21:34:41 UTC, 0 replies.
- Maximum Core Utilization - posted by Manu Kaul <ma...@gmail.com> on 2015/05/05 21:55:40 UTC, 2 replies.
- Multilabel Classification in spark - posted by peterg <pe...@garbers.me> on 2015/05/05 22:13:41 UTC, 4 replies.
- Help with datetime comparison in SparkSQL statement ... - posted by "subscriptions@prismalytics.io" <su...@prismalytics.io> on 2015/05/05 22:47:04 UTC, 0 replies.
- Map one RDD into two RDD - posted by Bill Q <bi...@gmail.com> on 2015/05/05 23:42:23 UTC, 11 replies.
- Configuring Number of Nodes with Standalone Scheduler - posted by "Nastooh Avessta (navesta)" <na...@cisco.com> on 2015/05/05 23:49:31 UTC, 0 replies.
- Spark SQL Standalone mode missing parquet? - posted by Manu Mukerji <ma...@gmail.com> on 2015/05/05 23:58:24 UTC, 0 replies.
- [ANNOUNCE] Ending Java 6 support in Spark 1.5 (Sep 2015) - posted by Reynold Xin <rx...@databricks.com> on 2015/05/06 00:25:21 UTC, 0 replies.
- AvroFiles - posted by Pankaj Deshpande <pp...@gmail.com> on 2015/05/06 01:09:38 UTC, 4 replies.
- Re: Join between Streaming data vs Historical Data in spark - posted by Rendy Bambang Junior <re...@gmail.com> on 2015/05/06 01:59:27 UTC, 0 replies.
- MLlib libsvm isssues with data - posted by doyere <do...@doyere.cn> on 2015/05/06 02:59:38 UTC, 1 replies.
- Possible to use hive-config.xml instead of hive-site.xml for HiveContext? - posted by nitinkak001 <ni...@gmail.com> on 2015/05/06 03:25:06 UTC, 1 replies.
- overloaded method constructor Strategy with alternatives - posted by xweb <as...@gmail.com> on 2015/05/06 03:37:51 UTC, 2 replies.
- what does "Container exited with a non-zero exit code 10" means? - posted by felicia <sh...@tsmc.com> on 2015/05/06 03:54:27 UTC, 1 replies.
- Using spark streaming to load data from Kafka to HDFS - posted by Rendy Bambang Junior <re...@gmail.com> on 2015/05/06 06:13:57 UTC, 3 replies.
- 回复:回复:RE: 回复:Re: sparksql running slow while joining_2_tables. - posted by lu...@sina.com on 2015/05/06 08:04:11 UTC, 1 replies.
- Job executed with no data in Spark Straming. - posted by secfree <zz...@gmail.com> on 2015/05/06 09:02:51 UTC, 0 replies.
- Creating topology in spark streaming - posted by anshu shukla <an...@gmail.com> on 2015/05/06 09:53:35 UTC, 8 replies.
- The explanation of input text format using LDA in Spark - posted by Cui xp <li...@gmail.com> on 2015/05/06 10:27:52 UTC, 3 replies.
- Re:MLlib libsvm isssues with data - posted by doyere <do...@doyere.cn> on 2015/05/06 10:42:48 UTC, 0 replies.
- Re: SparkR: filter() function? - posted by himaeda <hi...@deloitte.co.uk> on 2015/05/06 10:55:52 UTC, 1 replies.
- Re: Error in SparkSQL/Scala IDE - posted by Iulian Dragoș <iu...@typesafe.com> on 2015/05/06 11:12:02 UTC, 3 replies.
- Partition Case Class RDD without ParRDDFunctions - posted by Night Wolf <ni...@gmail.com> on 2015/05/06 11:14:26 UTC, 3 replies.
- large volume spark job spends most of the time in AppendOnlyMap.changeValue - posted by Michal Haris <mi...@visualdna.com> on 2015/05/06 11:45:14 UTC, 6 replies.
- spark jobs input/output http request - posted by Saurabh Gupta <sa...@semusi.com> on 2015/05/06 12:30:50 UTC, 0 replies.
- FAILED_TO_UNCOMPRESS(5) errors when fetching shuffle data with sort-based shuffle - posted by Jianshi Huang <ji...@gmail.com> on 2015/05/06 12:45:20 UTC, 2 replies.
- Kryo read method never called before reducing - posted by Florian Hussonnois <fh...@gmail.com> on 2015/05/06 12:53:26 UTC, 0 replies.
- Receiver Fault Tolerance - posted by James King <ja...@gmail.com> on 2015/05/06 14:09:28 UTC, 4 replies.
- No space left on device?? - posted by Yifan LI <ia...@gmail.com> on 2015/05/06 14:35:14 UTC, 5 replies.
- union eatch streaming window into a static rdd and use the static rdd periodicity - posted by lisendong <li...@163.com> on 2015/05/06 14:50:36 UTC, 0 replies.
- Re: How to add jars to standalone pyspark program - posted by mj <jo...@gmail.com> on 2015/05/06 15:43:26 UTC, 1 replies.
- Spark Kryo read method never called before reducing - posted by amine_901 <ch...@gmail.com> on 2015/05/06 16:20:08 UTC, 0 replies.
- how to use rdd.countApprox - posted by Du Li <li...@yahoo-inc.com.INVALID> on 2015/05/06 16:53:53 UTC, 10 replies.
- java.io.IOException: org.apache.spark.SparkException: Failed to get broadcast_2_piece0 - posted by "Wang, Ningjun (LNG-NPV)" <ni...@lexisnexis.com> on 2015/05/06 17:03:05 UTC, 0 replies.
- Re: java.io.IOException: org.apache.spark.SparkException: Failed to get broadcast_2_piece0 - posted by Ted Yu <yu...@gmail.com> on 2015/05/06 17:32:00 UTC, 5 replies.
- Stop Cluster Mode Running App - posted by James King <ja...@gmail.com> on 2015/05/06 18:02:58 UTC, 2 replies.
- Re: Implicit matrix factorization returning different results between spark 1.2.0 and 1.3.0 - posted by Ravi Mody <rm...@gmail.com> on 2015/05/06 18:29:35 UTC, 2 replies.
- Reading large files - posted by Vijayasarathy Kannan <kv...@vt.edu> on 2015/05/06 19:38:03 UTC, 3 replies.
- DataFrame DSL documentation - posted by Gerard Maas <ge...@gmail.com> on 2015/05/06 20:41:40 UTC, 1 replies.
- spark-shell breaks for scala 2.11 (with yarn)? - posted by Koert Kuipers <ko...@tresata.com> on 2015/05/06 21:05:46 UTC, 1 replies.
- question about the TFIDF. - posted by Dan Dong <do...@gmail.com> on 2015/05/06 21:44:31 UTC, 1 replies.
- (Unknown) - posted by anshu shukla <an...@gmail.com> on 2015/05/07 01:21:46 UTC, 0 replies.
- How to specify Worker and Master LOG folders? - posted by "Ulanov, Alexander" <al...@hp.com> on 2015/05/07 01:22:36 UTC, 0 replies.
- Re: - posted by Shixiong Zhu <zs...@gmail.com> on 2015/05/07 01:30:45 UTC, 0 replies.
- How update counter in cassandra - posted by Sergio Jiménez Barrio <dr...@gmail.com> on 2015/05/07 02:35:21 UTC, 1 replies.
- YARN mode startup takes too long (10+ secs) - posted by Taeyun Kim <ta...@innowireless.com> on 2015/05/07 03:02:43 UTC, 4 replies.
- Spark 1.3.1 and Parquet Partitions - posted by vasuki <va...@gmail.com> on 2015/05/07 04:32:25 UTC, 5 replies.
- Spark updateStateByKey fails with class leak when using case classes - resend - posted by rsearle <eg...@verizon.net> on 2015/05/07 04:48:56 UTC, 1 replies.
- How can I force operations to complete and spool to disk - posted by Steve Lewis <lo...@gmail.com> on 2015/05/07 07:16:54 UTC, 2 replies.
- Spark Job triggers second attempt - posted by ๏̯͡๏ <ÐΞ€ρ@Ҝ>, de...@gmail.com on 2015/05/07 08:34:06 UTC, 2 replies.
- Spark does not delete temporary directories - posted by Taeyun Kim <ta...@innowireless.com> on 2015/05/07 08:39:32 UTC, 6 replies.
- User Defined Type (UDT) - posted by Wojtek Jurczyk <wo...@gmail.com> on 2015/05/07 11:10:51 UTC, 4 replies.
- update resource when running spark - posted by Hoai-Thu Vuong <th...@gmail.com> on 2015/05/07 12:22:22 UTC, 0 replies.
- Not maximum CPU usage - posted by Krever <w....@gmail.com> on 2015/05/07 13:11:28 UTC, 0 replies.
- Cached RDD is not evenly split between executors - posted by Night Wolf <ni...@gmail.com> on 2015/05/07 14:16:18 UTC, 1 replies.
- Re: Sort (order by) of the big dataset - posted by Night Wolf <ni...@gmail.com> on 2015/05/07 14:26:19 UTC, 3 replies.
- Re: reduceByKeyAndWindow, but using log timestamps instead of clock seconds - posted by allonsy <lu...@gmail.com> on 2015/05/07 15:41:21 UTC, 0 replies.
- saveAsTable fails on Python with "Unresolved plan found" - posted by Judy Nash <ju...@exchange.microsoft.com> on 2015/05/07 16:26:04 UTC, 2 replies.
- Avro to Parquet ? - posted by ๏̯͡๏ <ÐΞ€ρ@Ҝ>, de...@gmail.com on 2015/05/07 17:29:44 UTC, 2 replies.
- RandomSplit with Spark-ML and Dataframe - posted by Olivier Girardot <o....@lateral-thoughts.com> on 2015/05/07 17:39:05 UTC, 2 replies.
- CompositeInputFormat implementation in Spark - posted by Bill Q <bi...@gmail.com> on 2015/05/07 18:02:07 UTC, 0 replies.
- branch-1.4 scala 2.11 - posted by Koert Kuipers <ko...@tresata.com> on 2015/05/07 18:07:19 UTC, 1 replies.
- history server - posted by Koert Kuipers <ko...@tresata.com> on 2015/05/07 20:00:58 UTC, 8 replies.
- Re: Spark unit test fails - posted by NoWisdom <cg...@gmail.com> on 2015/05/07 20:54:35 UTC, 0 replies.
- Re: saveAsTable fails on Python with "Unresolved plan found" - posted by Michael Armbrust <mi...@databricks.com> on 2015/05/07 21:22:01 UTC, 0 replies.
- Blocking DStream.forEachRDD() - posted by Corey Nolet <cj...@gmail.com> on 2015/05/07 21:30:21 UTC, 1 replies.
- Predict.scala using model for clustering In reference - posted by anshu shukla <an...@gmail.com> on 2015/05/07 21:40:03 UTC, 1 replies.
- AWS-Credentials fails with org.apache.hadoop.fs.s3.S3Exception: FORBIDDEN - posted by in4maniac <sa...@skimlinks.com> on 2015/05/07 21:51:28 UTC, 2 replies.
- Virtualenv pyspark - posted by alemagnani <al...@gmail.com> on 2015/05/07 23:20:30 UTC, 1 replies.
- Re: Loading file content based on offsets into the memory - posted by in4maniac <sa...@skimlinks.com> on 2015/05/07 23:34:26 UTC, 1 replies.
- Getting data into Spark Streaming - posted by Sathaye <sa...@gmail.com> on 2015/05/08 00:59:12 UTC, 1 replies.
- Duplicate entries in output of mllib column similarities - posted by rbolkey <rb...@gmail.com> on 2015/05/08 01:17:47 UTC, 5 replies.
- 回复:RE: 回复:回复:RE: 回复:Re:_sparksql_running_slow_while_joining_2_tables. - posted by lu...@sina.com on 2015/05/08 02:14:02 UTC, 0 replies.
- 回复:回复:RE: 回复:回复:RE: 回复:Re:_sparksql_running_slow_while_joining_2_tables. - posted by lu...@sina.com on 2015/05/08 03:01:56 UTC, 0 replies.
- 回复:回复:回复:RE: 回复:回复:RE: 回复:Re:_sparksql_running_slow_while_joining_2_tables. - posted by lu...@sina.com on 2015/05/08 03:56:20 UTC, 0 replies.
- SparkSQL issue: Spark 1.3.1 + hadoop 2.6 on CDH5.3 with parquet - posted by felicia <sh...@tsmc.com> on 2015/05/08 04:39:24 UTC, 2 replies.
- Re: Spark 1.3 createDataframe error with pandas df - posted by kevindahl <ke...@gmail.com> on 2015/05/08 05:01:09 UTC, 0 replies.
- Dismatch when use sparkSQL insert data into a hive table with dynamic partition datetype - posted by Gerald-G <sh...@gmail.com> on 2015/05/08 05:07:37 UTC, 1 replies.
- Is it possible to set the akka specify properties (akka.extensions) in spark - posted by Terry Hole <hu...@gmail.com> on 2015/05/08 05:19:14 UTC, 4 replies.
- Discretization - posted by spark_user_2015 <li...@adobe.com> on 2015/05/08 06:20:45 UTC, 1 replies.
- Master node memory usage question - posted by Richard Alex Hofer <rh...@andrew.cmu.edu> on 2015/05/08 06:50:19 UTC, 2 replies.
- (无主题) - posted by lu...@sina.com on 2015/05/08 07:53:14 UTC, 1 replies.
- Possible long lineage issue when using DStream to update a normal RDD - posted by yaochunnan <ya...@gmail.com> on 2015/05/08 07:56:58 UTC, 3 replies.
- Spark ThirftServer storage display error on WebUI - posted by "guoqing0629@yahoo.com.hk" <gu...@yahoo.com.hk> on 2015/05/08 09:28:26 UTC, 0 replies.
- updateStateByKey - how to generate a stream of state changes? - posted by minisaw <mi...@gmail.com> on 2015/05/08 09:38:16 UTC, 2 replies.
- SparkStreaming + Flume/PDI+Kafka - posted by "GARCIA MIGUEL, DAVID" <dg...@serikat.es> on 2015/05/08 10:22:41 UTC, 0 replies.
- [SparkSQL] cannot filter by a DateType column - posted by Haopu Wang <HW...@qilinsoft.com> on 2015/05/08 10:36:58 UTC, 2 replies.
- [SQL][Dataframe] Change data source after saveAsParquetFile - posted by Peter Rudenko <pe...@gmail.com> on 2015/05/08 15:02:04 UTC, 3 replies.
- filterRDD and flatMap - posted by hmaeda <hi...@gmail.com> on 2015/05/08 16:36:00 UTC, 0 replies.
- Submit Spark application in cluster mode and supervised - posted by James King <ja...@gmail.com> on 2015/05/08 17:22:31 UTC, 3 replies.
- Cluster mode and supervised app with multiple Masters - posted by James King <ja...@gmail.com> on 2015/05/08 17:35:32 UTC, 0 replies.
- dependencies on java-netlib and jblas - posted by John Niekrasz <jo...@gmail.com> on 2015/05/08 18:04:41 UTC, 2 replies.
- parallelism on binary file - posted by tog <gu...@gmail.com> on 2015/05/08 19:03:54 UTC, 1 replies.
- Cassandra number of Tasks - posted by Vijay Pawnarkar <vi...@gmail.com> on 2015/05/08 20:29:27 UTC, 3 replies.
- Spark Cassandra connector number of Tasks - posted by vijaypawnarkar <vi...@gmail.com> on 2015/05/08 20:42:45 UTC, 1 replies.
- Lambda architecture using Apache Spark - posted by rafac <ra...@hotmail.com> on 2015/05/08 22:48:12 UTC, 1 replies.
- Spark + Kinesis + Stream Name + Cache? - posted by Mike Trienis <mi...@orcsol.com> on 2015/05/08 23:06:57 UTC, 3 replies.
- Spark streaming updating a large window more frequently - posted by Ankur Chauhan <ac...@brightcove.com> on 2015/05/08 23:26:42 UTC, 1 replies.
- CREATE TABLE ignores database when using PARQUET option - posted by Carlos Pereira <cp...@groupon.com> on 2015/05/08 23:41:27 UTC, 4 replies.
- Hash Partitioning and Dataframes - posted by "Daniel, Ronald (ELS-SDG)" <R....@elsevier.com> on 2015/05/08 23:47:05 UTC, 3 replies.
- Spark SQL: STDDEV working in Spark Shell but not in a standalone app - posted by barmaley <ol...@solver.com> on 2015/05/09 01:44:32 UTC, 5 replies.
- Re: Spark SQL and Hive interoperability - posted by jdavidmitchell <jd...@gmail.com> on 2015/05/09 03:32:36 UTC, 1 replies.
- Using Pandas/Scikit Learning in Pyspark - posted by Bin Wang <bi...@gmail.com> on 2015/05/09 06:55:44 UTC, 1 replies.
- spark and binary files - posted by tog <gu...@gmail.com> on 2015/05/09 08:48:34 UTC, 2 replies.
- How to implement an Evaluator for a ML pipeline? - posted by "Stefan H." <tw...@gmx.de> on 2015/05/09 21:15:01 UTC, 2 replies.
- Spark can not access jar from HDFS !! - posted by Ravindra <ra...@gmail.com> on 2015/05/09 21:32:15 UTC, 3 replies.
- Spark streaming closes with Cassandra Conector - posted by Sergio Jiménez Barrio <dr...@gmail.com> on 2015/05/09 23:32:36 UTC, 7 replies.
- custom join using complex keys - posted by Mathieu D <ma...@gmail.com> on 2015/05/10 00:02:19 UTC, 2 replies.
- Spark SQL and java.lang.RuntimeException - posted by Nick Travers <n....@gmail.com> on 2015/05/10 04:34:26 UTC, 1 replies.
- Is the AMP lab done next February? - posted by Justin Pihony <ju...@gmail.com> on 2015/05/10 05:43:24 UTC, 1 replies.
- Re: Spark + Kinesis - posted by Chris Fregly <ch...@fregly.com> on 2015/05/10 06:14:34 UTC, 1 replies.
- Re: JavaKinesisWordCountASLYARN Example not working on EMR - posted by Chris Fregly <ch...@fregly.com> on 2015/05/10 06:18:54 UTC, 0 replies.
- Find KNN in Spark SQL - posted by Dong Li <li...@lidong.net.cn> on 2015/05/10 06:25:42 UTC, 2 replies.
- Does NullWritable can not be used in Spark? - posted by donhoff_h <16...@qq.com> on 2015/05/10 06:58:26 UTC, 2 replies.
- spark streaming and computation - posted by skippi <sk...@gmx.de> on 2015/05/10 11:19:05 UTC, 1 replies.
- EVent generation - posted by anshu shukla <an...@gmail.com> on 2015/05/10 11:51:21 UTC, 4 replies.
- Multiple DataFrames per Parquet file? - posted by Peter Aberline <pe...@gmail.com> on 2015/05/10 16:36:55 UTC, 4 replies.
- Re: Nullable is true for the schema of parquet data - posted by dsgriffin <ds...@gmail.com> on 2015/05/11 04:31:16 UTC, 0 replies.
- Spark on top of YARN Compression in iPython notebook - posted by Bin Wang <bi...@gmail.com> on 2015/05/11 05:42:45 UTC, 0 replies.
- guardian failed, shutting down system - posted by 董帅阳 <91...@qq.com> on 2015/05/11 07:56:14 UTC, 1 replies.
- Python -> SQL (geonames dataset) - posted by Tyler Mitchell <Ty...@actian.com> on 2015/05/11 08:22:09 UTC, 1 replies.
- [SparkSQL 1.4.0] groupBy columns are always nullable? - posted by Haopu Wang <HW...@qilinsoft.com> on 2015/05/11 10:48:30 UTC, 7 replies.
- spark mllib kmeans - posted by Pa Rö <pa...@googlemail.com> on 2015/05/11 14:59:20 UTC, 4 replies.
- how to load some of the files in a dir and monitor new file in that dir in spark streaming without missing? - posted by lisendong <li...@163.com> on 2015/05/11 15:01:30 UTC, 1 replies.
- Reading Nested Fields in DataFrames - posted by Ashish Kumar Singh <as...@gmail.com> on 2015/05/11 16:05:13 UTC, 3 replies.
- can we start a new thread in foreachRDD in spark streaming? - posted by hotdog <li...@163.com> on 2015/05/11 16:08:21 UTC, 1 replies.
- Re: SQL UserDefinedType can't be saved in parquet file when using assembly jar - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2015/05/11 16:39:47 UTC, 2 replies.
- spark : use the global config variables in executors - posted by hotdog <li...@163.com> on 2015/05/11 16:55:19 UTC, 1 replies.
- It takes too long (30 seconds) to create Spark Context with SPARK/YARN - posted by stanley <wa...@yahoo.com> on 2015/05/11 17:15:45 UTC, 1 replies.
- Re: Python -> SQL can't find table (Zeppelin) - posted by Tyler Mitchell <Ty...@actian.com> on 2015/05/11 17:16:33 UTC, 0 replies.
- Does long-lived SparkContext hold on to executor resources? - posted by stanley <wa...@yahoo.com> on 2015/05/11 17:23:54 UTC, 3 replies.
- Re: NoClassDefFoundError for NativeS3FileSystem in pyspark (1.3.1) - posted by steaz <cs...@comcast.net> on 2015/05/11 17:45:45 UTC, 0 replies.
- Unread block data - posted by Guy Needham <gu...@gmail.com> on 2015/05/11 18:03:29 UTC, 0 replies.
- Met a problem when using spark to load parquet files with different version schemas - posted by Wei Yan <yw...@gmail.com> on 2015/05/11 18:59:27 UTC, 4 replies.
- TwitterPopularTags Long Processing Delay - posted by Seyed Majid Zahedi <za...@cs.duke.edu> on 2015/05/11 19:03:09 UTC, 1 replies.
- Re: Getting error running MLlib example with new cluster - posted by Su She <su...@gmail.com> on 2015/05/11 19:59:41 UTC, 1 replies.
- Looking inside the 'mapPartitions' transformation, some confused observations - posted by myasuka <my...@live.com> on 2015/05/11 20:26:21 UTC, 1 replies.
- Stratified sampling with DataFrames - posted by Karthikeyan Muthukumar <mk...@gmail.com> on 2015/05/11 21:32:46 UTC, 1 replies.
- Get a list of temporary RDD tables via Thrift - posted by Judy Nash <ju...@exchange.microsoft.com> on 2015/05/11 21:54:25 UTC, 1 replies.
- Specify Python interpreter - posted by Bin Wang <bi...@gmail.com> on 2015/05/11 22:17:54 UTC, 2 replies.
- Kafka stream fails: java.lang.NoClassDefFound com/yammer/metrics/core/Gauge - posted by Lee McFadden <sp...@gmail.com> on 2015/05/11 22:32:00 UTC, 18 replies.
- Running Spark in local mode seems to ignore local[N] - posted by dgoldenberg <dg...@gmail.com> on 2015/05/11 22:52:58 UTC, 8 replies.
- Spark and RabbitMQ - posted by dgoldenberg <dg...@gmail.com> on 2015/05/11 23:01:12 UTC, 3 replies.
- DStream Union vs. StreamingContext Union - posted by Vadim Bichutskiy <va...@gmail.com> on 2015/05/12 00:49:54 UTC, 9 replies.
- Can standalone cluster manager provide I/O information on worker nodes? - posted by Shiyao Ma <i...@introo.me> on 2015/05/12 02:03:57 UTC, 0 replies.
- Spark SQL ArrayIndexOutOfBoundsException - posted by Mike Frampton <mi...@hotmail.com> on 2015/05/12 04:40:04 UTC, 2 replies.
- How to get Master UI with ZooKeeper HA setup? - posted by Rex Xiong <by...@gmail.com> on 2015/05/12 07:28:49 UTC, 1 replies.
- value toDF is not a member of RDD object - posted by SLiZn Liu <sl...@gmail.com> on 2015/05/12 11:36:09 UTC, 11 replies.
- Why so slow - posted by Jianshi Huang <ji...@gmail.com> on 2015/05/12 12:10:29 UTC, 2 replies.
- Master HA - posted by James King <ja...@gmail.com> on 2015/05/12 13:23:30 UTC, 2 replies.
- Content based filtering - posted by Yasemin Kaya <go...@gmail.com> on 2015/05/12 13:45:08 UTC, 1 replies.
- Reading Real Time Data only from Kafka - posted by James King <ja...@gmail.com> on 2015/05/12 13:45:28 UTC, 16 replies.
- Using sc.HadoopConfiguration in Python - posted by ayan guha <gu...@gmail.com> on 2015/05/12 15:39:08 UTC, 4 replies.
- Re: Re: Sort Shuffle performance issues about using AppendOnlyMap for large data sets - posted by Night Wolf <ni...@gmail.com> on 2015/05/12 16:05:01 UTC, 1 replies.
- How to speed up data ingestion with Spark - posted by dgoldenberg <dg...@gmail.com> on 2015/05/12 16:55:23 UTC, 1 replies.
- Spark Example Project 0.3.0 released - posted by Alex Dean <al...@snowplowanalytics.com> on 2015/05/12 19:58:57 UTC, 0 replies.
- Required settings for permanent HDFS Spark on EC2 - posted by darugar <s...@parand.com> on 2015/05/12 20:15:51 UTC, 0 replies.
- Correct approach to merging data - posted by Anthony Ikeda <an...@gmail.com> on 2015/05/12 21:40:52 UTC, 0 replies.
- [SparkSQL] Partition Autodiscovery (Spark 1.3) - posted by Yana Kadiyska <ya...@gmail.com> on 2015/05/12 22:35:32 UTC, 0 replies.
- Reasons for Pregel being slow - posted by dawiss <da...@gmail.com> on 2015/05/12 22:43:54 UTC, 0 replies.
- kafka + Spark Streaming with checkPointing fails to start with - posted by Ankur Chauhan <ac...@brightcove.com> on 2015/05/12 23:51:21 UTC, 0 replies.
- Fast big data analytics with Spark on Tachyon in Baidu - posted by Haoyuan Li <ha...@gmail.com> on 2015/05/13 01:45:17 UTC, 0 replies.
- Spark on Yarn : Map outputs lifetime ? - posted by Ashwin Shankar <as...@gmail.com> on 2015/05/13 01:50:55 UTC, 1 replies.
- how to set random seed - posted by Charles Hayden <ch...@atigeo.com> on 2015/05/13 02:40:30 UTC, 5 replies.
- question about customize kmeans distance measure - posted by June <zh...@yahoo.com.INVALID> on 2015/05/13 04:23:46 UTC, 0 replies.
- Worker & Core in Spark - posted by "guoqing0629@yahoo.com.hk" <gu...@yahoo.com.hk> on 2015/05/13 04:43:23 UTC, 1 replies.
- how to monitor multi directories in spark streaming task - posted by hotdog <li...@163.com> on 2015/05/13 09:53:41 UTC, 3 replies.
- kafka + Spark Streaming with checkPointing fails to restart - posted by Ankur Chauhan <ac...@brightcove.com> on 2015/05/13 10:53:37 UTC, 5 replies.
- Increase maximum amount of columns for covariance matrix for principal components - posted by Sebastian Alfers <se...@googlemail.com> on 2015/05/13 10:57:38 UTC, 1 replies.
- Re: How to get applicationId for yarn mode(both yarn-client and yarn-cluster mode) - posted by thanhtien522 <th...@gmail.com> on 2015/05/13 11:23:30 UTC, 0 replies.
- Backward compatibility with org.apache.spark.sql.api.java.Row class - posted by Emerson Castañeda <em...@gmail.com> on 2015/05/13 11:48:30 UTC, 1 replies.
- Kafka Direct Approach + Zookeeper - posted by James King <ja...@gmail.com> on 2015/05/13 12:00:59 UTC, 5 replies.
- com.esotericsoftware.kryo.KryoException: java.io.IOException: Stream is corrupted - posted by Yifan LI <ia...@gmail.com> on 2015/05/13 13:01:33 UTC, 1 replies.
- [Spark SQL 1.3.1] data frame saveAsTable returns exception - posted by Ishwardeep Singh <is...@impetus.co.in> on 2015/05/13 13:32:08 UTC, 6 replies.
- Removing FINISHED applications and shuffle data - posted by sayantini <sa...@gmail.com> on 2015/05/13 14:31:13 UTC, 0 replies.
- Building Spark - posted by Heisenberg Bb <hb...@gmail.com> on 2015/05/13 14:57:01 UTC, 2 replies.
- applications are still in progress? - posted by Yifan LI <ia...@gmail.com> on 2015/05/13 15:04:59 UTC, 1 replies.
- Spark and Flink - posted by Pa Rö <pa...@googlemail.com> on 2015/05/13 15:07:52 UTC, 6 replies.
- Kafka + Direct + Zookeeper - posted by James King <ja...@gmail.com> on 2015/05/13 15:10:06 UTC, 2 replies.
- JavaPairRDD - posted by Yasemin Kaya <go...@gmail.com> on 2015/05/13 15:12:56 UTC, 2 replies.
- how to read lz4 compressed data using fileStream of spark streaming? - posted by hotdog <li...@163.com> on 2015/05/13 15:48:46 UTC, 6 replies.
- NullPointerException while creating DataFrame from an S3 Avro Object - posted by Mohammad Tariq <do...@gmail.com> on 2015/05/13 16:02:53 UTC, 0 replies.
- Spark Sorted DataFrame & Repartitioning - posted by Night Wolf <ni...@gmail.com> on 2015/05/13 16:09:33 UTC, 0 replies.
- Worker Spark Port - posted by James King <ja...@gmail.com> on 2015/05/13 16:38:19 UTC, 5 replies.
- force the kafka consumer process to different machines - posted by hotdog <li...@163.com> on 2015/05/13 17:33:46 UTC, 5 replies.
- Spark SQL: preferred syntax for column reference? - posted by Diana Carroll <dc...@cloudera.com> on 2015/05/13 18:55:12 UTC, 3 replies.
- Word2Vec with billion-word corpora - posted by Shilad Sen <ss...@macalester.edu> on 2015/05/13 19:00:11 UTC, 4 replies.
- data schema and serialization format suggestions - posted by Ankur Chauhan <ac...@brightcove.com> on 2015/05/13 19:34:33 UTC, 0 replies.
- PostgreSQL JDBC Classpath Issue - posted by George Adams <g....@gmail.com> on 2015/05/13 21:21:16 UTC, 0 replies.
- Trouble trying to run ./spark-ec2 script - posted by Su She <su...@gmail.com> on 2015/05/13 22:33:12 UTC, 2 replies.
- Problem with current spark - posted by Giovanni Paolo Gibilisco <gi...@gmail.com> on 2015/05/13 23:02:50 UTC, 1 replies.
- spark-streaming whit flume error - posted by 鹰 <98...@qq.com> on 2015/05/14 05:14:50 UTC, 1 replies.
- --jars works in "yarn-client" but not "yarn-cluster" mode, why? - posted by Fengyun RAO <ra...@gmail.com> on 2015/05/14 05:37:30 UTC, 6 replies.
- Spark recovery takes long - posted by NB <nb...@gmail.com> on 2015/05/14 05:49:59 UTC, 0 replies.
- Spark performance in cluster mode using yarn - posted by sachin Singh <sa...@gmail.com> on 2015/05/14 07:02:04 UTC, 2 replies.
- How to run multiple jobs in one sparkcontext from separate threads in pyspark? - posted by MEETHU MATHEW <me...@yahoo.co.in> on 2015/05/14 07:40:05 UTC, 6 replies.
- spark sql hive-shims - posted by Lior Chaga <li...@taboola.com> on 2015/05/14 07:53:20 UTC, 3 replies.
- Unsubscribe - posted by Saurabh Agrawal <sa...@markit.com> on 2015/05/14 09:38:28 UTC, 2 replies.
- [Unit Test Failure] Test org.apache.spark.streaming.JavaAPISuite.testCount failed - posted by kf <wa...@huawei.com> on 2015/05/14 09:55:22 UTC, 3 replies.
- swap tuple - posted by Yasemin Kaya <go...@gmail.com> on 2015/05/14 10:15:01 UTC, 4 replies.
- Build change PSA: Hadoop 2.2 default; -Phadoop-x.y profile recommended for builds - posted by Sean Owen <so...@cloudera.com> on 2015/05/14 11:12:03 UTC, 0 replies.
- how to delete data from table in sparksql - posted by lu...@sina.com on 2015/05/14 12:26:39 UTC, 2 replies.
- SPARKTA: a real-time aggregation engine based on Spark Streaming - posted by dmoralesdf <dm...@stratio.com> on 2015/05/14 15:31:35 UTC, 8 replies.
- reduceByKey - posted by Yasemin Kaya <go...@gmail.com> on 2015/05/14 15:40:43 UTC, 2 replies.
- Multiple Kinesis Streams in a single Streaming job - posted by Erich Ess <er...@simplerelevance.com> on 2015/05/14 18:00:09 UTC, 5 replies.
- Storing a lot of state with updateStateByKey - posted by "krot.vyacheslav" <kr...@gmail.com> on 2015/05/14 18:00:44 UTC, 0 replies.
- store hive metastore on persistent store - posted by jamborta <ja...@gmail.com> on 2015/05/14 19:24:02 UTC, 9 replies.
- Restricting the number of iterations in Mllib Kmeans - posted by Suman Somasundar <su...@oracle.com> on 2015/05/14 21:05:57 UTC, 2 replies.
- spark log field clarification - posted by yanwei <ec...@gmail.com> on 2015/05/14 21:22:16 UTC, 2 replies.
- Per-machine configuration? - posted by mj...@columbus.rr.com on 2015/05/14 21:41:03 UTC, 0 replies.
- Custom Aggregate Function for DataFrame - posted by Justin Yip <yi...@prediction.io> on 2015/05/14 21:49:05 UTC, 3 replies.
- Hive partition table + read using hiveContext + spark 1.3.1 - posted by SamyaMaiti <sa...@gmail.com> on 2015/05/14 22:30:24 UTC, 0 replies.
- textFileStream Question - posted by Vadim Bichutskiy <va...@gmail.com> on 2015/05/14 22:55:38 UTC, 2 replies.
- Data Load - Newbie - posted by Ricardo Goncalves da Silva <ri...@telefonica.com> on 2015/05/14 23:17:40 UTC, 0 replies.
- LogisticRegressionWithLBFGS with large feature set - posted by Pala M Muthaia <mc...@rocketfuelinc.com> on 2015/05/15 00:44:23 UTC, 2 replies.
- [SparkStreaming] Is it possible to delay the start of some DStream in the application? - posted by Haopu Wang <HW...@qilinsoft.com> on 2015/05/15 01:09:08 UTC, 4 replies.
- RE: Is it feasible to keep millions of keys in state of Spark Streaming job for two months? - posted by Haopu Wang <HW...@qilinsoft.com> on 2015/05/15 01:41:15 UTC, 0 replies.
- Spark's Guava pieces cause exceptions in non-trivial deployments - posted by Anton Brazhnyk <an...@genesys.com> on 2015/05/15 01:52:11 UTC, 10 replies.
- Spark Summit 2015 - June 15-17 - Dev list invite - posted by Scott walent <sc...@gmail.com> on 2015/05/15 01:55:55 UTC, 0 replies.
- 回复:textFileStream Question - posted by 董帅阳 <91...@qq.com> on 2015/05/15 04:39:27 UTC, 0 replies.
- question about sparksql caching - posted by sequoiadb <ma...@sequoiadb.com> on 2015/05/15 05:02:22 UTC, 2 replies.
- Spark 1.3.0 -> 1.3.1 produces java.lang.NoSuchFieldError: NO_FILTER - posted by Exie <tf...@prodevelop.com.au> on 2015/05/15 07:21:07 UTC, 2 replies.
- What's the advantage features of Spark SQL(JDBC) - posted by Yi Zhang <zh...@yahoo.com.INVALID> on 2015/05/15 08:29:39 UTC, 4 replies.
- Spark on Mesos vs Yarn - posted by Ankur Chauhan <an...@malloc64.com> on 2015/05/15 08:30:31 UTC, 4 replies.
- Why association with remote system has failed when set master in Spark programmatically - posted by Yi Zhang <zh...@yahoo.com.INVALID> on 2015/05/15 10:06:20 UTC, 2 replies.
- 回复:Re: how to delete data from table in sparksql - posted by lu...@sina.com on 2015/05/15 10:44:31 UTC, 0 replies.
- Grouping and storing unordered time series data stream to HDFS - posted by Nisrina Luthfiyati <ni...@gmail.com> on 2015/05/15 12:10:54 UTC, 3 replies.
- Re: kafka + Spark Streaming with checkPointing fails to start with - posted by Alexander Krasheninnikov <a....@corp.badoo.com> on 2015/05/15 12:38:19 UTC, 0 replies.
- Forbidded : Error Code: 403 - posted by Mohammad Tariq <do...@gmail.com> on 2015/05/15 14:09:38 UTC, 6 replies.
- Hive Skew flag? - posted by Denny Lee <de...@gmail.com> on 2015/05/15 16:37:59 UTC, 0 replies.
- FetchFailedException and MetadataFetchFailedException - posted by rok <ro...@gmail.com> on 2015/05/15 17:07:21 UTC, 4 replies.
- SPARK-4412 regressed? - posted by Yana Kadiyska <ya...@gmail.com> on 2015/05/15 17:19:12 UTC, 2 replies.
- Spark Fair Scheduler for Spark Streaming - 1.2 and beyond - posted by Evo Eftimov <ev...@isecc.com> on 2015/05/15 17:33:43 UTC, 7 replies.
- Re: Spark Job execution time - posted by SamyaMaiti <sa...@gmail.com> on 2015/05/15 19:37:34 UTC, 0 replies.
- Using groupByKey with Spark SQL - posted by Edward Sargisson <ej...@gmail.com> on 2015/05/15 19:48:25 UTC, 1 replies.
- Re: SaveAsTextFile brings down data nodes with IO Exceptions - posted by Puneet Kapoor <pu...@gmail.com> on 2015/05/15 21:03:19 UTC, 3 replies.
- Re: Error communicating with MapOutputTracker - posted by Thomas Gerber <th...@radius.com> on 2015/05/16 00:09:23 UTC, 1 replies.
- Best practice to avoid ambiguous columns in DataFrame.join - posted by Justin Yip <yi...@prediction.io> on 2015/05/16 00:44:40 UTC, 4 replies.
- Broadcast variables can be rebroadcast? - posted by NB <nb...@gmail.com> on 2015/05/16 02:18:11 UTC, 9 replies.
- [spark sql] $ and === can't be recognised in IntelliJ - posted by Yi Zhang <zh...@yahoo.com.INVALID> on 2015/05/16 04:45:02 UTC, 0 replies.
- Join Issue in IntelliJ Idea - posted by Yi Zhang <zh...@yahoo.com.INVALID> on 2015/05/16 05:49:49 UTC, 0 replies.
- How to reshape RDD/Spark DataFrame - posted by macwanjason <ma...@gmail.com> on 2015/05/16 05:52:58 UTC, 1 replies.
- [spark sql] $ and === can't be recognised in IntelliJ - posted by "Yi.Zhang" <zh...@yahoo.com> on 2015/05/16 05:55:55 UTC, 0 replies.
- Spark sql and csv data processing question - posted by Mike Frampton <mi...@hotmail.com> on 2015/05/16 06:17:38 UTC, 1 replies.
- Using Spark SQL in Spark1.2.1 with Hive 0.14 - posted by smazumder <so...@gmail.com> on 2015/05/16 06:29:38 UTC, 0 replies.
- number of executors - posted by xiaohe lan <zo...@gmail.com> on 2015/05/16 09:01:31 UTC, 11 replies.
- Getting the best parameter set back from CrossValidatorModel - posted by Justin Yip <yi...@prediction.io> on 2015/05/16 12:17:14 UTC, 3 replies.
- IF in SQL statement - posted by Antony Mayi <an...@yahoo.com.INVALID> on 2015/05/16 16:21:08 UTC, 1 replies.
- Spark SQL is not able to connect to hive metastore - posted by smazumder <so...@gmail.com> on 2015/05/16 17:14:06 UTC, 4 replies.
- Problem building master on 2.11 - posted by "Fernando O." <fo...@gmail.com> on 2015/05/16 21:09:19 UTC, 0 replies.
- RE: Running Spark/YARN on AWS EMR - Issues finding file on hdfs? - posted by jaredtims <ja...@yahoo.com> on 2015/05/16 22:52:50 UTC, 0 replies.
- Re: zip files submitted with --py-files disappear from hdfs after a while on EMR - posted by jaredtims <ja...@yahoo.com> on 2015/05/16 22:57:58 UTC, 0 replies.
- How can I do pair-wise computation between RDD feature columns? - posted by yaochunnan <ya...@gmail.com> on 2015/05/17 03:13:04 UTC, 0 replies.
- println in spark-shell - posted by xiaohe lan <zo...@gmail.com> on 2015/05/17 11:01:40 UTC, 1 replies.
- Big Data Day LA: FREE Big Data Conference in Los Angeles on June 27, 2015 - posted by Slim Baltagi <sb...@gmail.com> on 2015/05/17 15:47:36 UTC, 0 replies.
- Spark Streaming and reducing latency - posted by dgoldenberg <dg...@gmail.com> on 2015/05/17 15:51:12 UTC, 11 replies.
- Effecient way to fetch all records on a particular node/partition in GraphX - posted by mas <ma...@gmail.com> on 2015/05/17 17:32:32 UTC, 1 replies.
- Trying to understand sc.textFile better - posted by Justin Pihony <ju...@gmail.com> on 2015/05/17 19:01:44 UTC, 0 replies.
- Re: Data partitioning and node tracking in Spark-GraphX - posted by MUHAMMAD AAMIR <ma...@gmail.com> on 2015/05/17 19:55:42 UTC, 0 replies.
- Union of checkpointed RDD in Apache Spark has long (> 10 hour) between-stage latency - posted by Peng Cheng <pc...@uow.edu.au> on 2015/05/17 23:58:35 UTC, 3 replies.
- InferredSchema Example in Spark-SQL - posted by Rajdeep Dua <ra...@gmail.com> on 2015/05/18 02:07:35 UTC, 8 replies.
- Implementing custom metrics under MLPipeline's BinaryClassificationEvaluator - posted by Justin Yip <yi...@prediction.io> on 2015/05/18 07:35:21 UTC, 1 replies.
- Re: StandardScaler failing with OOM errors in PySpark - posted by Xiangrui Meng <me...@gmail.com> on 2015/05/18 08:49:23 UTC, 0 replies.
- Re: bug: numClasses is not a valid argument of LogisticRegressionWithSGD - posted by Xiangrui Meng <me...@gmail.com> on 2015/05/18 08:50:51 UTC, 0 replies.
- Re: MLLib SVMWithSGD is failing for large dataset - posted by Xiangrui Meng <me...@gmail.com> on 2015/05/18 08:55:43 UTC, 0 replies.
- How to debug spark in IntelliJ Idea - posted by "Yi.Zhang" <zh...@yahoo.com> on 2015/05/18 09:37:34 UTC, 0 replies.
- AccessControlException hive table created from spark shell - posted by patcharee <Pa...@uni.no> on 2015/05/18 11:27:34 UTC, 0 replies.
- k-means core function for temporal geo data - posted by Pa Rö <pa...@googlemail.com> on 2015/05/18 11:30:58 UTC, 1 replies.
- NullPointerException when accessing broadcast variable in DStream - posted by hotienvu <ho...@gmail.com> on 2015/05/18 12:07:45 UTC, 0 replies.
- Spark sql error while writing Parquet file- Trying to write more fields than contained in row - posted by "Chandra Mohan, Ananda Vel Murugan" <An...@honeywell.com> on 2015/05/18 12:29:54 UTC, 4 replies.
- Working with slides. How do I know how many times a RDD has been processed? - posted by Guillermo Ortiz <ko...@gmail.com> on 2015/05/18 15:36:14 UTC, 1 replies.
- Processing multiple columns in parallel - posted by Laeeq Ahmed <la...@yahoo.com.INVALID> on 2015/05/18 15:37:56 UTC, 2 replies.
- Spark streaming over a rest API - posted by juandasgandaras <ju...@gmail.com> on 2015/05/18 16:21:23 UTC, 1 replies.
- pass configuration parameters to PySpark job - posted by Oleg Ruchovets <or...@gmail.com> on 2015/05/18 16:26:26 UTC, 1 replies.
- parsedData option - posted by Ricardo Goncalves da Silva <ri...@telefonica.com> on 2015/05/18 16:32:18 UTC, 0 replies.
- py-files (and others?) not properly set up in cluster-mode Spark Yarn job? - posted by Shay Rojansky <ro...@roji.org> on 2015/05/18 18:38:57 UTC, 2 replies.
- org.apache.spark.shuffle.FetchFailedException :: Migration from Spark 1.2 to 1.3 - posted by zia_kayani <zi...@platalytics.com> on 2015/05/18 19:19:31 UTC, 2 replies.
- Spark groupByKey, does it always create at least 1 partition per key? - posted by tomboyle <ic...@gmail.com> on 2015/05/18 19:38:34 UTC, 1 replies.
- Partition number of Spark Streaming Kafka receiver-based approach - posted by Bill Jay <bi...@gmail.com> on 2015/05/19 01:46:57 UTC, 1 replies.
- TwitterUtils on Windows - posted by Justin Pihony <ju...@gmail.com> on 2015/05/19 04:08:27 UTC, 4 replies.
- Spark Streaming graceful shutdown in Spark 1.4 - posted by Dibyendu Bhattacharya <di...@gmail.com> on 2015/05/19 06:43:15 UTC, 8 replies.
- group by and distinct performance issue - posted by "Peer, Oded" <Od...@rsa.com> on 2015/05/19 09:28:30 UTC, 2 replies.
- spark streaming doubt - posted by Shushant Arora <sh...@gmail.com> on 2015/05/19 09:53:27 UTC, 9 replies.
- Spark 1.3.1 Performance Tuning/Patterns for Data Generation Heavy/Throughput Jobs - posted by Night Wolf <ni...@gmail.com> on 2015/05/19 10:36:03 UTC, 1 replies.
- AvroParquetWriter equivalent in Spark 1.3 sqlContext Save or createDataFrame Interfaces? - posted by Ewan Leith <ew...@realitymine.com> on 2015/05/19 10:42:37 UTC, 4 replies.
- How to use spark to access HBase with Security enabled - posted by donhoff_h <16...@qq.com> on 2015/05/19 11:41:33 UTC, 6 replies.
- Spark SQL on large number of columns - posted by madhu phatak <ph...@gmail.com> on 2015/05/19 12:04:20 UTC, 9 replies.
- Reading Binary files in Spark program - posted by Tapan Sharma <ta...@gmail.com> on 2015/05/19 12:27:03 UTC, 6 replies.
- 回复: How to use spark to access HBase with Security enabled - posted by donhoff_h <16...@qq.com> on 2015/05/19 14:23:53 UTC, 4 replies.
- RE: Decision tree: categorical variables - posted by Keerthi <ke...@gmail.com> on 2015/05/19 14:45:40 UTC, 1 replies.
- Hive in IntelliJ - posted by Heisenberg Bb <hb...@gmail.com> on 2015/05/19 15:26:25 UTC, 0 replies.
- Does Python 2.7 have to be installed on every cluster node? - posted by YaoPau <jo...@gmail.com> on 2015/05/19 16:44:43 UTC, 1 replies.
- Re: PySpark Job throwing IOError - posted by "Muralidhar, Nikhil" <Ni...@washpost.com> on 2015/05/19 16:59:18 UTC, 0 replies.
- Multi user setup and saving a DataFrame / RDD to a network exported file system - posted by Tomasz Fruboes <To...@fuw.edu.pl> on 2015/05/19 17:15:47 UTC, 6 replies.
- Windows DOS bug in windows-utils.cmd - posted by Justin Pihony <ju...@gmail.com> on 2015/05/19 17:41:49 UTC, 0 replies.
- Mesos Spark Tasks - Lost - posted by Panagiotis Garefalakis <pa...@gmail.com> on 2015/05/19 17:57:31 UTC, 2 replies.
- Wish for 1.4: upper bound on # tasks in Mesos - posted by Thomas Dudziak <to...@gmail.com> on 2015/05/19 18:39:25 UTC, 4 replies.
- PanTera Big Data Visualization built with Spark - posted by Cyrus Handy <ch...@uncharted.software> on 2015/05/19 19:05:41 UTC, 0 replies.
- Spark Streaming + Kafka failure recovery - posted by Bill Jay <bi...@gmail.com> on 2015/05/19 19:42:15 UTC, 4 replies.
- Problem querying RDD using HiveThriftServer2.startWithContext functionality - posted by fdmitriy <df...@informatica.com> on 2015/05/19 20:00:44 UTC, 0 replies.
- Code error - posted by Ricardo Goncalves da Silva <ri...@telefonica.com> on 2015/05/19 20:59:28 UTC, 2 replies.
- Add to Powered by Spark page - posted by Michal Klos <mi...@gmail.com> on 2015/05/19 21:54:47 UTC, 0 replies.
- Re: question about customize kmeans distance measure - posted by Xiangrui Meng <me...@gmail.com> on 2015/05/19 22:17:23 UTC, 0 replies.
- Spark 1.3 classPath problem - posted by Bill Q <bi...@gmail.com> on 2015/05/19 22:44:23 UTC, 0 replies.
- rdd.sample() methods very slow - posted by "Wang, Ningjun (LNG-NPV)" <ni...@lexisnexis.com> on 2015/05/19 22:44:34 UTC, 6 replies.
- Exception when using CLUSTER BY or ORDER BY - posted by Thomas Dudziak <to...@gmail.com> on 2015/05/19 22:50:58 UTC, 0 replies.
- EOFException using KryoSerializer - posted by Jim Carroll <ji...@gmail.com> on 2015/05/19 22:56:22 UTC, 1 replies.
- Naming an DF aggregated column - posted by Cesar Flores <ce...@gmail.com> on 2015/05/19 23:11:30 UTC, 1 replies.
- How to set the file size for parquet Part - posted by Richard Grossman <ri...@gmail.com> on 2015/05/19 23:36:30 UTC, 1 replies.
- Hive 1.0 support in Spark - posted by Kannan Rajah <kr...@maprtech.com> on 2015/05/20 00:01:39 UTC, 0 replies.
- sparkSQL - Hive metastore connection hangs with MS SQL server - posted by jamborta <ja...@gmail.com> on 2015/05/20 02:09:38 UTC, 0 replies.
- Spark users - posted by Ricardo Goncalves da Silva <ri...@telefonica.com> on 2015/05/20 02:28:56 UTC, 1 replies.
- spark 1.3.1 jars in repo1.maven.org - posted by Edward Sargisson <es...@pobox.com> on 2015/05/20 03:17:33 UTC, 4 replies.
- Spark Job not using all nodes in cluster - posted by Shailesh Birari <sb...@gmail.com> on 2015/05/20 05:16:29 UTC, 2 replies.
- Spark logo license - posted by Justin Pihony <ju...@gmail.com> on 2015/05/20 06:02:14 UTC, 2 replies.
- Hive on Spark VS Spark SQL - posted by "guoqing0629@yahoo.com.hk" <gu...@yahoo.com.hk> on 2015/05/20 07:37:40 UTC, 3 replies.
- Spark Streaming to Kafka - posted by twinkle sachdeva <tw...@gmail.com> on 2015/05/20 07:41:00 UTC, 2 replies.
- java program Get Stuck at broadcasting - posted by allanjie <al...@gmail.com> on 2015/05/20 08:17:36 UTC, 5 replies.
- Is this a good use case for Spark? - posted by jakeheller <ja...@casetext.com> on 2015/05/20 09:38:25 UTC, 1 replies.
- Intermittent difficulties for Worker to contact Master on same machine in standalone - posted by Stephen Boesch <ja...@gmail.com> on 2015/05/20 11:07:00 UTC, 5 replies.
- saveasorcfile on partitioned orc - posted by patcharee <Pa...@uni.no> on 2015/05/20 11:14:15 UTC, 0 replies.
- How to set HBaseConfiguration in Spark - posted by donhoff_h <16...@qq.com> on 2015/05/20 11:21:22 UTC, 1 replies.
- LATERAL VIEW explode issue - posted by kiran mavatoor <ki...@yahoo.com.INVALID> on 2015/05/20 11:57:47 UTC, 2 replies.
- Spark Streaming - Design considerations/Knobs - posted by Hemant Bhanawat <he...@gmail.com> on 2015/05/20 12:40:16 UTC, 4 replies.
- PySpark Logs location - posted by Oleg Ruchovets <or...@gmail.com> on 2015/05/20 13:37:09 UTC, 5 replies.
- Initial job has not accepted any resources - posted by podioss <gr...@hotmail.com> on 2015/05/20 14:59:10 UTC, 0 replies.
- java program got Stuck at broadcasting - posted by allanjie <al...@gmail.com> on 2015/05/20 15:19:23 UTC, 3 replies.
- Re: save column values of DataFrame to text file - posted by allanjie <al...@gmail.com> on 2015/05/20 15:34:53 UTC, 0 replies.
- Re: Incrementally add/remove vertices in GraphX - posted by vzaychik <za...@drexel.edu> on 2015/05/20 17:30:00 UTC, 0 replies.
- IPv6 support - posted by Kevin Liu <ke...@fb.com> on 2015/05/20 19:39:31 UTC, 1 replies.
- FP Growth saveAsTextFile - posted by Eric Tanner <er...@justenough.com> on 2015/05/20 21:16:00 UTC, 2 replies.
- Read multiple files from S3 - posted by lovelylavs <lx...@utdallas.edu> on 2015/05/20 22:15:27 UTC, 1 replies.
- Re: Spark 1.3.1 - SQL Issues - posted by Davies Liu <da...@databricks.com> on 2015/05/20 23:11:55 UTC, 1 replies.
- GradientBoostedTrees.trainRegressor with categoricalFeaturesInfo - posted by Don Drake <do...@gmail.com> on 2015/05/20 23:44:50 UTC, 3 replies.
- Storing data in MySQL from spark hive tables - posted by roni <ro...@gmail.com> on 2015/05/20 23:48:25 UTC, 1 replies.
- Spark Application Dependency Issue - posted by Snehal Nagmote <na...@gmail.com> on 2015/05/21 00:07:20 UTC, 0 replies.
- Compare LogisticRegression results using Mllib with those using other libraries (e.g. statsmodel) - posted by Xin Liu <li...@gmail.com> on 2015/05/21 00:42:24 UTC, 5 replies.
- How to process data in chronological order - posted by roy <rp...@njit.edu> on 2015/05/21 01:03:36 UTC, 1 replies.
- Spatial function in spark - posted by developer developer <de...@gmail.com> on 2015/05/21 03:29:59 UTC, 0 replies.
- Help needed with Py4J - posted by "Addanki, Santosh Kumar" <sa...@sap.com> on 2015/05/21 04:07:47 UTC, 0 replies.
- Re: Help needed with Py4J - posted by Holden Karau <ho...@pigscanfly.ca> on 2015/05/21 04:26:24 UTC, 2 replies.
- Spark build with Hive - posted by "guoqing0629@yahoo.com.hk" <gu...@yahoo.com.hk> on 2015/05/21 05:08:37 UTC, 4 replies.
- Cannot submit SparkPi to Standalone (1.3.1) running on another Server (Both Linux) - posted by Carey Sublette <ca...@gmail.com> on 2015/05/21 06:34:29 UTC, 0 replies.
- Storing spark processed output to Database asynchronously. - posted by Gautam Bajaj <ga...@gmail.com> on 2015/05/21 07:28:28 UTC, 7 replies.
- View all user's application logs in history server - posted by Jianshi Huang <ji...@gmail.com> on 2015/05/21 07:29:05 UTC, 6 replies.
- Unable to use hive queries with constants in predicates - posted by Devarajan Srinivasan <de...@gmail.com> on 2015/05/21 08:10:04 UTC, 1 replies.
- Re: rdd.saveAsTextFile problem - posted by Keerthi <ke...@gmail.com> on 2015/05/21 08:49:40 UTC, 2 replies.
- Question about Serialization in Storage Level - posted by "Jiang, Zhipeng" <zh...@intel.com> on 2015/05/21 09:52:43 UTC, 3 replies.
- Re: [Streaming] Non-blocking recommendation in custom receiver documentation and KinesisReceiver's worker.run blocking calll - posted by Aniket Bhatnagar <an...@gmail.com> on 2015/05/21 10:31:57 UTC, 3 replies.
- DataFrame Column Alias problem - posted by SLiZn Liu <sl...@gmail.com> on 2015/05/21 12:09:34 UTC, 4 replies.
- map reduce ? - posted by Yasemin Kaya <go...@gmail.com> on 2015/05/21 16:13:04 UTC, 0 replies.
- saveAsTextFile() part- files are missing - posted by rroxanaioana <rr...@gmail.com> on 2015/05/21 17:16:18 UTC, 1 replies.
- Spark HistoryServer not coming up - posted by roy <rp...@njit.edu> on 2015/05/21 17:48:17 UTC, 2 replies.
- Query a Dataframe in rdd.map() - posted by ping yan <sh...@gmail.com> on 2015/05/21 19:19:44 UTC, 4 replies.
- Pipelining with Spark - posted by dgoldenberg <dg...@gmail.com> on 2015/05/21 20:45:19 UTC, 0 replies.
- Re: Spark Streaming with Tachyon : Data Loss on Receiver Failure due to WAL error - posted by Tathagata Das <td...@databricks.com> on 2015/05/21 20:54:38 UTC, 1 replies.
- Official Docker container for Spark - posted by tridib <tr...@live.com> on 2015/05/21 21:25:13 UTC, 3 replies.
- Re: Spark Streaming on top of Cassandra? - posted by tshah77 <te...@gmail.com> on 2015/05/21 22:24:13 UTC, 1 replies.
- Re: Connecting to an inmemory database from Spark - posted by tshah77 <te...@gmail.com> on 2015/05/21 22:33:23 UTC, 1 replies.
- Spark MOOC - early access - posted by Marco Shaw <ma...@gmail.com> on 2015/05/21 22:41:21 UTC, 1 replies.
- running spark on yarn - posted by Nathan Kronenfeld <nk...@uncharted.software> on 2015/05/21 23:42:24 UTC, 0 replies.
- foreach plus accumulator Vs mapPartitions performance - posted by ben <de...@gmail.com> on 2015/05/21 23:46:40 UTC, 1 replies.
- foreach vs foreachPartitions - posted by ben <de...@gmail.com> on 2015/05/21 23:50:41 UTC, 0 replies.
- Re: S3NativeFileSystem inefficient implementation when calling sc.textFile - posted by Peng Cheng <pc...@uow.edu.au> on 2015/05/22 00:16:42 UTC, 0 replies.
- Pandas timezone problems - posted by Def_Os <nj...@gmail.com> on 2015/05/22 00:16:58 UTC, 1 replies.
- Re: NegativeArraySizeException when doing joins on skewed data - posted by jstripit <ja...@concur.com> on 2015/05/22 00:47:02 UTC, 0 replies.
- Re: [pyspark] Starting workers in a virtualenv - posted by Davies Liu <da...@databricks.com> on 2015/05/22 03:15:50 UTC, 1 replies.
- Kmeans Labeled Point RDD - posted by anneywarlord <an...@gmail.com> on 2015/05/22 03:19:50 UTC, 1 replies.
- task all finished, while the stage marked finish long time later problem - posted by 邓刚, , 技术中心, , tr...@vipshop.com on 2015/05/22 04:31:49 UTC, 1 replies.
- LDA prediction on new document - posted by Dani Qiu <zo...@gmail.com> on 2015/05/22 05:48:40 UTC, 3 replies.
- Spark Memory management - posted by swaranga <sa...@gmail.com> on 2015/05/22 09:31:47 UTC, 2 replies.
- 回复: 回复: How to use spark to access HBase with Security enabled - posted by donhoff_h <16...@qq.com> on 2015/05/22 09:33:12 UTC, 1 replies.
- MLlib: how to get the best model with only the most significant explanatory variables in LogisticRegressionWithLBFGS or LogisticRegressionWithSGD ? - posted by SparknewUser <me...@gmail.com> on 2015/05/22 10:19:01 UTC, 6 replies.
- Spark with cassandra - posted by lucas <we...@gmail.com> on 2015/05/22 10:21:35 UTC, 0 replies.
- Spark Streaming and Drools - posted by Antonio Giambanco <an...@gmail.com> on 2015/05/22 10:43:28 UTC, 7 replies.
- Partitioning of Dataframes - posted by Karlson <ks...@siberie.de> on 2015/05/22 11:03:18 UTC, 5 replies.
- Re: Issues with constants in Spark HiveQL queries - posted by Skanda <sk...@gmail.com> on 2015/05/22 11:06:21 UTC, 1 replies.
- DataFrame groupBy vs RDD groupBy - posted by gtanguy <g....@gmail.com> on 2015/05/22 12:35:07 UTC, 2 replies.
- Trying to connect to many topics with several DirectConnect - posted by Guillermo Ortiz <ko...@gmail.com> on 2015/05/22 13:50:26 UTC, 3 replies.
- Parallel parameter tuning: distributed execution of MLlib algorithms - posted by Hugo Ferreira <hm...@inesctec.pt> on 2015/05/22 15:15:13 UTC, 1 replies.
- partitioning after extracting from a hive table? - posted by Cesar Flores <ce...@gmail.com> on 2015/05/22 16:02:29 UTC, 1 replies.
- Performance degradation between spark 0.9.3 and 1.3.1 - posted by Shay Seng <sh...@urbanengines.com> on 2015/05/22 18:43:21 UTC, 2 replies.
- Help reading Spark UI tea leaves.. - posted by Shay Seng <sh...@urbanengines.com> on 2015/05/22 18:59:23 UTC, 3 replies.
- How to share a (spring) singleton service with Spark? - posted by Tristan107 <tr...@gmail.com> on 2015/05/22 19:26:06 UTC, 0 replies.
- Why is RDD to PairRDDFunctions only via implicits? - posted by Justin Pihony <ju...@gmail.com> on 2015/05/22 19:26:19 UTC, 2 replies.
- HiveContext fails when querying large external Parquet tables - posted by Andrew Otto <ao...@wikimedia.org> on 2015/05/22 21:51:23 UTC, 2 replies.
- Spark Streaming: all tasks running on one executor (Kinesis + Mongodb) - posted by Mike Trienis <mi...@orcsol.com> on 2015/05/22 22:24:12 UTC, 3 replies.
- spark on Windows 2008 failed to save RDD to windows shared folder - posted by "Wang, Ningjun (LNG-NPV)" <ni...@lexisnexis.com> on 2015/05/22 22:55:24 UTC, 0 replies.
- Re: spark on Windows 2008 failed to save RDD to windows shared folder - posted by Ted Yu <yu...@gmail.com> on 2015/05/22 23:01:36 UTC, 1 replies.
- spark.executor.extraClassPath - Values not picked up by executors - posted by Todd Nist <ts...@gmail.com> on 2015/05/23 00:15:14 UTC, 2 replies.
- Application on standalone cluster never changes state to be stopped - posted by Edward Sargisson <es...@pobox.com> on 2015/05/23 00:22:46 UTC, 0 replies.
- Migrate Relational to Distributed - posted by Brant Seibert <br...@hotmail.com> on 2015/05/23 00:22:46 UTC, 1 replies.
- SparkSQL failing while writing into S3 for 'insert into table' - posted by ogoh <ok...@gmail.com> on 2015/05/23 02:50:13 UTC, 1 replies.
- Dynamic Allocation with Spark Streaming - posted by Saiph Kappa <sa...@gmail.com> on 2015/05/23 03:58:08 UTC, 4 replies.
- SparkSQL query plan to Stage wise breakdown - posted by Pramod Biligiri <pr...@gmail.com> on 2015/05/23 04:50:14 UTC, 1 replies.
- Re: Bigints in pyspark - posted by Davies Liu <da...@databricks.com> on 2015/05/23 05:09:01 UTC, 0 replies.
- 回复: 回复: 回复: How to use spark to access HBase with Security enabled - posted by donhoff_h <16...@qq.com> on 2015/05/23 12:53:23 UTC, 0 replies.
- split function on spark sql created rdd - posted by "Kali.tummala@gmail.com" <Ka...@gmail.com> on 2015/05/23 14:16:44 UTC, 1 replies.
- Is anyone using Amazon EC2? - posted by Joe Wass <jw...@crossref.org> on 2015/05/23 16:20:57 UTC, 4 replies.
- Is anyone using Amazon EC2? (second attempt!) - posted by Joe Wass <jw...@crossref.org> on 2015/05/23 16:24:35 UTC, 2 replies.
- Not able to run SparkPi locally - posted by Sujit Pal <su...@gmail.com> on 2015/05/23 17:14:48 UTC, 1 replies.
- Doubts about SparkSQL - posted by Renato Marroquín Mogrovejo <re...@gmail.com> on 2015/05/23 18:52:37 UTC, 3 replies.
- 回复:spark.executor.extraClassPath - Values not picked up by executors - posted by "wesley.miao" <we...@qq.com> on 2015/05/24 01:41:13 UTC, 0 replies.
- Strange ClassNotFound exeption - posted by boci <bo...@gmail.com> on 2015/05/24 02:05:08 UTC, 3 replies.
- SparkSQL can't read S3 path for hive external table - posted by ogoh <ok...@gmail.com> on 2015/05/24 04:33:32 UTC, 0 replies.
- SparkSQL errors in 1.4 rc when using with Hive 0.12 metastore - posted by Cheolsoo Park <pi...@gmail.com> on 2015/05/24 07:19:47 UTC, 2 replies.
- Spark dramatically slow when I add "saveAsTextFile" - posted by allanjie <al...@gmail.com> on 2015/05/24 10:36:02 UTC, 4 replies.
- how to distributed run a bash shell in spark - posted by lu...@sina.com on 2015/05/24 13:23:00 UTC, 4 replies.
- Help optimizing some spark code - posted by Tal <wu...@gmail.com> on 2015/05/24 19:52:51 UTC, 1 replies.
- Powered by Spark listing - posted by Michael Roberts <mi...@gmail.com> on 2015/05/25 00:40:05 UTC, 0 replies.
- 回复: How to use zookeeper in Spark Streaming - posted by "bit1129@163.com" <bi...@163.com> on 2015/05/25 04:15:40 UTC, 0 replies.
- 回复:Re: how to distributed run a bash shell in spark - posted by lu...@sina.com on 2015/05/25 04:32:26 UTC, 0 replies.
- Re: How to use zookeeper in Spark Streaming - posted by Ted Yu <yu...@gmail.com> on 2015/05/25 04:41:22 UTC, 1 replies.
- Using Spark like a search engine - posted by Сергей Мелехин <cp...@gmail.com> on 2015/05/25 07:58:45 UTC, 5 replies.
- The stage slow when I have for loop inside (Java) - posted by allanjie <al...@gmail.com> on 2015/05/25 08:33:20 UTC, 0 replies.
- Intellij IDEA import spark souce code error - posted by huangzheng <11...@qq.com> on 2015/05/25 08:48:18 UTC, 0 replies.
- Re: Intellij IDEA import spark souce code error - posted by Yi Zhang <zh...@yahoo.com.INVALID> on 2015/05/25 10:35:39 UTC, 0 replies.
- Websphere MQ as a data source for Apache Spark Streaming - posted by umesh9794 <um...@searshc.com> on 2015/05/25 12:13:37 UTC, 4 replies.
- spark sql through java code facing issue - posted by vinayak <vi...@tcs.com> on 2015/05/25 13:30:33 UTC, 0 replies.
- 回复:Re: Re: how to distributed run a bash shell in spark - posted by lu...@sina.com on 2015/05/25 13:30:48 UTC, 0 replies.
- Tasks randomly stall when running on mesos - posted by Reinis Vicups <sp...@orbit-x.de> on 2015/05/25 14:43:37 UTC, 5 replies.
- DataFrame. Conditional aggregation - posted by Masf <ma...@gmail.com> on 2015/05/25 16:24:52 UTC, 5 replies.
- Using Log4j for logging messages inside lambda functions - posted by Spico Florin <sp...@gmail.com> on 2015/05/25 17:05:55 UTC, 3 replies.
- scala.ScalaReflectionException when creating SchemaRDD - posted by vkcelik <vk...@gmail.com> on 2015/05/25 21:01:58 UTC, 1 replies.
- SparkSQL's performance : contacting namenode and datanode to uncessarily check all partitions for a query of specific partitions - posted by ogoh <ok...@gmail.com> on 2015/05/25 21:10:14 UTC, 0 replies.
- Implementing custom RDD in Java - posted by Swaranga Sarma <sa...@gmail.com> on 2015/05/25 23:58:32 UTC, 3 replies.
- 回复:Re: Re: Re: how to distributed run a bash shell in spark - posted by lu...@sina.com on 2015/05/26 04:57:39 UTC, 0 replies.
- Re: Re: is there any easier way to define a custom RDD in Java - posted by swaranga <sa...@gmail.com> on 2015/05/26 05:39:22 UTC, 1 replies.
- Re: Spark SQL High GC time - posted by Nick Travers <n....@gmail.com> on 2015/05/26 07:21:38 UTC, 0 replies.
- Remove COMPLETED applications and shuffle data - posted by sayantini <sa...@gmail.com> on 2015/05/26 08:53:22 UTC, 1 replies.
- Caching parquet table (with GZIP) on Spark 1.3.1 - posted by sh...@tsmc.com on 2015/05/26 09:26:59 UTC, 0 replies.
- Collabrative Filtering - posted by Yasemin Kaya <go...@gmail.com> on 2015/05/26 09:34:29 UTC, 0 replies.
- How does spark manage the memory of executor with multiple tasks - posted by canan chen <cc...@gmail.com> on 2015/05/26 10:02:21 UTC, 9 replies.
- Roadmap for Spark with Kafka on Scala 2.11? - posted by algermissen1971 <al...@icloud.com> on 2015/05/26 10:09:32 UTC, 1 replies.
- spark-streaming-kafka_2.11 not available yet? - posted by Petr Novak <os...@gmail.com> on 2015/05/26 10:14:28 UTC, 1 replies.
- Re: HiveContext test, "Spark Context did not initialize after waiting 10000ms" - posted by Mohammad Islam <mi...@yahoo.com.INVALID> on 2015/05/26 10:14:33 UTC, 1 replies.
- 回复:Re: Re: Re: Re: how to distributed run a bash shell in spark - posted by lu...@sina.com on 2015/05/26 10:30:28 UTC, 1 replies.
- How many executors can I acquire in standalone mode ? - posted by canan chen <cc...@gmail.com> on 2015/05/26 11:34:19 UTC, 3 replies.
- Apache Spark application deployment best practices - posted by lucas1000001 <ma...@gmail.com> on 2015/05/26 12:58:31 UTC, 2 replies.
- SparkR Jobs Hanging in collectPartitions - posted by "Eskilson,Aleksander" <Al...@Cerner.com> on 2015/05/26 16:28:15 UTC, 3 replies.
- Recommended Scala version - posted by Punyashloka Biswal <pu...@gmail.com> on 2015/05/26 16:51:43 UTC, 8 replies.
- DataFrame.explode produces field with wrong type. - posted by Eugene Morozov <fa...@list.ru> on 2015/05/26 17:46:08 UTC, 0 replies.
- process independent columns with same operations - posted by Laeeq Ahmed <la...@yahoo.com.INVALID> on 2015/05/26 18:33:08 UTC, 0 replies.
- Running Javascript from scala spark - posted by marcos rebelo <ol...@gmail.com> on 2015/05/26 19:17:51 UTC, 6 replies.
- PySpark Unknown Opcode Error - posted by Nikhil Muralidhar <nm...@gmail.com> on 2015/05/26 20:21:12 UTC, 1 replies.
- Accumulators in Spark Streaming on UI - posted by Snehal Nagmote <na...@gmail.com> on 2015/05/26 20:23:50 UTC, 1 replies.
- Spark unknown OpCode Error - posted by Nikhil Muralidhar <nm...@gmail.com> on 2015/05/26 20:50:19 UTC, 0 replies.
- Need some Cassandra integration help - posted by Yana Kadiyska <ya...@gmail.com> on 2015/05/26 22:31:28 UTC, 0 replies.
- Building scaladoc using "build/sbt unidoc" failure - posted by Justin Yip <yi...@prediction.io> on 2015/05/27 01:45:03 UTC, 0 replies.
- Spark Streming yarn-cluster Mode Off-heap Memory Is Constantly Growing - posted by Ji ZHANG <zh...@gmail.com> on 2015/05/27 08:21:48 UTC, 8 replies.
- How to give multiple directories as input ? - posted by ๏̯͡๏ <ÐΞ€ρ@Ҝ>, de...@gmail.com on 2015/05/27 08:21:48 UTC, 6 replies.
- Is the executor number fixed during the lifetime of one app ? - posted by canan chen <cc...@gmail.com> on 2015/05/27 08:44:17 UTC, 6 replies.
- Inconsistent behavior with Dataframe Timestamp between 1.3.1 and 1.4.0 - posted by Justin Yip <yi...@prediction.io> on 2015/05/27 09:57:17 UTC, 0 replies.
- Avro CombineInputFormat ? - posted by ๏̯͡๏ <ÐΞ€ρ@Ҝ>, de...@gmail.com on 2015/05/27 10:56:02 UTC, 0 replies.
- [POWERED BY] Please add our organization - posted by Antonio Giambanco <an...@gmail.com> on 2015/05/27 14:24:11 UTC, 0 replies.
- problem for mark stage finished? - posted by 邓刚, , 技术中心, , tr...@vipshop.com on 2015/05/27 14:33:37 UTC, 1 replies.
- Where does partitioning and data loading happen? - posted by Stephen Carman <sc...@coldlight.com> on 2015/05/27 16:22:06 UTC, 1 replies.
- Spark and logging - posted by dgoldenberg <dg...@gmail.com> on 2015/05/27 16:38:31 UTC, 1 replies.
- Decision Trees / Random Forest Multiple Labels - posted by cfusting <cf...@gmail.com> on 2015/05/27 16:47:48 UTC, 0 replies.
- Adding columns to DataFrame - posted by Masf <ma...@gmail.com> on 2015/05/27 17:02:56 UTC, 1 replies.
- hive external metastore connection timeout - posted by jamborta <ja...@gmail.com> on 2015/05/27 17:03:07 UTC, 1 replies.
- How to get the best performance with LogisticRegressionWithSGD? - posted by mélanie gallois <me...@gmail.com> on 2015/05/27 17:18:01 UTC, 4 replies.
- Multilabel classification using logistic regression - posted by peterg <pe...@garbers.me> on 2015/05/27 17:53:29 UTC, 2 replies.
- Re: Spark partition issue with Stanford NLP - posted by vishalvibhandik <vi...@gmail.com> on 2015/05/27 19:23:47 UTC, 1 replies.
- Re: Spark and Stanford CoreNLP - posted by mathewvinoj <vi...@hotmail.com> on 2015/05/27 20:17:06 UTC, 0 replies.
- Invoking Hive UDF programmatically - posted by Punyashloka Biswal <pu...@gmail.com> on 2015/05/27 21:01:52 UTC, 0 replies.
- debug jsonRDD problem? - posted by Michael Stone <ms...@mathom.us> on 2015/05/27 21:33:51 UTC, 4 replies.
- RDD boundaries and triggering processing using tags in the data - posted by David Webber <da...@gmail.com> on 2015/05/27 21:52:57 UTC, 0 replies.
- SF / East Bay Area Stream Processing Meetup next Thursday (6/4) - posted by Siva Jagadeesan <si...@gmail.com> on 2015/05/27 23:35:13 UTC, 0 replies.
- Spark Streaming from Kafka - no receivers and spark.streaming.receiver.maxRate? - posted by dgoldenberg <dg...@gmail.com> on 2015/05/28 01:11:26 UTC, 3 replies.
- Autoscaling Spark cluster based on topic sizes/rate of growth in Kafka or Spark's metrics? - posted by dgoldenberg <dg...@gmail.com> on 2015/05/28 01:21:14 UTC, 15 replies.
- Value for SPARK_EXECUTOR_CORES - posted by Mulugeta Mammo <mu...@gmail.com> on 2015/05/28 02:46:48 UTC, 3 replies.
- Pointing SparkSQL to existing Hive Metadata with data file locations in HDFS - posted by Sanjay Subramanian <sa...@yahoo.com.INVALID> on 2015/05/28 02:52:36 UTC, 4 replies.
- Fwd: Model weights of linear regression becomes abnormal values - posted by Maheshakya Wijewardena <ma...@wso2.com> on 2015/05/28 06:08:47 UTC, 4 replies.
- How to use Eclipse on Windows to build Spark environment? - posted by Nan Xiao <xi...@gmail.com> on 2015/05/28 08:27:06 UTC, 3 replies.
- Adding slaves on spark standalone on ec2 - posted by nizang <ni...@windward.eu> on 2015/05/28 08:59:15 UTC, 3 replies.
- DataFrame nested sctructure selection limit - posted by Eugene Morozov <fa...@list.ru> on 2015/05/28 10:01:20 UTC, 0 replies.
- Spark Cassandra - posted by lucas <we...@gmail.com> on 2015/05/28 10:49:00 UTC, 0 replies.
- Get all servers in security group in bash(ec2) - posted by nizang <ni...@windward.eu> on 2015/05/28 10:52:20 UTC, 1 replies.
- why does "com.esotericsoftware.kryo.KryoException: java.u til.ConcurrentModificationException" happen? - posted by randylu <ra...@gmail.com> on 2015/05/28 11:14:34 UTC, 2 replies.
- SPARK STREAMING PROBLEM - posted by Animesh Baranawal <an...@gmail.com> on 2015/05/28 15:27:36 UTC, 3 replies.
- Spark streaming with kafka - posted by boci <bo...@gmail.com> on 2015/05/28 15:33:37 UTC, 1 replies.
- Soft distinct on data frames. - posted by Jan-Paul Bultmann <ja...@me.com> on 2015/05/28 15:43:59 UTC, 0 replies.
- [Streaming] Configure executor logging on Mesos - posted by Gerard Maas <ge...@gmail.com> on 2015/05/28 15:46:20 UTC, 4 replies.
- Dataframe Partitioning - posted by Masf <ma...@gmail.com> on 2015/05/28 16:02:25 UTC, 1 replies.
- Spark SQL v MemSQL/Voltdb - posted by Ashish Mukherjee <as...@gmail.com> on 2015/05/28 16:48:06 UTC, 3 replies.
- Best practice to update a MongoDB document from Sparks - posted by ni...@free.fr on 2015/05/28 16:49:33 UTC, 0 replies.
- Loading CSV to DataFrame and saving it into Parquet for speedup - posted by M Rez <mm...@gmail.com> on 2015/05/28 17:32:18 UTC, 0 replies.
- PySpark with OpenCV causes python worker to crash - posted by Sam Stoelinga <sa...@gmail.com> on 2015/05/28 17:33:19 UTC, 3 replies.
- Spark1.3.1 build issue with CDH5.4.0 getUnknownFields - posted by Abhishek Tripathi <tr...@gmail.com> on 2015/05/28 19:22:00 UTC, 4 replies.
- Batch aggregation by sliding window + join - posted by "igor.berman" <ig...@gmail.com> on 2015/05/28 19:31:06 UTC, 4 replies.
- Hyperthreading - posted by Mulugeta Mammo <mu...@gmail.com> on 2015/05/28 19:39:42 UTC, 0 replies.
- spark sql lateral view unresolved attribute exception - posted by weoccc <we...@gmail.com> on 2015/05/28 20:23:29 UTC, 0 replies.
- yarn-cluster spark-submit process not dying - posted by Corey Nolet <cj...@gmail.com> on 2015/05/28 20:48:23 UTC, 2 replies.
- Add Custom Aggregate Column to Spark DataFrame - posted by calstad <co...@gmail.com> on 2015/05/28 20:57:07 UTC, 0 replies.
- spark submit debugging - posted by boci <bo...@gmail.com> on 2015/05/28 22:13:56 UTC, 0 replies.
- Adding an indexed column - posted by Cesar Flores <ce...@gmail.com> on 2015/05/28 23:43:58 UTC, 2 replies.
- spark java.io.FileNotFoundException: /user/spark/applicationHistory/application - posted by roy <rp...@njit.edu> on 2015/05/29 00:06:58 UTC, 1 replies.
- UDF accessing hive struct array fails with buffer underflow from kryo - posted by yluo <yl...@groupon.com> on 2015/05/29 00:21:08 UTC, 0 replies.
- spark mlib variance analysis - posted by rafac <ra...@hotmail.com> on 2015/05/29 00:47:44 UTC, 0 replies.
- Twitter Streaming HTTP 401 Error - posted by tracynj <tr...@gmail.com> on 2015/05/29 01:28:44 UTC, 0 replies.
- Registering Custom metrics [Spark-Streaming-monitoring] - posted by Snehal Nagmote <na...@gmail.com> on 2015/05/29 05:45:52 UTC, 0 replies.
- Execption writing on two cassandra tables NoHostAvailableException: All host(s) tried for query failed (no host was tried) - posted by Antonio Giambanco <an...@gmail.com> on 2015/05/29 12:11:06 UTC, 2 replies.
- Spark Executor Memory Usage - posted by Valerii Moisieienko <va...@gmail.com> on 2015/05/29 15:56:47 UTC, 1 replies.
- dataframe cumulative sum - posted by Cesar Flores <ce...@gmail.com> on 2015/05/29 17:09:15 UTC, 1 replies.
- Python implementation of RDD interface - posted by Sven Kreiss <sk...@svenkreiss.com> on 2015/05/29 17:29:50 UTC, 3 replies.
- Anybody using Spark SQL JDBC server with DSE Cassandra? - posted by Mohammed Guller <mo...@glassbeam.com> on 2015/05/29 20:48:34 UTC, 0 replies.
- spark-sql errors - posted by Sanjay Subramanian <sa...@yahoo.com.INVALID> on 2015/05/30 02:29:21 UTC, 1 replies.
- Format RDD/SchemaRDD contents to screen? - posted by Minnow Noir <mi...@gmail.com> on 2015/05/30 02:33:50 UTC, 1 replies.
- Security,authorization and governance - posted by "Phani Yadavilli -X (pyadavil)" <py...@cisco.com> on 2015/05/30 07:33:36 UTC, 0 replies.
- Re: How Broadcast variable works - posted by "bit1129@163.com" <bi...@163.com> on 2015/05/30 08:11:26 UTC, 1 replies.
- I see two countByKey stages - posted by ๏̯͡๏ <ÐΞ€ρ@Ҝ>, de...@gmail.com on 2015/05/30 13:46:18 UTC, 0 replies.
- Why is my performance on local really slow? - posted by Tal <wu...@gmail.com> on 2015/05/30 23:16:23 UTC, 0 replies.
- import CSV file using read.csv - posted by sherine ahmed <sh...@hotmail.com> on 2015/05/30 23:20:41 UTC, 1 replies.
- Re: Question regarding spark data partition and coalesce. Need info on my use case. - posted by firemonk9 <dh...@gmail.com> on 2015/05/31 05:01:21 UTC, 0 replies.
- RDD staleness - posted by Ashish Mukherjee <as...@gmail.com> on 2015/05/31 14:11:01 UTC, 1 replies.
- data localisation in spark - posted by Shushant Arora <sh...@gmail.com> on 2015/05/31 16:24:56 UTC, 0 replies.
- union and reduceByKey wrong shuffle? - posted by "igor.berman" <ig...@gmail.com> on 2015/05/31 17:33:13 UTC, 3 replies.