You are viewing a plain text version of this content. The canonical link for it is here.
- Re: [VOTE] Release Apache Spark 1.4.1 - posted by Reynold Xin <rx...@databricks.com> on 2015/07/01 02:27:23 UTC, 10 replies.
- Re: HyperLogLogUDT - posted by Nick Pentreath <ni...@gmail.com> on 2015/07/01 17:26:24 UTC, 3 replies.
- Re: enum-like types in Spark - posted by Stephen Boesch <ja...@gmail.com> on 2015/07/02 00:53:49 UTC, 1 replies.
- [pyspark] What is the best way to run a minimum unit testing related to our developing module? - posted by Yu Ishikawa <yu...@gmail.com> on 2015/07/02 06:10:16 UTC, 3 replies.
- Size of RDD partitions - posted by "prateek3.14" <pr...@gmail.com> on 2015/07/02 15:24:52 UTC, 0 replies.
- [SPARK-8794] [SQL] PrunedScan problem - posted by Eron Wright <ew...@live.com> on 2015/07/02 18:03:04 UTC, 0 replies.
- A proposal for Test matrix decompositions for speed/stability (SPARK-7210) - posted by Chris Harvey <ct...@gmail.com> on 2015/07/02 18:14:54 UTC, 0 replies.
- Re: Grouping runs of elements in a RDD - posted by Mohit Jaggi <mo...@gmail.com> on 2015/07/02 19:27:43 UTC, 1 replies.
- Differential Equation Spark Solver - posted by jamaica <my...@gmail.com> on 2015/07/03 08:11:56 UTC, 0 replies.
- except vs subtract - posted by Krishna Sankar <ks...@gmail.com> on 2015/07/03 08:54:58 UTC, 2 replies.
- [SparkSQL 1.4.0]The result of SUM(xxx) in SparkSQL is 0.0 but not null when the column xxx is all null - posted by StanZhai <ma...@zhaishidan.cn> on 2015/07/03 08:58:50 UTC, 1 replies.
- SparkSqlSerializer2 - posted by Zoltán Zvara <zo...@gmail.com> on 2015/07/03 16:34:57 UTC, 0 replies.
- Should spark-ec2 get its own repo? - posted by Nicholas Chammas <ni...@gmail.com> on 2015/07/03 19:23:29 UTC, 22 replies.
- [RESULT] [VOTE] Release Apache Spark 1.4.1 - posted by Patrick Wendell <pw...@gmail.com> on 2015/07/03 22:12:21 UTC, 0 replies.
- Can not build master - posted by Tarek Auel <ta...@gmail.com> on 2015/07/03 22:13:19 UTC, 15 replies.
- [VOTE] Release Apache Spark 1.4.1 (RC2) - posted by Patrick Wendell <pw...@gmail.com> on 2015/07/03 22:15:42 UTC, 9 replies.
- Re: Error in invoking a custom StandaloneRecoveryModeFactory in java env (Spark v1.3.0) - posted by Niranda Perera <ni...@gmail.com> on 2015/07/05 13:38:22 UTC, 1 replies.
- [SparkScore]Performance portal for Apache Spark - WW27 - posted by "Huang, Jie" <ji...@intel.com> on 2015/07/06 03:01:07 UTC, 2 replies.
- asf git merge currently not working - posted by Reynold Xin <rx...@databricks.com> on 2015/07/06 22:06:56 UTC, 1 replies.
- Re: Unable to add to roles in JIRA - posted by Sean Owen <so...@cloudera.com> on 2015/07/07 12:05:22 UTC, 3 replies.
- TableScan vs PrunedScan - posted by Gil Vernik <GI...@il.ibm.com> on 2015/07/07 12:12:24 UTC, 1 replies.
- Regarding master node failure - posted by swetha <sw...@gmail.com> on 2015/07/07 18:47:02 UTC, 0 replies.
- [RESULT] [VOTE] Release Apache Spark 1.4.1 (RC2) - posted by Patrick Wendell <pw...@gmail.com> on 2015/07/07 20:59:07 UTC, 0 replies.
- [VOTE] Release Apache Spark 1.4.1 (RC3) - posted by Patrick Wendell <pw...@gmail.com> on 2015/07/07 21:06:58 UTC, 13 replies.
- Data interaction between various RDDs in Spark Streaming - posted by swetha <sw...@gmail.com> on 2015/07/07 21:35:55 UTC, 1 replies.
- spark - redshift !!! - posted by spark user <sp...@yahoo.com.INVALID> on 2015/07/08 00:57:34 UTC, 1 replies.
- thrift server reliability issue - posted by Judy Nash <ju...@exchange.microsoft.com> on 2015/07/08 05:53:20 UTC, 1 replies.
- Spark job hangs when History server events are written to hdfs - posted by Pankaj Arora <Pa...@guavus.com> on 2015/07/08 07:25:57 UTC, 0 replies.
- Re: Spark job hangs when History server events are written to hdfs - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2015/07/08 09:40:06 UTC, 2 replies.
- What steps to take to work on [Spark-8899] issue? - posted by Chandrashekhar Kotekar <sh...@gmail.com> on 2015/07/08 20:40:37 UTC, 4 replies.
- Helpful IntelliJ shortcuts for working with Javadoc / Scaladoc - posted by Josh Rosen <jo...@databricks.com> on 2015/07/09 00:57:38 UTC, 2 replies.
- Code movements from Driver to Workers - posted by Eugene Morozov <fa...@list.ru> on 2015/07/09 01:13:14 UTC, 0 replies.
- Why are all spark deps not shaded to avoid dependency hell? - posted by ankits <an...@gmail.com> on 2015/07/09 04:14:01 UTC, 0 replies.
- [RESULT] [VOTE] Release Apache Spark 1.4.1 (RC3) - posted by Patrick Wendell <pw...@gmail.com> on 2015/07/09 07:53:19 UTC, 0 replies.
- [VOTE] Release Apache Spark 1.4.1 (RC4) - posted by Patrick Wendell <pw...@gmail.com> on 2015/07/09 07:55:16 UTC, 15 replies.
- Spark and Haskell support - posted by "Vasili I. Galchin" <vi...@gmail.com> on 2015/07/09 08:19:36 UTC, 0 replies.
- Questions about Fault tolerance of Spark - posted by 牛兆捷 <nz...@gmail.com> on 2015/07/09 10:19:26 UTC, 2 replies.
- databases currently supported by Spark SQL JDBC - posted by Niranda Perera <ni...@gmail.com> on 2015/07/09 14:09:47 UTC, 1 replies.
- The latest master branch didn't compile with -Phive? - posted by Yijie Shen <he...@gmail.com> on 2015/07/09 16:24:59 UTC, 7 replies.
- spark-ec2 Fails installing ganglia properly in 1.4 - posted by Pradeep Bashyal <pr...@bashyal.com> on 2015/07/09 18:21:06 UTC, 2 replies.
- Are These Issues Suitable for our Senior Project? - posted by emrehan <em...@gmail.com> on 2015/07/09 22:04:34 UTC, 5 replies.
- jenkins downtime 7/13/15, 7am PDT - posted by shane knapp <sk...@berkeley.edu> on 2015/07/09 22:07:39 UTC, 7 replies.
- callJMethod? - posted by "Vasili I. Galchin" <vi...@gmail.com> on 2015/07/10 01:28:06 UTC, 3 replies.
- Re: Spark ThriftServer encounter java.lang.IllegalArgumentException: Unknown auth type: null Allowed values are: [auth-int, auth-conf, auth] - posted by gogototo <wa...@gmail.com> on 2015/07/10 06:57:11 UTC, 0 replies.
- PySpark vs R - posted by "Vasili I. Galchin" <vi...@gmail.com> on 2015/07/10 07:40:53 UTC, 1 replies.
- language-independent RDD Spark core code? - posted by "Vasili I. Galchin" <vi...@gmail.com> on 2015/07/11 04:00:17 UTC, 1 replies.
- Model parallelism with RDD - posted by "Ulanov, Alexander" <al...@hp.com> on 2015/07/11 04:44:32 UTC, 7 replies.
- Foundation policy on releases and Spark nightly builds - posted by Sean Busbey <bu...@cloudera.com> on 2015/07/11 06:34:49 UTC, 19 replies.
- Spark application examples - posted by "Vasili I. Galchin" <vi...@gmail.com> on 2015/07/12 05:57:53 UTC, 0 replies.
- Re: [PySpark DataFrame] When a Row is not a Row - posted by Jerry Lam <ch...@gmail.com> on 2015/07/12 07:32:37 UTC, 1 replies.
- question related partitions of the DataFrame - posted by Gil Vernik <GI...@il.ibm.com> on 2015/07/12 12:05:57 UTC, 2 replies.
- Spark master broken? - posted by René Treffer <rt...@gmail.com> on 2015/07/12 12:49:21 UTC, 3 replies.
- Spark development under Windows - posted by Olivier Delalleau <sh...@keba.be> on 2015/07/12 18:37:15 UTC, 0 replies.
- pyspark.sql.tests: is test_time_with_timezone a flaky test? - posted by Cheolsoo Park <pi...@gmail.com> on 2015/07/12 22:33:42 UTC, 3 replies.
- ./dev/run-tests fail on master - posted by Xiaoyu Ma <hz...@corp.netease.com> on 2015/07/13 05:26:28 UTC, 2 replies.
- [RESULT] [VOTE] Release Apache Spark 1.4.1 (RC4) - posted by Patrick Wendell <pw...@gmail.com> on 2015/07/13 08:18:58 UTC, 0 replies.
- RandomForest evaluator for grid search - posted by Olivier Girardot <o....@lateral-thoughts.com> on 2015/07/13 11:16:29 UTC, 4 replies.
- Contributiona nd choice of langauge - posted by srinivasraghavansr71 <sr...@gmail.com> on 2015/07/13 13:59:53 UTC, 5 replies.
- How to Read Excel file in Spark 1.4 - posted by spark user <sp...@yahoo.com.INVALID> on 2015/07/13 18:22:13 UTC, 2 replies.
- Joining Apache Spark - posted by Animesh Tripathy <a....@gmail.com> on 2015/07/14 00:58:17 UTC, 4 replies.
- [SparkScore] Performance portal for Apache Spark - WW28 - posted by "Huang, Jie" <ji...@intel.com> on 2015/07/14 02:45:34 UTC, 0 replies.
- BlockMatrix multiplication - posted by "Ulanov, Alexander" <al...@hp.com> on 2015/07/14 03:01:17 UTC, 10 replies.
- Spark Core and ways of "talking" to it for enhancing application language support - posted by "Vasili I. Galchin" <vi...@gmail.com> on 2015/07/14 05:51:12 UTC, 2 replies.
- RDD checkpoint - posted by 牛兆捷 <nz...@gmail.com> on 2015/07/14 07:35:39 UTC, 0 replies.
- problems with build of latest the master - posted by Gil Vernik <GI...@il.ibm.com> on 2015/07/14 11:23:15 UTC, 11 replies.
- Regarding sessionization with updateStateByKey - posted by swetha <sw...@gmail.com> on 2015/07/15 00:32:55 UTC, 0 replies.
- Re: Does RDD checkpointing store the entire state in HDFS? - posted by swetha <sw...@gmail.com> on 2015/07/15 01:11:34 UTC, 3 replies.
- RestSubmissionClient Basic Auth - posted by Joel Zambrano <jo...@microsoft.com> on 2015/07/15 01:21:39 UTC, 3 replies.
- PySpark GroupByKey implementation question - posted by Matt Cheah <mc...@palantir.com> on 2015/07/15 04:11:56 UTC, 3 replies.
- Expression.resolved unmatched with the correct values in catalyst? - posted by Takeshi Yamamuro <li...@gmail.com> on 2015/07/15 09:47:34 UTC, 3 replies.
- Spark-SQL parameters like shuffle.partitions should be stored in the lineage - posted by "daniel.mescheder" <da...@realimpactanalytics.com> on 2015/07/15 14:49:19 UTC, 0 replies.
- Record metadata with RDDs and DataFrames - posted by RJ Nowling <rn...@gmail.com> on 2015/07/15 19:31:28 UTC, 3 replies.
- Use of non-standard LIMIT keyword in JDBC tableExists code - posted by Bob Beauchemin <bo...@sqlskills.com> on 2015/07/15 20:29:11 UTC, 2 replies.
- Slight API incompatibility caused by SPARK-4072 - posted by Marcelo Vanzin <va...@cloudera.com> on 2015/07/15 20:47:51 UTC, 5 replies.
- Re: Unable to use dynamicAllocation if spark.executor.instances is set in spark-defaults.conf - posted by "Kelly, Jonathan" <jo...@amazon.com> on 2015/07/15 23:22:38 UTC, 0 replies.
- Announcing Spark 1.4.1! - posted by Patrick Wendell <pw...@gmail.com> on 2015/07/15 23:48:28 UTC, 0 replies.
- S3 Read / Write makes executors deadlocked - posted by Hao Ren <in...@gmail.com> on 2015/07/16 11:39:49 UTC, 1 replies.
- Apache gives exception when running groupby on df temp table - posted by nipun <ib...@gmail.com> on 2015/07/16 14:31:03 UTC, 4 replies.
- why doesn't jenkins like me? - posted by Steve Loughran <st...@hortonworks.com> on 2015/07/16 14:44:05 UTC, 1 replies.
- KryoSerializer gives class cast exception - posted by Eugene Morozov <fa...@list.ru> on 2015/07/16 16:57:24 UTC, 2 replies.
- Hive Table with large number of partitions - posted by Xiaoyu Ma <hz...@corp.netease.com> on 2015/07/17 12:26:06 UTC, 1 replies.
- [discuss] Removing individual commit messages from the squash commit message - posted by Reynold Xin <rx...@databricks.com> on 2015/07/18 09:48:52 UTC, 9 replies.
- Writing to multiple outputs in Spark - posted by Silas Davis <si...@silasdavis.net> on 2015/07/18 18:24:45 UTC, 0 replies.
- If gmail, check sparm - posted by Mridul Muralidharan <mr...@gmail.com> on 2015/07/18 19:25:10 UTC, 2 replies.
- Dynamic resource allocation in Standalone mode - posted by Dogtail Ray <sp...@gmail.com> on 2015/07/19 03:47:18 UTC, 1 replies.
- Compact RDD representation - posted by Сергей Лихоман <se...@gmail.com> on 2015/07/19 19:40:23 UTC, 9 replies.
- KinesisStreamSuite failing in master branch - posted by Ted Yu <yu...@gmail.com> on 2015/07/20 02:32:40 UTC, 5 replies.
- [SparkScore] Performance portal for Apache Spark - WW29 - posted by "Huang, Jie" <ji...@intel.com> on 2015/07/20 02:51:29 UTC, 0 replies.
- What is the reason there is no out of the box sortByValue API? - posted by suyog choudhari <su...@gmail.com> on 2015/07/20 03:14:45 UTC, 0 replies.
- Re: Spark Mesos Dispatcher - posted by Jerry Lam <ch...@gmail.com> on 2015/07/20 04:57:05 UTC, 1 replies.
- countByValue on dataframe with multiple columns - posted by Olivier Girardot <o....@lateral-thoughts.com> on 2015/07/20 11:28:49 UTC, 11 replies.
- Worker memory leaks? - posted by Richard Marscher <rm...@localytics.com> on 2015/07/20 18:56:41 UTC, 3 replies.
- Silly question about building Spark 1.4.1 - posted by Michael Segel <ms...@hotmail.com> on 2015/07/20 21:26:40 UTC, 0 replies.
- Make off-heap store pluggable - posted by Alexey Goncharuk <al...@gmail.com> on 2015/07/20 23:16:56 UTC, 9 replies.
- -Phive-thriftserver when compiling for use in pyspark and JDBC connections - posted by Aaron <aa...@gmail.com> on 2015/07/22 02:27:46 UTC, 0 replies.
- What is the difference between SlowSparkPullRequestBuilder and SparkPullRequestBuilder? - posted by Yu Ishikawa <yu...@gmail.com> on 2015/07/22 07:47:50 UTC, 2 replies.
- Deleted unreleased version 1.6.0 from JIRA by mistake - posted by Cheng Lian <li...@databricks.com> on 2015/07/22 10:35:22 UTC, 0 replies.
- Fixed number of partitions in RangePartitioner - posted by Sergio Ramírez <sr...@ugr.es> on 2015/07/22 11:37:18 UTC, 0 replies.
- Re: PySpark on PyPi - posted by Justin Uang <ju...@gmail.com> on 2015/07/22 18:49:21 UTC, 3 replies.
- Package Release Annoucement: Spark SQL on HBase "Astro" - posted by "Bing Xiao (Bing)" <bi...@huawei.com> on 2015/07/23 01:53:28 UTC, 5 replies.
- PySpark addPyFile for directories - posted by Pedro Rodriguez <sk...@gmail.com> on 2015/07/23 02:22:17 UTC, 0 replies.
- Where to find Spark-project-hive - posted by Xiaoyu Ma <hz...@corp.netease.com> on 2015/07/23 03:57:33 UTC, 2 replies.
- non-deprecation compiler warnings are upgraded to build errors now - posted by Reynold Xin <rx...@databricks.com> on 2015/07/23 06:08:13 UTC, 7 replies.
- Shouldn't SparseVector constructor give error when declared number of elements less than array lenght? - posted by Andrew Vykhodtsev <yo...@gmail.com> on 2015/07/23 12:52:05 UTC, 2 replies.
- Fwd: posts are not accepted - posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov> on 2015/07/23 22:43:05 UTC, 0 replies.
- Re: [ANNOUNCE] Nightly maven and package builds for Spark - posted by Bharath Ravi Kumar <re...@gmail.com> on 2015/07/24 09:51:41 UTC, 3 replies.
- review SPARK-8730 - posted by Eugen Cepoi <ce...@gmail.com> on 2015/07/24 13:03:44 UTC, 2 replies.
- Policy around backporting bug fixes - posted by Patrick Wendell <pw...@gmail.com> on 2015/07/24 21:57:28 UTC, 1 replies.
- jenkins failing on Kinesis shard limits - posted by Steve Loughran <st...@hortonworks.com> on 2015/07/25 01:05:21 UTC, 4 replies.
- Jenkins HiveCompatibilitySuite Test Failures - posted by Calvin Jia <ji...@gmail.com> on 2015/07/25 01:43:13 UTC, 0 replies.
- Protocol for build breaks - posted by Patrick Wendell <pw...@gmail.com> on 2015/07/25 07:59:52 UTC, 0 replies.
- ReceiverStream SPARK not able to cope up with 20,000 events /sec . - posted by anshu shukla <an...@gmail.com> on 2015/07/25 11:59:08 UTC, 1 replies.
- Parallelism of Custom receiver in spark - posted by anshu shukla <an...@gmail.com> on 2015/07/25 19:43:26 UTC, 0 replies.
- Log For in[put rate value in streaming statistics - posted by anshu shukla <an...@gmail.com> on 2015/07/26 00:34:57 UTC, 0 replies.
- Confidence in implicit factorization - posted by Debasish Das <de...@gmail.com> on 2015/07/26 07:45:17 UTC, 7 replies.
- 回复: Asked to remove non-existent executor exception - posted by Sea <26...@qq.com> on 2015/07/26 18:57:54 UTC, 0 replies.
- Writing streaming data to cassandra creates duplicates - posted by Priya Ch <le...@gmail.com> on 2015/07/26 20:19:54 UTC, 3 replies.
- Re: Asked to remove non-existent executor exception - posted by Mridul Muralidharan <mr...@gmail.com> on 2015/07/27 00:28:19 UTC, 1 replies.
- [SparkScore]Performance portal for Apache Spark - WW30 - posted by "Huang, Jie" <ji...@intel.com> on 2015/07/27 05:10:27 UTC, 0 replies.
- Is `dev/lint-python` broken? - posted by Yu Ishikawa <yu...@gmail.com> on 2015/07/27 15:29:06 UTC, 7 replies.
- Two joins in GraphX Pregel implementation - posted by "Ulanov, Alexander" <al...@hp.com> on 2015/07/27 17:42:58 UTC, 4 replies.
- Ever increasing physical memory for a Spark Application in YARN - posted by Nitin Goyal <ni...@gmail.com> on 2015/07/27 18:08:49 UTC, 0 replies.
- Converting DataFrame to RDD of case class - posted by Vyacheslav Baranov <sl...@gmail.com> on 2015/07/27 20:23:12 UTC, 2 replies.
- "Spree": Live-updating web UI for Spark - posted by Ryan Williams <ry...@gmail.com> on 2015/07/27 23:59:31 UTC, 2 replies.
- dynamically update the master list of a worker or a spark context - posted by Niranda Perera <ni...@gmail.com> on 2015/07/28 07:01:06 UTC, 0 replies.
- Custom UDFs with zero parameters support - posted by Sachith Withana <sw...@gmail.com> on 2015/07/28 11:15:19 UTC, 7 replies.
- DataFrame#rdd doesn't respect DataFrame#cache, slowing down CrossValidator - posted by Justin Uang <ju...@gmail.com> on 2015/07/28 11:36:35 UTC, 4 replies.
- [Spark SQL]Could not read parquet table after recreating it with the same table name - posted by StanZhai <ma...@zhaishidan.cn> on 2015/07/28 14:32:53 UTC, 0 replies.
- ReceiverTrackerSuite failing in master build - posted by Ted Yu <yu...@gmail.com> on 2015/07/28 17:25:17 UTC, 1 replies.
- Generalised Spark-HBase integration - posted by Michal Haris <mi...@visualdna.com> on 2015/07/28 17:59:45 UTC, 9 replies.
- Broadcast variable of size 1 GB fails with negative memory exception - posted by Mike Hynes <91...@gmail.com> on 2015/07/28 19:37:09 UTC, 4 replies.
- update on git timeouts for jenkins builds - posted by shane knapp <sk...@berkeley.edu> on 2015/07/28 20:51:56 UTC, 5 replies.
- Opinion on spark-class script simplification and posix compliance - posted by Félix-Antoine Fortin <fe...@calculquebec.ca> on 2015/07/28 21:13:52 UTC, 1 replies.
- Rebase and Squash Commits to Revise PR? - posted by Meihua Wu <ro...@gmail.com> on 2015/07/28 22:46:48 UTC, 2 replies.
- Reminder about Spark 1.5.0 code freeze deadline of Aug 1st - posted by Reynold Xin <rx...@databricks.com> on 2015/07/29 07:46:43 UTC, 1 replies.
- Spark (1.2) yarn allocator does not remove container request for allocated container, resulting in a bloated ask[] of containers and inefficient resource utilization of cluster resources. - posted by prakhar jauhari <pr...@gmail.com> on 2015/07/29 09:55:23 UTC, 1 replies.
- unit test failure for hive query - posted by JaeSung Jun <ja...@gmail.com> on 2015/07/30 04:02:20 UTC, 1 replies.
- Machine learning unit tests guidelines - posted by "Ulanov, Alexander" <al...@hp.com> on 2015/07/30 10:58:02 UTC, 0 replies.
- UDF Method overloading - posted by Sachith Withana <sw...@gmail.com> on 2015/07/30 12:00:46 UTC, 1 replies.
- Data source aliasing - posted by Joseph Batchik <jo...@gmail.com> on 2015/07/30 18:44:37 UTC, 4 replies.
- FrequentItems in spark-sql-execution-stat - posted by Yucheng <yl...@nyu.edu> on 2015/07/30 22:26:49 UTC, 1 replies.
- Parquet SaveMode.Append Trouble. - posted by satyajit vegesna <sa...@gmail.com> on 2015/07/31 00:26:51 UTC, 1 replies.
- High availability with zookeeper: worker discovery - posted by Christophe Schmitz <co...@gmail.com> on 2015/07/31 04:41:04 UTC, 2 replies.
- add to user list - posted by Sachin Aggarwal <di...@gmail.com> on 2015/07/31 06:23:16 UTC, 1 replies.
- Spark CBO - posted by burakkk <bu...@gmail.com> on 2015/07/31 08:38:00 UTC, 1 replies.
- Came across Spark SQL hang issue with Spark 1.5 Tungsten feature - posted by james <yi...@gmail.com> on 2015/07/31 09:49:42 UTC, 0 replies.
- Re: Came across Spark SQL hang/Error issue with Spark 1.5 Tungsten feature - posted by james <yi...@gmail.com> on 2015/07/31 10:31:30 UTC, 2 replies.
- New Feature Request - posted by Sandeep Giri <sa...@knowbigdata.com> on 2015/07/31 11:11:51 UTC, 2 replies.