You are viewing a plain text version of this content. The canonical link for it is here.
- Re: [VOTE] Release Apache Spark 1.4.0 (RC3) - posted by Sandy Ryza <sa...@cloudera.com> on 2015/06/01 01:34:25 UTC, 11 replies.
- Re: Why is RDD to PairRDDFunctions only via implicits? - posted by Reynold Xin <rx...@databricks.com> on 2015/06/01 06:57:01 UTC, 0 replies.
- spark 1.4 - test-loading 1786 mysql tables / a few TB - posted by René Treffer <rt...@gmail.com> on 2015/06/01 10:10:22 UTC, 4 replies.
- Re: please use SparkFunSuite instead of ScalaTest's FunSuite from now on - posted by Steve Loughran <st...@hortonworks.com> on 2015/06/01 12:40:58 UTC, 3 replies.
- GraphX: New graph operator - posted by Tarek Auel <ta...@gmail.com> on 2015/06/01 17:54:17 UTC, 6 replies.
- Re: [Streaming] Configure executor logging on Mesos - posted by Gerard Maas <ge...@gmail.com> on 2015/06/02 02:28:15 UTC, 0 replies.
- [SQL] Write parquet files under partition directories? - posted by Matt Cheah <mc...@palantir.com> on 2015/06/02 07:21:56 UTC, 3 replies.
- about Spark MLlib StandardScaler's Implementation - posted by RoyGaoVLIS <ro...@zju.edu.cn> on 2015/06/02 10:25:37 UTC, 1 replies.
- Unit tests can generate spurious shutdown messages - posted by Mick Davies <mi...@gmail.com> on 2015/06/02 13:25:26 UTC, 1 replies.
- CSV Support in SparkR - posted by "Eskilson,Aleksander" <Al...@Cerner.com> on 2015/06/02 20:52:47 UTC, 7 replies.
- DataFrame.withColumn very slow when used iteratively? - posted by zsampson <zs...@palantir.com> on 2015/06/02 21:34:55 UTC, 3 replies.
- Possible space improvements to shuffle - posted by John Carrino <jo...@gmail.com> on 2015/06/02 22:50:47 UTC, 2 replies.
- createDataframe from s3 results in error - posted by Ignacio Zendejas <iz...@node.io> on 2015/06/03 00:13:40 UTC, 4 replies.
- [RESULT] [VOTE] Release Apache Spark 1.4.0 (RC3) - posted by Patrick Wendell <pw...@gmail.com> on 2015/06/03 05:51:39 UTC, 0 replies.
- [VOTE] Release Apache Spark 1.4.0 (RC4) - posted by Patrick Wendell <pw...@gmail.com> on 2015/06/03 05:53:32 UTC, 31 replies.
- Stop Master and Slaves without SSH - posted by Devl Devel <de...@gmail.com> on 2015/06/03 11:32:34 UTC, 0 replies.
- MLlib: Anybody working on hierarchical topic models like HLDA? - posted by Lorenz Fischer <lo...@gmail.com> on 2015/06/03 15:43:13 UTC, 4 replies.
- SparkR DataFrame Column Casts esp. from CSV Files - posted by "Eskilson,Aleksander" <Al...@Cerner.com> on 2015/06/03 16:51:50 UTC, 7 replies.
- Cleaning up workers' directories automatically - posted by atalay <at...@yahoo.com> on 2015/06/04 00:01:17 UTC, 0 replies.
- [ANNOUNCE] YARN support in Spark EC2 - posted by Shivaram Venkataraman <sh...@eecs.berkeley.edu> on 2015/06/04 01:32:53 UTC, 0 replies.
- Ivy support in Spark vs. sbt - posted by Marcelo Vanzin <va...@cloudera.com> on 2015/06/04 01:33:42 UTC, 9 replies.
- Anyone facing problem in incremental building of individual project - posted by Meethu Mathew <me...@flytxt.com> on 2015/06/04 12:16:17 UTC, 3 replies.
- Spark Packages: using sbt-spark-package tool with R - posted by Chris Freeman <cf...@alteryx.com> on 2015/06/04 16:58:23 UTC, 0 replies.
- Where is the JIRA filter for new contributers? - posted by Ravi Desai <rd...@gmail.com> on 2015/06/04 17:45:31 UTC, 1 replies.
- Fwd: How to pass system properties in spark ? - posted by Ashwin Shankar <as...@gmail.com> on 2015/06/04 18:33:27 UTC, 0 replies.
- PySpark on PyPi - posted by Olivier Girardot <o....@lateral-thoughts.com> on 2015/06/05 07:45:58 UTC, 3 replies.
- Re: Regarding "Connecting spark to Mesos" documentation - posted by François Garillot <fr...@typesafe.com> on 2015/06/05 12:38:47 UTC, 0 replies.
- Scheduler question: stages with non-arithmetic numbering - posted by Mike Hynes <91...@gmail.com> on 2015/06/05 18:51:33 UTC, 2 replies.
- Fwd: Multi-node Docker based *Spark 1.3.1* clusters on VirtualBox(Mac)/EC2 instance - posted by Anant Chintamaneni <an...@gmail.com> on 2015/06/05 23:03:30 UTC, 0 replies.
- [DISCUSS] Minimize use of MINOR, BUILD, and HOTFIX w/ no JIRA - posted by Patrick Wendell <pw...@gmail.com> on 2015/06/06 18:01:05 UTC, 2 replies.
- Stages with non-arithmetic numbering & Timing metrics in event logs - posted by Mike Hynes <91...@gmail.com> on 2015/06/08 06:12:29 UTC, 8 replies.
- [SparkSQL ] What is Exchange in physical plan for ? - posted by invkrh <in...@gmail.com> on 2015/06/08 15:33:31 UTC, 1 replies.
- [ml] Why all model classes are final? - posted by Peter Rudenko <pe...@gmail.com> on 2015/06/08 18:17:44 UTC, 2 replies.
- [sample code] deeplearning4j for Spark ML (@DeveloperAPI) - posted by Eron Wright <ew...@live.com> on 2015/06/08 18:20:03 UTC, 2 replies.
- SparkR Reading Tables from Hive - posted by "Eskilson,Aleksander" <Al...@Cerner.com> on 2015/06/08 22:38:34 UTC, 2 replies.
- Fwd: pull requests no longer closing by commit messages with "closes #xxxx" - posted by Reynold Xin <rx...@databricks.com> on 2015/06/09 02:59:40 UTC, 0 replies.
- Recreating JIRA SPARK-8142 - posted by Devl Devel <de...@gmail.com> on 2015/06/09 11:28:49 UTC, 0 replies.
- Re: Spark on Mesos vs Yarn - posted by Bharath Ravi Kumar <re...@gmail.com> on 2015/06/10 06:10:04 UTC, 2 replies.
- About akka used in spark - posted by "wangtao (A)" <wa...@huawei.com> on 2015/06/10 07:55:13 UTC, 2 replies.
- 答复: [VOTE] Release Apache Spark 1.4.0 (RC4) - posted by Tao Wang <wa...@huawei.com> on 2015/06/10 08:04:00 UTC, 0 replies.
- [RESULT] [VOTE] Release Apache Spark 1.4.0 (RC4) - posted by Patrick Wendell <pw...@gmail.com> on 2015/06/10 21:13:20 UTC, 0 replies.
- Problem with pyspark on Docker talking to YARN cluster - posted by Ashwin Shankar <as...@gmail.com> on 2015/06/10 22:43:04 UTC, 2 replies.
- Re: Approximate rank-based statistics (median, 95-th percentile, etc.) for Spark - posted by Grega Kešpret <gr...@celtra.com> on 2015/06/10 23:53:44 UTC, 4 replies.
- Jcenter / bintray support for spark packages? - posted by Hector Yee <he...@gmail.com> on 2015/06/11 03:23:57 UTC, 1 replies.
- How to support dependency jars and files on HDFS in standalone cluster mode? - posted by Dong Lei <do...@microsoft.com> on 2015/06/11 05:04:41 UTC, 6 replies.
- [ANNOUNCE] Announcing Spark 1.4 - posted by Patrick Wendell <pw...@gmail.com> on 2015/06/11 18:05:06 UTC, 0 replies.
- When to expect UTF8String? - posted by zsampson <zs...@palantir.com> on 2015/06/12 05:08:07 UTC, 3 replies.
- Contributing to pyspark - posted by Usman Ehtesham <ue...@gmail.com> on 2015/06/12 06:06:39 UTC, 2 replies.
- Re: Spark 1.4: Python API for getting Kafka offsets in direct mode? - posted by Saisai Shao <sa...@gmail.com> on 2015/06/12 07:02:47 UTC, 8 replies.
- A confusing ClassNotFoundException error - posted by Zhiwei Chan <z....@gmail.com> on 2015/06/12 11:04:14 UTC, 2 replies.
- Remove Hadoop 1 support (Hadoop <2.2) for Spark 1.5? - posted by Sean Owen <so...@cloudera.com> on 2015/06/12 12:09:52 UTC, 12 replies.
- Contribution - posted by srinivasraghavansr71 <sr...@gmail.com> on 2015/06/13 05:16:33 UTC, 3 replies.
- [NEW] Debugging test failures on Jenkins - posted by Andrew Or <an...@databricks.com> on 2015/06/13 07:31:30 UTC, 0 replies.
- About HostName display in SparkUI - posted by Sea <26...@qq.com> on 2015/06/13 18:21:44 UTC, 2 replies.
- [SparkStreaming] NPE in DStreamCheckPointData.scala:125 - posted by Haopu Wang <HW...@qilinsoft.com> on 2015/06/15 09:36:06 UTC, 1 replies.
- Spark hangs without notification (broadcasting) - posted by Sergio Ramírez <sr...@ugr.es> on 2015/06/15 11:56:47 UTC, 0 replies.
- Using queueStream - posted by anshu shukla <an...@gmail.com> on 2015/06/15 15:37:22 UTC, 0 replies.
- Re: About HostName display in SparkUI - posted by Sea <26...@qq.com> on 2015/06/15 18:24:20 UTC, 0 replies.
- Problem: Custom Receiver for getting events from a Dynamic Queue - posted by anshu shukla <an...@gmail.com> on 2015/06/15 21:23:15 UTC, 0 replies.
- Random Forest driver memory - posted by Isca Harmatz <po...@gmail.com> on 2015/06/16 06:45:35 UTC, 2 replies.
- Re: Sidebar: issues targeted for 1.4.0 - posted by Sean Owen <so...@cloudera.com> on 2015/06/16 14:24:29 UTC, 5 replies.
- [SparkScore] Performance portal for Apache Spark - posted by "Huang, Jie" <ji...@intel.com> on 2015/06/16 19:27:18 UTC, 2 replies.
- Spark-Shell 2.11 1.4.0-RC-03 does not add jars to class path - posted by Alessandro Baretta <al...@gmail.com> on 2015/06/16 20:44:09 UTC, 0 replies.
- Read/write metrics for jobs which use S3 - posted by abshkmodi <ab...@gmail.com> on 2015/06/17 08:00:12 UTC, 0 replies.
- Implementing and Using a Custom Actor-based Receiver - posted by anshu shukla <an...@gmail.com> on 2015/06/17 11:07:04 UTC, 0 replies.
- Welcoming some new committers - posted by Matei Zaharia <ma...@gmail.com> on 2015/06/18 00:12:53 UTC, 6 replies.
- [SparkR] Have we already had any lint for SparkR? - posted by Yu Ishikawa <yu...@gmail.com> on 2015/06/18 00:20:07 UTC, 2 replies.
- Hive 0.12 support in 1.4.0 ? - posted by Thomas Dudziak <to...@gmail.com> on 2015/06/18 01:18:21 UTC, 1 replies.
- [mllib] Refactoring some spark.mllib model classes in Python not inheriting JavaModelWrapper - posted by Yu Ishikawa <yu...@gmail.com> on 2015/06/18 05:15:46 UTC, 2 replies.
- [MLlib] Contributing algorithm for DP means clustering - posted by Meethu Mathew <me...@flytxt.com> on 2015/06/18 06:58:11 UTC, 0 replies.
- Spark-sql(yarn-client) java.lang.NoClassDefFoundError: org/apache/spark/deploy/yarn/ExecutorLauncher - posted by Sea <26...@qq.com> on 2015/06/18 15:15:40 UTC, 2 replies.
- Latency between the RDD in Streaming - posted by anshu shukla <an...@gmail.com> on 2015/06/18 20:24:53 UTC, 0 replies.
- Increase partition count (repartition) without shuffle - posted by "Ulanov, Alexander" <al...@hp.com> on 2015/06/18 23:26:00 UTC, 2 replies.
- 回复: Spark-sql(yarn-client) java.lang.NoClassDefFoundError: org/apache/spark/deploy/yarn/ExecutorLauncher - posted by Sea <26...@qq.com> on 2015/06/19 05:05:05 UTC, 0 replies.
- [Tungsten] NPE in UnsafeShuffleWriter.java - posted by Peter Rudenko <pe...@gmail.com> on 2015/06/19 20:36:07 UTC, 2 replies.
- Workaround for problems with OS X + JIRA Client - posted by Sean Owen <so...@cloudera.com> on 2015/06/19 20:54:41 UTC, 1 replies.
- Stats on targets for 1.5.0 - posted by Sean Owen <so...@cloudera.com> on 2015/06/19 21:17:12 UTC, 1 replies.
- a - posted by Cubenomy Subscriber <cu...@gmail.com> on 2015/06/19 22:18:14 UTC, 0 replies.
- Impala created parquet tables - posted by Debasish Das <de...@gmail.com> on 2015/06/20 09:21:27 UTC, 0 replies.
- Fwd: Verifying number of workers in Spark Streaming - posted by anshu shukla <an...@gmail.com> on 2015/06/20 16:27:32 UTC, 0 replies.
- Velox Model Server - posted by Debasish Das <de...@gmail.com> on 2015/06/20 17:00:34 UTC, 0 replies.
- [pyspark][mllib] What is the best way to treat int and long int between python2.6/python3.4 and Java? - posted by Yu Ishikawa <yu...@gmail.com> on 2015/06/20 19:29:38 UTC, 0 replies.
- unsafe/compile error - posted by acidghost <an...@gmail.com> on 2015/06/21 12:18:23 UTC, 6 replies.
- JIRA 2344 status (Fuzzy C-Means) - posted by salexln <sa...@gmail.com> on 2015/06/21 18:57:35 UTC, 0 replies.
- Web UI and History Server are Inconsistent; Web UI sometimes cannot process logs - posted by jcai <jo...@yale.edu> on 2015/06/21 21:05:14 UTC, 0 replies.
- [jenkins] ERROR: Publisher 'Publish JUnit test result report' failed: No test report files were found. Configuration error? - posted by Yu Ishikawa <yu...@gmail.com> on 2015/06/21 22:54:19 UTC, 2 replies.
- Force Spark save parquet files with replication factor other than 3 (default one) - posted by "Ulanov, Alexander" <al...@hp.com> on 2015/06/23 03:44:29 UTC, 0 replies.
- custom REST port from spark-defaults.cof - posted by Niranda Perera <ni...@gmail.com> on 2015/06/23 08:03:58 UTC, 1 replies.
- [SparkSQL 1.4]Could not use concat with UDF in where clause - posted by StanZhai <ma...@zhaishidan.cn> on 2015/06/23 10:42:15 UTC, 2 replies.
- OK to add committers active on JIRA to JIRA admin role? - posted by Sean Owen <so...@cloudera.com> on 2015/06/23 10:47:12 UTC, 1 replies.
- HyperLogLogUDT - posted by Nick Pentreath <ni...@gmail.com> on 2015/06/23 11:19:13 UTC, 0 replies.
- Calculating tuple count /input rate with time - posted by anshu shukla <an...@gmail.com> on 2015/06/23 11:49:19 UTC, 1 replies.
- [DataFrame] partitionBy issues - posted by vladio <vl...@palantir.com> on 2015/06/23 20:26:11 UTC, 2 replies.
- how can I write a language "wrapper"? - posted by "Vasili I. Galchin" <vi...@gmail.com> on 2015/06/23 22:35:04 UTC, 7 replies.
- Python UDF performance at large scale - posted by Justin Uang <ju...@gmail.com> on 2015/06/24 00:27:11 UTC, 9 replies.
- [VOTE] Release Apache Spark 1.4.1 - posted by Patrick Wendell <pw...@gmail.com> on 2015/06/24 07:37:29 UTC, 22 replies.
- Loss of data due to congestion - posted by anshu shukla <an...@gmail.com> on 2015/06/24 15:18:35 UTC, 1 replies.
- [GraphX] Graph 500 graph generator - posted by "Carr, J. Ryan" <Ry...@jhuapl.edu> on 2015/06/24 16:55:04 UTC, 1 replies.
- Spark SQL 1.3 Exception - posted by Debasish Das <de...@gmail.com> on 2015/06/24 17:18:39 UTC, 0 replies.
- Re: IPv6 support - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2015/06/24 20:07:21 UTC, 1 replies.
- Problem with version compatibility - posted by jimfcarroll <ji...@gmail.com> on 2015/06/24 20:21:34 UTC, 10 replies.
- Force inner join to shuffle the smallest table - posted by "Ulanov, Alexander" <al...@hp.com> on 2015/06/24 20:35:57 UTC, 4 replies.
- parallelize method v.s. textFile method - posted by xing <eh...@gmail.com> on 2015/06/25 02:59:12 UTC, 3 replies.
- Error in invoking a custom StandaloneRecoveryModeFactory in java env (Spark v1.3.0) - posted by Niranda Perera <ni...@gmail.com> on 2015/06/25 07:57:25 UTC, 2 replies.
- how to implement my own datasource? - posted by 诺铁 <no...@gmail.com> on 2015/06/25 08:25:36 UTC, 4 replies.
- Github spam from naver user - posted by Sean Owen <so...@cloudera.com> on 2015/06/25 12:22:39 UTC, 2 replies.
- Various forks - posted by Iulian Dragoș <iu...@typesafe.com> on 2015/06/25 15:18:52 UTC, 0 replies.
- [SQL] codegen on wide dataset throws StackOverflow - posted by Peter Rudenko <pe...@gmail.com> on 2015/06/25 15:35:41 UTC, 2 replies.
- Verifying Empirically Number of Performance-Heavy Threads and Parallelism - posted by jcai <jo...@yale.edu> on 2015/06/25 18:26:57 UTC, 0 replies.
- Visualize Spark-SQL query plans - posted by Raajay <ra...@gmail.com> on 2015/06/25 18:33:35 UTC, 0 replies.
- External Shuffle service over yarn - posted by yash datta <sa...@gmail.com> on 2015/06/26 08:08:41 UTC, 2 replies.
- Spark for distributed dbms cluster - posted by "louis.hust" <lo...@gmail.com> on 2015/06/26 08:37:52 UTC, 1 replies.
- Time is ugly in Spark Streaming.... - posted by Sea <26...@qq.com> on 2015/06/26 11:06:16 UTC, 4 replies.
- [SparkScore]Performance portal for Apache Spark - WW26 - posted by "Huang, Jie" <ji...@intel.com> on 2015/06/26 13:24:38 UTC, 4 replies.
- 回复: Time is ugly in Spark Streaming.... - posted by Sea <26...@qq.com> on 2015/06/26 14:59:47 UTC, 1 replies.
- R - Scala interface used in Spark? - posted by "Vasili I. Galchin" <vi...@gmail.com> on 2015/06/27 00:19:21 UTC, 7 replies.
- Unable to add to roles in JIRA - posted by Sean Owen <so...@cloudera.com> on 2015/06/28 12:27:11 UTC, 0 replies.
- Question about Spark process and thread - posted by Dogtail Ray <sp...@gmail.com> on 2015/06/28 17:32:40 UTC, 1 replies.
- Spark 1.5.0-SNAPSHOT broken with Scala 2.11 - posted by Alessandro Baretta <al...@gmail.com> on 2015/06/29 03:02:14 UTC, 6 replies.
- Gossip protocol in Master selection - posted by Debasish Das <de...@gmail.com> on 2015/06/29 03:42:38 UTC, 0 replies.
- Re: UnusedStubClass in 1.3.0-rc1 - posted by dobashim <do...@oss.nttdata.co.jp> on 2015/06/29 06:33:56 UTC, 0 replies.
- Dataframes filter by count fails with python API - posted by Andrew Vykhodtsev <yo...@gmail.com> on 2015/06/29 08:57:20 UTC, 1 replies.
- DStream.reduce - posted by Zoltán Zvara <zo...@gmail.com> on 2015/06/30 16:59:21 UTC, 0 replies.
- Grouping runs of elements in a RDD - posted by RJ Nowling <rn...@gmail.com> on 2015/06/30 20:01:57 UTC, 4 replies.