You are viewing a plain text version of this content. The canonical link for it is here.
- persist versus checkpoint - posted by Renyi Xiong <re...@gmail.com> on 2016/05/01 00:52:08 UTC, 1 replies.
- MetadataFetchFailedException if executorLost when spark.speculation enabled ? - posted by Renyi Xiong <re...@gmail.com> on 2016/05/01 21:13:33 UTC, 0 replies.
- [ANNOUNCE] Spark branch-2.0 - posted by Reynold Xin <rx...@databricks.com> on 2016/05/02 00:59:52 UTC, 1 replies.
- Cross Validator to work with K-Fold value of 1? - posted by Rahul Tanwani <ta...@gmail.com> on 2016/05/02 09:05:19 UTC, 3 replies.
- Re: spark 2 segfault - posted by Ted Yu <yu...@gmail.com> on 2016/05/02 12:48:51 UTC, 3 replies.
- Re: Requesting feedback for PR for SPARK-11962 - posted by Arun Allamsetty <ar...@gmail.com> on 2016/05/02 17:24:37 UTC, 0 replies.
- Re: [build system] short downtime monday morning (5-2-16), 7-9am PDT - posted by shane knapp <sk...@berkeley.edu> on 2016/05/02 17:26:49 UTC, 3 replies.
- Re: Ever increasing physical memory for a Spark Application in YARN - posted by Daniel Darabos <da...@lynxanalytics.com> on 2016/05/02 17:45:10 UTC, 1 replies.
- Re: Spark streaming Kafka receiver WriteAheadLog question - posted by Renyi Xiong <re...@gmail.com> on 2016/05/02 22:48:11 UTC, 0 replies.
- SQLContext and "stable identifier required" - posted by Koert Kuipers <ko...@tresata.com> on 2016/05/03 18:16:59 UTC, 4 replies.
- Unable To Find Proto Buffer Class Error With RDD - posted by kyle <ch...@gmail.com> on 2016/05/04 01:56:56 UTC, 0 replies.
- Caching behaviour and deserialized size - posted by Adam Roberts <AR...@uk.ibm.com> on 2016/05/04 17:01:37 UTC, 0 replies.
- [build system] short downtime next thursday morning, 5-12-16 @ 8am PDT - posted by shane knapp <sk...@berkeley.edu> on 2016/05/04 18:38:34 UTC, 5 replies.
- TaskSchedulerImpl#initialize - why is rootPool initialized here not while TaskSchedulerImpl is created? - posted by Jacek Laskowski <ja...@japila.pl> on 2016/05/06 07:36:24 UTC, 0 replies.
- Proposal of closing some PRs and maybe some PRs abandoned by its author - posted by Hyukjin Kwon <gu...@gmail.com> on 2016/05/06 15:45:36 UTC, 4 replies.
- CfP 11th Workshop on Virtualization in High-Performance Cloud Computing (VHPC '16) (deadline extended May 20th) - posted by VHPC 16 <vh...@gmail.com> on 2016/05/07 15:09:43 UTC, 0 replies.
- Re: Cache Shuffle Based Operation Before Sort - posted by Ali Tootoonchian <al...@levyx.com> on 2016/05/09 00:17:53 UTC, 2 replies.
- spark 2.0 issue with yarn? - posted by Jesse F Chen <jf...@us.ibm.com> on 2016/05/09 20:24:46 UTC, 7 replies.
- Remote JAR download in client mode - posted by Michael Gummelt <mg...@mesosphere.io> on 2016/05/11 00:52:46 UTC, 1 replies.
- Structured Streaming with Kafka source/sink - posted by Ofir Manor <of...@equalum.io> on 2016/05/11 08:47:55 UTC, 1 replies.
- dataframe udf functioin will be executed twice when filter on new column created by withColumn - posted by Tony Jin <li...@gmail.com> on 2016/05/11 13:55:08 UTC, 2 replies.
- Adding HDFS read-time metrics per task (RE: SPARK-1683) - posted by Brian Cho <ch...@gmail.com> on 2016/05/11 19:01:32 UTC, 4 replies.
- Shrinking the DataFrame lineage - posted by "Ulanov, Alexander" <al...@hpe.com> on 2016/05/11 19:46:29 UTC, 0 replies.
- Spark Exposing RDD as WebService ? - posted by Senthil Kumar <se...@gmail.com> on 2016/05/12 10:53:49 UTC, 0 replies.
- How Spark SQL correctly connect hive metastore database with Spark 2.0 ? - posted by james <yi...@gmail.com> on 2016/05/12 14:36:41 UTC, 0 replies.
- Spark uses disk instead of memory to store RDD blocks - posted by Alexander Pivovarov <ap...@gmail.com> on 2016/05/12 21:16:30 UTC, 3 replies.
- [discuss] separate API annotation into two components: InterfaceAudience & InterfaceStability - posted by Reynold Xin <rx...@databricks.com> on 2016/05/12 21:29:24 UTC, 9 replies.
- code change for adding takeSample of DataFrame - posted by 段石石 <bu...@gmail.com> on 2016/05/13 06:41:48 UTC, 1 replies.
- HiveContext.refreshTable() missing in spark 2.0 - posted by 汪洋 <ti...@icloud.com> on 2016/05/13 09:47:28 UTC, 1 replies.
- Re: Shrinking the DataFrame lineage - posted by Joseph Bradley <jo...@databricks.com> on 2016/05/13 19:38:20 UTC, 2 replies.
- Nested/Chained case statements generate codegen over 64k exception - posted by Jonathan Gray <jo...@gmail.com> on 2016/05/14 09:29:41 UTC, 2 replies.
- combitedTextFile and CombineTextInputFormat - posted by Alexander Pivovarov <ap...@gmail.com> on 2016/05/15 03:13:44 UTC, 8 replies.
- Spark shuffling OutOfMemoryError Java heap space - posted by Renyi Xiong <re...@gmail.com> on 2016/05/15 18:46:33 UTC, 0 replies.
- PySpark mixed with Jython - posted by Holden Karau <ho...@pigscanfly.ca> on 2016/05/15 21:40:51 UTC, 0 replies.
- Question about enabling some of missing rules. - posted by Hyukjin Kwon <gu...@gmail.com> on 2016/05/16 01:50:55 UTC, 4 replies.
- SBT doesn't pick resource file after clean - posted by dhruve ashar <dh...@gmail.com> on 2016/05/17 18:58:30 UTC, 5 replies.
- Indexing of RDDs and DF in 2.0? - posted by Michael Segel <ms...@hotmail.com> on 2016/05/17 19:48:10 UTC, 0 replies.
- CompileException for spark-sql generated code in 2.0.0-SNAPSHOT - posted by Koert Kuipers <ko...@tresata.com> on 2016/05/17 22:29:52 UTC, 3 replies.
- [vote] Apache Spark 2.0.0-preview release (rc1) - posted by Reynold Xin <rx...@apache.org> on 2016/05/18 05:40:51 UTC, 25 replies.
- Query parsing error for the join query between different database - posted by JaeSung Jun <ja...@gmail.com> on 2016/05/18 13:12:50 UTC, 4 replies.
- PR for In-App Scheduling - posted by Nick White <nw...@palantir.com> on 2016/05/18 14:03:43 UTC, 0 replies.
- SparkR dataframe error - posted by Gayathri Murali <ga...@gmail.com> on 2016/05/19 00:12:50 UTC, 12 replies.
- Spark Security. Generating SSL keystore for each job - posted by ScoRp <ni...@gmail.com> on 2016/05/19 11:09:12 UTC, 0 replies.
- [DISCUSS] Removing or changing maintainer process - posted by Matei Zaharia <ma...@gmail.com> on 2016/05/19 15:34:59 UTC, 6 replies.
- right outer joins on Datasets - posted by Andres Perez <an...@tresata.com> on 2016/05/19 15:48:04 UTC, 4 replies.
- Dataset reduceByKey - posted by Andres Perez <an...@tresata.com> on 2016/05/19 18:12:31 UTC, 2 replies.
- Possible Hive problem with Spark 2.0.0 preview. - posted by Doug Balog <do...@dugos.com> on 2016/05/19 20:56:08 UTC, 0 replies.
- Re: Possible Hive problem with Spark 2.0.0 preview. - posted by Michael Armbrust <mi...@databricks.com> on 2016/05/19 21:44:38 UTC, 3 replies.
- Spark driver and yarn behavior - posted by Shankar Venkataraman <sh...@gmail.com> on 2016/05/19 22:16:19 UTC, 3 replies.
- Quick question on spark performance - posted by Yash Sharma <ya...@gmail.com> on 2016/05/21 00:54:14 UTC, 4 replies.
- spark on kubernetes - posted by Gurvinder Singh <gu...@uninett.no> on 2016/05/21 16:30:08 UTC, 11 replies.
- - posted by 成强 <cq...@qq.com> on 2016/05/22 12:04:12 UTC, 1 replies.
- Using Travis for JDK7/8 compilation and lint-java. - posted by Dongjoon Hyun <do...@apache.org> on 2016/05/22 20:25:04 UTC, 23 replies.
- [VOTE] Removing module maintainer process - posted by Matei Zaharia <ma...@gmail.com> on 2016/05/23 00:34:01 UTC, 8 replies.
- Building spark master failed - posted by Ovidiu-Cristian MARCU <ov...@inria.fr> on 2016/05/23 09:16:56 UTC, 2 replies.
- I will fix SPARK-15477 - posted by 马骉 <ma...@qq.com> on 2016/05/23 09:27:24 UTC, 0 replies.
- Re: I will fix SPARK-15477 - posted by Sean Owen <so...@cloudera.com> on 2016/05/23 11:39:30 UTC, 0 replies.
- Running TPCDSQueryBenchmark results in java.lang.OutOfMemoryError - posted by Ovidiu-Cristian MARCU <ov...@inria.fr> on 2016/05/23 15:58:13 UTC, 3 replies.
- How to map values read from test file to 2 different RDDs - posted by Deepak Sharma <de...@gmail.com> on 2016/05/23 17:05:42 UTC, 0 replies.
- Issue with Spark Streaming UI - posted by Sachin Janani <sj...@snappydata.io> on 2016/05/24 06:42:59 UTC, 1 replies.
- ClassCastException: SomeCaseClass cannot be cast to org.apache.spark.sql.Row - posted by Koert Kuipers <ko...@tresata.com> on 2016/05/24 15:33:29 UTC, 3 replies.
- [ANNOUNCE] Apache Spark 2.0.0-preview release - posted by Reynold Xin <rx...@databricks.com> on 2016/05/25 06:44:36 UTC, 12 replies.
- Cartesian join on RDDs taking too much time - posted by Priya Ch <le...@gmail.com> on 2016/05/25 07:05:12 UTC, 1 replies.
- Cannot build master with sbt - posted by Yiannis Gkoufas <jo...@gmail.com> on 2016/05/25 13:12:37 UTC, 2 replies.
- The 7th and Largest Spark Summit is less than 2 weeks away! - posted by Scott walent <sc...@gmail.com> on 2016/05/25 18:18:09 UTC, 0 replies.
- LiveListenerBus with started and stopped flags? Why both? - posted by Jacek Laskowski <ja...@japila.pl> on 2016/05/25 19:59:35 UTC, 1 replies.
- Re: feedback on dataset api explode - posted by Reynold Xin <rx...@databricks.com> on 2016/05/25 20:20:21 UTC, 1 replies.
- Labeling Jiras - posted by Luciano Resende <lu...@gmail.com> on 2016/05/25 20:48:00 UTC, 6 replies.
- Spark docker image - does that sound useful? - posted by Marcin Tustin <mt...@handybook.com> on 2016/05/25 21:44:57 UTC, 0 replies.
- Merging two datafiles - posted by dvlpr <na...@gmail.com> on 2016/05/26 10:04:31 UTC, 0 replies.
- Spark Job Execution halts during shuffle... - posted by Priya Ch <le...@gmail.com> on 2016/05/26 14:40:52 UTC, 0 replies.
- [RESULT][VOTE] Removing module maintainer process - posted by Matei Zaharia <ma...@gmail.com> on 2016/05/26 18:18:45 UTC, 0 replies.
- How to access the off-heap representation of cached data in Spark 2.0 - posted by "jpivarski@gmail.com" <jp...@gmail.com> on 2016/05/26 21:46:46 UTC, 6 replies.
- RE: JDBC Dialect for saving DataFrame into Vertica Table - posted by Mohammed Guller <mo...@glassbeam.com> on 2016/05/26 22:09:23 UTC, 1 replies.
- changed behavior for csv datasource and quoting in spark 2.0.0-SNAPSHOT - posted by Koert Kuipers <ko...@tresata.com> on 2016/05/26 22:35:56 UTC, 3 replies.
- Creation of SparkML Estimators in Java broken? - posted by Benjii519 <be...@gmail.com> on 2016/05/27 01:54:37 UTC, 2 replies.
- NegativeArraySizeException / segfault - posted by Koert Kuipers <ko...@tresata.com> on 2016/05/27 20:00:17 UTC, 3 replies.
- Spark Streaming - Twitter on Python current status - posted by Ricardo Almeida <ri...@actnowib.com> on 2016/05/28 15:37:33 UTC, 1 replies.
- NLP & Constraint Programming - posted by "Debusmann, Ralph" <ra...@sap.com> on 2016/05/30 11:12:29 UTC, 1 replies.
- Secondary Indexing? - posted by Michael Segel <ms...@hotmail.com> on 2016/05/30 16:08:20 UTC, 1 replies.