You are viewing a plain text version of this content. The canonical link for it is here.
- Re: [VOTE] Release Apache Spark 1.5.0 (RC2) - posted by Reynold Xin <rx...@databricks.com> on 2015/09/01 00:52:09 UTC, 7 replies.
- Re: IOError on createDataFrame - posted by Philip <br...@gmail.com> on 2015/09/01 01:25:19 UTC, 0 replies.
- Re: Tungsten off heap memory access for C++ libraries - posted by Reynold Xin <rx...@databricks.com> on 2015/09/01 11:11:12 UTC, 2 replies.
- [SparkR] lint script for SpakrR - posted by Yu Ishikawa <yu...@gmail.com> on 2015/09/01 16:09:01 UTC, 0 replies.
- Resource allocation in SPARK streaming - posted by anshu shukla <an...@gmail.com> on 2015/09/01 19:55:01 UTC, 0 replies.
- [VOTE] Release Apache Spark 1.5.0 (RC3) - posted by Reynold Xin <rx...@databricks.com> on 2015/09/01 22:41:46 UTC, 20 replies.
- Use of UnsafeRow - posted by "Ulanov, Alexander" <al...@hpe.com> on 2015/09/02 01:08:25 UTC, 0 replies.
- [ compress in-memory column storage used in sparksql cache table ] - posted by "Wangchangchun (A)" <wa...@huawei.com> on 2015/09/02 06:23:41 UTC, 0 replies.
- taking an n number of rows from and RDD starting from an index - posted by Niranda Perera <ni...@gmail.com> on 2015/09/02 06:35:13 UTC, 3 replies.
- OOM in spark driver - posted by ankit tyagi <an...@gmail.com> on 2015/09/02 08:03:50 UTC, 2 replies.
- Re: [ compress in-memory column storage used in sparksql cache table ] - posted by Nitin Goyal <ni...@gmail.com> on 2015/09/02 09:58:38 UTC, 1 replies.
- Harmonic centrality in GraphX - posted by Pavel Gladkov <gl...@gmail.com> on 2015/09/02 15:54:51 UTC, 0 replies.
- [HELP] Spark 1.4.1 tasks take ridiculously long time to complete - posted by lankaz <sa...@zomato.com> on 2015/09/03 06:33:04 UTC, 1 replies.
- Spark SQL sort by and collect by in multiple partitions - posted by Niranda Perera <ni...@gmail.com> on 2015/09/03 07:19:59 UTC, 1 replies.
- EOFException on History server reading in progress lz4 - posted by an...@thomsonreuters.com on 2015/09/03 18:32:47 UTC, 0 replies.
- Re: Code generation for GPU - posted by Reynold Xin <rx...@databricks.com> on 2015/09/03 22:37:17 UTC, 7 replies.
- (Spark SQL) partition-scoped UDF - posted by Eron Wright <ew...@live.com> on 2015/09/04 19:08:48 UTC, 3 replies.
- Flaky test in DAGSchedulerSuite? - posted by Cheolsoo Park <pi...@gmail.com> on 2015/09/04 20:09:15 UTC, 4 replies.
- [build system] java package updates on the amplab jenkins workers - posted by shane knapp <sk...@berkeley.edu> on 2015/09/04 23:45:22 UTC, 0 replies.
- Exception in saving MatrixFactorizationModel - posted by Madawa Soysa <ma...@cse.mrt.ac.lk> on 2015/09/05 15:47:19 UTC, 4 replies.
- Detecting configuration problems - posted by Madhu <ma...@madhu.com> on 2015/09/06 17:23:54 UTC, 2 replies.
- groupByKey() and keys with many values - posted by kaklakariada <ch...@gmail.com> on 2015/09/07 10:02:18 UTC, 6 replies.
- Fast Iteration while developing - posted by Justin Uang <ju...@gmail.com> on 2015/09/08 06:02:11 UTC, 2 replies.
- adding jars to the classpath with the relative path to spark home - posted by Niranda Perera <ni...@gmail.com> on 2015/09/08 09:03:19 UTC, 0 replies.
- Pyspark DataFrame TypeError - posted by "Prabeesh K." <pr...@gmail.com> on 2015/09/08 10:45:03 UTC, 2 replies.
- Question on DAGScheduler.getMissingParentStages() - posted by Madhusudanan Kandasamy <ma...@in.ibm.com> on 2015/09/08 17:00:37 UTC, 0 replies.
- Deserializing JSON into Scala objects in Java code - posted by Kevin Chen <kc...@palantir.com> on 2015/09/08 21:46:46 UTC, 4 replies.
- [MLlib] Extensibility of MLlib classes (Word2VecModel etc.) - posted by Maandy <dy...@gmail.com> on 2015/09/09 11:01:12 UTC, 1 replies.
- Did the 1.5 release complete? - posted by Sean Owen <so...@cloudera.com> on 2015/09/09 11:42:51 UTC, 1 replies.
- [ANNOUNCE] Announcing Spark 1.5.0 - posted by Reynold Xin <rx...@databricks.com> on 2015/09/09 11:47:30 UTC, 10 replies.
- looking for a technical reviewer to review a book on Spark - posted by Mohammed Guller <mo...@glassbeam.com> on 2015/09/09 17:36:00 UTC, 2 replies.
- Spark 1.5: How to trigger expression execution through UnsafeRow/TungstenProject - posted by lonikar <lo...@gmail.com> on 2015/09/09 21:31:29 UTC, 2 replies.
- [SparkSQL]Could not alter table in Spark 1.5 use HiveContext - posted by StanZhai <ma...@zhaishidan.cn> on 2015/09/10 08:11:35 UTC, 5 replies.
- Spark 1.5.x: Java files in src/main/scala and vice versa - posted by lonikar <lo...@gmail.com> on 2015/09/10 13:10:12 UTC, 6 replies.
- DF.intersection issue in 1.5 - posted by Nitay Joffe <ni...@actioniq.co> on 2015/09/10 15:16:03 UTC, 1 replies.
- Concurrency issue in SQLExecution.withNewExecutionId - posted by Olivier Toupin <ol...@gmail.com> on 2015/09/10 18:09:18 UTC, 3 replies.
- Re: ClassCastException using DataFrame only when num-executors > 2 ... - posted by Reynold Xin <rx...@databricks.com> on 2015/09/11 00:28:52 UTC, 1 replies.
- Re: MongoDB and Spark - posted by Sandeep Giri <sa...@knowbigdata.com> on 2015/09/11 11:48:56 UTC, 2 replies.
- Spark 1.5.0: setting up debug env - posted by lonikar <lo...@gmail.com> on 2015/09/11 18:54:59 UTC, 0 replies.
- Re: SparkR driver side JNI - posted by Renyi Xiong <re...@gmail.com> on 2015/09/11 19:54:33 UTC, 2 replies.
- New Spark json endpoints - posted by Kevin Chen <kc...@palantir.com> on 2015/09/11 20:30:31 UTC, 5 replies.
- New JavaRDD Inside JavaPairDStream - posted by Rachana Srivastava <Ra...@markmonitor.com> on 2015/09/11 21:09:56 UTC, 1 replies.
- SIGTERM 15 Issue : Spark Streaming for ingesting huge text files using custom Receiver - posted by "Varadhan, Jawahar" <va...@yahoo.com.INVALID> on 2015/09/12 00:02:21 UTC, 0 replies.
- Multithreaded vs Spark Executor - posted by Rachana Srivastava <Ra...@markmonitor.com> on 2015/09/12 05:07:28 UTC, 0 replies.
- Re: SIGTERM 15 Issue : Spark Streaming for ingesting huge text files using custom Receiver - posted by Jörn Franke <jo...@gmail.com> on 2015/09/12 09:32:57 UTC, 0 replies.
- Re: HyperLogLogUDT - posted by Nick Pentreath <ni...@gmail.com> on 2015/09/12 10:07:28 UTC, 8 replies.
- spark dataframe transform JSON to ORC meet “column ambigous exception” - posted by Fengdong Yu <fe...@everstring.com> on 2015/09/12 11:05:43 UTC, 3 replies.
- [Question] ORC - EMRFS Problem - posted by Cazen Lee <ca...@gmail.com> on 2015/09/12 17:07:01 UTC, 0 replies.
- Spark Streaming..Exception - posted by Priya Ch <le...@gmail.com> on 2015/09/12 19:34:33 UTC, 2 replies.
- (send this email to subscribe) - posted by 蒋林 <ye...@163.com> on 2015/09/14 03:43:45 UTC, 1 replies.
- An alternate UI for Spark. - posted by Prashant Sharma <sc...@gmail.com> on 2015/09/14 08:18:36 UTC, 1 replies.
- Unable to acquire memory errors in HiveCompatibilitySuite - posted by Pete Robbins <ro...@gmail.com> on 2015/09/14 13:16:11 UTC, 18 replies.
- JavaRDD using Reflection - posted by Rachana Srivastava <Ra...@markmonitor.com> on 2015/09/14 18:54:25 UTC, 2 replies.
- Data frame with one column - posted by "Ulanov, Alexander" <al...@hpe.com> on 2015/09/14 19:50:29 UTC, 3 replies.
- ML: embed a transformer - posted by Sa...@wellsfargo.com on 2015/09/14 20:48:53 UTC, 2 replies.
- Fwd: JobScheduler: Error generating jobs for time for custom InputDStream - posted by Juan Rodríguez Hortalá <ju...@gmail.com> on 2015/09/14 20:51:24 UTC, 1 replies.
- Spark 1.5.1 release - posted by Reynold Xin <rx...@databricks.com> on 2015/09/14 22:07:35 UTC, 0 replies.
- JDBC Dialect tests - posted by Luciano Resende <lu...@gmail.com> on 2015/09/14 22:34:40 UTC, 2 replies.
- Null Value in DecimalType column of DataFrame - posted by Dirceu Semighini Filho <di...@gmail.com> on 2015/09/14 22:42:07 UTC, 3 replies.
- Enum parameter in ML - posted by "Ulanov, Alexander" <al...@hpe.com> on 2015/09/15 01:31:14 UTC, 7 replies.
- RDD API patterns - posted by sim <si...@swoop.com> on 2015/09/15 01:36:05 UTC, 14 replies.
- And.eval short circuiting - posted by Zack Sampson <zs...@palantir.com> on 2015/09/15 06:12:08 UTC, 6 replies.
- Predicate push-down bug? - posted by Ravi Ravi <i....@gmail.com> on 2015/09/15 19:32:55 UTC, 2 replies.
- pyspark streaming DStream compute - posted by Renyi Xiong <re...@gmail.com> on 2015/09/15 22:46:13 UTC, 1 replies.
- JENKINS: downtime next week, wed and thurs mornings (9-23 and 9-24) - posted by shane knapp <sk...@berkeley.edu> on 2015/09/16 17:40:24 UTC, 6 replies.
- SparkR streaming source code - posted by Renyi Xiong <re...@gmail.com> on 2015/09/16 18:52:57 UTC, 3 replies.
- Communication between executors and drivers - posted by Muhammad Haseeb Javed <11...@seecs.edu.pk> on 2015/09/16 21:44:51 UTC, 0 replies.
- Spark streaming DStream state on worker - posted by Renyi Xiong <re...@gmail.com> on 2015/09/16 21:47:21 UTC, 0 replies.
- how to send additional configuration to the RDD after it was lazily created - posted by Gil Vernik <GI...@il.ibm.com> on 2015/09/17 09:07:50 UTC, 1 replies.
- bug in Worker.scala, ExecutorRunner is not serializable - posted by Huangguowei <hu...@huawei.com> on 2015/09/17 09:46:20 UTC, 3 replies.
- 答复: bug in Worker.scala, ExecutorRunner is not serializable - posted by Huangguowei <hu...@huawei.com> on 2015/09/17 10:26:20 UTC, 4 replies.
- QueueStream doesn't support checkpoint makes it difficult to do unit test - posted by Bin Wang <wb...@gmail.com> on 2015/09/17 10:50:25 UTC, 1 replies.
- RDD: Execution and Scheduling - posted by gsvic <vi...@gmail.com> on 2015/09/17 14:23:05 UTC, 5 replies.
- [MLlib] BinaryLogisticRegressionSummary on test set - posted by Hao Ren <in...@gmail.com> on 2015/09/17 17:07:10 UTC, 3 replies.
- Does anyone use ShuffleDependency directly? - posted by Josh Rosen <jo...@databricks.com> on 2015/09/18 22:17:14 UTC, 0 replies.
- One element per node - posted by "Ulanov, Alexander" <al...@hpe.com> on 2015/09/19 00:57:44 UTC, 0 replies.
- Re: One element per node - posted by Feynman Liang <fl...@databricks.com> on 2015/09/19 01:06:01 UTC, 5 replies.
- BUILD SYSTEM: fire and power event at UC berkeley's IST colo, jenkins offline - posted by shane knapp <sk...@berkeley.edu> on 2015/09/19 09:28:56 UTC, 4 replies.
- [SparkSQL]How does spark handle a parquet file in parallel? - posted by StanZhai <ma...@zhaishidan.cn> on 2015/09/19 13:43:28 UTC, 0 replies.
- spark-shell 1.5 doesn't seem to work in local mode - posted by Madhu <ma...@madhu.com> on 2015/09/19 18:14:12 UTC, 4 replies.
- Re: AMP JENKINS - unplanned outage at 1845, ongoing - posted by shane knapp <sk...@berkeley.edu> on 2015/09/19 19:29:36 UTC, 0 replies.
- SparkR installation not working - posted by Devl Devel <de...@gmail.com> on 2015/09/19 21:30:16 UTC, 1 replies.
- Using scala-2.11 when making changes to spark source - posted by Stephen Boesch <ja...@gmail.com> on 2015/09/20 15:18:35 UTC, 2 replies.
- Join operation on DStreams - posted by guoxu1231 <gu...@gmail.com> on 2015/09/21 08:47:56 UTC, 2 replies.
- Hbase Spark streaming issue. - posted by Siva <sb...@gmail.com> on 2015/09/21 09:46:11 UTC, 0 replies.
- Re: passing SparkContext as parameter - posted by Priya Ch <le...@gmail.com> on 2015/09/21 13:27:10 UTC, 2 replies.
- Forecasting Library For Apache Spark - posted by Mohamed Baddar <mo...@badrit.com> on 2015/09/21 13:47:25 UTC, 2 replies.
- Test workflow - blacklist entire suites and run any independently - posted by Adam Roberts <AR...@uk.ibm.com> on 2015/09/21 17:21:15 UTC, 2 replies.
- Spark SQL DataFrame 1.5.0 is extremely slow for take(1) or head() or first() - posted by Jerry Lam <ch...@gmail.com> on 2015/09/21 17:56:15 UTC, 6 replies.
- Unsubscribe - posted by Dulaj Viduranga <vi...@icloud.com> on 2015/09/21 19:15:58 UTC, 1 replies.
- How to modify Hadoop APIs used by Spark? - posted by Dogtail Ray <sp...@gmail.com> on 2015/09/21 22:20:06 UTC, 2 replies.
- DataFrames Aggregate does not spill? - posted by Matt Cheah <mc...@palantir.com> on 2015/09/22 02:34:39 UTC, 2 replies.
- SparkR package path - posted by Hossein <fa...@gmail.com> on 2015/09/22 03:18:50 UTC, 10 replies.
- Why there is no snapshots for 1.5 branch? - posted by Bin Wang <wb...@gmail.com> on 2015/09/22 06:51:04 UTC, 10 replies.
- RowMatrix tallSkinnyQR - ERROR: Second call to constructor of static parser - posted by Sa...@wellsfargo.com on 2015/09/22 17:27:08 UTC, 0 replies.
- Open Issues for Contributors - posted by Pedro Rodriguez <sk...@gmail.com> on 2015/09/22 17:50:26 UTC, 2 replies.
- column identifiers in Spark SQL - posted by Richard Hillegas <rh...@us.ibm.com> on 2015/09/22 19:53:23 UTC, 4 replies.
- Derby version in Spark - posted by Richard Hillegas <rh...@us.ibm.com> on 2015/09/22 22:28:29 UTC, 5 replies.
- Fwd: Parallel collection in driver programs - posted by Andy Huang <an...@servian.com.au> on 2015/09/23 07:03:39 UTC, 0 replies.
- Why Filter return a DataFrame object in DataFrame.scala? - posted by qiuhai <98...@qq.com> on 2015/09/23 07:57:33 UTC, 3 replies.
- using Codahale counters in source - posted by Steve Loughran <st...@hortonworks.com> on 2015/09/23 11:05:22 UTC, 0 replies.
- Checkpoint directory structure - posted by Bin Wang <wb...@gmail.com> on 2015/09/23 12:58:34 UTC, 4 replies.
- RFC: packaging Spark without assemblies - posted by Marcelo Vanzin <va...@cloudera.com> on 2015/09/24 00:13:07 UTC, 3 replies.
- Get only updated RDDs from or after updateStateBykey - posted by Bin Wang <wb...@gmail.com> on 2015/09/24 07:45:01 UTC, 6 replies.
- [VOTE] Release Apache Spark 1.5.1 (RC1) - posted by Reynold Xin <rx...@databricks.com> on 2015/09/24 09:27:25 UTC, 29 replies.
- [Discuss] NOTICE file for transitive "NOTICE"s - posted by Reynold Xin <rx...@databricks.com> on 2015/09/24 19:55:53 UTC, 11 replies.
- How to get the HDFS path for each RDD - posted by Fengdong Yu <fe...@everstring.com> on 2015/09/25 05:12:00 UTC, 9 replies.
- unsubscribe - posted by Nirmal R Kumar <ni...@hotmail.com> on 2015/09/25 12:30:04 UTC, 2 replies.
- Dataframes: PrunedFilteredScan without Spark Side Filtering - posted by Russell Spitzer <ru...@gmail.com> on 2015/09/26 07:02:00 UTC, 1 replies.
- Re: Spark Streaming with Tachyon : Data Loss on Receiver Failure due to WAL error - posted by Dibyendu Bhattacharya <di...@gmail.com> on 2015/09/26 08:49:25 UTC, 0 replies.
- treeAggregate timing / SGD performance with miniBatchFraction < 1 - posted by Mike Hynes <91...@gmail.com> on 2015/09/26 19:20:31 UTC, 2 replies.
- Spark-Kafka Connector issue - posted by Ratika Prasad <rp...@couponsinc.com> on 2015/09/27 06:50:22 UTC, 2 replies.
- using JavaRDD in spark-redis connector - posted by Rohith P <rp...@couponsinc.com> on 2015/09/28 17:31:39 UTC, 1 replies.
- failed to run spark sample on windows - posted by Renyi Xiong <re...@gmail.com> on 2015/09/29 01:36:10 UTC, 4 replies.
- Monitoring tools for spark streaming - posted by Siva <sb...@gmail.com> on 2015/09/29 01:45:14 UTC, 0 replies.
- spark-submit classloader issue... - posted by Rachana Srivastava <Ra...@markmonitor.com> on 2015/09/29 04:01:31 UTC, 0 replies.
- Where are logs for Spark Kafka Yarn on Cloudera - posted by Rachana Srivastava <Ra...@markmonitor.com> on 2015/09/29 08:37:48 UTC, 0 replies.
- Spark Network Module behaviour - posted by sbiookag <sb...@asu.edu> on 2015/09/29 20:36:56 UTC, 0 replies.
- Dynamic DAG use-case for spark streaming. - posted by Archit Thakur <ar...@gmail.com> on 2015/09/29 21:06:26 UTC, 1 replies.
- Too many executors are created - posted by "Ulanov, Alexander" <al...@hpe.com> on 2015/09/29 21:23:53 UTC, 0 replies.
- Hive permanent functions are not available in Spark SQL - posted by Pala M Muthaia <mc...@rocketfuelinc.com.INVALID> on 2015/09/30 00:43:08 UTC, 1 replies.
- CQs on WindowedStream created on running StreamingContext - posted by Yogs <ma...@gmail.com> on 2015/09/30 09:55:02 UTC, 0 replies.
- Task Execution - posted by gsvic <vi...@gmail.com> on 2015/09/30 11:21:44 UTC, 0 replies.