You are viewing a plain text version of this content. The canonical link for it is here.
- GraphX PageRank keeps 3 copies of graph in memory - posted by "Ulanov, Alexander" <al...@hpe.com> on 2015/10/01 00:55:47 UTC, 2 replies.
- Speculatively using spare capacity - posted by Muhammed Uluyol <ul...@umich.edu> on 2015/10/01 04:49:24 UTC, 1 replies.
- Spark 1.6 Release window is not updated in Spark-wiki - posted by Meethu Mathew <me...@flytxt.com> on 2015/10/01 08:20:41 UTC, 3 replies.
- Dataframe nested schema inference from Json without type conflicts - posted by Ewan Leith <ew...@realitymine.com> on 2015/10/01 16:33:11 UTC, 8 replies.
- Re: Task Execution - posted by Rishitesh Mishra <ri...@gmail.com> on 2015/10/01 19:40:15 UTC, 0 replies.
- Re: Tungsten off heap memory access for C++ libraries - posted by Paul Wais <pa...@gmail.com> on 2015/10/02 01:53:09 UTC, 0 replies.
- [ANNOUNCE] Announcing Spark 1.5.1 - posted by Reynold Xin <rx...@databricks.com> on 2015/10/02 04:42:31 UTC, 17 replies.
- [Build] repo1.maven.org: spark libs 1.5.0 for scala 2.10 poms are broken (404) - posted by andy petrella <an...@gmail.com> on 2015/10/02 19:49:34 UTC, 9 replies.
- SparkR dataframe UDF - posted by Renyi Xiong <re...@gmail.com> on 2015/10/02 19:57:39 UTC, 1 replies.
- Re: RowMatrix tallSkinnyQR - ERROR: Second call to constructor of static parser - posted by Joseph Bradley <jo...@databricks.com> on 2015/10/02 21:07:28 UTC, 0 replies.
- Python UDAFs - posted by Justin Uang <ju...@gmail.com> on 2015/10/02 21:20:10 UTC, 2 replies.
- Spark 1.5.1 - Scala 2.10 - Hadoop 1 package is missing from S3 - posted by Nicholas Chammas <ni...@gmail.com> on 2015/10/05 02:17:09 UTC, 10 replies.
- Difference between a task and a job - posted by Guna Prasaad <gu...@gmail.com> on 2015/10/05 12:52:21 UTC, 1 replies.
- StructType has more rows, than corresponding Row has objects. - posted by Eugene Morozov <ev...@gmail.com> on 2015/10/05 13:28:13 UTC, 2 replies.
- spark hive branch location - posted by weoccc <we...@gmail.com> on 2015/10/05 20:03:48 UTC, 2 replies.
- HiveContext in standalone mode: shuffle hang ups - posted by Sa...@wellsfargo.com on 2015/10/05 21:57:07 UTC, 0 replies.
- Re: failure notice - posted by Renyi Xiong <re...@gmail.com> on 2015/10/06 00:03:41 UTC, 3 replies.
- IllegalArgumentException: Size exceeds Integer.MAX_VALUE - posted by Jegan <je...@gmail.com> on 2015/10/06 00:31:52 UTC, 5 replies.
- Re: Dataframes: PrunedFilteredScan without Spark Side Filtering - posted by Russell Spitzer <ru...@gmail.com> on 2015/10/06 01:31:46 UTC, 2 replies.
- FW: Spark error while running in spark mode - posted by Ratika Prasad <rp...@couponsinc.com> on 2015/10/06 08:01:32 UTC, 0 replies.
- How can I access data on RDDs? - posted by jatinganhotra <ja...@gmail.com> on 2015/10/06 08:58:23 UTC, 0 replies.
- Pyspark dataframe read - posted by Blaž Šnuderl <sn...@gmail.com> on 2015/10/06 09:02:41 UTC, 4 replies.
- Re: CQs on WindowedStream created on running StreamingContext - posted by Yogesh Mahajan <ma...@gmail.com> on 2015/10/06 18:59:40 UTC, 0 replies.
- Adding Spark Testing functionality - posted by Holden Karau <ho...@pigscanfly.ca> on 2015/10/07 00:12:00 UTC, 3 replies.
- multiple count distinct in SQL/DataFrame? - posted by Reynold Xin <rx...@databricks.com> on 2015/10/07 02:51:17 UTC, 4 replies.
- What is the difference between ml.classification.LogisticRegression and mllib.classification.LogisticRegressionWithLBFGS - posted by YiZhi Liu <ja...@gmail.com> on 2015/10/07 08:47:57 UTC, 4 replies.
- Spark standalone hangup during shuffle flatMap or explode in cluster - posted by Sa...@wellsfargo.com on 2015/10/07 19:23:20 UTC, 0 replies.
- Understanding code/closure shipment to Spark workers‏ - posted by Arijit <ar...@live.com> on 2015/10/08 01:47:14 UTC, 3 replies.
- SparkSQL: First query execution is always slower than subsequent queries - posted by Lloyd Haris <ll...@gmail.com> on 2015/10/08 03:16:43 UTC, 1 replies.
- RowNumber in HiveContext returns null, negative numbers or huge - posted by Sa...@wellsfargo.com on 2015/10/08 15:22:56 UTC, 1 replies.
- Scala 2.11 builds broken/ Can the PR build run also 2.11? - posted by Iulian Dragoș <iu...@typesafe.com> on 2015/10/08 15:40:03 UTC, 14 replies.
- Build spark 1.5.1 branch fails - posted by Chester Chen <ch...@alpinenow.com> on 2015/10/08 19:35:17 UTC, 6 replies.
- Compiling Spark with a local hadoop profile - posted by sbiookag <sb...@asu.edu> on 2015/10/08 20:22:57 UTC, 3 replies.
- spark over drill - posted by Pranay Tonpay <pt...@gmail.com> on 2015/10/08 20:51:41 UTC, 1 replies.
- passing a AbstractFunction1 to sparkContext().runJob instead of a Closure - posted by Niranda Perera <ni...@gmail.com> on 2015/10/09 09:25:23 UTC, 0 replies.
- sbt test error -- "Could not reserve enough space" - posted by Robert Dodier <ro...@gmail.com> on 2015/10/09 18:41:42 UTC, 1 replies.
- Operations with cached RDD - posted by "Ulanov, Alexander" <al...@hpe.com> on 2015/10/10 01:35:37 UTC, 2 replies.
- Re: Too many executors are created - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2015/10/11 10:38:37 UTC, 0 replies.
- No speedup in MultiLayerPerceptronClassifier with increase in number of cores - posted by Disha Shrivastava <di...@gmail.com> on 2015/10/11 14:27:44 UTC, 5 replies.
- yarn-cluster mode throwing NullPointerException - posted by Rachana Srivastava <Ra...@markmonitor.com> on 2015/10/12 05:49:52 UTC, 1 replies.
- taking the heap dump when an executor goes OOM - posted by Niranda Perera <ni...@gmail.com> on 2015/10/12 07:45:55 UTC, 1 replies.
- SparkSQL can not extract values from UDT (like VectorUDT) - posted by Hao Ren <in...@gmail.com> on 2015/10/12 16:54:35 UTC, 0 replies.
- Regarding SPARK JIRA ID-10286 - posted by "Jagadeesan A.S." <li...@gmail.com> on 2015/10/12 17:56:48 UTC, 1 replies.
- Flaky Jenkins tests? - posted by Meihua Wu <ro...@gmail.com> on 2015/10/12 22:24:34 UTC, 5 replies.
- Live UI - posted by Jakob Odersky <jo...@gmail.com> on 2015/10/12 23:36:16 UTC, 2 replies.
- a few major changes / improvements for Spark 1.6 - posted by Reynold Xin <rx...@databricks.com> on 2015/10/13 00:28:38 UTC, 0 replies.
- SPARK-10617 - posted by Alex Rovner <al...@magnetic.com> on 2015/10/13 04:21:09 UTC, 0 replies.
- How to split one RDD to small ones according to its key's value - posted by "张志强(旺轩)" <zz...@alibaba-inc.com> on 2015/10/13 10:16:30 UTC, 1 replies.
- Getting started - posted by _abhishek <ab...@iitg.ernet.in> on 2015/10/13 14:49:27 UTC, 1 replies.
- Spark Event Listener - posted by Jakob Odersky <jo...@gmail.com> on 2015/10/14 01:29:05 UTC, 3 replies.
- [Streaming] join events in last 10 minutes - posted by Daniel Li <da...@gmail.com> on 2015/10/14 02:14:24 UTC, 1 replies.
- When does python program started in pyspark - posted by canan chen <cc...@gmail.com> on 2015/10/14 04:50:47 UTC, 2 replies.
- Is "mllib" no longer Experimental? - posted by Sean Owen <so...@cloudera.com> on 2015/10/14 11:13:15 UTC, 1 replies.
- Contributing Receiver based Low Level Kafka Consumer from Spark-Packages to Apache Spark Project - posted by Dibyendu Bhattacharya <di...@gmail.com> on 2015/10/14 12:16:08 UTC, 0 replies.
- Re: [SQL] Memory leak with spark streaming and spark sql in spark 1.5.1 - posted by Reynold Xin <rx...@databricks.com> on 2015/10/14 20:39:53 UTC, 1 replies.
- If you use Spark 1.5 and disabled Tungsten mode ... - posted by Reynold Xin <rx...@databricks.com> on 2015/10/14 21:00:37 UTC, 14 replies.
- Status of SBT Build - posted by Jakob Odersky <jo...@gmail.com> on 2015/10/14 21:13:01 UTC, 2 replies.
- Strange spark problems among different versions - posted by zhaoxia <zh...@gmail.com> on 2015/10/14 22:05:40 UTC, 0 replies.
- Gradient Descent with large model size - posted by "Ulanov, Alexander" <al...@hpe.com> on 2015/10/14 22:18:42 UTC, 6 replies.
- SPARK_MASTER_IP actually expects a DNS name, not IP address - posted by Nicholas Chammas <ni...@gmail.com> on 2015/10/15 04:10:41 UTC, 10 replies.
- Should enforce the uniqueness of field name in DataFrame ? - posted by Jeff Zhang <zj...@gmail.com> on 2015/10/15 04:26:30 UTC, 4 replies.
- PMML export for LinearRegressionModel - posted by Fazlan Nazeem <fa...@wso2.com> on 2015/10/15 11:25:43 UTC, 5 replies.
- MLlib Contribution - posted by Kybe67 <be...@gmail.com> on 2015/10/15 16:58:54 UTC, 1 replies.
- Network-related environemental problem when running JDBCSuite - posted by Richard Hillegas <rh...@us.ibm.com> on 2015/10/15 18:47:22 UTC, 4 replies.
- Building Spark - posted by Annabel Melongo <me...@yahoo.com.INVALID> on 2015/10/16 00:45:28 UTC, 3 replies.
- Insight into Spark Packages - posted by jeff saremi <je...@hotmail.com> on 2015/10/16 17:43:47 UTC, 1 replies.
- Spark Implicit Functions - posted by Bill Bejeck <bb...@gmail.com> on 2015/10/16 23:06:05 UTC, 2 replies.
- flaky test "map stage submission with multiple shared stages and failures" - posted by Reynold Xin <rx...@databricks.com> on 2015/10/17 22:42:05 UTC, 0 replies.
- Streaming and storing to Google Cloud Storage or S3 - posted by vonnagy <iv...@vadio.com> on 2015/10/18 04:23:06 UTC, 1 replies.
- Checkpointing RDD calls the job twice? - posted by jatinganhotra <ja...@gmail.com> on 2015/10/18 05:40:43 UTC, 0 replies.
- test failed due to OOME - posted by Ted Yu <yu...@gmail.com> on 2015/10/18 16:54:06 UTC, 4 replies.
- streaming test failure - posted by Ted Yu <yu...@gmail.com> on 2015/10/18 18:56:34 UTC, 0 replies.
- ShuffledHashJoin Possible Issue - posted by gsvic <vi...@gmail.com> on 2015/10/18 21:55:04 UTC, 3 replies.
- Spark SQL: what does an exclamation mark mean in the plan? - posted by Xiao Li <ga...@gmail.com> on 2015/10/19 08:38:57 UTC, 2 replies.
- Guaranteed processing orders of each batch in Spark Streaming - posted by Renjie Liu <li...@gmail.com> on 2015/10/19 08:58:10 UTC, 1 replies.
- Spark driver reducing total executors count even when Dynamic Allocation is disabled. - posted by prakhar jauhari <pr...@gmail.com> on 2015/10/19 09:51:21 UTC, 3 replies.
- Re: Haskell language Spark support - posted by weymouth <we...@umich.edu> on 2015/10/19 10:34:41 UTC, 0 replies.
- Unable to run applications on spark in standalone cluster mode - posted by Rohith Parameshwara <rp...@couponsinc.com> on 2015/10/19 14:05:51 UTC, 0 replies.
- Re: Unable to run applications on spark in standalone cluster mode - posted by Jean-Baptiste Onofré <jb...@nanthrax.net> on 2015/10/19 14:16:45 UTC, 1 replies.
- failed mesos task loses executor - posted by Adrian Bridgett <ad...@opensignal.com> on 2015/10/19 14:41:56 UTC, 0 replies.
- Building Spark w/ 1.8 and binary incompatibilities - posted by Iulian Dragoș <iu...@typesafe.com> on 2015/10/19 15:19:12 UTC, 0 replies.
- BUILD SYSTEM: amp-jenkins-worker-05 offline - posted by shane knapp <sk...@berkeley.edu> on 2015/10/19 18:39:09 UTC, 7 replies.
- Problem using User Defined Predicate pushdown with core RDD and parquet - UDP class not found - posted by Vladimir Vladimirov <sm...@gmail.com> on 2015/10/20 01:38:07 UTC, 0 replies.
- Problem building Spark - posted by Annabel Melongo <me...@yahoo.com.INVALID> on 2015/10/20 03:59:00 UTC, 2 replies.
- MapStatus too large for drvier - posted by yaoqin <ya...@huawei.com> on 2015/10/20 08:59:17 UTC, 3 replies.
- Ability to offer initial coefficients in ml.LogisticRegression - posted by YiZhi Liu <ja...@gmail.com> on 2015/10/20 09:34:48 UTC, 3 replies.
- BUILD SYSTEM: builds are OOMing the jenkins workers, investigating. also need to reboot amp-jenkins-worker-06 - posted by shane knapp <sk...@berkeley.edu> on 2015/10/21 00:24:07 UTC, 5 replies.
- Set numExecutors by sparklaunch - posted by "qinggangwang7@gmail.com" <qi...@gmail.com> on 2015/10/21 04:10:44 UTC, 1 replies.
- Exception when using cosh - posted by Shagun Sodhani <ss...@gmail.com> on 2015/10/21 11:30:47 UTC, 3 replies.
- SPARK_DRIVER_MEMORY doc wrong - posted by tyronecai <ty...@163.com> on 2015/10/21 11:46:17 UTC, 1 replies.
- FW: Spark Streaming scheduler delay VS driver.cores - posted by Adrian Tanase <at...@adobe.com> on 2015/10/21 13:45:14 UTC, 0 replies.
- Bringing up JDBC Tests to trunk - posted by Luciano Resende <lu...@gmail.com> on 2015/10/21 22:16:33 UTC, 1 replies.
- Possible bug on Spark Yarn Client (1.5.1) during kerberos mode ? - posted by Chester Chen <ch...@alpinenow.com> on 2015/10/21 23:33:12 UTC, 9 replies.
- repartitionAndSortWithinPartitions task shuffle phase is very slow - posted by 周千昊 <qh...@apache.org> on 2015/10/22 11:02:10 UTC, 7 replies.
- Trouble creating JIRA issue - posted by Richard Marscher <rm...@localytics.com> on 2015/10/22 18:38:35 UTC, 1 replies.
- Spark.Executor.Cores question - posted by mkhaitman <ma...@chango.com> on 2015/10/23 21:05:55 UTC, 4 replies.
- slightly more informative error message in MLUtils.loadLibSVMFile - posted by Robert Dodier <ro...@gmail.com> on 2015/10/23 21:43:42 UTC, 0 replies.
- [VOTE] Release Apache Spark 1.5.2 (RC1) - posted by Reynold Xin <rx...@databricks.com> on 2015/10/25 08:07:07 UTC, 11 replies.
- Adding support for truncate operator - posted by Shagun Sodhani <ss...@gmail.com> on 2015/10/25 17:01:31 UTC, 2 replies.
- spark-sql / apache-drill / jboss-tiied - posted by Pranay Tonpay <pt...@gmail.com> on 2015/10/25 18:05:15 UTC, 1 replies.
- Duplicate (?) code paths to handle Executor failures - posted by Kay Ousterhout <ke...@eecs.berkeley.edu> on 2015/10/26 00:40:58 UTC, 0 replies.
- Loading Files from HDFS Incurs Network Communication - posted by Jinfeng Li <li...@gmail.com> on 2015/10/26 09:57:20 UTC, 1 replies.
- Spark Implementation of XGBoost - posted by Meihua Wu <ro...@gmail.com> on 2015/10/26 19:42:53 UTC, 7 replies.
- Exception when using some aggregate operators - posted by Shagun Sodhani <ss...@gmail.com> on 2015/10/27 11:19:44 UTC, 15 replies.
- Pickle Spark DataFrame - posted by agg212 <ag...@cs.brown.edu> on 2015/10/27 21:47:08 UTC, 2 replies.
- Filter applied on merged Parquet shemsa with new column fails. - posted by Hyukjin Kwon <gu...@gmail.com> on 2015/10/28 03:11:29 UTC, 1 replies.
- Task not serializable exception - posted by Rohith Parameshwara <rp...@couponsinc.com> on 2015/10/28 04:41:23 UTC, 0 replies.
- Re: using JavaRDD in spark-redis connector - posted by Rohith P <rp...@couponsinc.com> on 2015/10/28 05:07:25 UTC, 0 replies.
- sample or takeSample or ?? - posted by "张志强(旺轩)" <zz...@alibaba-inc.com> on 2015/10/29 06:15:21 UTC, 0 replies.
- Guidance to get started - posted by Aaska Shah <aa...@gmail.com> on 2015/10/29 11:23:50 UTC, 0 replies.
- Fwd: [jira] [Created] (HADOOP-12527) Upgrade Avro dependency to 1.7.7 - posted by Steve Loughran <st...@hortonworks.com> on 2015/10/29 11:42:20 UTC, 0 replies.
- want to contribute - posted by Aadi Thakar <th...@gmail.com> on 2015/10/29 11:43:23 UTC, 1 replies.
- Spark streaming - failed recovery from checkpoint - posted by Adrian Tanase <at...@adobe.com> on 2015/10/29 12:04:04 UTC, 1 replies.
- Maintaining overall cumulative data in Spark Streaming - posted by Sandeep Giri <sa...@knowbigdata.com> on 2015/10/29 23:08:42 UTC, 4 replies.
- Getting Started - posted by Saurabh Shah <sh...@gmail.com> on 2015/10/30 12:25:16 UTC, 0 replies.
- Off-heap storage and dynamic allocation - posted by Justin Uang <ju...@gmail.com> on 2015/10/30 17:13:44 UTC, 0 replies.
- Spark 1.6 Release Schedule - posted by Michael Armbrust <mi...@databricks.com> on 2015/10/31 12:25:50 UTC, 1 replies.
- SparkLauncher#setJavaHome does not set JAVA_HOME in child process - posted by gus <gu...@gmail.com> on 2015/10/31 13:20:31 UTC, 1 replies.