You are viewing a plain text version of this content. The canonical link for it is here.
- Re: Export BLAS module on Spark MLlib - posted by DB Tsai <db...@dbtsai.com> on 2015/12/01 01:02:41 UTC, 0 replies.
- Re: Problem in running MLlib SVM - posted by Joseph Bradley <jo...@databricks.com> on 2015/12/01 01:33:37 UTC, 3 replies.
- Re: Grid search with Random Forest - posted by Joseph Bradley <jo...@databricks.com> on 2015/12/01 01:34:02 UTC, 6 replies.
- FOSDEM 2016 - take action by 4th of December 2015 - posted by Roman Shaposhnik <rv...@apache.org> on 2015/12/01 07:30:21 UTC, 0 replies.
- Re: How to add 1.5.2 support to ec2/spark_ec2.py ? - posted by Alexander Pivovarov <ap...@gmail.com> on 2015/12/01 08:38:37 UTC, 4 replies.
- Re: Bringing up JDBC Tests to trunk - posted by Jacek Laskowski <ja...@japila.pl> on 2015/12/01 10:50:05 UTC, 2 replies.
- query on SVD++ - posted by "张志强(旺轩)" <zz...@alibaba-inc.com> on 2015/12/02 04:21:42 UTC, 3 replies.
- (Unknown) - posted by Alexander Pivovarov <ap...@gmail.com> on 2015/12/02 05:14:52 UTC, 0 replies.
- Python API for Association Rules - posted by caiquermarques95 <ca...@gmail.com> on 2015/12/02 13:51:05 UTC, 2 replies.
- IntelliJ license for committers? - posted by Sean Owen <so...@cloudera.com> on 2015/12/02 16:47:09 UTC, 7 replies.
- [VOTE] Release Apache Spark 1.6.0 (RC1) - posted by Michael Armbrust <mi...@databricks.com> on 2015/12/02 21:26:53 UTC, 16 replies.
- When to cut RCs - posted by Sean Owen <so...@cloudera.com> on 2015/12/02 21:28:30 UTC, 6 replies.
- [build system] jenkins downtime, thursday 12/10/15 7am PDT - posted by shane knapp <sk...@berkeley.edu> on 2015/12/03 04:20:37 UTC, 8 replies.
- Re: Multiplication on decimals in a dataframe query - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2015/12/03 07:32:25 UTC, 0 replies.
- Re: A proposal for Spark 2.0 - posted by Sean Owen <so...@cloudera.com> on 2015/12/03 09:47:27 UTC, 17 replies.
- Spark Streaming Kafka - DirectKafkaInputDStream: Using the new Kafka Consumer API - posted by Mario Ds Briggs <ma...@in.ibm.com> on 2015/12/03 18:30:38 UTC, 5 replies.
- SparkStreaming is failing to process Kafka jobs under load.... - posted by Rachana Srivastava <Ra...@markmonitor.com> on 2015/12/03 22:42:28 UTC, 1 replies.
- Quick question regarding Maven and Spark Assembly jar - posted by Matt Cheah <mc...@palantir.com> on 2015/12/04 02:27:40 UTC, 2 replies.
- Spark doesn't unset HADOOP_CONF_DIR when testing ? - posted by Jeff Zhang <zj...@gmail.com> on 2015/12/04 03:40:01 UTC, 2 replies.
- [ML] Missing documentation for the IndexToString feature transformer - posted by Benjamin Fradet <be...@gmail.com> on 2015/12/05 14:02:34 UTC, 2 replies.
- Returning numpy types from udfs - posted by Justin Uang <ju...@gmail.com> on 2015/12/05 16:03:39 UTC, 2 replies.
- How to debug Spark source using IntelliJ/ Eclipse - posted by jatinganhotra <ja...@gmail.com> on 2015/12/06 03:57:21 UTC, 1 replies.
- Shared memory between C++ process and Spark - posted by Jia <ja...@gmail.com> on 2015/12/06 21:43:24 UTC, 12 replies.
- mlib compilation errors - posted by "wei.zhu@kaiyuandao.com" <we...@kaiyuandao.com> on 2015/12/07 09:43:04 UTC, 0 replies.
- java.lang.OutOfMemoryError: Java heap space - posted by "Jagadeesan A.S." <li...@gmail.com> on 2015/12/07 14:55:40 UTC, 0 replies.
- Re: Fastest way to build Spark from scratch - posted by Jakob Odersky <jo...@gmail.com> on 2015/12/07 20:07:25 UTC, 6 replies.
- Data and Model Parallelism in MLPC - posted by Disha Shrivastava <di...@gmail.com> on 2015/12/08 09:42:53 UTC, 5 replies.
- 回复: mlib compilation errors - posted by "wei.zhu@kaiyuandao.com" <we...@kaiyuandao.com> on 2015/12/08 09:44:25 UTC, 0 replies.
- Failed to generate predicate Error when using dropna - posted by Chang Ya-Hsuan <su...@gmail.com> on 2015/12/08 10:25:53 UTC, 2 replies.
- Filte the null before InnerJoin to solve the problem of data skew - posted by vector <79...@qq.com> on 2015/12/08 14:58:51 UTC, 0 replies.
- I filed SPARK-12233 - posted by Fengdong Yu <fe...@everstring.com> on 2015/12/09 06:23:25 UTC, 0 replies.
- Differences between Spark APIs for Hadoop 1.x and Hadoop 2.x in terms of performance, progress reporting and IO metrics. - posted by Hyukjin Kwon <gu...@gmail.com> on 2015/12/09 10:01:12 UTC, 2 replies.
- SQL language vs DataFrame API - posted by Cristian O <cr...@googlemail.com> on 2015/12/09 17:34:06 UTC, 6 replies.
- Specifying Scala types when calling methods from SparkR - posted by Chris Freeman <cf...@alteryx.com> on 2015/12/09 19:11:22 UTC, 4 replies.
- DStream not initialized SparkException - posted by Renyi Xiong <re...@gmail.com> on 2015/12/09 21:45:45 UTC, 2 replies.
- Re: let spark streaming sample come to stop - posted by Renyi Xiong <re...@gmail.com> on 2015/12/09 22:09:36 UTC, 0 replies.
- Cause of akka.pattern.AskTimeoutException - posted by Rachana Srivastava <Ra...@markmonitor.com> on 2015/12/09 22:45:00 UTC, 0 replies.
- A bug in Spark standalone? Worker registration and deregistration - posted by Jacek Laskowski <ja...@japila.pl> on 2015/12/10 09:22:18 UTC, 4 replies.
- Re: releasing Spark 1.4.2 - posted by Inosh Goonewardena <in...@gmail.com> on 2015/12/10 12:10:06 UTC, 0 replies.
- Does RDD[Type1, Iterable[Type2]] split into multiple partitions? - posted by JaeSung Jun <ja...@gmail.com> on 2015/12/10 14:19:53 UTC, 1 replies.
- Spark Streaming Kinesis - DynamoDB Streams compatability - posted by Nick Pentreath <ni...@gmail.com> on 2015/12/10 16:04:27 UTC, 0 replies.
- coalesce at DataFrame missing argument for shuffle. - posted by Hyukjin Kwon <gu...@gmail.com> on 2015/12/11 07:56:20 UTC, 1 replies.
- JIRA: Wrong dates from imported JIRAs - posted by Lars Francke <la...@gmail.com> on 2015/12/11 08:40:34 UTC, 5 replies.
- A very Minor typo in the Spark paper - posted by Fengdong Yu <fe...@everstring.com> on 2015/12/11 09:17:12 UTC, 0 replies.
- Re: Spark streaming with Kinesis broken? - posted by Nick Pentreath <ni...@gmail.com> on 2015/12/11 11:07:30 UTC, 1 replies.
- Maven build against Hadoop 2.4 times out - posted by Ted Yu <yu...@gmail.com> on 2015/12/11 19:27:51 UTC, 5 replies.
- Multi-core support per task in Spark - posted by Zhan Zhang <zz...@hortonworks.com> on 2015/12/11 19:46:05 UTC, 1 replies.
- [VOTE] Release Apache Spark 1.6.0 (RC2) - posted by Michael Armbrust <mi...@databricks.com> on 2015/12/12 18:39:21 UTC, 23 replies.
- Doc readiness vs releasability - posted by Sean Owen <so...@cloudera.com> on 2015/12/13 08:50:49 UTC, 0 replies.
- [SparkR] Any reason why saveDF's mode is append by default ? - posted by Jeff Zhang <zj...@gmail.com> on 2015/12/14 08:58:09 UTC, 2 replies.
- Dev Environment (again) - posted by Al Pivonka <al...@gmail.com> on 2015/12/14 16:22:05 UTC, 0 replies.
- BIRCH clustering algorithm - posted by Dženan Softić <dz...@gmail.com> on 2015/12/14 16:56:58 UTC, 2 replies.
- [build system] brief downtime right now - posted by shane knapp <sk...@berkeley.edu> on 2015/12/14 19:31:33 UTC, 6 replies.
- SparkML algos limitations question. - posted by Eugene Morozov <ev...@gmail.com> on 2015/12/14 19:52:22 UTC, 2 replies.
- Secondary Indexing of RDDs? - posted by Michael Segel <ms...@hotmail.com> on 2015/12/14 19:58:46 UTC, 2 replies.
- Re: Problem using User Defined Predicate pushdown with core RDD and parquet - UDP class not found - posted by chao chu <ch...@gmail.com> on 2015/12/15 05:24:48 UTC, 0 replies.
- status of 2.11 support? - posted by Sachin Aggarwal <di...@gmail.com> on 2015/12/15 07:29:55 UTC, 1 replies.
- spark with label nodes in yarn - posted by "张志强(旺轩)" <zz...@alibaba-inc.com> on 2015/12/15 10:23:27 UTC, 9 replies.
- java.lang.NoSuchMethodError while saving a random forest model Spark version 1.5 - posted by Rachana Srivastava <Ra...@markmonitor.com> on 2015/12/16 00:23:03 UTC, 2 replies.
- ​Spark 1.6 - H​ive remote metastore not working - posted by syepes <sy...@gmail.com> on 2015/12/16 00:31:55 UTC, 4 replies.
- security testing on spark ? - posted by Judy Nash <ju...@exchange.microsoft.com> on 2015/12/16 02:16:21 UTC, 0 replies.
- does spark really support label expr like && or || ? - posted by Allen Zhang <al...@126.com> on 2015/12/16 09:32:04 UTC, 8 replies.
- A bug in Spark ML? NoSuchElementException while using RandomForest for regression. - posted by Eugene Morozov <ev...@gmail.com> on 2015/12/16 13:16:26 UTC, 0 replies.
- Update to Spar Mesos docs possibly? LIBPROCESS_IP needs to be set for client mode - posted by Aaron <aa...@gmail.com> on 2015/12/16 14:00:55 UTC, 9 replies.
- RandomForestModel Save is throwing NoSuchMethodError with Spark Version 1.5x - posted by Rachana Srivastava <Ra...@markmonitor.com> on 2015/12/16 22:01:28 UTC, 0 replies.
- [VOTE] Release Apache Spark 1.6.0 (RC3) - posted by Michael Armbrust <mi...@databricks.com> on 2015/12/16 22:32:14 UTC, 38 replies.
- Spark basicOperators - posted by sara mustafa <en...@gmail.com> on 2015/12/17 02:57:52 UTC, 1 replies.
- How do we convert a Dataset includes timestamp columns to RDD? - posted by Yu Ishikawa <yu...@gmail.com> on 2015/12/17 08:35:28 UTC, 2 replies.
- implict ClassTag in KafkaUtils - posted by Hao Ren <in...@gmail.com> on 2015/12/17 14:49:24 UTC, 2 replies.
- Re: security testing on spark ? - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2015/12/18 10:23:44 UTC, 0 replies.
- Is there any way to select columns of Dataset in addition to the combination of `expr` and `as`? - posted by Yu Ishikawa <yu...@gmail.com> on 2015/12/19 01:48:06 UTC, 0 replies.
- 回复: [VOTE] Release Apache Spark 1.6.0 (RC3) - posted by Ricky <49...@qq.com> on 2015/12/20 13:32:39 UTC, 0 replies.
- Spark fails after 6000s because of akka - posted by Alexander Pivovarov <ap...@gmail.com> on 2015/12/20 19:42:57 UTC, 5 replies.
- [Spark SQL] SQLContext getOrCreate incorrect behaviour - posted by Jerry Lam <ch...@gmail.com> on 2015/12/20 23:59:48 UTC, 9 replies.
- Expression/LogicalPlan dichotomy in Spark SQL Catalyst - posted by Roland Reumerman <Ro...@mendix.com> on 2015/12/21 11:26:01 UTC, 2 replies.
- pyspark streaming 1.6 mapWithState? - posted by Renyi Xiong <re...@gmail.com> on 2015/12/21 18:52:50 UTC, 0 replies.
- Re: Tungsten gives unexpected results when selecting null elements in array - posted by PierreB <pi...@realimpactanalytics.com> on 2015/12/21 21:45:43 UTC, 3 replies.
- [VOTE] Release Apache Spark 1.6.0 (RC4) - posted by Michael Armbrust <mi...@databricks.com> on 2015/12/22 21:10:46 UTC, 25 replies.
- value of sc.defaultParallelism - posted by Chang Ya-Hsuan <su...@gmail.com> on 2015/12/23 17:09:38 UTC, 0 replies.
- Kafka consumer: Upgrading to use the the new Java Consumer - posted by eugene miretsky <eu...@gmail.com> on 2015/12/23 22:27:03 UTC, 1 replies.
- confused behavior about pyspark.sql, Row, schema, and createDataFrame - posted by Chang Ya-Hsuan <su...@gmail.com> on 2015/12/24 05:19:27 UTC, 0 replies.
- Re: Downloading Hadoop from s3://spark-related-packages/ - posted by Nicholas Chammas <ni...@gmail.com> on 2015/12/24 06:59:35 UTC, 2 replies.
- [DAGScheduler] resubmitFailedStages, failedStages.clear() and submitStage - posted by Jacek Laskowski <ja...@japila.pl> on 2015/12/24 14:19:13 UTC, 1 replies.
- Shuffle Write Size - posted by gsvic <vi...@gmail.com> on 2015/12/24 17:53:49 UTC, 1 replies.
- latest Spark build error - posted by salexln <sa...@gmail.com> on 2015/12/25 07:51:41 UTC, 4 replies.
- How can I get the column data based on specific column name and then stored these data in array or list ? - posted by zml张明磊 <mi...@Ctrip.com> on 2015/12/25 08:35:11 UTC, 1 replies.
- 答复: How can I get the column data based on specific column name and then stored these data in array or list ? - posted by zml张明磊 <mi...@Ctrip.com> on 2015/12/25 08:44:04 UTC, 2 replies.
- 答复: 答复: How can I get the column data based on specific column name and then stored these data in array or list ? - posted by zml张明磊 <mi...@Ctrip.com> on 2015/12/25 09:07:40 UTC, 1 replies.
- recurring test failures against hadoop-2.4 profile - posted by Ted Yu <yu...@gmail.com> on 2015/12/25 22:48:26 UTC, 0 replies.
- Akka with Spark - posted by Disha Shrivastava <di...@gmail.com> on 2015/12/26 19:08:33 UTC, 7 replies.
- ERROR server.TThreadPoolServer: Error occurred during processing of message - posted by Dasun Hegoda <da...@gmail.com> on 2015/12/27 06:10:10 UTC, 0 replies.
- what is the best way to debug spark / mllib? - posted by salexln <sa...@gmail.com> on 2015/12/27 09:20:43 UTC, 4 replies.
- running lda in spark throws exception - posted by Li Li <fa...@gmail.com> on 2015/12/28 04:26:43 UTC, 4 replies.
- Catalyst Class Cast Exception - posted by sara mustafa <en...@gmail.com> on 2015/12/28 21:04:16 UTC, 4 replies.
- RDD[Vector] Immutability issue - posted by salexln <sa...@gmail.com> on 2015/12/28 21:36:33 UTC, 7 replies.
- Spark streaming 1.6.0-RC4 NullPointerException using mapWithState - posted by Jan Uyttenhove <ja...@insidin.com> on 2015/12/29 12:42:57 UTC, 4 replies.
- Partitioning of RDD across worker machines - posted by Disha Shrivastava <di...@gmail.com> on 2015/12/29 12:57:58 UTC, 1 replies.
- Is there any way to stop a jenkins build - posted by Herman van Hövell tot Westerflier <hv...@questtec.nl> on 2015/12/29 18:56:06 UTC, 5 replies.
- IndentationCheck of checkstyle - posted by Ted Yu <yu...@gmail.com> on 2015/12/30 06:36:05 UTC, 5 replies.
- problem with reading source code-pull out nondeterministic expresssions - posted by 汪洋 <ti...@icloud.com> on 2015/12/30 07:57:28 UTC, 2 replies.
- New processes / tools for changing dependencies in Spark - posted by Josh Rosen <jo...@databricks.com> on 2015/12/30 21:52:29 UTC, 0 replies.
- Automated close of PR's ? - posted by Mridul Muralidharan <mr...@gmail.com> on 2015/12/31 04:00:54 UTC, 6 replies.