You are viewing a plain text version of this content. The canonical link for it is here.
- deterministic D-Stream mode - posted by forough <fo...@gmail.com> on 2014/11/01 09:53:11 UTC, 0 replies.
- Re: Surprising Spark SQL benchmark - posted by "Arthur.hk.chan@gmail.com" <ar...@gmail.com> on 2014/11/01 10:00:21 UTC, 13 replies.
- Changes to Spark's networking subsystem - posted by Patrick Wendell <pw...@gmail.com> on 2014/11/01 21:17:10 UTC, 0 replies.
- OOM when making bins in BinaryClassificationMetrics ? - posted by Sean Owen <so...@cloudera.com> on 2014/11/02 18:34:49 UTC, 2 replies.
- sbt scala compiler crashes on spark-sql - posted by Imran Rashid <im...@therashids.com> on 2014/11/03 04:25:46 UTC, 5 replies.
- branch-1.2 has been cut - posted by Patrick Wendell <pw...@gmail.com> on 2014/11/03 09:55:20 UTC, 1 replies.
- Re: matrix factorization cross validation - posted by Debasish Das <de...@gmail.com> on 2014/11/04 00:20:30 UTC, 0 replies.
- MatrixFactorizationModel predict(Int, Int) API - posted by Debasish Das <de...@gmail.com> on 2014/11/04 02:25:14 UTC, 6 replies.
- Spark shuffle consolidateFiles performance degradation numbers - posted by Matt Cheah <mc...@palantir.com> on 2014/11/04 02:26:03 UTC, 4 replies.
- Spark shuffle consolidateFiles performance degradation quantification - posted by Matt Cheah <mc...@palantir.com> on 2014/11/04 03:05:06 UTC, 0 replies.
- Re: [MLlib] Contributing Algorithm for Outlier Detection - posted by Ashutosh <as...@iiitb.org> on 2014/11/04 16:22:21 UTC, 15 replies.
- Hadoop configuration for checkpointing - posted by Cody Koeninger <co...@koeninger.org> on 2014/11/04 18:34:30 UTC, 3 replies.
- [ANN] Spark resources searchable - posted by Otis Gospodnetic <ot...@gmail.com> on 2014/11/04 21:12:52 UTC, 0 replies.
- Build fails on master (f90ad5d) - posted by Alessandro Baretta <al...@gmail.com> on 2014/11/04 23:08:23 UTC, 8 replies.
- Issues with AbstractParams - posted by Debasish Das <de...@gmail.com> on 2014/11/05 01:42:04 UTC, 3 replies.
- src/main/resources/kv1.txt not found in example of HiveFromSpark - posted by Qiuzhuang Lian <qi...@gmail.com> on 2014/11/05 04:13:34 UTC, 1 replies.
- Appropriate way to add a debug flag - posted by "Ganelin, Ilya" <Il...@capitalone.com> on 2014/11/05 17:02:35 UTC, 2 replies.
- Re: Breaking the previous large-scale sort record with Spark - posted by Reynold Xin <rx...@databricks.com> on 2014/11/06 00:11:42 UTC, 1 replies.
- [VOTE] Designating maintainers for some Spark components - posted by Matei Zaharia <ma...@gmail.com> on 2014/11/06 02:31:58 UTC, 68 replies.
- create_image.sh contains broken hadoop web link - posted by Nicholas Chammas <ni...@gmail.com> on 2014/11/06 04:36:46 UTC, 4 replies.
- [Classloading] Strange class loading issue - posted by Matt Cheah <mc...@palantir.com> on 2014/11/06 04:52:55 UTC, 0 replies.
- 回复: [VOTE] Designating maintainers for some Spark components - posted by witgo <wi...@qq.com> on 2014/11/06 08:30:44 UTC, 0 replies.
- About implicit rddToPairRDDFunctions - posted by Shixiong Zhu <zs...@gmail.com> on 2014/11/06 12:12:37 UTC, 4 replies.
- JIRA + PR backlog - posted by Sean Owen <so...@cloudera.com> on 2014/11/06 13:13:01 UTC, 6 replies.
- Implementing TinkerPop on top of GraphX - posted by "York, Brennon" <Br...@capitalone.com> on 2014/11/06 20:34:33 UTC, 16 replies.
- Using partitioning to speed up queries in Shark - posted by Gordon Benjamin <go...@gmail.com> on 2014/11/06 23:01:49 UTC, 1 replies.
- Wrong temp directory when compressing before sending text file to S3 - posted by Gary Malouf <ma...@gmail.com> on 2014/11/06 23:10:21 UTC, 1 replies.
- Python3 and spark 1.1.0 - posted by catchmonster <sk...@gmail.com> on 2014/11/07 00:01:04 UTC, 1 replies.
- proposal / discuss: multiple Serializers within a SparkContext? - posted by Sandy Ryza <sa...@cloudera.com> on 2014/11/07 10:05:34 UTC, 3 replies.
- How spark/*/Storage/BlockManagerMaster.askDriverWithReply() responds to various query messages - posted by rapelly kartheek <ka...@gmail.com> on 2014/11/07 11:11:18 UTC, 1 replies.
- Bind exception while running FlumeEventCount - posted by Jeniba Johnson <Je...@lntinfotech.com> on 2014/11/07 13:04:26 UTC, 9 replies.
- Replacing Spark's native scheduler with Sparrow - posted by Nicholas Chammas <ni...@gmail.com> on 2014/11/08 00:05:27 UTC, 13 replies.
- Should new YARN shuffle service work with "yarn-alpha"? - posted by Sean Owen <so...@cloudera.com> on 2014/11/08 08:43:03 UTC, 5 replies.
- Re: EC2 clusters ready in launch time + 30 seconds - posted by Nicholas Chammas <ni...@gmail.com> on 2014/11/08 09:38:16 UTC, 0 replies.
- MLlib related query - posted by Manu Kaul <ma...@gmail.com> on 2014/11/08 10:37:01 UTC, 1 replies.
- [RESULT] [VOTE] Designating maintainers for some Spark components - posted by Matei Zaharia <ma...@gmail.com> on 2014/11/09 04:28:20 UTC, 0 replies.
- getting exception when trying to build spark from master - posted by Sadhan Sood <sa...@gmail.com> on 2014/11/10 22:42:29 UTC, 3 replies.
- Http client dependency conflict when using AWS - posted by Cody Koeninger <co...@koeninger.org> on 2014/11/10 23:05:30 UTC, 0 replies.
- Spark 1.1.1 release - posted by Andrew Or <an...@databricks.com> on 2014/11/10 23:17:10 UTC, 2 replies.
- thrift jdbc server probably running queries as hive query - posted by Sadhan Sood <sa...@gmail.com> on 2014/11/11 01:29:27 UTC, 4 replies.
- Checkpoint bugs in GraphX - posted by Xu Lijie <li...@gmail.com> on 2014/11/11 03:19:03 UTC, 3 replies.
- Discuss how to do checkpoint more efficently - posted by Xu Lijie <li...@gmail.com> on 2014/11/11 04:32:12 UTC, 0 replies.
- Terasort example - posted by Ewan Higgs <ew...@ugent.be> on 2014/11/11 14:03:40 UTC, 3 replies.
- Partition caching taking too long - posted by Sadhan Sood <sa...@gmail.com> on 2014/11/12 00:38:10 UTC, 0 replies.
- [NOTICE] [BUILD] Minor changes to Spark's build - posted by Patrick Wendell <pw...@gmail.com> on 2014/11/12 06:47:09 UTC, 14 replies.
- Spark-Submit issues - posted by Jeniba Johnson <Je...@lntinfotech.com> on 2014/11/12 08:09:05 UTC, 3 replies.
- Too many failed collects when trying to cache a table in SparkSQL - posted by Sadhan Sood <sa...@gmail.com> on 2014/11/12 18:31:12 UTC, 2 replies.
- Cache sparkSql data without uncompressing it in memory - posted by Sadhan Sood <sa...@gmail.com> on 2014/11/13 00:16:32 UTC, 3 replies.
- [VOTE] Release Apache Spark 1.1.1 (RC1) - posted by Andrew Or <an...@databricks.com> on 2014/11/13 05:34:47 UTC, 16 replies.
- Problems with spark.locality.wait - posted by MaChong <ma...@sina.com> on 2014/11/13 08:55:37 UTC, 6 replies.
- Join operator in PySpark - posted by 夏俊鸾 <xi...@gmail.com> on 2014/11/13 14:07:02 UTC, 1 replies.
- TimSort in 1.2 - posted by Debasish Das <de...@gmail.com> on 2014/11/14 01:19:30 UTC, 1 replies.
- RE: Spark- How can I run MapReduce only on one partition in an RDD? - posted by "Ganelin, Ilya" <Il...@capitalone.com> on 2014/11/14 01:33:53 UTC, 0 replies.
- 回复:Re: Problems with spark.locality.wait - posted by MaChong <ma...@sina.com> on 2014/11/14 06:26:38 UTC, 0 replies.
- 1gb file processing...task doesn't launch on all the node...Unseen exception - posted by Priya Ch <le...@gmail.com> on 2014/11/14 12:47:26 UTC, 1 replies.
- Skipping Bad Records in Spark - posted by Qiuzhuang Lian <qi...@gmail.com> on 2014/11/14 16:28:35 UTC, 1 replies.
- Spark & Hadoop 2.5.1 - posted by Corey Nolet <cj...@gmail.com> on 2014/11/14 16:43:49 UTC, 4 replies.
- Has anyone else observed this build break? - posted by Patrick Wendell <pw...@gmail.com> on 2014/11/14 21:17:54 UTC, 7 replies.
- mvn or sbt for studying and developing Spark? - posted by "Yiming (John) Zhang" <sd...@gmail.com> on 2014/11/16 02:41:41 UTC, 13 replies.
- Regarding RecordReader of spark - posted by Vibhanshu Prasad <vi...@gmail.com> on 2014/11/16 12:22:35 UTC, 2 replies.
- send currentJars and currentFiles to exetutor with actor? - posted by scwf <wa...@huawei.com> on 2014/11/16 13:24:57 UTC, 1 replies.
- If first batch fails, does Streaming JobGenerator.stop() hang? - posted by Sean Owen <so...@cloudera.com> on 2014/11/16 15:12:27 UTC, 0 replies.
- Is there a way for scala compiler to catch unserializable app code? - posted by jay vyas <ja...@gmail.com> on 2014/11/17 01:12:21 UTC, 2 replies.
- [ANNOUNCE] Spark 1.2.0 Release Preview Posted - posted by Patrick Wendell <pw...@gmail.com> on 2014/11/17 10:42:47 UTC, 6 replies.
- Quantile regression in tree models - posted by Alessandro Baretta <al...@gmail.com> on 2014/11/17 20:11:01 UTC, 5 replies.
- [VOTE][RESULT] Release Apache Spark 1.1.1 (RC1) - posted by Andrew Or <an...@databricks.com> on 2014/11/17 23:41:13 UTC, 0 replies.
- matrix computation in spark - posted by liaoyuxi <li...@huawei.com> on 2014/11/18 04:24:20 UTC, 3 replies.
- Using sampleByKey - posted by Debasish Das <de...@gmail.com> on 2014/11/18 04:32:13 UTC, 5 replies.
- 答复: matrix computation in spark - posted by liaoyuxi <li...@huawei.com> on 2014/11/18 07:50:23 UTC, 0 replies.
- Intro to using IntelliJ to debug SPARK-1.1 Apps with mvn/sbt (for beginners) - posted by "Yiming (John) Zhang" <sd...@gmail.com> on 2014/11/19 05:00:18 UTC, 4 replies.
- Apache infra github sync down - posted by Patrick Wendell <pw...@gmail.com> on 2014/11/19 07:24:55 UTC, 2 replies.
- Help needed to publish SizeEstimator as separate library - posted by madhu phatak <ph...@gmail.com> on 2014/11/19 11:57:32 UTC, 0 replies.
- Build break - posted by Patrick Wendell <pw...@gmail.com> on 2014/11/19 23:09:29 UTC, 0 replies.
- [VOTE] Release Apache Spark 1.1.1 (RC2) - posted by Andrew Or <an...@databricks.com> on 2014/11/19 23:51:39 UTC, 23 replies.
- Too many open files error - posted by Qiuzhuang Lian <qi...@gmail.com> on 2014/11/20 05:15:24 UTC, 2 replies.
- [important] jenkins down - posted by shane knapp <sk...@berkeley.edu> on 2014/11/20 19:21:01 UTC, 1 replies.
- Spark Streaming Metrics - posted by Gerard Maas <ge...@gmail.com> on 2014/11/20 21:25:58 UTC, 2 replies.
- Re: Eliminate copy while sending data : any Akka experts here ? - posted by Shixiong Zhu <zs...@gmail.com> on 2014/11/21 04:14:09 UTC, 2 replies.
- sbt publish-local fails, missing spark-network-common - posted by pedrorodriguez <sk...@gmail.com> on 2014/11/21 04:53:47 UTC, 4 replies.
- Spark development with IntelliJ - posted by Patrick Wendell <pw...@gmail.com> on 2014/11/21 08:42:42 UTC, 0 replies.
- Why Executor Deserialize Time takes more than 300ms? - posted by Xuelin Cao <xu...@yahoo.com> on 2014/11/21 15:12:13 UTC, 3 replies.
- Automated github closing of issues is not working - posted by Patrick Wendell <pw...@gmail.com> on 2014/11/21 21:24:29 UTC, 1 replies.
- Troubleshooting JVM OOM during Spark Unit Tests - posted by Nicholas Chammas <ni...@gmail.com> on 2014/11/21 22:50:20 UTC, 2 replies.
- How spark and hive integrate in long term? - posted by Zhan Zhang <zh...@gmail.com> on 2014/11/21 23:51:23 UTC, 7 replies.
- java.lang.OutOfMemoryError at simple local test - posted by rzykov <rz...@gmail.com> on 2014/11/22 15:23:41 UTC, 2 replies.
- Notes on writing complex spark applications - posted by "Evan R. Sparks" <ev...@gmail.com> on 2014/11/23 17:55:45 UTC, 5 replies.
- 2 spark streaming questions - posted by tian zhang <tz...@yahoo.com.INVALID> on 2014/11/24 06:31:28 UTC, 0 replies.
- Time taken to merge Spark PR's? - posted by "York, Brennon" <Br...@capitalone.com> on 2014/11/24 15:31:22 UTC, 1 replies.
- [VOTE][RESULT] Release Apache Spark 1.1.1 (RC2) - posted by Andrew Or <an...@databricks.com> on 2014/11/24 20:22:59 UTC, 0 replies.
- [SparkSQL] Why this AttributeReference.exprId is not setted? - posted by EarthsonLu <Ea...@gmail.com> on 2014/11/25 03:08:11 UTC, 0 replies.
- Re: [SparkSQL][Solved] Why this AttributeReference.exprId is not setted? - posted by EarthsonLu <Ea...@gmail.com> on 2014/11/25 03:31:04 UTC, 0 replies.
- java.io.IOException: sendMessageReliably failed without being ACK'd - posted by xukun <xu...@huawei.com> on 2014/11/25 14:43:11 UTC, 0 replies.
- java.util.concurrent.TimeoutException: Futures timed out after [10000 milliseconds] - posted by xukun <xu...@huawei.com> on 2014/11/25 15:00:41 UTC, 0 replies.
- How to resolve Spark site issues? - posted by "York, Brennon" <Br...@capitalone.com> on 2014/11/25 20:12:23 UTC, 4 replies.
- Re: How to do broadcast join in SparkSQL - posted by Jianshi Huang <ji...@gmail.com> on 2014/11/26 07:13:55 UTC, 1 replies.
- [mllib] useFeatureScaling likes hardcode in LogisticRegressionWithLBFGS and is not comprehensive for users. - posted by Yanbo Liang <ya...@gmail.com> on 2014/11/26 10:39:06 UTC, 3 replies.
- Fwd: How the sequence of blockManagerId's are constructed in spark/*/storage/blockManagerMasterActor.getPeers()? - posted by rapelly kartheek <ka...@gmail.com> on 2014/11/27 07:24:08 UTC, 0 replies.
- Standalone scheduling - document inconsistent - posted by Praveen Sripati <pr...@gmail.com> on 2014/11/27 12:47:24 UTC, 1 replies.
- [mllib] Which is the correct package to add a new algorithm? - posted by Yu Ishikawa <yu...@gmail.com> on 2014/11/27 15:41:58 UTC, 1 replies.
- Creating a SchemaRDD from an existing API - posted by Niranda Perera <ni...@wso2.com> on 2014/11/28 07:31:13 UTC, 1 replies.
- [VOTE] Release Apache Spark 1.2.0 (RC1) - posted by Patrick Wendell <pw...@gmail.com> on 2014/11/29 06:16:56 UTC, 9 replies.
- Trouble testing after updating to latest master - posted by "Ganelin, Ilya" <Il...@capitalone.com> on 2014/11/30 04:29:42 UTC, 3 replies.
- Spurious test failures, testing best practices - posted by Ryan Williams <ry...@gmail.com> on 2014/11/30 23:39:28 UTC, 0 replies.