You are viewing a plain text version of this content. The canonical link for it is here.
- 回复: Where to put "local" data files? - posted by guxiaobo1982 <gu...@qq.com> on 2014/01/01 08:40:32 UTC, 1 replies.
- Re: How to map each line to (line number, line)? - posted by "K. Shankari" <sh...@eecs.berkeley.edu> on 2014/01/01 10:35:52 UTC, 2 replies.
- Upper limit of broadcast variables size - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/01 12:46:11 UTC, 0 replies.
- Not able to understand Exception. - posted by Archit Thakur <ar...@gmail.com> on 2014/01/01 17:22:05 UTC, 2 replies.
- Re: Reply: Reply: Any best practice for hardware configuration for themasterserver in standalone cluster mode? - posted by Sriram Ramachandrasekaran <sr...@gmail.com> on 2014/01/01 18:00:47 UTC, 0 replies.
- Re: Running Spark jar on EC2 - posted by Jeff Higgens <je...@gmail.com> on 2014/01/02 00:36:23 UTC, 2 replies.
- What's the advantage of running Spark inside the YARN cluster? - posted by guxiaobo1982 <gu...@qq.com> on 2014/01/02 03:54:58 UTC, 0 replies.
- Re: Reply: Reply: Any best practice for hardware configuration forthemasterserver in standalone cluster mode? - posted by guxiaobo1982 <gu...@qq.com> on 2014/01/02 05:50:30 UTC, 0 replies.
- Where can I find more information about the R interface for Spark? - posted by guxiaobo1982 <gu...@qq.com> on 2014/01/02 05:55:16 UTC, 3 replies.
- 答复: Reply: Reply: Any best practice for hardware configuration forthemasterserver in standalone cluster mode? - posted by jasonliu <ja...@gmail.com> on 2014/01/02 06:55:00 UTC, 1 replies.
- Spark context jar confusions - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/02 11:40:46 UTC, 12 replies.
- Initial job has not accepted any resources; check your cluster UI to ensure that workers are registered and have sufficient memory - posted by Archit Thakur <ar...@gmail.com> on 2014/01/02 12:31:49 UTC, 3 replies.
- the spark worker assignment Question? - posted by lihu <li...@gmail.com> on 2014/01/02 13:53:57 UTC, 12 replies.
- rdd.saveAsTextFile problem - posted by Philip Ogren <ph...@oracle.com> on 2014/01/02 18:22:36 UTC, 6 replies.
- Executor metrics in spark application - posted by Issac Buenrostro <bu...@ooyala.com> on 2014/01/02 20:18:10 UTC, 0 replies.
- Standalone spark cluster dead nodes - posted by Debasish Das <de...@gmail.com> on 2014/01/02 22:38:15 UTC, 1 replies.
- Spark Matrix Factorization - posted by Debasish Das <de...@gmail.com> on 2014/01/03 00:16:33 UTC, 10 replies.
- How to deal with multidimensional keys? - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/03 00:23:57 UTC, 2 replies.
- cache eviction - posted by Erik Selin <ty...@gmail.com> on 2014/01/03 05:19:07 UTC, 1 replies.
- www.spark-project.org down? - posted by Nan Zhu <zh...@gmail.com> on 2014/01/03 06:20:12 UTC, 1 replies.
- a question about "take" - posted by Chen Jin <ka...@gmail.com> on 2014/01/03 07:02:27 UTC, 0 replies.
- Is spark-env.sh supposed to be stateless? - posted by Andrew Ash <an...@andrewash.com> on 2014/01/03 07:33:44 UTC, 2 replies.
- NoSuchMethodError: org.apache.commons.io.IOUtils.closeQuietly with cdh4 binary - posted by Roshan Nair <ro...@indix.com> on 2014/01/03 08:35:21 UTC, 6 replies.
- Shark compile - posted by "leosandylh@gmail.com" <le...@gmail.com> on 2014/01/03 10:07:35 UTC, 0 replies.
- Issue with sortByKey. - posted by Archit Thakur <ar...@gmail.com> on 2014/01/03 10:39:59 UTC, 4 replies.
- Turning kryo on does not decrease binary output - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/03 17:48:40 UTC, 19 replies.
- Data locality during Spark RDD creation - posted by Debasish Das <de...@gmail.com> on 2014/01/03 20:34:23 UTC, 1 replies.
- What version of protobuf should I be using? - posted by Shay Seng <sh...@1618labs.com> on 2014/01/03 21:11:45 UTC, 2 replies.
- Re: Unable to connect spark 0.8.1 (built for hadoop 2.2.0) to connect to mesos 0.14.2 - posted by Matei Zaharia <ma...@gmail.com> on 2014/01/04 06:14:10 UTC, 1 replies.
- Re: examples not build successfully - posted by Matei Zaharia <ma...@gmail.com> on 2014/01/04 06:15:07 UTC, 0 replies.
- SequenceFileRDDFunctions cannot be used output of spark package - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/04 16:42:35 UTC, 3 replies.
- Will JVM be reused? - posted by Archit Thakur <ar...@gmail.com> on 2014/01/04 20:36:24 UTC, 7 replies.
- Troubles with the Spark-EC2 stuff - posted by Guillaume Pitel <gu...@exensa.com> on 2014/01/04 21:54:37 UTC, 3 replies.
- ADD_JARS doesn't properly work for spark-shell - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/05 01:46:08 UTC, 10 replies.
- State of spark on scala 2.10 - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/05 04:11:41 UTC, 9 replies.
- how to bind spark-master to the public IP of EC2 - posted by Nan Zhu <zh...@gmail.com> on 2014/01/05 14:45:38 UTC, 2 replies.
- debug standalone Spark jobs? - posted by Nan Zhu <zh...@gmail.com> on 2014/01/05 16:06:20 UTC, 6 replies.
- hdfs replication on saving RDD - posted by Kapil Malik <km...@adobe.com> on 2014/01/05 16:20:43 UTC, 1 replies.
- Why does saveAfObjectFile() serialize Array[T] instead of T? - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/05 20:52:30 UTC, 0 replies.
- Cannot start spark master - posted by danoomistmatiste <kk...@yahoo.com> on 2014/01/06 04:37:15 UTC, 4 replies.
- A new Spark Web UI - posted by Romain Rigaux <ro...@gmail.com> on 2014/01/06 17:44:27 UTC, 4 replies.
- Worker hangs with 100% CPU in Standalone cluster - posted by Grega Kešpret <gr...@celtra.com> on 2014/01/06 17:44:58 UTC, 5 replies.
- shared variable and ALS in mllib - posted by Nan Zhu <zh...@gmail.com> on 2014/01/06 18:17:56 UTC, 2 replies.
- Temp hdfs files picked up by textFileStream dstream - posted by Chris Regnier <c....@oculusinfo.com> on 2014/01/06 22:16:58 UTC, 1 replies.
- How to access global kryo instance? - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/07 01:36:37 UTC, 5 replies.
- How to make Spark merge the output file? - posted by Nan Zhu <zh...@gmail.com> on 2014/01/07 04:56:53 UTC, 4 replies.
- NoSuchMethodError running Spark on YARN - posted by Sandy Ryza <sa...@cloudera.com> on 2014/01/07 05:51:32 UTC, 2 replies.
- is it forgotten to document how to set SPARK_WORKER_DIR? - posted by Nan Zhu <zh...@gmail.com> on 2014/01/07 07:13:48 UTC, 2 replies.
- Problems with broadcast large datastructure - posted by Sebastian Schelter <ss...@apache.org> on 2014/01/07 09:55:35 UTC, 10 replies.
- split a RDD by pencetage - posted by redocpot <ju...@gmail.com> on 2014/01/07 12:08:50 UTC, 0 replies.
- How to time transformations and provide more detailed progress report? - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/07 18:00:02 UTC, 1 replies.
- How to map index from another dataset - posted by Xiaoli Li <li...@gmail.com> on 2014/01/07 18:25:32 UTC, 0 replies.
- Creating DStream windows that start at specific times - posted by Chris Regnier <c....@oculusinfo.com> on 2014/01/07 22:06:51 UTC, 0 replies.
- Spark on Yarn classpath problems - posted by Eric Kimbrel <le...@gmail.com> on 2014/01/08 04:16:21 UTC, 3 replies.
- EC2 scripts documentations lacks how to actually run applications - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/08 04:21:00 UTC, 4 replies.
- ship MatrixFactorizationModel with each partition? - posted by Nan Zhu <zh...@gmail.com> on 2014/01/08 04:23:48 UTC, 3 replies.
- Spark SequenceFile Java API Repeat Key Values - posted by Michael Quinlan <mq...@gmail.com> on 2014/01/08 05:32:59 UTC, 5 replies.
- Re: Why does sortByKey launch cluster job? - posted by Andrew Ash <an...@andrewash.com> on 2014/01/08 05:47:56 UTC, 3 replies.
- shark not able to connect to spark master - posted by danoomistmatiste <kk...@yahoo.com> on 2014/01/08 06:53:46 UTC, 0 replies.
- newbie : java.lang.OutOfMemoryError: Java heap space - posted by Vipul Pandey <vi...@gmail.com> on 2014/01/08 08:13:31 UTC, 0 replies.
- Re: newbie : java.lang.OutOfMemoryError: Java heap space - posted by Prashant Sharma <sc...@gmail.com> on 2014/01/08 08:31:16 UTC, 3 replies.
- Dying workers since migration to 0.8.1 - posted by Guillaume Pitel <gu...@exensa.com> on 2014/01/08 14:28:17 UTC, 4 replies.
- native-lzo / gpl lib - posted by "leosandylh@gmail.com" <le...@gmail.com> on 2014/01/08 16:02:36 UTC, 1 replies.
- confusion on RDD usage in MatrixFactorizationModel (master branch) - posted by Nan Zhu <zh...@gmail.com> on 2014/01/08 16:38:15 UTC, 1 replies.
- WARN ClusterScheduler: Initial job has not accepted any resources; check your cluster UI to ensure that workers are registered and have sufficient memory - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/08 17:31:33 UTC, 13 replies.
- How to confirm serializer type on workers? - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/08 21:23:49 UTC, 1 replies.
- performance - posted by Yann Luppo <Ya...@LiveNation.com> on 2014/01/08 22:49:21 UTC, 6 replies.
- what paper is the L2 regularization based on? - posted by Walrus theCat <wa...@gmail.com> on 2014/01/08 23:23:58 UTC, 8 replies.
- shark shell (shark-withinfo) not starting - posted by danoomistmatiste <kk...@yahoo.com> on 2014/01/09 00:33:09 UTC, 0 replies.
- Lost of vertex when running a Bagel based program in non-local mode - posted by 杨强 <ya...@ict.ac.cn> on 2014/01/09 10:45:31 UTC, 0 replies.
- How to increase spark.worker.timeout? - posted by Jyun-Fan Tsai <jy...@gmail.com> on 2014/01/09 10:47:31 UTC, 0 replies.
- Not able to connect to master from a worker on 2 different machine - posted by rishi <ri...@knoldus.com> on 2014/01/09 13:27:05 UTC, 0 replies.
- some problems - posted by "leosandylh@gmail.com" <le...@gmail.com> on 2014/01/09 14:31:03 UTC, 0 replies.
- hadoop files in Python - posted by Diana Carroll <dc...@cloudera.com> on 2014/01/09 17:15:04 UTC, 1 replies.
- log4j question - posted by Shay Seng <sh...@1618labs.com> on 2014/01/09 19:49:34 UTC, 3 replies.
- General Spark question (streaming) - posted by Ognen Duzlevski <og...@nengoiksvelzud.com> on 2014/01/09 20:07:21 UTC, 1 replies.
- Error Handling on calling saveAsHadoopDataset - posted by Kanwaldeep <ka...@gmail.com> on 2014/01/09 21:46:35 UTC, 1 replies.
- PLEASE HELP: ./shark-withinfo not connecting to spark master - posted by danoomistmatiste <kk...@yahoo.com> on 2014/01/09 22:23:20 UTC, 0 replies.
- Re: PLEASE HELP: ./shark-withinfo not connecting to spark master - posted by Andrew Ash <an...@andrewash.com> on 2014/01/09 23:08:50 UTC, 3 replies.
- is saveAsTextFile in java uses buffered I/O streams? - posted by Hu...@Dell.com on 2014/01/09 23:54:36 UTC, 1 replies.
- An open HDFS connection fails RDD.take() - posted by Mingyu Kim <mk...@palantir.com> on 2014/01/10 01:25:05 UTC, 3 replies.
- Spark streaming on YARN? - posted by Mike Percy <mp...@apache.org> on 2014/01/10 02:29:53 UTC, 11 replies.
- possible major memory problem in saveAsTextFile - posted by Hu...@Dell.com on 2014/01/10 02:53:23 UTC, 0 replies.
- Re: graphx merge for scala 2.9 - posted by Denny Lee <de...@gmail.com> on 2014/01/10 06:27:30 UTC, 2 replies.
- Getting java.netUnknownHostException - posted by Rishi <ri...@knoldus.com> on 2014/01/10 13:57:22 UTC, 3 replies.
- some problems about shark on spark - posted by "leosandylh@gmail.com" <le...@gmail.com> on 2014/01/10 15:29:48 UTC, 0 replies.
- Default Storage Level in Spark - posted by mharwida <ma...@yahoo.com> on 2014/01/10 17:38:43 UTC, 0 replies.
- Please help: possible major memory problem in saveAsTextFile - posted by Hu...@Dell.com on 2014/01/10 18:26:01 UTC, 2 replies.
- Help needed. Not sure how to reduceByKey works in spark - posted by suman bharadwaj <su...@gmail.com> on 2014/01/10 20:01:11 UTC, 1 replies.
- SparkPi example exceeds virtual memory limits (yarn) - posted by Eric Kimbrel <le...@gmail.com> on 2014/01/10 20:37:33 UTC, 0 replies.
- Please help: virtualization type 'hvm' when I try to launch ec2 ssd instance - posted by Chen Jin <ka...@gmail.com> on 2014/01/10 20:47:08 UTC, 6 replies.
- Windows of windowed streams not displaying the expected results - posted by Chris Regnier <c....@oculusinfo.com> on 2014/01/11 00:05:42 UTC, 0 replies.
- org.apache.hadoop.hive.ql.metadata.HiveException: java.lang.NumberFormatException: For input string: "0L" - posted by danoomistmatiste <kk...@yahoo.com> on 2014/01/11 04:33:58 UTC, 0 replies.
- 转发: some problems about shark on spark - posted by "leosandylh@gmail.com" <le...@gmail.com> on 2014/01/11 08:47:56 UTC, 0 replies.
- build shark with hive 0.11/0.12 - posted by "leosandylh@gmail.com" <le...@gmail.com> on 2014/01/11 12:12:23 UTC, 0 replies.
- shark cli set spark java system params - posted by "leosandylh@gmail.com" <le...@gmail.com> on 2014/01/11 13:02:12 UTC, 0 replies.
- Development version error on sbt compile publish-local - posted by Shing Hing Man <ma...@yahoo.com> on 2014/01/11 21:49:57 UTC, 0 replies.
- Re: Development version error on sbt compile publish-local - posted by Patrick Wendell <pw...@gmail.com> on 2014/01/12 01:31:50 UTC, 3 replies.
- Problem running example GroupByTest from scala command line - posted by Shing Hing Man <ma...@yahoo.com> on 2014/01/12 13:21:28 UTC, 2 replies.
- Spark on google compute engine - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/13 05:00:31 UTC, 9 replies.
- Stalling during large iterative PySpark jobs - posted by Jeremy Freeman <fr...@gmail.com> on 2014/01/13 05:45:19 UTC, 4 replies.
- Pb building: jettty-server - Connection timed out - posted by Nicolas Seyvet <se...@yahoo.com> on 2014/01/13 10:46:06 UTC, 0 replies.
- Spark writing to disk when there's enough memory?! - posted by mharwida <ma...@yahoo.com> on 2014/01/13 13:24:44 UTC, 2 replies.
- Can you help me ? - posted by "leosandylh@gmail.com" <le...@gmail.com> on 2014/01/13 17:29:48 UTC, 0 replies.
- Re: Running Spark on Mesos - posted by deric <ba...@gmail.com> on 2014/01/13 18:19:24 UTC, 4 replies.
- Unable to load native-hadoop library - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/13 18:28:16 UTC, 2 replies.
- yarn SPARK_CLASSPATH - posted by Eric Kimbrel <le...@gmail.com> on 2014/01/13 22:29:51 UTC, 7 replies.
- squestion on using spark parallelism vs using num partitions in spark api - posted by Hu...@Dell.com on 2014/01/14 02:17:44 UTC, 3 replies.
- Occasional failed tasks - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/14 02:38:49 UTC, 0 replies.
- Controlling hadoop block size - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/14 03:29:11 UTC, 5 replies.
- Cloud interface for Spark/Shark/PySpark - posted by Tristan Zajonc <tr...@senseplatform.com> on 2014/01/14 06:12:58 UTC, 0 replies.
- 答复: squestion on using spark parallelism vs using num partitions in spark api - posted by Huangguowei <hu...@huawei.com> on 2014/01/14 13:42:56 UTC, 0 replies.
- Shark runtime error - posted by Kishore kumar <ki...@techdigita.in> on 2014/01/14 14:10:24 UTC, 0 replies.
- Akka error kills workers in standalone mode - posted by vuakko <ni...@gmail.com> on 2014/01/14 15:09:41 UTC, 4 replies.
- master attempted to re-register the worker and then took all workers as unregistered - posted by Nan Zhu <zh...@gmail.com> on 2014/01/15 02:53:20 UTC, 1 replies.
- Anyone know hot to submit spark job to yarn in java code? - posted by John Zhao <jz...@alpinenow.com> on 2014/01/15 19:25:09 UTC, 3 replies.
- Please help: change $SPARK_HOME/work directory for spark applications - posted by Chen Jin <ka...@gmail.com> on 2014/01/15 20:03:58 UTC, 1 replies.
- Join us for AMP Camp 4 on February 11 and enter to win a free Strata pass - posted by Scott walent <sc...@gmail.com> on 2014/01/15 21:22:28 UTC, 0 replies.
- Exception in thread "DAGScheduler" scala.MatchError: None (of class scala.None$) - posted by Soren Macbeth <so...@yieldbot.com> on 2014/01/15 23:19:12 UTC, 7 replies.
- libraryDependencies configuration is different for sbt assembly vs sbt run - posted by kamatsuoka <ke...@gmail.com> on 2014/01/15 23:29:27 UTC, 0 replies.
- Reading files on a cluster / shared file system - posted by Ognen Duzlevski <og...@nengoiksvelzud.com> on 2014/01/16 00:56:52 UTC, 2 replies.
- jarOfClass method no found in SparkContext - posted by arjun biswas <ar...@gmail.com> on 2014/01/16 01:14:13 UTC, 4 replies.
- Master and worker nodes in standalone deployment - posted by Manoj Samel <ma...@gmail.com> on 2014/01/16 05:32:30 UTC, 3 replies.
- Consistency between RDD's and Native File System - posted by SaiPrasanna <sa...@siemens.com> on 2014/01/16 10:42:22 UTC, 8 replies.
- How does shuffle work in spark ? - posted by suman bharadwaj <su...@gmail.com> on 2014/01/16 11:33:40 UTC, 7 replies.
- MLlib linking error Mac OS X - posted by Nick Pentreath <ni...@gmail.com> on 2014/01/16 13:10:09 UTC, 0 replies.
- Expect only DirectTaskResults when using LocalScheduler - posted by Nick Pentreath <ni...@gmail.com> on 2014/01/16 13:37:37 UTC, 0 replies.
- Help Getting started with Standalone Mode (Akka port requirements?) - posted by Yana Kadiyska <ya...@gmail.com> on 2014/01/16 17:58:28 UTC, 0 replies.
- Spark does not retry a failed task due to hdfs io error - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/16 19:48:24 UTC, 0 replies.
- python on YARN - posted by Diana Carroll <dc...@cloudera.com> on 2014/01/16 22:28:33 UTC, 0 replies.
- SparkR developer release - posted by Shivaram Venkataraman <sh...@eecs.berkeley.edu> on 2014/01/16 23:14:40 UTC, 2 replies.
- Re : SparkR developer release - posted by "andy.petrella@gmail.com" <an...@gmail.com> on 2014/01/16 23:47:02 UTC, 0 replies.
- get CPU Metrics from spark - posted by tdeng <td...@twitter.com> on 2014/01/17 02:26:53 UTC, 1 replies.
- cannot run sbt/sbt assembly - posted by Kal El <pi...@yahoo.com> on 2014/01/17 13:04:10 UTC, 1 replies.
- FileNotFoundException on distinct()? - posted by Ryan Compton <co...@gmail.com> on 2014/01/18 00:02:48 UTC, 9 replies.
- SparkException: Expect only DirectTaskResults when using localScheduler() - posted by Hu...@Dell.com on 2014/01/18 01:34:36 UTC, 5 replies.
- Lzo + Protobuf - posted by Vipul Pandey <vi...@gmail.com> on 2014/01/18 01:45:56 UTC, 8 replies.
- Spark job failing on cluster - posted by Ognen Duzlevski <og...@nengoiksvelzud.com> on 2014/01/18 14:49:41 UTC, 3 replies.
- OOM - Help Optimizing Local Job - posted by Brad Ruderman <br...@gmail.com> on 2014/01/19 01:58:06 UTC, 9 replies.
- Which of the hadoop file formats are supported by Spark ? - posted by Manoj Samel <ma...@gmail.com> on 2014/01/19 06:47:19 UTC, 3 replies.
- Do RDD actions run only on driver ? - posted by Manoj Samel <ma...@gmail.com> on 2014/01/19 07:34:22 UTC, 3 replies.
- Time frame / features in spark 0.9 release ? - posted by Manoj Samel <ma...@gmail.com> on 2014/01/19 08:03:17 UTC, 1 replies.
- Problems with Spark (Lost executor error) - posted by Ognen Duzlevski <og...@nengoiksvelzud.com> on 2014/01/19 13:08:30 UTC, 0 replies.
- Quality of documentation (rant) - posted by Ognen Duzlevski <og...@nengoiksvelzud.com> on 2014/01/19 13:52:29 UTC, 15 replies.
- Print in JavaNetworkWordCount - posted by Eduardo Costa Alfaia <e....@studenti.unibs.it> on 2014/01/20 11:02:27 UTC, 2 replies.
- TorrentBroadcast + persist = bug - posted by Milos Nikolic <mi...@gmail.com> on 2014/01/20 12:22:22 UTC, 1 replies.
- ExternalAppendOnlyMap throw no such element - posted by guojc <gu...@gmail.com> on 2014/01/20 13:22:09 UTC, 5 replies.
- Loss was due to java.lang.ClassNotFoundException java.lang.ClassNotFoundException: scala.None$ error when mysql-async is add in build.sbt - posted by Richard Siebeling <rs...@gmail.com> on 2014/01/20 13:29:47 UTC, 1 replies.
- Spark Master on Hadoop Job Tracker? - posted by mharwida <ma...@yahoo.com> on 2014/01/20 19:14:25 UTC, 3 replies.
- SPARK protocol buffer issue. Need Help - posted by suman bharadwaj <su...@gmail.com> on 2014/01/20 22:05:11 UTC, 2 replies.
- Gathering exception stack trace - posted by Mingyu Kim <mk...@palantir.com> on 2014/01/20 22:51:35 UTC, 0 replies.
- spark-shell on standalone cluster gives error " no mesos in java.library.path" - posted by Manoj Samel <ma...@gmail.com> on 2014/01/21 00:14:11 UTC, 1 replies.
- Error: Could not find or load main class org.apache.spark.executor.CoarseGrainedExecutorBackend - posted by Hu...@Dell.com on 2014/01/21 01:44:06 UTC, 1 replies.
- RDD action hangs on a standalone mode cluster - posted by Manoj Samel <ma...@gmail.com> on 2014/01/21 06:02:27 UTC, 1 replies.
- How to perform multi dimensional reduction in spark? - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/21 06:42:03 UTC, 2 replies.
- Forcing RDD computation with something else than count() ? - posted by Guillaume Pitel <gu...@exensa.com> on 2014/01/21 10:02:06 UTC, 5 replies.
- Want some Metrics - posted by Pankaj Mittal <pa...@livestream.com> on 2014/01/21 10:54:28 UTC, 1 replies.
- How to clean up jars on worker nodes - posted by Mingyu Kim <mk...@palantir.com> on 2014/01/21 12:24:28 UTC, 0 replies.
- How to stop a streaming job - posted by prabeesh k <pr...@gmail.com> on 2014/01/21 13:59:01 UTC, 0 replies.
- Spark on private network - posted by goi cto <go...@gmail.com> on 2014/01/21 14:22:53 UTC, 1 replies.
- Lazy evaluation of RDD data transformation - posted by DB Tsai <db...@alpinenow.com> on 2014/01/21 20:32:00 UTC, 3 replies.
- Shark 0.8.1-rc0 not able to connect to master spark 0.8.1-incubating HADOOP=2.2.0 - posted by Andre Kuhnen <an...@gmail.com> on 2014/01/21 23:22:11 UTC, 0 replies.
- spark.default.parallelism - posted by Ognen Duzlevski <og...@nengoiksvelzud.com> on 2014/01/21 23:27:36 UTC, 4 replies.
- Unsubscribe - posted by sawan ruparel <sa...@live.com> on 2014/01/22 03:29:46 UTC, 0 replies.
- Re: reading LZO compressed file in spark - posted by Vipul Pandey <vi...@gmail.com> on 2014/01/22 07:56:54 UTC, 1 replies.
- Re: TorrentBroadcast + persist = bug - posted by Mosharaf Chowdhury <mo...@gmail.com> on 2014/01/22 09:58:49 UTC, 2 replies.
- a newbee trying to compile and execute examples from 0.9.0-incubating-SNAPSHOT - posted by Alonso Isidoro Roman <al...@gmail.com> on 2014/01/22 13:05:57 UTC, 0 replies.
- Running K-Means on a cluster setup - posted by Kal El <pi...@yahoo.com> on 2014/01/22 15:35:32 UTC, 11 replies.
- Problem with newAPIHadoopFile - posted by chadi jaber <ch...@hotmail.com> on 2014/01/22 16:17:16 UTC, 0 replies.
- make-distribution.sh error org.apache.hadoop#hadoop-client;2.0.0: not found - posted by Manoj Samel <ma...@gmail.com> on 2014/01/22 19:09:43 UTC, 1 replies.
- Running make-distribution.sh .. compilation errors in streaming/api/java/JavaPairDStream.scala - posted by Manoj Samel <ma...@gmail.com> on 2014/01/22 19:22:50 UTC, 4 replies.
- Union of 2 RDD's only returns the first one - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/22 20:12:43 UTC, 3 replies.
- why is it so slow to run sbt/sbt assembly in my machine? - posted by dachuan <hd...@gmail.com> on 2014/01/22 20:15:23 UTC, 4 replies.
- KRYO usage details: Need Help - posted by suman bharadwaj <su...@gmail.com> on 2014/01/22 21:27:04 UTC, 2 replies.
- How to use cluster for large set of linux files - posted by Manoj Samel <ma...@gmail.com> on 2014/01/22 21:37:22 UTC, 5 replies.
- Using persistent hdfs on spark ec2 instanes - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/22 23:40:09 UTC, 7 replies.
- Spark does not retry failed tasks initiated by hadoop - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/23 01:04:55 UTC, 2 replies.
- Handling occasional bad data ... - posted by Manoj Samel <ma...@gmail.com> on 2014/01/23 05:04:59 UTC, 3 replies.
- Location and memory allocations for master / worker nodes - posted by Manoj Samel <ma...@gmail.com> on 2014/01/23 05:16:11 UTC, 1 replies.
- DStream foreachRdd not working in standalone cluster mode - posted by Sourav Chandra <so...@livestream.com> on 2014/01/23 10:01:34 UTC, 6 replies.
- Advices if your worker die often - posted by Guillaume Pitel <gu...@exensa.com> on 2014/01/23 11:56:39 UTC, 3 replies.
- Is SparkContext.stop() optional or required? - posted by Mingyu Kim <mk...@palantir.com> on 2014/01/23 20:16:48 UTC, 2 replies.
- Exception in thread "DAGScheduler" java.lang.OutOfMemoryError: GC overhead limit exceeded - posted by Manoj Samel <ma...@gmail.com> on 2014/01/23 20:46:21 UTC, 3 replies.
- data within batchduration in RDD of a Dstream reliable? - posted by aecc <al...@gmail.com> on 2014/01/23 21:12:19 UTC, 0 replies.
- Time window size in Spark Streaming - posted by Ricky Ho <ri...@yahoo.com> on 2014/01/23 21:32:31 UTC, 0 replies.
- heterogeneous cluster - problems setting spark.executor.memory - posted by Yadid Ayzenberg <ya...@media.mit.edu> on 2014/01/23 21:34:05 UTC, 0 replies.
- Giraph Vs SPARK - posted by suman bharadwaj <su...@gmail.com> on 2014/01/23 22:10:16 UTC, 6 replies.
- Too many RDD partititons ??? - posted by Manoj Samel <ma...@gmail.com> on 2014/01/23 22:18:14 UTC, 1 replies.
- RE: Problem with newAPIHadoopFile CDH4.3 - posted by chadi jaber <ch...@hotmail.com> on 2014/01/23 22:37:15 UTC, 0 replies.
- Submitting job to Yarn's ResourceManager - posted by DB Tsai <db...@alpinenow.com> on 2014/01/24 02:11:41 UTC, 1 replies.
- .intersection() method on RDDs? - posted by Andrew Ash <an...@andrewash.com> on 2014/01/24 02:18:34 UTC, 15 replies.
- executor failed, cannot find compute-classpath.sh - posted by kyocum <ky...@illumina.com> on 2014/01/24 03:16:04 UTC, 2 replies.
- Hash Join in Spark - posted by rose <ro...@yahoo.com> on 2014/01/24 09:31:52 UTC, 0 replies.
- Spark Scheduler - posted by Sai Prasanna <an...@gmail.com> on 2014/01/24 10:15:21 UTC, 2 replies.
- Lazy Scheduling - posted by Sai Prasanna <an...@gmail.com> on 2014/01/24 10:15:47 UTC, 2 replies.
- Non-deterministic behavior in spark - posted by Ognen Duzlevski <og...@nengoiksvelzud.com> on 2014/01/24 11:46:07 UTC, 7 replies.
- What I am missing from configuration? - posted by Dana Tontea <dt...@cylex.ro> on 2014/01/24 11:53:32 UTC, 1 replies.
- Does foreach operation increase rdd lineage? - posted by guojc <gu...@gmail.com> on 2014/01/24 12:51:19 UTC, 5 replies.
- Division of work between master, worker, executor and driver - posted by Manoj Samel <ma...@gmail.com> on 2014/01/24 18:59:49 UTC, 4 replies.
- How to create RDD over hashmap? - posted by Manoj Samel <ma...@gmail.com> on 2014/01/24 21:56:48 UTC, 14 replies.
- Pyspark job not starting - posted by lobdellb <lo...@gmail.com> on 2014/01/24 22:43:55 UTC, 1 replies.
- Suggestion for ec2 script - posted by Mingyu Kim <mk...@palantir.com> on 2014/01/24 23:19:45 UTC, 2 replies.
- Running spark driver inside a servlet - posted by Kapil Malik <km...@adobe.com> on 2014/01/24 23:35:44 UTC, 2 replies.
- Can I share the RDD between multiprocess - posted by "D.Y Feng" <yy...@gmail.com> on 2014/01/25 03:06:32 UTC, 8 replies.
- real world streaming code - posted by dachuan <hd...@gmail.com> on 2014/01/25 04:28:18 UTC, 2 replies.
- subscribe - posted by Shafaq <s....@gmail.com> on 2014/01/25 09:41:02 UTC, 0 replies.
- Spark connecting to wrong Filesystem.uri - posted by mharwida <ma...@yahoo.com> on 2014/01/25 20:44:07 UTC, 0 replies.
- ClassNotFoundException with simple Spark job on cluster - posted by zhanif <zh...@gmail.com> on 2014/01/26 00:13:42 UTC, 2 replies.
- how to set SPARK_WORKER_INSTANCES and SPARK_WORKER_CORES otpimally - posted by Chen Jin <ka...@gmail.com> on 2014/01/26 01:28:02 UTC, 2 replies.
- Moving from java.util.HashMap to org.apache.spark.util.AppendOnlyMap.scala - posted by Archit Thakur <ar...@gmail.com> on 2014/01/26 11:03:52 UTC, 0 replies.
- GroupByKey implementation. - posted by Archit Thakur <ar...@gmail.com> on 2014/01/26 21:22:51 UTC, 0 replies.
- Inaccurate Estimates from LinearRegressionWithSGD - posted by herbps10 <hp...@geneseo.edu> on 2014/01/27 02:35:56 UTC, 1 replies.
- s3n > 5GB - posted by kamatsuoka <ke...@gmail.com> on 2014/01/27 03:18:09 UTC, 3 replies.
- Cannot get Hadoop dependencies - posted by Kal El <pi...@yahoo.com> on 2014/01/27 14:24:06 UTC, 3 replies.
- Purpose of the HTTP Server? - posted by Heiko Braun <ik...@googlemail.com> on 2014/01/27 15:15:00 UTC, 3 replies.
- Problems while moving from 0.8.0 to 0.8.1 - posted by Archit Thakur <ar...@gmail.com> on 2014/01/27 15:44:16 UTC, 0 replies.
- updateStateByKey Question - posted by Craig Vanderborgh <cr...@gmail.com> on 2014/01/27 16:24:29 UTC, 0 replies.
- Sporadic "IOException: Class not found" in ClosureCleaner - posted by John Salvatier <js...@gmail.com> on 2014/01/28 00:39:52 UTC, 0 replies.
- SparkStreaming not read hadoop configuration from its sparkContext on Stand Alone mode? - posted by robin_up <ro...@gmail.com> on 2014/01/28 04:59:56 UTC, 0 replies.
- What could be the cause of this Streaming error - posted by Ashish Rangole <ar...@gmail.com> on 2014/01/28 07:00:07 UTC, 2 replies.
- setting partitioners with hadoop rdds - posted by Imran Rashid <im...@quantifind.com> on 2014/01/28 08:57:59 UTC, 3 replies.
- Exception in serialization hangs saving-to-disk - posted by Ionized <io...@gmail.com> on 2014/01/28 11:26:10 UTC, 2 replies.
- computeStats() in MLUtils will cause Nan (not a number) error - posted by yinxusen <yi...@gmail.com> on 2014/01/28 14:22:38 UTC, 2 replies.
- cannot read file from HDFS - posted by Kal El <pi...@yahoo.com> on 2014/01/28 14:29:59 UTC, 3 replies.
- Problem with running Spark over Mesos in fine-grained mode - posted by Marek Wiewiorka <ma...@gmail.com> on 2014/01/28 16:07:56 UTC, 2 replies.
- unsubscribe - posted by Stanley Burnitt <St...@huawei.com> on 2014/01/28 19:17:23 UTC, 1 replies.
- RDD and Partition - posted by David Thomas <dt...@gmail.com> on 2014/01/28 20:35:40 UTC, 11 replies.
- Spark for searching text - posted by subacini Arunkumar <su...@gmail.com> on 2014/01/29 01:29:12 UTC, 0 replies.
- Distributed Shared Access of Cached RDD's - posted by "Annamalai, Sai IN BLR STS" <sa...@siemens.com> on 2014/01/29 04:07:10 UTC, 2 replies.
- Spark Fault-tolerant question - posted by nowfats <no...@gmail.com> on 2014/01/29 04:33:20 UTC, 3 replies.
- Cassandra composite and simple keys - posted by Anton B <to...@gmail.com> on 2014/01/29 08:40:47 UTC, 2 replies.
- Problem with flatmap. - posted by Archit Thakur <ar...@gmail.com> on 2014/01/29 09:34:39 UTC, 6 replies.
- Row order of RDDs - posted by Mingyu Kim <mk...@palantir.com> on 2014/01/29 10:18:20 UTC, 0 replies.
- defining classes in Spark REPL - posted by Luca Rosellini <lu...@stratio.com> on 2014/01/29 15:04:05 UTC, 0 replies.
- Spark and Scala Worksheet - posted by Stevo Slavić <ss...@gmail.com> on 2014/01/29 16:38:40 UTC, 1 replies.
- SparkR dev preview package errors when other packages are loaded - posted by Justin Lent <ju...@gmail.com> on 2014/01/29 17:51:55 UTC, 5 replies.
- Non-interactive job fails to copy local variable to remote machines - posted by Michael Diamant <di...@gmail.com> on 2014/01/29 21:35:07 UTC, 3 replies.
- Question on Scalability - posted by David Thomas <dt...@gmail.com> on 2014/01/30 01:50:56 UTC, 1 replies.
- SparkR: filter() function? - posted by Justin Lent <ju...@gmail.com> on 2014/01/30 02:45:49 UTC, 2 replies.
- Please Help: Amplab Benchmark Performance - posted by Chen Jin <ka...@gmail.com> on 2014/01/30 05:10:36 UTC, 4 replies.
- Streaming files as a whole - posted by Mayur Rustagi <ma...@gmail.com> on 2014/01/30 11:08:55 UTC, 2 replies.
- Stream RDD to local disk - posted by Andrew Ash <an...@andrewash.com> on 2014/01/30 11:21:13 UTC, 2 replies.
- RDD[URI] - posted by Philip Ogren <ph...@oracle.com> on 2014/01/30 17:18:12 UTC, 4 replies.
- Python API Performance - posted by nileshc <ni...@nileshc.com> on 2014/01/30 17:30:01 UTC, 6 replies.
- Seattle Spark Meetup - posted by Denny Lee <de...@gmail.com> on 2014/01/30 18:21:00 UTC, 0 replies.
- Re: Source code JavaNetworkWordcount - posted by Tathagata Das <ta...@gmail.com> on 2014/01/30 20:15:06 UTC, 0 replies.
- various questions about yarn-standalone vs. yarn-client - posted by Philip Ogren <ph...@oracle.com> on 2014/01/30 20:21:03 UTC, 0 replies.
- spark-shell with yarn, runs beyond virtual memory limits - posted by Eric Kimbrel <er...@soteradefense.com> on 2014/01/30 20:58:34 UTC, 0 replies.
- Spark + MongoDB - posted by Sampo Niskanen <sa...@wellmo.com> on 2014/01/30 21:36:22 UTC, 1 replies.
- CQL3 Example (Scala Noobie Question) - posted by Brian O'Neill <bo...@alumni.brown.edu> on 2014/01/31 01:15:07 UTC, 0 replies.
- Kyro serialization slow and runs OOM - posted by Vipul Pandey <vi...@gmail.com> on 2014/01/31 02:00:50 UTC, 0 replies.
- Configuring distributed caching with Spark and YARN - posted by Paul Schooss <pa...@gmail.com> on 2014/01/31 04:01:55 UTC, 0 replies.
- Is there a way to get the current progress of the job? - posted by DB Tsai <db...@alpinenow.com> on 2014/01/31 06:32:49 UTC, 0 replies.
- SLF4J error and log4j error - posted by Sai Prasanna <an...@gmail.com> on 2014/01/31 06:36:26 UTC, 2 replies.
- MLLib Sparse Input - posted by jshao <ja...@gmail.com> on 2014/01/31 19:49:35 UTC, 1 replies.
- Connecting to remote Spark cluster using Java+Maven - posted by Guillermo Cabrera <gu...@gmail.com> on 2014/01/31 21:39:39 UTC, 0 replies.
- Spark app gets slower as it gets executed more times - posted by Aureliano Buendia <bu...@gmail.com> on 2014/01/31 23:31:20 UTC, 0 replies.
- Single application using all the cores - preventing other applications from running - posted by Timothee Besset <tt...@ttimo.net> on 2014/01/31 23:42:10 UTC, 3 replies.