You are viewing a plain text version of this content. The canonical link for it is here.
- Re: Spark streaming on ec2 - posted by Nicholas Chammas <ni...@gmail.com> on 2014/03/01 00:08:12 UTC, 0 replies.
- java.net.SocketException on reduceByKey() in pyspark - posted by "nicholas.chammas" <ni...@gmail.com> on 2014/03/01 00:33:38 UTC, 3 replies.
- Re: JVM error - posted by Mohit Singh <mo...@gmail.com> on 2014/03/01 01:55:57 UTC, 1 replies.
- Lazyoutput format in spark - posted by Mohit Singh <mo...@gmail.com> on 2014/03/01 02:18:36 UTC, 1 replies.
- Re: Trying to connect to spark from within a web server - posted by Nathan Kronenfeld <nk...@oculusinfo.com> on 2014/03/01 03:01:22 UTC, 0 replies.
- Connection Refused When Running SparkPi Locally - posted by Benny Thompson <be...@gmail.com> on 2014/03/01 03:18:01 UTC, 1 replies.
- How to provide a custom Comparator to sortByKey? - posted by Tao Xiao <xi...@gmail.com> on 2014/03/01 03:19:06 UTC, 0 replies.
- Using jeromq instead of akka wrapped zeromq for spark streaming - posted by Aureliano Buendia <bu...@gmail.com> on 2014/03/01 03:48:27 UTC, 0 replies.
- error in streaming word count API? - posted by Aaron Kimball <ak...@gmail.com> on 2014/03/01 05:46:17 UTC, 4 replies.
- Create a new object in pyspark map function - posted by "Kaden(Xiaozhe) Wang" <wa...@umn.edu> on 2014/03/01 07:51:51 UTC, 0 replies.
- RDDToTable - posted by subacini Arunkumar <su...@gmail.com> on 2014/03/01 07:52:32 UTC, 1 replies.
- Re: Error reading HDFS file using spark 0.9.0 / hadoop 2.2.0 - incompatible protobuf 2.5 and 2.4.1 - posted by Prasad <ra...@gmail.com> on 2014/03/01 08:37:09 UTC, 9 replies.
- Spark properties setting doesn't take effect - posted by hequn8128 <ch...@gmail.com> on 2014/03/01 10:30:43 UTC, 0 replies.
- Where does println output go? - posted by David Thomas <dt...@gmail.com> on 2014/03/01 22:49:24 UTC, 1 replies.
- spark-ec2 login expects at least 1 slave - posted by "nicholas.chammas" <ni...@gmail.com> on 2014/03/02 01:47:10 UTC, 7 replies.
- NoSuchMethodError in KafkaReciever - posted by venki-kratos <ve...@thekratos.com> on 2014/03/02 05:26:32 UTC, 2 replies.
- OutOfMemoryError when loading input file - posted by Yonathan Perez <yo...@gmail.com> on 2014/03/02 07:29:51 UTC, 1 replies.
- Python 2.7 + numpy break sortByKey() - posted by "nicholas.chammas" <ni...@gmail.com> on 2014/03/02 07:50:20 UTC, 3 replies.
- Unable to load realm info from SCDynamicStore - posted by xiiik <xi...@qq.com> on 2014/03/02 09:40:47 UTC, 1 replies.
- Spark 0.9.0 - local mode - sc.addJar problem (bug?) - posted by Pierre B <pi...@realimpactanalytics.com> on 2014/03/02 15:48:14 UTC, 3 replies.
- Incrementally add/remove vertices in GraphX - posted by Deepak Nulu <de...@gmail.com> on 2014/03/02 23:38:35 UTC, 13 replies.
- flatten RDD[RDD[T]] - posted by Cosmin Radoi <co...@gmail.com> on 2014/03/03 02:37:13 UTC, 1 replies.
- Help with groupByKey - posted by David Thomas <dt...@gmail.com> on 2014/03/03 04:04:37 UTC, 3 replies.
- Re: Job initialization performance of Spark standalone mode vs YARN - posted by polkosity <po...@gmail.com> on 2014/03/03 06:41:52 UTC, 12 replies.
- Error: Could not find or load main class org.apache.spark.repl.Main on GitBash - posted by goi cto <go...@gmail.com> on 2014/03/03 11:06:24 UTC, 0 replies.
- Problem with "delete spark temp dir" on spark 0.8.1 - posted by goi cto <go...@gmail.com> on 2014/03/03 11:34:16 UTC, 4 replies.
- Beginners Hadoop question - posted by goi cto <go...@gmail.com> on 2014/03/03 12:10:53 UTC, 3 replies.
- Blog : Why Apache Spark is a Crossover Hit for Data Scientists - posted by Sean Owen <so...@cloudera.com> on 2014/03/03 18:12:44 UTC, 0 replies.
- pyspark crash on mesos - posted by bmiller1 <bm...@cs.berkeley.edu> on 2014/03/03 19:21:37 UTC, 4 replies.
- Missing Spark URL after staring the master - posted by Bin Wang <bi...@gmail.com> on 2014/03/03 20:00:28 UTC, 6 replies.
- o.a.s.u.Vector instances for equality - posted by Oleksandr Olgashko <al...@gmail.com> on 2014/03/03 21:23:48 UTC, 2 replies.
- filter operation in pyspark - posted by Mohit Singh <mo...@gmail.com> on 2014/03/04 01:13:32 UTC, 1 replies.
- Shuffle Files - posted by Usman Ghani <us...@platfora.com> on 2014/03/04 07:45:44 UTC, 1 replies.
- RE: Actors and sparkcontext actions - posted by Suraj Satishkumar Sheth <su...@adobe.com> on 2014/03/04 10:20:03 UTC, 3 replies.
- Problem with Spark on Mesos - posted by juanpedromoreno <ju...@gmail.com> on 2014/03/04 13:20:12 UTC, 0 replies.
- RDD Manipulation in Scala. - posted by trottdw <tr...@gmail.com> on 2014/03/04 14:06:12 UTC, 2 replies.
- Fwd: [Scikit-learn-general] Spark+sklearn sprint outcome ? - posted by Nick Pentreath <ni...@gmail.com> on 2014/03/04 16:05:02 UTC, 0 replies.
- sstream.foreachRDD - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/03/04 17:18:58 UTC, 1 replies.
- how to add rddID to tuples in DStream - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/03/04 19:08:00 UTC, 0 replies.
- trying to understand job cancellation - posted by Koert Kuipers <ko...@tresata.com> on 2014/03/04 23:02:50 UTC, 14 replies.
- Spark Streaming Maven Build - posted by Bin Wang <bi...@gmail.com> on 2014/03/05 00:48:19 UTC, 0 replies.
- Word Count on Mesos Cluster - posted by juanpedromoreno <ju...@gmail.com> on 2014/03/05 11:20:58 UTC, 1 replies.
- pyspark: Importing other py-files in PYTHONPATH - posted by Anders Bennehag <an...@tajitsu.com> on 2014/03/05 13:34:50 UTC, 1 replies.
- pyspark and Python virtual enviroments - posted by Christian <ch...@gmail.com> on 2014/03/05 13:54:31 UTC, 2 replies.
- Unable to redirect Spark logs to slf4j - posted by Sergey Parhomenko <sp...@gmail.com> on 2014/03/05 14:12:13 UTC, 6 replies.
- Problem with HBase external table on freshly created EMR cluster - posted by Philip Limbeck <ph...@gmail.com> on 2014/03/05 15:56:41 UTC, 3 replies.
- Re: Explain About Logs NetworkWordcount.scala - posted by eduardocalfaia <e....@unibs.it> on 2014/03/05 16:23:16 UTC, 4 replies.
- disconnected from cluster; reconnecting gives java.net.BindException - posted by Nicholas Chammas <ni...@gmail.com> on 2014/03/05 19:01:32 UTC, 2 replies.
- Spark Worker crashing and Master not seeing recovered worker - posted by Rob Povey <ro...@maana.io> on 2014/03/05 19:27:58 UTC, 1 replies.
- PIG to SPARK - posted by suman bharadwaj <su...@gmail.com> on 2014/03/05 19:29:59 UTC, 2 replies.
- Running spark 0.9 on mesos 0.15 - posted by elyast <lu...@gmail.com> on 2014/03/05 23:51:28 UTC, 0 replies.
- Job aborted: Spark cluster looks down - posted by Christian <ch...@gmail.com> on 2014/03/05 23:57:08 UTC, 4 replies.
- Is there a way to control where RDD partition physically go to? - posted by Yishu Lin <yi...@gmail.com> on 2014/03/06 01:00:19 UTC, 0 replies.
- need someone to help clear some questions. - posted by qingyang li <li...@gmail.com> on 2014/03/06 09:20:48 UTC, 5 replies.
- Re: Implementing a custom Spark shell - posted by Sampo Niskanen <sa...@wellmo.com> on 2014/03/06 10:28:42 UTC, 0 replies.
- Re: Mesos Scheduler - posted by deric <ba...@gmail.com> on 2014/03/06 11:31:04 UTC, 0 replies.
- NPE when create cache table with where condition in subquery - posted by "leosandylh@gmail.com" <le...@gmail.com> on 2014/03/06 11:47:08 UTC, 0 replies.
- Re: Kryo serialization does not compress - posted by pradeeps8 <sr...@gmail.com> on 2014/03/06 11:58:24 UTC, 2 replies.
- Building spark with native library support - posted by Alan Burlison <Al...@oracle.com> on 2014/03/06 18:04:43 UTC, 5 replies.
- Access SBT with proxy - posted by Mayur Rustagi <ma...@gmail.com> on 2014/03/06 19:08:30 UTC, 3 replies.
- Streaming JSON string from REST Api in Spring - posted by sonyjv <so...@yahoo.com> on 2014/03/06 19:21:20 UTC, 4 replies.
- major Spark performance problem - posted by "Livni, Dana" <da...@intel.com> on 2014/03/06 20:49:29 UTC, 4 replies.
- Pig on Spark - posted by Sameer Tilak <ss...@live.com> on 2014/03/06 22:11:28 UTC, 11 replies.
- Re: NoSuchMethodError - Akka - Props - posted by Deepak Nulu <de...@gmail.com> on 2014/03/07 00:45:10 UTC, 2 replies.
- Running actions in loops - posted by Ognen Duzlevski <og...@nengoiksvelzud.com> on 2014/03/07 04:39:28 UTC, 5 replies.
- Please remove me from the mail list.//Re: NoSuchMethodError - Akka - Props - posted by "Qiuxin (robert)" <qi...@huawei.com> on 2014/03/07 07:42:10 UTC, 0 replies.
- how to get size of rdd in memery - posted by qingyang li <li...@gmail.com> on 2014/03/07 09:09:55 UTC, 4 replies.
- Class not found in Kafka-Stream due to multi-thread without correct ClassLoader? - posted by Aries Kong <ar...@gmail.com> on 2014/03/07 13:52:25 UTC, 2 replies.
- java.lang.ClassNotFoundException in spark 0.9.0, shark 0.9.0 (pre-release) and hadoop 2.2.0 - posted by pradeeps8 <sr...@gmail.com> on 2014/03/07 17:56:08 UTC, 1 replies.
- Setting properties in core-site.xml for Spark and Hadoop to access - posted by Nicholas Chammas <ni...@gmail.com> on 2014/03/07 18:32:30 UTC, 2 replies.
- Can anyone offer any insight at all? - posted by Ognen Duzlevski <og...@nengoiksvelzud.com> on 2014/03/07 19:41:43 UTC, 4 replies.
- Help connecting to the cluster - posted by Yana Kadiyska <ya...@gmail.com> on 2014/03/07 21:29:36 UTC, 1 replies.
- [BLOG] Spark on Cassandra w/ Calliope - posted by Brian O'Neill <bo...@alumni.brown.edu> on 2014/03/07 21:48:06 UTC, 4 replies.
- exception while running pi example on yarn cluster - posted by Venkata siva kamesh Bhallamudi <ka...@gmail.com> on 2014/03/08 11:24:42 UTC, 1 replies.
- Spark Training Scripts (from Strata 09/13) Ec2 Deployment scripts having errors - posted by Stephen Boesch <ja...@gmail.com> on 2014/03/08 15:26:31 UTC, 1 replies.
- sequenceFile and groupByKey - posted by Kane <ka...@gmail.com> on 2014/03/09 06:30:15 UTC, 3 replies.
- Aggregators in GraphX - posted by Sebastian Schelter <ss...@apache.org> on 2014/03/09 11:29:23 UTC, 0 replies.
- State of spark docker script - posted by Aureliano Buendia <bu...@gmail.com> on 2014/03/09 17:33:10 UTC, 2 replies.
- Spark on YARN use only one node - posted by Assaf <as...@Intel.com> on 2014/03/09 22:10:43 UTC, 0 replies.
- no stdout output from worker - posted by "Sen, Ranjan [USA]" <Se...@bah.com> on 2014/03/09 22:32:22 UTC, 1 replies.
- TriangleCount & Shortest Path under Spark - posted by yxzhao <yx...@ualr.edu> on 2014/03/09 23:52:17 UTC, 4 replies.
- CDH5b2, Spark 0.9.0 and shark - posted by danoomistmatiste <kk...@yahoo.com> on 2014/03/10 02:01:50 UTC, 0 replies.
- Sbt Permgen - posted by Koert Kuipers <ko...@tresata.com> on 2014/03/10 02:54:41 UTC, 3 replies.
- Re: [External] Re: no stdout output from worker - posted by "Sen, Ranjan [USA]" <Se...@bah.com> on 2014/03/10 07:33:20 UTC, 1 replies.
- Re: [External] Re: no stdout output from worker - posted by Sourav Chandra <so...@livestream.com> on 2014/03/10 07:37:02 UTC, 2 replies.
- what is shark's mailiing list? - posted by qingyang li <li...@gmail.com> on 2014/03/10 08:06:26 UTC, 1 replies.
- subscribe - posted by hequn cheng <ch...@gmail.com> on 2014/03/10 10:32:26 UTC, 2 replies.
- Unsubscribe - posted by Aditya Devarakonda <de...@gmail.com> on 2014/03/10 10:57:35 UTC, 3 replies.
- Using flume to create stream for spark streaming. - posted by Ravi Hemnani <ra...@gmail.com> on 2014/03/10 11:26:18 UTC, 0 replies.
- Log Analyze - posted by Eduardo Costa Alfaia <e....@unibs.it> on 2014/03/10 17:02:12 UTC, 0 replies.
- "Too many open files" exception on reduceByKey - posted by Matthew Cheah <ma...@gmail.com> on 2014/03/10 18:41:02 UTC, 3 replies.
- Room for rent in Aptos - posted by arjun biswas <ar...@gmail.com> on 2014/03/10 18:51:18 UTC, 3 replies.
- Re-distribute cache on new slave nodes for better performance - posted by Praveen Rachabattuni <pr...@gmail.com> on 2014/03/10 19:06:14 UTC, 0 replies.
- Custom RDD - posted by David Thomas <dt...@gmail.com> on 2014/03/10 20:30:42 UTC, 2 replies.
- Java example of using broadcast - posted by "Sen, Ranjan [USA]" <Se...@bah.com> on 2014/03/10 22:30:13 UTC, 0 replies.
- test - posted by Yishu Lin <yi...@gmail.com> on 2014/03/10 22:47:33 UTC, 3 replies.
- computation slows down 10x because of cached RDDs - posted by Koert Kuipers <ko...@tresata.com> on 2014/03/10 23:18:41 UTC, 8 replies.
- SPARK_JAVA_OPTS not picked up by the application - posted by Linlin <li...@gmail.com> on 2014/03/10 23:47:55 UTC, 9 replies.
- How to create RDD from Java in-memory data? - posted by wallacemann <wa...@bandpage.com> on 2014/03/11 01:26:51 UTC, 6 replies.
- if there is shark 0.9 build can be download? - posted by qingyang li <li...@gmail.com> on 2014/03/11 02:29:02 UTC, 0 replies.
- Re: Sharing SparkContext - posted by abhinav chowdary <ab...@gmail.com> on 2014/03/11 02:49:12 UTC, 4 replies.
- is spark 0.9.0 HA? - posted by qingyang li <li...@gmail.com> on 2014/03/11 02:57:55 UTC, 2 replies.
- how to use the log4j for the standalone app - posted by lihu <li...@gmail.com> on 2014/03/11 04:36:34 UTC, 2 replies.
- Using s3 instead of broadcast - posted by Aureliano Buendia <bu...@gmail.com> on 2014/03/11 05:58:42 UTC, 0 replies.
- Block - posted by David Thomas <dt...@gmail.com> on 2014/03/11 07:06:06 UTC, 2 replies.
- pyspark broadcast error - posted by Brad Miller <bm...@eecs.berkeley.edu> on 2014/03/11 07:12:07 UTC, 0 replies.
- building spark over proxy - posted by hades dark <ha...@gmail.com> on 2014/03/11 07:14:23 UTC, 1 replies.
- Reading sequencefile - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/11 09:54:49 UTC, 2 replies.
- Powered By Spark Page -- Companies & Organizations - posted by Christoph Böhm <li...@gmx.net> on 2014/03/11 10:47:58 UTC, 1 replies.
- (Unknown) - posted by Gino Mathews <gi...@thinkpalm.com> on 2014/03/11 11:07:55 UTC, 1 replies.
- OpenCV + Spark : Where to put System.loadLibrary ? - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/11 12:06:50 UTC, 3 replies.
- Spark Applicaion (Stages) UI does not recognize line number - posted by "orly.lampert" <or...@kaltura.com> on 2014/03/11 12:59:25 UTC, 0 replies.
- How to set task number in a container - posted by hequn cheng <ch...@gmail.com> on 2014/03/11 13:57:40 UTC, 0 replies.
- Spark stand alone cluster mode - posted by Gino Mathews <gi...@thinkpalm.com> on 2014/03/11 14:02:22 UTC, 0 replies.
- Re: Spark stand alone cluster mode - posted by Yana Kadiyska <ya...@gmail.com> on 2014/03/11 14:42:45 UTC, 0 replies.
- Computation time increasing every super-step - posted by Alessandro Lulli <al...@gmail.com> on 2014/03/11 16:07:03 UTC, 0 replies.
- NO SUCH METHOD EXCEPTION - posted by "Jeyaraj, Arockia R (Arockia)" <ar...@verizon.com> on 2014/03/11 16:19:42 UTC, 1 replies.
- Re: Out of memory on large RDDs - posted by Domen Grabec <do...@celtra.com> on 2014/03/11 16:35:43 UTC, 4 replies.
- RDD.saveAs... - posted by Koert Kuipers <ko...@tresata.com> on 2014/03/11 17:06:27 UTC, 1 replies.
- Pyspark Memory Woes - posted by Aaron Olson <aa...@shopify.com> on 2014/03/11 18:11:18 UTC, 4 replies.
- Spark usage patterns and questions - posted by Sourav Chandra <so...@livestream.com> on 2014/03/11 19:09:29 UTC, 1 replies.
- unsubscribe - posted by Abhishek Pratap <ap...@sagebase.org> on 2014/03/11 20:15:22 UTC, 2 replies.
- is spark.cleaner.ttl safe? - posted by Michael Allman <ms...@allman.ms> on 2014/03/11 21:58:36 UTC, 3 replies.
- Applications for Spark on HDFS - posted by Paul Schooss <pa...@gmail.com> on 2014/03/11 23:09:11 UTC, 2 replies.
- possible bug in Spark's ALS implementation... - posted by Michael Allman <ms...@allman.ms> on 2014/03/11 23:18:43 UTC, 21 replies.
- how to config worker HA - posted by qingyang li <li...@gmail.com> on 2014/03/12 05:11:40 UTC, 1 replies.
- Are all transformations lazy? - posted by David Thomas <dt...@gmail.com> on 2014/03/12 05:49:29 UTC, 6 replies.
- What is the difference between map and flatMap - posted by goi cto <go...@gmail.com> on 2014/03/12 12:50:29 UTC, 3 replies.
- [re-cont] map and flatMap - posted by andy petrella <an...@gmail.com> on 2014/03/12 15:06:56 UTC, 2 replies.
- spark config params conventions - posted by Koert Kuipers <ko...@tresata.com> on 2014/03/12 17:10:18 UTC, 3 replies.
- Is it common in spark to broadcast a 10 gb variable? - posted by Aureliano Buendia <bu...@gmail.com> on 2014/03/12 18:08:47 UTC, 9 replies.
- Changing number of workers for benchmarking purposes - posted by Pierre Borckmans <pi...@realimpactanalytics.com> on 2014/03/12 18:18:40 UTC, 4 replies.
- NLP with Spark - posted by shankark <sh...@gmail.com> on 2014/03/12 19:10:24 UTC, 3 replies.
- building Spark docs - posted by Diana Carroll <dc...@cloudera.com> on 2014/03/12 19:39:17 UTC, 1 replies.
- Unable to start a stand app in scala on a spark cluster - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/12 22:55:32 UTC, 0 replies.
- Re: Lzo + Protobuf - posted by Vipul Pandey <vi...@gmail.com> on 2014/03/13 01:10:14 UTC, 0 replies.
- Local Standalone Application and shuffle spills - posted by Fabrizio Milo aka misto <mi...@gmail.com> on 2014/03/13 02:22:09 UTC, 1 replies.
- How to monitor the communication process? - posted by moxiecui <mo...@gmail.com> on 2014/03/13 10:04:29 UTC, 1 replies.
- Spark temp dir (spark.local.dir) - posted by Tsai Li Ming <ma...@ltsai.com> on 2014/03/13 10:16:45 UTC, 4 replies.
- Spark Java example using external Jars - posted by dmpour23 <dm...@gmail.com> on 2014/03/13 10:46:59 UTC, 3 replies.
- Large shuffle RDD - posted by Domen Grabec <do...@celtra.com> on 2014/03/13 12:31:03 UTC, 1 replies.
- How to solve : java.io.NotSerializableException: org.apache.hadoop.io.Text ? - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/13 13:36:30 UTC, 1 replies.
- JVM memory in local threading (SparkLR example) - posted by Tsai Li Ming <ma...@ltsai.com> on 2014/03/13 14:55:11 UTC, 0 replies.
- sample data for pagerank? - posted by Diana Carroll <dc...@cloudera.com> on 2014/03/13 15:13:22 UTC, 2 replies.
- RDD partition task number - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/13 15:49:54 UTC, 0 replies.
- parson json within rdd's filter() - posted by Ognen Duzlevski <og...@nengoiksvelzud.com> on 2014/03/13 16:04:52 UTC, 6 replies.
- How to work with ReduceByKey? - posted by goi cto <go...@gmail.com> on 2014/03/13 17:56:44 UTC, 1 replies.
- Re: SparkContext startup time out - posted by velvia <ve...@gmail.com> on 2014/03/13 18:11:41 UTC, 2 replies.
- links for the old versions are broken - posted by Walrus theCat <wa...@gmail.com> on 2014/03/13 18:52:59 UTC, 3 replies.
- Kafka in Yarn - posted by aecc <al...@gmail.com> on 2014/03/13 18:58:31 UTC, 0 replies.
- Reading back a sorted RDD - posted by Aureliano Buendia <bu...@gmail.com> on 2014/03/13 19:27:36 UTC, 0 replies.
- Round Robin Partitioner - posted by David Thomas <dt...@gmail.com> on 2014/03/13 19:50:25 UTC, 1 replies.
- combining operations elegantly - posted by Koert Kuipers <ko...@tresata.com> on 2014/03/13 21:39:07 UTC, 5 replies.
- best practices for pushing an RDD into a database - posted by Nicholas Chammas <ni...@gmail.com> on 2014/03/13 22:05:13 UTC, 5 replies.
- NoClassFound Errors with Streaming Twitter - posted by Paul Schooss <pa...@gmail.com> on 2014/03/14 00:38:07 UTC, 0 replies.
- Help vote for Spark talks at the Hadoop Summit - posted by Patrick Wendell <pw...@gmail.com> on 2014/03/14 01:07:17 UTC, 0 replies.
- Are there any plans to develop Graphx Streaming? - posted by Qi Song <so...@gmail.com> on 2014/03/14 08:19:40 UTC, 1 replies.
- Realtime counting job with reading access from flatmappers - posted by Dirk Weissenborn <di...@gmail.com> on 2014/03/14 13:01:48 UTC, 0 replies.
- Fwd: Accessing HDFS file on CDH4.4 through Spark - posted by Pariksheet Barapatre <pb...@gmail.com> on 2014/03/14 18:40:49 UTC, 0 replies.
- slf4j and log4j loop - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/03/14 21:04:30 UTC, 3 replies.
- How to run a jar against spark - posted by Chengi Liu <ch...@gmail.com> on 2014/03/14 23:13:59 UTC, 0 replies.
- new user question on using scala collections inside RDDs - posted by Peter <th...@yahoo.com> on 2014/03/15 03:12:53 UTC, 1 replies.
- spark-streaming - posted by Nathan Kronenfeld <nk...@oculusinfo.com> on 2014/03/15 04:57:24 UTC, 1 replies.
- Can two spark applications share rdd? - posted by 林武康 <vb...@gmail.com> on 2014/03/15 07:29:33 UTC, 0 replies.
- Spark join for skewed dataset - posted by Debasish Das <de...@gmail.com> on 2014/03/16 04:58:27 UTC, 0 replies.
- Task Recalculate or toal failure due to fectchError - posted by guojc <gu...@gmail.com> on 2014/03/16 10:45:08 UTC, 0 replies.
- Contributing pyspark ports - posted by Krakna H <sh...@gmail.com> on 2014/03/16 13:59:40 UTC, 1 replies.
- [Powered by] Yandex Islands powered by Spark - posted by Egor Pahomov <pa...@gmail.com> on 2014/03/16 14:48:27 UTC, 1 replies.
- Separating classloader management from SparkContexts - posted by Punya Biswal <pb...@palantir.com> on 2014/03/16 16:09:24 UTC, 3 replies.
- Maximum memory limits - posted by Debasish Das <de...@gmail.com> on 2014/03/16 19:40:01 UTC, 5 replies.
- How to kill a spark app ? - posted by Debasish Das <de...@gmail.com> on 2014/03/16 19:59:19 UTC, 6 replies.
- Running Spark on a single machine - posted by goi cto <go...@gmail.com> on 2014/03/16 22:38:45 UTC, 3 replies.
- Machine Learning on streaming data - posted by Nasir Khan <na...@gmail.com> on 2014/03/17 01:56:43 UTC, 3 replies.
- Re: worker keeps getting disassociated upon a failed job spark version 0.90 - posted by yukang chen <cy...@gmail.com> on 2014/03/17 08:08:42 UTC, 1 replies.
- Spark shell exits after 1 min - posted by Sai Prasanna <an...@gmail.com> on 2014/03/17 09:02:25 UTC, 2 replies.
- Question about RDD creations in Spark - posted by 王永春 <yo...@audaque.com> on 2014/03/17 10:39:53 UTC, 0 replies.
- Log analyzer and other Spark tools - posted by Roman Pastukhov <me...@gmail.com> on 2014/03/17 15:35:34 UTC, 2 replies.
- example of non-line oriented input data? - posted by Diana Carroll <dc...@cloudera.com> on 2014/03/17 16:18:50 UTC, 20 replies.
- Efficiently map external array data (OpenCV) to spark array - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/17 16:59:56 UTC, 0 replies.
- Running spark examples - posted by Chengi Liu <ch...@gmail.com> on 2014/03/17 17:55:11 UTC, 2 replies.
- sbt assembly fails - posted by Chengi Liu <ch...@gmail.com> on 2014/03/17 18:25:23 UTC, 7 replies.
- java.lang.NullPointerException met when computing new RDD or use .count - posted by anny9699 <an...@gmail.com> on 2014/03/17 18:36:55 UTC, 3 replies.
- is collect exactly-once? - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/03/17 21:14:43 UTC, 1 replies.
- Problem when execute spark-shell - posted by Yexi Jiang <ye...@gmail.com> on 2014/03/17 22:10:30 UTC, 3 replies.
- inexplicable exceptions in Spark 0.7.3 - posted by Walrus theCat <wa...@gmail.com> on 2014/03/18 00:17:53 UTC, 2 replies.
- Trouble getting hadoop and spark run along side on my vm - posted by Shivani Rao <ra...@gmail.com> on 2014/03/18 02:44:44 UTC, 0 replies.
- Spark 0.9.0-incubation + Apache Hadoop 2.2.0 + YARN encounter Compression codec com.hadoop.compression.lzo.LzoCodec not found - posted by Andrew Lee <al...@altiscale.com> on 2014/03/18 05:00:45 UTC, 1 replies.
- Apache Spark 0.9.0 Build Error - posted by wapisani <wa...@mtu.edu> on 2014/03/18 05:06:24 UTC, 5 replies.
- Running spark examples/scala scripts - posted by Pariksheet Barapatre <pb...@gmail.com> on 2014/03/18 07:37:56 UTC, 3 replies.
- Feed KMeans algorithm with a row major matrix - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/18 10:46:21 UTC, 1 replies.
- Connect Exception Error in spark interactive shell... - posted by Sai Prasanna <an...@gmail.com> on 2014/03/18 10:59:57 UTC, 4 replies.
- KryoSerializer return null when deserialize Task obj in Executor - posted by 林武康 <vb...@gmail.com> on 2014/03/18 15:20:04 UTC, 0 replies.
- [spark] New article on spark & scalaz-stream (& a bit of ML) - posted by Pascal Voitot Dev <pa...@gmail.com> on 2014/03/18 17:33:02 UTC, 0 replies.
- Re: spark-shell fails - posted by psteckler <st...@stecksoft.com> on 2014/03/18 19:45:07 UTC, 1 replies.
- Maven repo for Spark pre-built with CDH4? - posted by Punya Biswal <pb...@palantir.com> on 2014/03/18 21:55:47 UTC, 1 replies.
- Regarding Successive operation on elements and recursively - posted by yh18190 <yh...@gmail.com> on 2014/03/18 23:11:50 UTC, 0 replies.
- Access original filename in a map function - posted by Uri Laserson <la...@cloudera.com> on 2014/03/19 01:12:35 UTC, 0 replies.
- Pyspark worker memory - posted by Jim Blomo <ji...@gmail.com> on 2014/03/19 02:17:28 UTC, 5 replies.
- Re: There is an error in Graphx - posted by ankurdave <an...@gmail.com> on 2014/03/19 02:49:18 UTC, 1 replies.
- Spark enables us to process Big Data on an ARM cluster !! - posted by Chanwit Kaewkasi <ch...@gmail.com> on 2014/03/19 03:36:00 UTC, 8 replies.
- Re: Unable to read HDFS file -- SimpleApp.java - posted by Prasad <ra...@gmail.com> on 2014/03/19 08:47:55 UTC, 0 replies.
- What's the lifecycle of an rdd? Can I control it? - posted by 林武康 <vb...@gmail.com> on 2014/03/19 09:40:04 UTC, 5 replies.
- Joining two HDFS files in in Spark - posted by Chhaya Vishwakarma <Ch...@lntinfotech.com> on 2014/03/19 09:57:18 UTC, 2 replies.
- Spark worker threads waiting - posted by Domen Grabec <do...@celtra.com> on 2014/03/19 10:40:57 UTC, 6 replies.
- Hadoop Input Format - newAPIHadoopFile - posted by Pariksheet Barapatre <pb...@gmail.com> on 2014/03/19 11:29:45 UTC, 4 replies.
- How to distribute external executable (script) with Spark ? - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/19 11:57:57 UTC, 2 replies.
- Is shutting down of SparkContext optional? - posted by Roman Pastukhov <me...@gmail.com> on 2014/03/19 15:45:48 UTC, 0 replies.
- Transitive dependency incompatibility - posted by Jaka Jančar <ja...@kubje.org> on 2014/03/19 16:30:56 UTC, 3 replies.
- partitioning via groupByKey - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/03/19 17:32:43 UTC, 1 replies.
- closure scope & Serialization - posted by Domen Grabec <do...@celtra.com> on 2014/03/19 18:04:53 UTC, 0 replies.
- workers die with AssociationError - posted by Eric Kimbrel <er...@soteradefense.com> on 2014/03/19 20:33:56 UTC, 0 replies.
- spark 0.8 examples in local mode - posted by maxpar <hl...@gmail.com> on 2014/03/19 20:56:36 UTC, 1 replies.
- how to sort within DStream or merge consecutive RDDs - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/03/19 21:39:26 UTC, 0 replies.
- saveAsTextFile() failing for large datasets - posted by Soila Pertet Kavulya <sk...@gmail.com> on 2014/03/20 00:26:00 UTC, 0 replies.
- in SF until Friday - posted by Nicholas Chammas <ni...@gmail.com> on 2014/03/20 00:34:54 UTC, 1 replies.
- Shark does not give any results with SELECT count(*) command - posted by qingyang li <li...@gmail.com> on 2014/03/20 02:57:44 UTC, 7 replies.
- 答复: What's the lifecycle of an rdd? Can I control it? - posted by 林武康 <vb...@gmail.com> on 2014/03/20 04:30:17 UTC, 0 replies.
- PySpark worker fails with IOError Broken Pipe - posted by Nicholas Chammas <ni...@gmail.com> on 2014/03/20 06:12:49 UTC, 2 replies.
- Relation between DStream and RDDs - posted by Sanjay Awatramani <sa...@yahoo.com> on 2014/03/20 06:20:52 UTC, 9 replies.
- Largest input data set observed for Spark. - posted by Usman Ghani <us...@platfora.com> on 2014/03/20 08:23:45 UTC, 5 replies.
- Hadoop streaming like feature for Spark - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/20 09:04:42 UTC, 2 replies.
- Cassandra CQL read/write from spark using Java - [Remoting] Remoting error: [Startup timed out] - posted by sonyjv <so...@yahoo.com> on 2014/03/20 10:17:12 UTC, 1 replies.
- Error while reading from HDFS Simple application - posted by Laeeq Ahmed <la...@yahoo.com> on 2014/03/20 14:39:37 UTC, 0 replies.
- Accessing the reduce key - posted by Surendranauth Hiraman <su...@velos.io> on 2014/03/20 14:48:31 UTC, 6 replies.
- graphx samples in Java - posted by David Soroko <ds...@attivio.com> on 2014/03/20 15:36:49 UTC, 0 replies.
- Reload RDD saved with saveAsObjectFile - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/20 16:01:42 UTC, 1 replies.
- is sorting necessary after join of sorted RDD - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/03/20 17:00:24 UTC, 0 replies.
- sort order after reduceByKey / groupByKey - posted by Ameet Kini <am...@gmail.com> on 2014/03/20 20:20:22 UTC, 2 replies.
- Flume Corrupted Stream Error - posted by bbuild11 <be...@hotmail.com> on 2014/03/20 21:07:08 UTC, 0 replies.
- DStream spark paper - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/03/20 21:36:23 UTC, 1 replies.
- Sprak Job stuck - posted by "mohit.goyal" <mo...@guavus.com> on 2014/03/21 06:24:16 UTC, 1 replies.
- Does RDD.saveAsObjectFile appends or create a new file ? - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/21 10:59:00 UTC, 0 replies.
- Persist streams to text files - posted by gaganbm <ga...@gmail.com> on 2014/03/21 11:42:33 UTC, 0 replies.
- Re: How to use FlumeInputDStream in spark cluster? - posted by Ravi Hemnani <ra...@gmail.com> on 2014/03/21 13:31:14 UTC, 5 replies.
- N-Fold validation and RDD partitions - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/21 14:32:34 UTC, 6 replies.
- Parallelizing job execution - posted by Ognen Duzlevski <og...@nengoiksvelzud.com> on 2014/03/21 14:43:38 UTC, 0 replies.
- Sliding Window operations do not work as documented - posted by Sanjay Awatramani <sa...@yahoo.com> on 2014/03/21 15:33:53 UTC, 2 replies.
- assumption data must fit in memory per reducer - posted by Koert Kuipers <ko...@tresata.com> on 2014/03/21 15:35:32 UTC, 0 replies.
- Spark executor paths - posted by deric <ba...@gmail.com> on 2014/03/21 18:05:28 UTC, 0 replies.
- Re: SequenceFileRDDFunctions cannot be used output of spark package - posted by "deenar.toraskar" <de...@db.com> on 2014/03/21 18:53:39 UTC, 7 replies.
- Spark and Hadoop cluster - posted by Sameer Tilak <ss...@live.com> on 2014/03/21 19:19:31 UTC, 2 replies.
- Spark streaming kafka _output_ - posted by Benjamin Black <b...@b3k.us> on 2014/03/21 23:58:10 UTC, 0 replies.
- How to save as a single file efficiently? - posted by Aureliano Buendia <bu...@gmail.com> on 2014/03/22 00:04:36 UTC, 4 replies.
- Shark Table for >22 columns - posted by subacini Arunkumar <su...@gmail.com> on 2014/03/22 00:53:35 UTC, 1 replies.
- pySpark memory usage - posted by Jim Blomo <ji...@gmail.com> on 2014/03/22 02:18:44 UTC, 5 replies.
- unable to build spark - sbt/sbt: line 50: killed - posted by Bharath Bhushan <ma...@outlook.com> on 2014/03/22 05:49:46 UTC, 0 replies.
- distinct on huge dataset - posted by Kane <ka...@gmail.com> on 2014/03/22 06:45:55 UTC, 15 replies.
- Yet another question on saving RDD into files - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/22 10:49:36 UTC, 0 replies.
- 答复: unable to build spark - sbt/sbt: line 50: killed - posted by 林武康 <vb...@gmail.com> on 2014/03/22 13:03:28 UTC, 2 replies.
- Configuring shuffle write directory - posted by Tsai Li Ming <ma...@ltsai.com> on 2014/03/22 17:11:51 UTC, 5 replies.
- Kmeans example reduceByKey slow - posted by Tsai Li Ming <ma...@ltsai.com> on 2014/03/23 11:15:23 UTC, 7 replies.
- sbt/sbt assembly fails with ssl certificate error - posted by Bharath Bhushan <ma...@outlook.com> on 2014/03/23 13:38:51 UTC, 5 replies.
- error loading large files in PySpark 0.9.0 - posted by Jeremy Freeman <fr...@gmail.com> on 2014/03/23 18:11:26 UTC, 2 replies.
- No space left on device exception - posted by Ognen Duzlevski <og...@nengoiksvelzud.com> on 2014/03/23 23:06:36 UTC, 12 replies.
- Problem with SparkR - posted by Jacques Basaldúa <ja...@dybot.com> on 2014/03/24 00:48:45 UTC, 1 replies.
- is it possible to access the inputsplit in Spark directly? - posted by hwpstorage <hw...@gmail.com> on 2014/03/24 02:41:05 UTC, 0 replies.
- How many partitions is my RDD split into? - posted by Nicholas Chammas <ni...@gmail.com> on 2014/03/24 05:24:04 UTC, 7 replies.
- spark executor/driver log files management - posted by Sourav Chandra <so...@livestream.com> on 2014/03/24 08:20:39 UTC, 3 replies.
- GC overhead limit exceeded in Spark-interactive shell - posted by Sai Prasanna <an...@gmail.com> on 2014/03/24 08:24:45 UTC, 5 replies.
- Re: Java API - Serialization Issue - posted by santhoma <sa...@yahoo.com> on 2014/03/24 10:36:45 UTC, 2 replies.
- Re: java.io.NotSerializableException Of dependent Java lib. - posted by santhoma <sa...@yahoo.com> on 2014/03/24 10:41:19 UTC, 1 replies.
- RDD usage - posted by Chieh-Yen <r0...@csie.ntu.edu.tw> on 2014/03/24 11:13:38 UTC, 2 replies.
- mapPartitions use case - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/24 13:57:03 UTC, 1 replies.
- Akka error with largish job (works fine for smaller versions) - posted by Nathan Kronenfeld <nk...@oculusinfo.com> on 2014/03/24 14:13:02 UTC, 2 replies.
- Problem starting worker processes in standalone mode - posted by Yonathan Perez <yo...@gmail.com> on 2014/03/24 16:28:28 UTC, 1 replies.
- remove duplicates - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/03/24 17:44:59 UTC, 0 replies.
- distinct in data frame in spark - posted by Chengi Liu <ch...@gmail.com> on 2014/03/24 18:21:37 UTC, 1 replies.
- Comparing GraphX and GraphLab - posted by Niko Stahl <r....@gmail.com> on 2014/03/24 18:59:26 UTC, 4 replies.
- quick start guide: building a standalone scala program - posted by Diana Carroll <dc...@cloudera.com> on 2014/03/24 20:44:41 UTC, 18 replies.
- [bug?] streaming window unexpected behaviour - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/03/24 21:12:41 UTC, 4 replies.
- question about partitions - posted by Walrus theCat <wa...@gmail.com> on 2014/03/24 21:28:18 UTC, 3 replies.
- Writing RDDs to HDFS - posted by Ognen Duzlevski <og...@nengoiksvelzud.com> on 2014/03/24 22:00:00 UTC, 6 replies.
- Splitting RDD and Grouping together to perform computation - posted by yh18190 <yh...@gmail.com> on 2014/03/25 00:59:48 UTC, 0 replies.
- Cluster taking a long time with not much activity (or so I think) - posted by Vipul Pandey <vi...@gmail.com> on 2014/03/25 01:13:45 UTC, 4 replies.
- Re: Splitting RDD and Grouping together to perform computation - posted by Walrus theCat <wa...@gmail.com> on 2014/03/25 01:16:15 UTC, 11 replies.
- coalescing RDD into equally sized partitions - posted by Walrus theCat <wa...@gmail.com> on 2014/03/25 01:20:19 UTC, 3 replies.
- 答复: RDD usage - posted by 林武康 <vb...@gmail.com> on 2014/03/25 02:45:42 UTC, 1 replies.
- 答复: 答复: RDD usage - posted by 林武康 <vb...@gmail.com> on 2014/03/25 04:11:26 UTC, 2 replies.
- Spark Streaming ZeroMQ Java Example - posted by goofy real <go...@gmail.com> on 2014/03/25 07:28:44 UTC, 1 replies.
- graph.persist error - posted by moxing <do...@alibaba-inc.com> on 2014/03/25 09:42:35 UTC, 0 replies.
- How to set environment variable for a spark job - posted by santhoma <sa...@yahoo.com> on 2014/03/25 09:59:35 UTC, 5 replies.
- tracking resource usage for spark-shell commands - posted by Bharath Bhushan <ma...@outlook.com> on 2014/03/25 11:04:34 UTC, 2 replies.
- Worker Threads Vs Spark Executor Memory - posted by "Annamalai, Sai IN BLR STS" <sa...@siemens.com> on 2014/03/25 13:34:08 UTC, 0 replies.
- Change print() in JavaNetworkWordCount - posted by Eduardo Costa Alfaia <e....@unibs.it> on 2014/03/25 13:37:02 UTC, 4 replies.
- tuple as keys in pyspark show up reversed - posted by Friso van Vollenhoven <f....@gmail.com> on 2014/03/25 13:53:17 UTC, 1 replies.
- K-means faster on Mahout then on Spark - posted by Egor Pahomov <pa...@gmail.com> on 2014/03/25 14:20:02 UTC, 4 replies.
- Running a task once on each executor - posted by "deenar.toraskar" <de...@db.com> on 2014/03/25 18:03:07 UTC, 9 replies.
- Using an external jar in the driver, in yarn-standalone mode. - posted by Julien Carme <ju...@gmail.com> on 2014/03/25 18:05:55 UTC, 6 replies.
- ClassCastException when using saveAsTextFile - posted by Niko Stahl <r....@gmail.com> on 2014/03/25 18:55:11 UTC, 1 replies.
- Implementation problem with Streaming - posted by Sanjay Awatramani <sa...@yahoo.com> on 2014/03/25 19:04:43 UTC, 1 replies.
- Static ports for fileserver and httpbroadcast in Spark driver - posted by Guillermo Cabrera2 <gc...@us.ibm.com> on 2014/03/25 21:46:33 UTC, 0 replies.
- Spark 0.9.1 - How to run bin/spark-class with my own hadoop jar files? - posted by Andrew Lee <al...@hotmail.com> on 2014/03/25 21:47:05 UTC, 2 replies.
- [BLOG] Shark on Cassandra - posted by Brian O'Neill <bo...@alumni.brown.edu> on 2014/03/26 02:17:31 UTC, 0 replies.
- [BLOG] : Shark on Cassandra - posted by Brian O'Neill <bo...@alumni.brown.edu> on 2014/03/26 02:18:43 UTC, 2 replies.
- Building Spark 0.9.x for CDH5 with mrv1 installation (Protobuf 2.5 upgrade) - posted by Gary Malouf <ma...@gmail.com> on 2014/03/26 02:42:19 UTC, 1 replies.
- any distributed cache mechanism available in spark ? - posted by santhoma <sa...@yahoo.com> on 2014/03/26 06:35:25 UTC, 0 replies.
- Spark executor memory & relationship with worker threads - posted by Sai Prasanna <an...@gmail.com> on 2014/03/26 06:36:38 UTC, 0 replies.
- Re: rdd.saveAsTextFile problem - posted by gaganbm <ga...@gmail.com> on 2014/03/26 06:43:22 UTC, 1 replies.
- ALS memory limits - posted by Debasish Das <de...@gmail.com> on 2014/03/26 07:06:25 UTC, 0 replies.
- RDD Collect returns empty arrays - posted by gaganbm <ga...@gmail.com> on 2014/03/26 07:26:59 UTC, 0 replies.
- Spark Streaming - Shared hashmaps - posted by Bryan Bryan <br...@gmail.com> on 2014/03/26 09:19:54 UTC, 2 replies.
- Distributed running in Spark Interactive shell - posted by Sai Prasanna <an...@gmail.com> on 2014/03/26 13:54:38 UTC, 8 replies.
- java.lang.ClassNotFoundException - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/26 14:42:48 UTC, 9 replies.
- Re: spark-shell on standalone cluster gives error " no mesos in java.library.path" - posted by Christoph Böhm <li...@gmx.net> on 2014/03/26 15:52:10 UTC, 0 replies.
- Shark drop table partitions - posted by vinay Bajaj <vb...@gmail.com> on 2014/03/26 15:57:29 UTC, 0 replies.
- closures & moving averages (state) - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/03/26 16:34:31 UTC, 2 replies.
- streaming questions - posted by Diana Carroll <dc...@cloudera.com> on 2014/03/26 19:08:32 UTC, 4 replies.
- interleave partitions? - posted by Walrus theCat <wa...@gmail.com> on 2014/03/26 19:11:30 UTC, 1 replies.
- Spark Streaming + Kafka + Mesos/Marathon strangeness - posted by Scott Clasen <sc...@gmail.com> on 2014/03/26 20:18:19 UTC, 8 replies.
- YARN problem using an external jar in worker nodes Inbox x - posted by Sung Hwan Chung <co...@cs.stanford.edu> on 2014/03/26 21:17:49 UTC, 5 replies.
- Announcing Spark SQL - posted by Michael Armbrust <mi...@databricks.com> on 2014/03/26 22:58:25 UTC, 24 replies.
- All pairs shortest paths? - posted by Ryan Compton <co...@gmail.com> on 2014/03/26 23:04:50 UTC, 4 replies.
- Spark preferred compression format - posted by Debasish Das <de...@gmail.com> on 2014/03/27 04:33:17 UTC, 0 replies.
- Not getting it - posted by lannyripple <la...@gmail.com> on 2014/03/27 05:34:31 UTC, 3 replies.
- Re: - posted by Sonal Goyal <so...@gmail.com> on 2014/03/27 08:36:54 UTC, 3 replies.
- Re: Configuring distributed caching with Spark and YARN - posted by santhoma <sa...@yahoo.com> on 2014/03/27 08:58:33 UTC, 1 replies.
- Run spark on mesos remotely - posted by Wush Wu <wu...@bridgewell.com> on 2014/03/27 09:09:16 UTC, 3 replies.
- java.lang.NoClassDefFoundError: org/apache/spark/util/Vector - posted by Kal El <pi...@yahoo.com> on 2014/03/27 09:57:45 UTC, 0 replies.
- Spark Pipe wrapException - posted by ''癫、砜' <29...@qq.com> on 2014/03/27 13:07:45 UTC, 0 replies.
- WikipediaPageRank Data Set - posted by Niko Stahl <r....@gmail.com> on 2014/03/27 14:45:46 UTC, 3 replies.
- spark streaming: what is awaitTermination()? - posted by Diana Carroll <dc...@cloudera.com> on 2014/03/27 15:02:40 UTC, 1 replies.
- function state lost when next RDD is processed - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/03/27 15:04:14 UTC, 4 replies.
- spark streaming and the spark shell - posted by Diana Carroll <dc...@cloudera.com> on 2014/03/27 16:02:58 UTC, 7 replies.
- Spark powered wikipedia analysis and exploration - posted by Guillaume Pitel <gu...@exensa.com> on 2014/03/27 16:12:02 UTC, 0 replies.
- GC overhead limit exceeded - posted by Sai Prasanna <an...@gmail.com> on 2014/03/27 16:21:33 UTC, 8 replies.
- StreamingContext.transform on a DStream - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/03/27 16:57:00 UTC, 1 replies.
- KafkaInputDStream mapping of partitions to tasks - posted by Scott Clasen <sc...@gmail.com> on 2014/03/27 19:09:16 UTC, 12 replies.
- how to create a DStream from bunch of RDDs - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/03/27 21:17:13 UTC, 0 replies.
- RDD[Array] question - posted by Walrus theCat <wa...@gmail.com> on 2014/03/28 00:17:31 UTC, 0 replies.
- Using ProtoBuf 2.5 for messages with Spark Streaming - posted by Kanwaldeep <ka...@gmail.com> on 2014/03/28 02:41:39 UTC, 1 replies.
- ArrayIndexOutOfBoundsException in ALS.implicit - posted by bearrito <j....@gmail.com> on 2014/03/28 04:38:47 UTC, 1 replies.
- Replicating RDD elements - posted by David Thomas <dt...@gmail.com> on 2014/03/28 04:54:24 UTC, 2 replies.
- Setting SPARK_MEM higher than available memory in driver - posted by Tsai Li Ming <ma...@ltsai.com> on 2014/03/28 06:48:34 UTC, 2 replies.
- Exception on simple pyspark script - posted by idanzalz <id...@gmail.com> on 2014/03/28 08:59:24 UTC, 1 replies.
- spark.akka.frameSize setting problem - posted by lihu <li...@gmail.com> on 2014/03/28 09:02:43 UTC, 0 replies.
- Strange behavior of RDD.cartesian - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/03/28 10:58:21 UTC, 3 replies.
- How does Spark handle executor down? RDD in this executor will be recomputed automatically? - posted by colt_colt <we...@hotmail.com> on 2014/03/28 11:25:09 UTC, 0 replies.
- Re: How does Spark handle executor down? RDD in this executor will be recomputed automatically? - posted by Sonal Goyal <so...@gmail.com> on 2014/03/28 12:11:21 UTC, 0 replies.
- groupByKey is taking more time - posted by "mohit.goyal" <mo...@guavus.com> on 2014/03/28 12:24:40 UTC, 0 replies.
- Guidelines for Spark Cluster Sizing - posted by Sonal Goyal <so...@gmail.com> on 2014/03/28 12:25:08 UTC, 0 replies.
- streaming: code to simulate a network socket data source - posted by Diana Carroll <dc...@cloudera.com> on 2014/03/28 14:08:15 UTC, 0 replies.
- Do all classes involving RDD operation need to be registered? - posted by anny9699 <an...@gmail.com> on 2014/03/28 17:37:09 UTC, 5 replies.
- 2 weeks until the deadline - Spark Summit call for submissions. - posted by Scott walent <sc...@gmail.com> on 2014/03/28 20:05:05 UTC, 0 replies.
- Mutable tagging RDD rows ? - posted by Sung Hwan Chung <co...@cs.stanford.edu> on 2014/03/29 01:16:55 UTC, 3 replies.
- working with MultiTableInputFormat - posted by "Livni, Dana" <da...@intel.com> on 2014/03/29 08:42:47 UTC, 0 replies.
- Zip or map elements to create new RDD - posted by yh18190 <yh...@gmail.com> on 2014/03/29 12:57:47 UTC, 2 replies.
- How to index each map operation???? - posted by yh18190 <yh...@gmail.com> on 2014/03/29 14:05:18 UTC, 0 replies.
- Limiting number of reducers performance implications - posted by Matthew Cheah <ma...@gmail.com> on 2014/03/30 00:58:38 UTC, 0 replies.
- SQL on Spark - Shark or SparkSQL - posted by Manoj Samel <ma...@gmail.com> on 2014/03/30 03:48:45 UTC, 2 replies.
- Cross validation is missing in machine learning examples - posted by Aureliano Buendia <bu...@gmail.com> on 2014/03/30 07:25:53 UTC, 1 replies.
- Can we convert scala.collection.ArrayBuffer[(Int,Double)] to org.spark.RDD[(Int,Double]) - posted by yh18190 <yh...@gmail.com> on 2014/03/30 12:22:54 UTC, 1 replies.
- Error in SparkSQL Example - posted by Manoj Samel <ma...@gmail.com> on 2014/03/30 16:31:30 UTC, 3 replies.
- Shouldn't the UNION of SchemaRDDs produce SchemaRDD ? - posted by Manoj Samel <ma...@gmail.com> on 2014/03/30 16:56:32 UTC, 3 replies.
- SparkSQL "where" with BigDecimal type gives stacktrace - posted by Manoj Samel <ma...@gmail.com> on 2014/03/30 19:16:44 UTC, 4 replies.
- Spark webUI - application details page - posted by David Thomas <dt...@gmail.com> on 2014/03/30 19:30:47 UTC, 1 replies.
- Spark-ec2 setup is getting slower and slower - posted by Aureliano Buendia <bu...@gmail.com> on 2014/03/31 00:12:36 UTC, 1 replies.
- Re: [shark-users] SQL on Spark - Shark or SparkSQL - posted by Matei Zaharia <ma...@gmail.com> on 2014/03/31 04:35:04 UTC, 2 replies.
- groupBy RDD does not have grouping column ? - posted by Manoj Samel <ma...@gmail.com> on 2014/03/31 06:52:11 UTC, 2 replies.
- batching the output - posted by Vipul Pandey <vi...@gmail.com> on 2014/03/31 07:11:48 UTC, 0 replies.
- Re: Task not serializable? - posted by Daniel Liu <hp...@gmail.com> on 2014/03/31 10:14:20 UTC, 0 replies.
- java.lang.ClassNotFoundException - spark on mesos - posted by Bharath Bhushan <ma...@outlook.com> on 2014/03/31 15:16:19 UTC, 5 replies.
- yarn.application.classpath in yarn-site.xml - posted by Dan <zs...@gmail.com> on 2014/03/31 16:27:12 UTC, 0 replies.
- Best practices: Parallelized write to / read from S3 - posted by Nicholas Chammas <ni...@gmail.com> on 2014/03/31 17:49:05 UTC, 4 replies.
- network wordcount example - posted by eric perler <er...@hotmail.com> on 2014/03/31 18:18:28 UTC, 1 replies.
- Calling Spark enthusiasts in NYC - posted by Andy Konwinski <an...@gmail.com> on 2014/03/31 19:28:57 UTC, 12 replies.
- how spark dstream handles congestion? - posted by Dong Mo <mo...@gmail.com> on 2014/03/31 20:05:02 UTC, 2 replies.
- Calling Spahk enthusiasts in Boston - posted by Nicholas Chammas <ni...@gmail.com> on 2014/03/31 20:52:47 UTC, 1 replies.