You are viewing a plain text version of this content. The canonical link for it is here.
- Re: Could not compute split, block not found - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/07/01 02:53:29 UTC, 3 replies.
- Re: TaskNotSerializable when invoking KMeans.run - posted by Jaideep Dhok <ja...@inmobi.com> on 2014/07/01 05:07:18 UTC, 0 replies.
- Re: History Server renered page not suitable for load balancing - posted by elyast <lu...@gmail.com> on 2014/07/01 06:02:43 UTC, 0 replies.
- Re: little confused about SPARK_JAVA_OPTS alternatives - posted by elyast <lu...@gmail.com> on 2014/07/01 06:09:38 UTC, 0 replies.
- Re: spark job stuck when running on mesos fine grained mode - posted by elyast <lu...@gmail.com> on 2014/07/01 06:13:10 UTC, 0 replies.
- Re: Spark 1.0 and Logistic Regression Python Example - posted by Xiangrui Meng <me...@gmail.com> on 2014/07/01 06:13:31 UTC, 1 replies.
- Re: org.jboss.netty.channel.ChannelException: Failed to bind to: master/1xx.xx..xx:0 - posted by MEETHU MATHEW <me...@yahoo.co.in> on 2014/07/01 08:59:44 UTC, 2 replies.
- Re: Serialization of objects - posted by Aaron Davidson <il...@gmail.com> on 2014/07/01 09:08:22 UTC, 0 replies.
- build spark assign version number myself? - posted by majian <ma...@nq.com> on 2014/07/01 09:21:44 UTC, 2 replies.
- issue with running example code - posted by Gurvinder Singh <gu...@uninett.no> on 2014/07/01 09:28:23 UTC, 2 replies.
- Questions about disk IOs - posted by Charles Li <li...@gmail.com> on 2014/07/01 09:55:46 UTC, 4 replies.
- Question about VD and ED - posted by Bin WU <bw...@connect.ust.hk> on 2014/07/01 10:49:59 UTC, 1 replies.
- RSpark installation on Windows - posted by Stuti Awasthi <st...@hcl.com> on 2014/07/01 11:31:10 UTC, 0 replies.
- Spark Streaming question batch size - posted by Laeeq Ahmed <la...@yahoo.com> on 2014/07/01 11:34:43 UTC, 2 replies.
- Error: UnionPartition cannot be cast to org.apache.spark.rdd.HadoopPartition - posted by Honey Joshi <ho...@ideata-analytics.com> on 2014/07/01 11:41:35 UTC, 4 replies.
- Window Size - posted by Laeeq Ahmed <la...@yahoo.com> on 2014/07/01 11:51:41 UTC, 0 replies.
- java.io.FileNotFoundException: http:///broadcast_1 - posted by Honey Joshi <ho...@ideata-analytics.com> on 2014/07/01 12:15:47 UTC, 0 replies.
- SparkKMeans.scala from examples will show: NoClassDefFoundError: breeze/linalg/Vector - posted by Wanda Hawk <wa...@yahoo.com> on 2014/07/01 13:39:57 UTC, 7 replies.
- Failed to launch Worker - posted by MEETHU MATHEW <me...@yahoo.co.in> on 2014/07/01 14:38:52 UTC, 0 replies.
- Re: Failed to launch Worker - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2014/07/01 14:44:25 UTC, 3 replies.
- Re: Changing log level of spark - posted by Philip Limbeck <ph...@gmail.com> on 2014/07/01 15:20:33 UTC, 2 replies.
- difference between worker and slave nodes - posted by aminn_524 <am...@yahoo.com> on 2014/07/01 15:21:36 UTC, 0 replies.
- Spark 1.0: Unable to Read LZO Compressed File - posted by "Uddin, Nasir M." <nu...@dtcc.com> on 2014/07/01 16:15:37 UTC, 1 replies.
- Spark Summit 2014 Day 2 Video Streams? - posted by Aditya Varun Chadha <ad...@gmail.com> on 2014/07/01 18:37:17 UTC, 5 replies.
- spark streaming rate limiting from kafka - posted by Chen Song <ch...@gmail.com> on 2014/07/01 18:57:01 UTC, 13 replies.
- Re: Improving Spark multithreaded performance? - posted by Kyle Ellrott <ke...@soe.ucsc.edu> on 2014/07/01 19:01:08 UTC, 0 replies.
- Re: Re: spark table to hive table - posted by John Omernik <jo...@omernik.com> on 2014/07/01 20:16:35 UTC, 1 replies.
- PySpark Driver from Jython - posted by Surendranauth Hiraman <su...@velos.io> on 2014/07/01 20:31:38 UTC, 0 replies.
- Spark SQL : Join throws exception - posted by Subacini B <su...@gmail.com> on 2014/07/01 21:06:58 UTC, 2 replies.
- why is toBreeze private everywhere in mllib? - posted by Koert Kuipers <ko...@tresata.com> on 2014/07/01 21:17:59 UTC, 1 replies.
- [ANNOUNCE] Flambo - A Clojure DSL for Apache Spark - posted by Soren Macbeth <so...@yieldbot.com> on 2014/07/01 21:31:54 UTC, 1 replies.
- spark-submit script and spark.files.userClassPathFirst - posted by _soumya_ <so...@gmail.com> on 2014/07/02 00:01:36 UTC, 0 replies.
- slf4j multiple bindings - posted by Bill Jay <bi...@gmail.com> on 2014/07/02 01:19:08 UTC, 0 replies.
- Lost TID: Loss was due to fetch failure from BlockManagerId - posted by Mohammed Guller <mo...@glassbeam.com> on 2014/07/02 02:57:09 UTC, 3 replies.
- Re: multiple passes in mapPartitions - posted by Chris Fregly <ch...@fregly.com> on 2014/07/02 04:19:26 UTC, 1 replies.
- Re: Fw: How Spark Choose Worker Nodes for respective HDFS block - posted by Chris Fregly <ch...@fregly.com> on 2014/07/02 04:31:56 UTC, 0 replies.
- Re: Help understanding spark.task.maxFailures - posted by Mayur Rustagi <ma...@gmail.com> on 2014/07/02 08:28:06 UTC, 0 replies.
- Re: spark streaming counter metrics - posted by Mayur Rustagi <ma...@gmail.com> on 2014/07/02 08:32:29 UTC, 1 replies.
- Re: Help alleviating OOM errors - posted by Mayur Rustagi <ma...@gmail.com> on 2014/07/02 08:40:03 UTC, 2 replies.
- Re: Callbacks on freeing up of RDDs - posted by Mayur Rustagi <ma...@gmail.com> on 2014/07/02 08:42:23 UTC, 0 replies.
- Configure and run external process with RDD.pipe - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/07/02 09:01:43 UTC, 0 replies.
- Re: Serializer or Out-of-Memory issues? - posted by Mayur Rustagi <ma...@gmail.com> on 2014/07/02 09:17:56 UTC, 0 replies.
- Help: WARN AbstractNioSelector: Unexpected exception in the selector loop. java.lang.OutOfMemoryError: Java heap space - posted by innowireless TaeYun Kim <ta...@innowireless.co.kr> on 2014/07/02 10:58:14 UTC, 2 replies.
- Is it possible to run HiveThriftServer2 based on SparkSQL in YARN now? - posted by 田毅 <ti...@asiainfo.com> on 2014/07/02 10:59:05 UTC, 0 replies.
- Where to set proxy in order to run ./install-dev.sh for SparkR - posted by Stuti Awasthi <st...@hcl.com> on 2014/07/02 13:04:53 UTC, 1 replies.
- How to use groupByKey and CqlPagingInputFormat - posted by Martin Gammelsæter <ma...@gmail.com> on 2014/07/02 13:36:09 UTC, 4 replies.
- java options for spark-1.0.0 - posted by Wanda Hawk <wa...@yahoo.com> on 2014/07/02 13:50:25 UTC, 2 replies.
- java.io.FileNotFoundException: shuffle - posted by nit <ni...@gmail.com> on 2014/07/02 14:38:37 UTC, 0 replies.
- Custom Serialization - posted by Andrea Esposito <an...@gmail.com> on 2014/07/02 14:59:11 UTC, 0 replies.
- Re: How to terminate job from the task code? - posted by Piotr Kołaczkowski <pk...@datastax.com> on 2014/07/02 15:30:28 UTC, 0 replies.
- [mllib] strange/buggy results with RidgeRegressionWithSGD - posted by Eustache DIEMERT <eu...@diemert.fr> on 2014/07/02 16:11:39 UTC, 6 replies.
- Stream into Parquet Table - posted by prashant amar <am...@gmail.com> on 2014/07/02 18:27:58 UTC, 0 replies.
- Re: Execution stalls in LogisticRegressionWithSGD - posted by Bharath Ravi Kumar <re...@gmail.com> on 2014/07/02 18:34:19 UTC, 10 replies.
- Re: streaming questions - posted by mcampbell <mi...@gmail.com> on 2014/07/02 18:36:16 UTC, 0 replies.
- Run spark unit test on Windows 7 - posted by Konstantin Kudryavtsev <ku...@gmail.com> on 2014/07/02 18:38:33 UTC, 9 replies.
- installing spark 1 on hadoop 1 - posted by Imran Akbar <im...@infoscoutinc.com> on 2014/07/02 18:47:04 UTC, 2 replies.
- Worker can not find custom KryoRegistrator - posted by dash <bs...@nd.edu> on 2014/07/02 19:03:40 UTC, 1 replies.
- Spark, Logging Issues: slf4j or log4j - posted by Shivani Rao <ra...@gmail.com> on 2014/07/02 19:11:20 UTC, 0 replies.
- NullPointerException on ExternalAppendOnlyMap - posted by Konstantin Kudryavtsev <ku...@gmail.com> on 2014/07/02 19:22:32 UTC, 1 replies.
- LIMIT with offset in SQL queries - posted by durin <ma...@simon-schaefer.net> on 2014/07/02 22:37:52 UTC, 3 replies.
- MLLib : Math on Vector and Matrix - posted by Thunder Stumpges <th...@gmail.com> on 2014/07/02 23:00:51 UTC, 6 replies.
- Kafka - streaming from multiple topics - posted by Sergey Malov <sm...@collective.com> on 2014/07/02 23:47:34 UTC, 4 replies.
- Use Spark Streaming to update result whenever data come - posted by Bill Jay <bi...@gmail.com> on 2014/07/03 00:09:51 UTC, 8 replies.
- Spark SQL - groupby - posted by Subacini B <su...@gmail.com> on 2014/07/03 00:34:42 UTC, 2 replies.
- Shark Vs Spark SQL - posted by Subacini B <su...@gmail.com> on 2014/07/03 00:53:14 UTC, 5 replies.
- reduceByKey Not Being Called by Spark Streaming - posted by "Dan H." <dc...@gmail.com> on 2014/07/03 01:01:16 UTC, 1 replies.
- Spark Streaming- Input from Kafka, output to HBase - posted by JiajiaJing <jj...@gmail.com> on 2014/07/03 01:12:11 UTC, 0 replies.
- AWS Credentials for private S3 reads - posted by Brian Gawalt <bg...@gmail.com> on 2014/07/03 01:17:11 UTC, 4 replies.
- RE: write event logs with YARN - posted by Andrew Lee <al...@hotmail.com> on 2014/07/03 01:49:36 UTC, 3 replies.
- Enable Parsing Failed or Incompleted jobs on HistoryServer (YARN mode) - posted by Andrew Lee <al...@hotmail.com> on 2014/07/03 02:01:46 UTC, 2 replies.
- RDD join: composite keys - posted by Sameer Tilak <ss...@live.com> on 2014/07/03 02:12:22 UTC, 1 replies.
- Re: Spark job tracker. - posted by abhiguruvayya <sh...@gmail.com> on 2014/07/03 02:35:56 UTC, 13 replies.
- Visualize task distribution in cluster - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/07/03 02:58:51 UTC, 1 replies.
- Re: Distribute data from Kafka evenly on cluster - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/07/03 04:58:39 UTC, 3 replies.
- One question about RDD.zip function when trying Naive Bayes - posted by x <wa...@gmail.com> on 2014/07/03 05:23:22 UTC, 3 replies.
- Spark S3 LZO input files - posted by hassan <He...@gmail.com> on 2014/07/03 08:21:25 UTC, 0 replies.
- Re: Integrate Spark Editor with Hue for source compiled installation of spark/spark-jobServer - posted by Sunita Arvind <su...@gmail.com> on 2014/07/03 08:42:37 UTC, 0 replies.
- Case class in java - posted by Kevin Jung <it...@samsung.com> on 2014/07/03 10:31:12 UTC, 3 replies.
- Which version of Hive support Spark & Shark - posted by Ravi Prasad <ra...@gmail.com> on 2014/07/03 11:29:59 UTC, 1 replies.
- hdfs short circuit - posted by "Jahagirdar, Madhu" <ma...@philips.com> on 2014/07/03 12:50:14 UTC, 0 replies.
- Reading text file vs streaming text files - posted by M Singh <ma...@yahoo.com> on 2014/07/03 15:04:32 UTC, 2 replies.
- matchError:null in ALS.train - posted by Honey Joshi <ho...@ideata-analytics.com> on 2014/07/03 15:12:32 UTC, 2 replies.
- Running the BroadcastTest.scala with TorrentBroadcastFactory in a standalone cluster - posted by jackxucs <ja...@gmail.com> on 2014/07/03 16:48:24 UTC, 2 replies.
- reading compress lzo files - posted by Gurvinder Singh <gu...@uninett.no> on 2014/07/03 18:24:27 UTC, 9 replies.
- spark text processing - posted by M Singh <ma...@yahoo.com> on 2014/07/03 18:40:35 UTC, 0 replies.
- Spark Streaming Error Help -> ERROR actor.OneForOneStrategy: key not found: - posted by jschindler <jo...@utexas.edu> on 2014/07/03 19:56:55 UTC, 1 replies.
- Anaconda Spark AMI - posted by Benjamin Zaitlen <qu...@gmail.com> on 2014/07/03 20:54:33 UTC, 3 replies.
- Spark logging strategy on YARN - posted by Kostiantyn Kudriavtsev <ku...@gmail.com> on 2014/07/03 21:26:48 UTC, 1 replies.
- Sample datasets for MLlib and Graphx - posted by AlexanderRiggers <al...@gmail.com> on 2014/07/04 00:25:26 UTC, 3 replies.
- (Unknown) - posted by Steven Cox <sc...@renci.org> on 2014/07/04 03:21:13 UTC, 2 replies.
- No FileSystem for scheme: hdfs - posted by Steven Cox <sc...@renci.org> on 2014/07/04 03:45:11 UTC, 6 replies.
- Re: graphx Joining two VertexPartitions with different indexes is slow. - posted by Ankur Dave <an...@gmail.com> on 2014/07/04 04:45:37 UTC, 6 replies.
- SparkSQL with Streaming RDD - posted by Chang Lim <ch...@gmail.com> on 2014/07/04 06:58:17 UTC, 0 replies.
- OFF_HEAP storage level - posted by Ajay Srivastava <a_...@yahoo.com> on 2014/07/04 08:19:20 UTC, 2 replies.
- Re: Spark Streaming on top of Cassandra? - posted by zarzyk <k....@gmail.com> on 2014/07/04 09:33:04 UTC, 2 replies.
- RE: Spark with HBase - posted by "N.Venkata Naga Ravi" <nv...@hotmail.com> on 2014/07/04 10:19:18 UTC, 1 replies.
- Spark SQL user defined functions - posted by Martin Gammelsæter <ma...@gmail.com> on 2014/07/04 10:59:32 UTC, 8 replies.
- Graphx traversal and merge interesting edges - posted by HHB <hi...@gmail.com> on 2014/07/04 11:02:38 UTC, 5 replies.
- spark and mesos issue - posted by Gurvinder Singh <gu...@uninett.no> on 2014/07/04 11:05:49 UTC, 1 replies.
- Spark memory optimization - posted by Igor Pernek <ig...@pernek.net> on 2014/07/04 11:06:43 UTC, 4 replies.
- sparck Stdout and stderr - posted by aminn_524 <am...@yahoo.com> on 2014/07/04 12:14:43 UTC, 0 replies.
- Unable to run Spark 1.0 SparkPi on HDP 2.0 - posted by Konstantin Kudryavtsev <ku...@gmail.com> on 2014/07/04 14:16:44 UTC, 14 replies.
- window analysis with Spark and Spark streaming - posted by alessandro finamore <al...@polito.it> on 2014/07/04 15:46:26 UTC, 7 replies.
- SQL FIlter of tweets (json) running on Disk - posted by Abel Coronado Iruegas <ac...@gmail.com> on 2014/07/04 16:49:08 UTC, 3 replies.
- DynamoDB input source - posted by Ian Wilkinson <ia...@me.com> on 2014/07/04 17:28:17 UTC, 8 replies.
- pyspark + yarn: how everything works. - posted by Egor Pahomov <pa...@gmail.com> on 2014/07/04 19:39:33 UTC, 0 replies.
- Java sample for using cassandra-driver-spark - posted by M Singh <ma...@yahoo.com> on 2014/07/05 00:48:57 UTC, 2 replies.
- classnotfound error due to groupByKey - posted by Joe L <se...@yahoo.com> on 2014/07/05 01:26:26 UTC, 0 replies.
- Spark streaming kafka cost long time at "take at DStream.scala:586" - posted by xiemeilong <xi...@gmail.com> on 2014/07/05 06:31:01 UTC, 0 replies.
- How to parallelize model fitting with different cross-validation folds? - posted by sparkuser2345 <hm...@gmail.com> on 2014/07/05 10:35:34 UTC, 5 replies.
- Spark 1.0 failed on HDP 2.0 with absurd exception - posted by Konstantin Kudryavtsev <ku...@gmail.com> on 2014/07/05 11:48:05 UTC, 1 replies.
- taking top k values of rdd - posted by Koert Kuipers <ko...@tresata.com> on 2014/07/05 19:16:44 UTC, 5 replies.
- Addind and subtracting workers on Spark EC2 cluster - posted by Robert James <sr...@gmail.com> on 2014/07/06 16:10:40 UTC, 1 replies.
- Controlling amount of data sent to slaves - posted by asylvest <ad...@gmail.com> on 2014/07/07 02:29:10 UTC, 0 replies.
- Data loading to Parquet using spark - posted by Shaikh Riyaz <sh...@gmail.com> on 2014/07/07 02:30:30 UTC, 2 replies.
- SparkSQL - Partitioned Parquet - posted by Raffael Marty <ra...@pixlcloud.com> on 2014/07/07 07:00:06 UTC, 1 replies.
- Dense to sparse vector converter - posted by "Ulanov, Alexander" <al...@hp.com> on 2014/07/07 09:37:35 UTC, 1 replies.
- Broadcast variable in Spark Java application - posted by Praveen R <pr...@sigmoidanalytics.com> on 2014/07/07 09:41:55 UTC, 1 replies.
- which Spark package(wrt. graphX) I should install to do graph computation on cluster? - posted by Yifan LI <ia...@gmail.com> on 2014/07/07 13:51:45 UTC, 0 replies.
- spark-submit conflicts with dependencies - posted by Robert James <sr...@gmail.com> on 2014/07/07 15:00:59 UTC, 0 replies.
- Possible bug in Spark Streaming :: TextFileStream - posted by Luis Ángel Vicente Sánchez <la...@gmail.com> on 2014/07/07 16:11:45 UTC, 3 replies.
- Pig 0.13, Spark, Spork - posted by Bertrand Dechoux <de...@gmail.com> on 2014/07/07 16:51:30 UTC, 3 replies.
- Comparative study - posted by sa...@accenture.com on 2014/07/07 17:07:02 UTC, 22 replies.
- Control number of tasks per stage - posted by Konstantin Kudryavtsev <ku...@gmail.com> on 2014/07/07 17:25:49 UTC, 1 replies.
- tiers of caching - posted by Koert Kuipers <ko...@tresata.com> on 2014/07/07 19:02:10 UTC, 2 replies.
- spark-assembly libraries conflict with needed libraries - posted by Robert James <sr...@gmail.com> on 2014/07/07 19:31:31 UTC, 6 replies.
- spark-assembly libraries conflict with application libraries - posted by Robert James <sr...@gmail.com> on 2014/07/07 19:32:20 UTC, 0 replies.
- Error while launching spark cluster manaually - posted by Sameer Tilak <ss...@live.com> on 2014/07/07 20:14:46 UTC, 0 replies.
- Issues in opening UI when running Spark Streaming in YARN - posted by Yan Fang <ya...@gmail.com> on 2014/07/07 20:20:40 UTC, 6 replies.
- Which is the best way to get a connection to an external database per task in Spark Streaming? - posted by Juan Rodríguez Hortalá <ju...@gmail.com> on 2014/07/07 20:40:49 UTC, 6 replies.
- SparkSQL with sequence file RDDs - posted by Gary Malouf <ma...@gmail.com> on 2014/07/07 21:36:31 UTC, 5 replies.
- Re: NoSuchMethodError in KafkaReciever - posted by mcampbell <mi...@gmail.com> on 2014/07/07 22:40:30 UTC, 1 replies.
- Spark shell error messages and app exit issues - posted by Sameer Tilak <ss...@live.com> on 2014/07/07 23:05:09 UTC, 0 replies.
- acl for spark ui - posted by Koert Kuipers <ko...@tresata.com> on 2014/07/07 23:37:29 UTC, 0 replies.
- Cannot create dir in Tachyon when running Spark with OFF_HEAP caching (FileDoesNotExistException) - posted by Teng Long <lo...@gmail.com> on 2014/07/07 23:52:54 UTC, 1 replies.
- memory leak query - posted by Michael Lewis <le...@me.com> on 2014/07/08 00:50:18 UTC, 1 replies.
- Re: master attempted to re-register the worker and then took all workers as unregistered - posted by Nan Zhu <zh...@gmail.com> on 2014/07/08 01:16:30 UTC, 3 replies.
- The number of cores vs. the number of executors - posted by innowireless TaeYun Kim <ta...@innowireless.co.kr> on 2014/07/08 02:42:46 UTC, 1 replies.
- usage question for saprk run on YARN - posted by Cheng Ju Chuang <Ch...@symantec.com> on 2014/07/08 02:44:15 UTC, 1 replies.
- Re: how to set spark.executor.memory and heap size - posted by Alex Gaudio <ad...@gmail.com> on 2014/07/08 03:13:58 UTC, 0 replies.
- Powered By Spark: Can you please add our org? - posted by Alex Gaudio <ad...@gmail.com> on 2014/07/08 03:19:38 UTC, 2 replies.
- the Pre-built packages for CDH4 can not support yarn ? - posted by ch huang <ju...@gmail.com> on 2014/07/08 03:37:09 UTC, 1 replies.
- Is the order of messages guaranteed in a DStream? - posted by Yan Fang <ya...@gmail.com> on 2014/07/08 03:49:54 UTC, 1 replies.
- Re: Pig 0.13, Spark, Spork - posted by 张包峰 <pe...@qq.com> on 2014/07/08 04:04:50 UTC, 3 replies.
- Spark RDD Disk Persistance - posted by "Jahagirdar, Madhu" <ma...@philips.com> on 2014/07/08 04:15:41 UTC, 1 replies.
- Help for the large number of the input data files - posted by innowireless TaeYun Kim <ta...@innowireless.co.kr> on 2014/07/08 04:38:51 UTC, 1 replies.
- Spark Installation - posted by Srikrishna S <sr...@gmail.com> on 2014/07/08 05:07:28 UTC, 7 replies.
- Seattle Spark Meetup slides: xPatterns, Fun Things, and Machine Learning Streams - next is Interactive OLAP - posted by Denny Lee <de...@gmail.com> on 2014/07/08 05:41:30 UTC, 0 replies.
- 答复: Spark RDD Disk Persistance - posted by "Lizhengbing (bing, BIPA)" <zh...@huawei.com> on 2014/07/08 08:39:15 UTC, 0 replies.
- Error and doubts in using Mllib Naive bayes for text clasification - posted by Rahul Bhojwani <ra...@gmail.com> on 2014/07/08 08:44:39 UTC, 3 replies.
- Spark SQL registerAsTable requires a Java Class? - posted by Ionized <io...@gmail.com> on 2014/07/08 08:48:33 UTC, 3 replies.
- Spark: All masters are unresponsive! - posted by Sameer Tilak <ss...@live.com> on 2014/07/08 08:51:19 UTC, 6 replies.
- error when spark access hdfs with Kerberos enable - posted by 许晓炜 <xu...@qiyi.com> on 2014/07/08 10:40:39 UTC, 0 replies.
- "NoSuchElementException: key not found" when changing the window lenght and interval in Spark Streaming - posted by Juan Rodríguez Hortalá <ju...@gmail.com> on 2014/07/08 11:20:05 UTC, 2 replies.
- Task's "Scheduler Delay" in web ui - posted by haopu <hw...@qilinsoft.com> on 2014/07/08 13:14:14 UTC, 0 replies.
- Spark MapReduce job to work with Hive - posted by Darq Moth <da...@gmail.com> on 2014/07/08 13:46:08 UTC, 0 replies.
- Terminal freeze during SVM - posted by AlexanderRiggers <al...@gmail.com> on 2014/07/08 14:14:41 UTC, 6 replies.
- Disabling SparkContext WebUI on port 4040, accessing information programatically? - posted by Martin Gammelsæter <ma...@gmail.com> on 2014/07/08 14:58:58 UTC, 4 replies.
- Is MLlib NaiveBayes implementation for Spark 0.9.1 correct? - posted by Rahul Bhojwani <ra...@gmail.com> on 2014/07/08 16:20:12 UTC, 2 replies.
- How to incorporate the new data in the MLlib-NaiveBayes model along with predicting? - posted by Rahul Bhojwani <ra...@gmail.com> on 2014/07/08 16:35:31 UTC, 2 replies.
- got java.lang.AssertionError when run sbt/sbt compile - posted by bai阿蒙 <sm...@hotmail.com> on 2014/07/08 17:25:31 UTC, 0 replies.
- Scheduling in spark - posted by rapelly kartheek <ka...@gmail.com> on 2014/07/08 18:11:13 UTC, 4 replies.
- java.lang.OutOfMemoryError (java.lang.OutOfMemoryError: GC overhead limit exceeded) - posted by Konstantin Kudryavtsev <ku...@gmail.com> on 2014/07/08 18:27:27 UTC, 3 replies.
- Please add Talend to "Powered By Spark" page - posted by Daniel Kulp <dk...@talend.com> on 2014/07/08 19:05:41 UTC, 0 replies.
- how to convert RDD to PairRDDFunctions ? - posted by Konstantin Kudryavtsev <ku...@gmail.com> on 2014/07/08 19:25:38 UTC, 2 replies.
- Join two Spark Streaming - posted by Bill Jay <bi...@gmail.com> on 2014/07/08 20:03:09 UTC, 3 replies.
- Further details on spark cluster set up - posted by Sameer Tilak <ss...@live.com> on 2014/07/08 20:05:12 UTC, 0 replies.
- CoarseGrainedExecutorBackend: Driver Disassociated - posted by Sameer Tilak <ss...@live.com> on 2014/07/08 20:52:54 UTC, 5 replies.
- Re: error when spark access hdfs with Kerberos enable - posted by Marcelo Vanzin <va...@cloudera.com> on 2014/07/08 21:04:28 UTC, 4 replies.
- Error: Could not delete temporary files. - posted by Rahul Bhojwani <ra...@gmail.com> on 2014/07/08 21:17:24 UTC, 9 replies.
- [Spark SQL]: Convert SchemaRDD back to RDD - posted by Pierre B <pi...@realimpactanalytics.com> on 2014/07/08 21:43:20 UTC, 2 replies.
- Re: got java.lang.AssertionError when run sbt/sbt compile - posted by Xiangrui Meng <me...@gmail.com> on 2014/07/08 22:09:34 UTC, 0 replies.
- OutOfMemory : Java heap space error - posted by Rahul Bhojwani <ra...@gmail.com> on 2014/07/08 22:26:35 UTC, 0 replies.
- spark-1.0.0-rc11 2f1dc868 spark-shell not honoring --properties-file option? - posted by Andrew Lee <al...@hotmail.com> on 2014/07/09 00:17:00 UTC, 1 replies.
- Spark-streaming-kafka error - posted by Bill Jay <bi...@gmail.com> on 2014/07/09 01:18:06 UTC, 2 replies.
- issues with ./bin/spark-shell for standalone mode - posted by Mikhail Strebkov <st...@gmail.com> on 2014/07/09 01:29:18 UTC, 5 replies.
- Spark Streaming using File Stream in Java - posted by Aravind <ar...@gmail.com> on 2014/07/09 02:33:06 UTC, 3 replies.
- Spark Streaming and Storm - posted by "xichen_tju@126" <xi...@126.com> on 2014/07/09 03:17:16 UTC, 2 replies.
- Purpose of spark-submit? - posted by Robert James <sr...@gmail.com> on 2014/07/09 03:22:14 UTC, 14 replies.
- Requirements for Spark cluster - posted by Robert James <sr...@gmail.com> on 2014/07/09 03:24:05 UTC, 3 replies.
- slower worker node in the cluster - posted by haopu <hw...@qilinsoft.com> on 2014/07/09 04:37:34 UTC, 0 replies.
- Document page load fault - posted by binbinbin915 <bi...@live.cn> on 2014/07/09 04:46:52 UTC, 1 replies.
- Kryo is slower, and the size saving is minimal - posted by innowireless TaeYun Kim <ta...@innowireless.co.kr> on 2014/07/09 07:11:59 UTC, 2 replies.
- spark Driver - posted by amin mohebbi <am...@yahoo.com> on 2014/07/09 07:28:32 UTC, 4 replies.
- Standalone cluster on Windows - posted by Chitturi Padma <le...@gmail.com> on 2014/07/09 08:22:55 UTC, 0 replies.
- Need advice to create an objectfile of set of images from Spark - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/07/09 08:47:22 UTC, 2 replies.
- How to clear the list of Completed Appliations in Spark web UI? - posted by Haopu Wang <HW...@qilinsoft.com> on 2014/07/09 09:04:23 UTC, 2 replies.
- how to host the drive node - posted by aminn_524 <am...@yahoo.com> on 2014/07/09 09:42:15 UTC, 0 replies.
- How to host spark driver - posted by amin mohebbi <am...@yahoo.com> on 2014/07/09 09:43:18 UTC, 0 replies.
- TaskContext stageId = 0 - posted by silvermast <vt...@paxata.com> on 2014/07/09 09:58:32 UTC, 1 replies.
- Re: Why doesn't the driver node do any work? - posted by aminn_524 <am...@yahoo.com> on 2014/07/09 10:01:52 UTC, 0 replies.
- "Initial job has not accepted any resources" means many things - posted by Martin Gammelsæter <ma...@gmail.com> on 2014/07/09 10:29:31 UTC, 0 replies.
- How to run a job on all workers? - posted by silvermast <vt...@paxata.com> on 2014/07/09 11:08:54 UTC, 0 replies.
- Error using MLlib-NaiveBayes : "Matrices are not aligned" - posted by Rahul Bhojwani <ra...@gmail.com> on 2014/07/09 11:24:59 UTC, 0 replies.
- Filtering data during the read - posted by Konstantin Kudryavtsev <ku...@gmail.com> on 2014/07/09 11:45:25 UTC, 1 replies.
- FW: memory question - posted by mi...@barclays.com on 2014/07/09 12:31:04 UTC, 1 replies.
- Docker Scripts - posted by dmpour23 <dm...@gmail.com> on 2014/07/09 12:31:50 UTC, 0 replies.
- Spark SQL - java.lang.NoClassDefFoundError: Could not initialize class $line10.$read$ - posted by "gileny@gmail.com" <gi...@gmail.com> on 2014/07/09 13:28:00 UTC, 2 replies.
- Cassandra driver Spark question - posted by RodrigoB <ro...@aspect.com> on 2014/07/09 15:24:14 UTC, 5 replies.
- how to convert JavaDStream to JavaRDD - posted by Madabhattula Rajesh Kumar <mr...@gmail.com> on 2014/07/09 15:28:46 UTC, 1 replies.
- Re: controlling the time in spark-streaming - posted by Laeeq Ahmed <la...@yahoo.com> on 2014/07/09 15:56:32 UTC, 0 replies.
- RDD Cleanup - posted by premdass <pr...@yahoo.co.in> on 2014/07/09 16:03:54 UTC, 4 replies.
- Re: SparkSQL registerAsTable - No TypeTag available Error - posted by premdass <pr...@yahoo.co.in> on 2014/07/09 16:05:04 UTC, 0 replies.
- Spark on Yarn: Connecting to Existing Instance - posted by John Omernik <jo...@omernik.com> on 2014/07/09 17:31:32 UTC, 6 replies.
- Mechanics of passing functions to Spark? - posted by Seref Arikan <se...@gmail.com> on 2014/07/09 18:09:57 UTC, 0 replies.
- Error with Stream Kafka Kryo - posted by richiesgr <ri...@gmail.com> on 2014/07/09 18:25:33 UTC, 0 replies.
- Apache Spark, Hadoop 2.2.0 without Yarn Integration - posted by "Nick R. Katsipoulakis" <ka...@cs.pitt.edu> on 2014/07/09 18:27:57 UTC, 2 replies.
- Spark 0.9.1 implementation of MLlib-NaiveBayes is having bug. - posted by Rahul Bhojwani <ra...@gmail.com> on 2014/07/09 18:57:54 UTC, 1 replies.
- Compilation error in Spark 1.0.0 - posted by Silvina Caíno Lores <si...@gmail.com> on 2014/07/09 19:14:52 UTC, 1 replies.
- Spark Streaming - two questions about the streamingcontext - posted by Yan Fang <ya...@gmail.com> on 2014/07/09 20:22:14 UTC, 2 replies.
- How should I add a jar? - posted by Nick Chammas <ni...@gmail.com> on 2014/07/09 20:44:27 UTC, 3 replies.
- Spark streaming - tasks and stages continue to be generated when using reduce by key - posted by M Singh <ma...@yahoo.com> on 2014/07/09 20:51:03 UTC, 7 replies.
- Some question about SQL and streaming - posted by "hsy541@gmail.com" <hs...@gmail.com> on 2014/07/09 21:21:22 UTC, 8 replies.
- SPARK_CLASSPATH Warning - posted by "Nick R. Katsipoulakis" <ka...@cs.pitt.edu> on 2014/07/09 22:45:07 UTC, 1 replies.
- Understanding how to install in HDP - posted by Abel Coronado Iruegas <ac...@gmail.com> on 2014/07/09 23:06:00 UTC, 2 replies.
- Number of executors change during job running - posted by Bill Jay <bi...@gmail.com> on 2014/07/10 01:05:18 UTC, 15 replies.
- Cannot submit to a Spark Application to a remote cluster Spark 1.0 - posted by Aris Vlasakakis <ar...@vlasakakis.com> on 2014/07/10 01:18:49 UTC, 5 replies.
- CoarseGrainedExecutorBackend: Driver Disassociated‏ - posted by Sameer Tilak <ss...@live.com> on 2014/07/10 01:24:56 UTC, 0 replies.
- Pyspark, references to different rdds being overwritten to point to the same rdd, different results when using .cache() - posted by nimbus <ni...@radius.com> on 2014/07/10 02:25:03 UTC, 0 replies.
- spark1.0 principal component analysis - posted by fintis <fi...@gmail.com> on 2014/07/10 02:46:45 UTC, 1 replies.
- Re: error in creating external table - posted by Du Li <li...@yahoo-inc.com> on 2014/07/10 02:58:03 UTC, 0 replies.
- Spark Streaming - What does Spark Streaming checkpoint? - posted by Yan Fang <ya...@gmail.com> on 2014/07/10 03:11:21 UTC, 0 replies.
- Restarting a Streaming Context - posted by Nick Chammas <ni...@gmail.com> on 2014/07/10 03:11:51 UTC, 3 replies.
- Re: executor failed, cannot find compute-classpath.sh - posted by cjwang <cj...@cjwang.us> on 2014/07/10 03:42:04 UTC, 5 replies.
- Map Function does not seem to be executing over RDD - posted by Raza Rehman <ra...@gmail.com> on 2014/07/10 03:46:36 UTC, 1 replies.
- Does MLlib Naive Bayes implementation incorporates Laplase smoothing? - posted by Rahul Bhojwani <ra...@gmail.com> on 2014/07/10 07:55:44 UTC, 5 replies.
- A Task failed with java.lang.ArrayIndexOutOfBoundsException at com.ning.compress.lzf.impl.UnsafeChunkDecoder.copyOverlappingLong - posted by innowireless TaeYun Kim <ta...@innowireless.co.kr> on 2014/07/10 10:13:29 UTC, 0 replies.
- All of the tasks have been completed but the Stage is still shown as "Active"? - posted by Haopu Wang <HW...@qilinsoft.com> on 2014/07/10 10:21:18 UTC, 7 replies.
- KMeans code is rubbish - posted by Wanda Hawk <wa...@yahoo.com> on 2014/07/10 10:44:20 UTC, 10 replies.
- Getting Persistent Connection using socketStream? - posted by kytay <ka...@gmail.com> on 2014/07/10 11:47:22 UTC, 3 replies.
- running scrapy (or any other scraper) on the cluster? - posted by mrm <ma...@skimlinks.com> on 2014/07/10 11:58:34 UTC, 0 replies.
- RDD registerAsTable gives error on regular scala class records - posted by Kefah Issa <ke...@freesoft.jo> on 2014/07/10 14:39:04 UTC, 1 replies.
- Difference between collect() and take(n) - posted by MEETHU MATHEW <me...@yahoo.co.in> on 2014/07/10 14:50:22 UTC, 0 replies.
- EC2 Cluster script. Shark install fails - posted by Jason H <ja...@developer.net.nz> on 2014/07/10 14:51:10 UTC, 2 replies.
- SparkSQL - Language Integrated query - OR clause and IN clause - posted by premdass <pr...@yahoo.co.in> on 2014/07/10 15:15:09 UTC, 3 replies.
- Re: Yay for 1.0.0! EC2 Still has problems. - posted by nit <ni...@gmail.com> on 2014/07/10 17:14:20 UTC, 1 replies.
- Potential bugs in SparkSQL - posted by Jerry Lam <ch...@gmail.com> on 2014/07/10 17:15:20 UTC, 5 replies.
- GraphX: how to specify partition strategy? - posted by Yifan LI <ia...@gmail.com> on 2014/07/10 17:20:33 UTC, 1 replies.
- How to read saved model? - posted by rohitspujari <rp...@hortonworks.com> on 2014/07/10 17:36:08 UTC, 0 replies.
- SPARKSQL problem with implementing Scala's Product interface - posted by yadid <ya...@media.mit.edu> on 2014/07/10 18:02:18 UTC, 3 replies.
- Recommended pipeline automation tool? Oozie? - posted by "k.tham" <ke...@gmail.com> on 2014/07/10 19:20:00 UTC, 8 replies.
- Running Spark on Yarn vs Mesos - posted by "k.tham" <ke...@gmail.com> on 2014/07/10 19:21:43 UTC, 0 replies.
- Stateful RDDs? - posted by Sargun Dhillon <sa...@sargun.me> on 2014/07/10 19:25:51 UTC, 1 replies.
- Difference between SparkSQL and shark - posted by "hsy541@gmail.com" <hs...@gmail.com> on 2014/07/10 19:50:33 UTC, 2 replies.
- sparkStaging - posted by Koert Kuipers <ko...@tresata.com> on 2014/07/10 20:50:22 UTC, 0 replies.
- Generic Interface between RDD and DStream - posted by mshah <sh...@gmail.com> on 2014/07/10 20:53:04 UTC, 3 replies.
- How are the executors used in Spark Streaming in terms of receiver and driver program? - posted by Yan Fang <ya...@gmail.com> on 2014/07/10 20:59:52 UTC, 6 replies.
- Use of the SparkContext.hadoopRDD function in Scala code - posted by "Nick R. Katsipoulakis" <ka...@cs.pitt.edu> on 2014/07/10 21:13:25 UTC, 0 replies.
- How to RDD.take(middle 10 elements) - posted by Nick Chammas <ni...@gmail.com> on 2014/07/10 21:53:37 UTC, 1 replies.
- writing FLume data to HDFS - posted by "Sundaram, Muthu X." <Mu...@sabre.com> on 2014/07/10 22:36:01 UTC, 4 replies.
- Using HQL is terribly slow: Potential Performance Issue - posted by Jerry Lam <ch...@gmail.com> on 2014/07/10 23:08:04 UTC, 6 replies.
- What version of twitter4j should I use with Spark Streaming? - posted by Nick Chammas <ni...@gmail.com> on 2014/07/10 23:42:53 UTC, 1 replies.
- Multiple SparkContexts with different configurations in same JVM - posted by Philip Ogren <ph...@oracle.com> on 2014/07/11 00:27:49 UTC, 0 replies.
- incorrect labels being read by MLUtils.loadLabeledData() - posted by SK <sk...@gmail.com> on 2014/07/11 00:28:44 UTC, 1 replies.
- SparkR failed to connect to the master - posted by cjwang <cj...@cjwang.us> on 2014/07/11 01:03:15 UTC, 3 replies.
- Submitting to a cluster behind a VPN, configuring different IP address - posted by Aris Vlasakakis <ar...@vlasakakis.com> on 2014/07/11 01:04:12 UTC, 1 replies.
- Streaming. Cannot get socketTextStream to receive anything. - posted by kytay <ka...@gmail.com> on 2014/07/11 04:41:20 UTC, 11 replies.
- Wanna know more about Pyspark Internals - posted by Baofeng Zhang <pe...@qq.com> on 2014/07/11 05:49:40 UTC, 1 replies.
- Spark Streaming with Kafka NoClassDefFoundError - posted by Dilip <di...@hotmail.com> on 2014/07/11 06:39:25 UTC, 7 replies.
- Spark summit 2014 videos ? - posted by Ajay Srivastava <a_...@yahoo.com> on 2014/07/11 07:12:39 UTC, 0 replies.
- KMeans for large training data - posted by durin <ma...@simon-schaefer.net> on 2014/07/11 10:53:19 UTC, 6 replies.
- Iteration question - posted by Nathan Kronenfeld <nk...@oculusinfo.com> on 2014/07/11 15:54:18 UTC, 1 replies.
- Categorical Features for K-Means Clustering - posted by Wen Phan <we...@mac.com> on 2014/07/11 16:07:26 UTC, 2 replies.
- Spark Streaming timing considerations - posted by Laeeq Ahmed <la...@yahoo.com> on 2014/07/11 16:44:02 UTC, 8 replies.
- Re: Using CQLSSTableWriter to batch load data from Spark to Cassandra. - posted by Rohit Rai <ro...@tuplejump.com> on 2014/07/11 17:18:25 UTC, 0 replies.
- Databricks demo - posted by Debasish Das <de...@gmail.com> on 2014/07/11 18:09:35 UTC, 0 replies.
- Re: RDD join, index key: composite keys - posted by marspoc <so...@gmail.com> on 2014/07/11 18:26:11 UTC, 0 replies.
- Re: Spark Streaming RDD to Shark table - posted by patwhite <pa...@synata.com> on 2014/07/11 20:45:31 UTC, 0 replies.
- Top N predictions - posted by Rich Kroll <ri...@modernizingmedicine.com> on 2014/07/11 20:48:06 UTC, 1 replies.
- Job getting killed - posted by Srikrishna S <sr...@gmail.com> on 2014/07/11 21:01:54 UTC, 0 replies.
- MLlib feature request - posted by Joseph Feng <jo...@creditkarma.com> on 2014/07/11 21:04:53 UTC, 1 replies.
- not getting output from socket connection - posted by Walrus theCat <wa...@gmail.com> on 2014/07/11 22:25:08 UTC, 5 replies.
- Spark Questions - posted by Gonzalo Zarza <go...@globant.com> on 2014/07/11 22:35:38 UTC, 3 replies.
- Spark groupBy operation is only assigned 2 executors - posted by Bill Jay <bi...@gmail.com> on 2014/07/11 22:41:55 UTC, 0 replies.
- pyspark sc.parallelize running OOM with smallish data - posted by Mohit Jaggi <mo...@gmail.com> on 2014/07/11 23:00:03 UTC, 3 replies.
- How to separate a subset of an RDD by day? - posted by bdamos <am...@adobe.com> on 2014/07/11 23:19:13 UTC, 7 replies.
- Graphx : optimal partitions for a graph and error in logs - posted by ShreyanshB <sh...@gmail.com> on 2014/07/11 23:23:40 UTC, 6 replies.
- Decision tree classifier in MLlib - posted by SK <sk...@gmail.com> on 2014/07/11 23:25:48 UTC, 3 replies.
- Streaming training@ Spark Summit 2014 - posted by SK <sk...@gmail.com> on 2014/07/11 23:57:37 UTC, 6 replies.
- Re: confirm subscribe to user@spark.apache.org - posted by Veeranagouda Mukkanagoudar <ve...@gmail.com> on 2014/07/12 00:25:43 UTC, 0 replies.
- try JDBC server - posted by Nan Zhu <zh...@gmail.com> on 2014/07/12 01:02:53 UTC, 1 replies.
- Linkage error - duplicate class definition - posted by _soumya_ <so...@gmail.com> on 2014/07/12 01:29:40 UTC, 0 replies.
- spark ui on yarn - posted by Koert Kuipers <ko...@tresata.com> on 2014/07/12 01:42:44 UTC, 4 replies.
- ML classifier and data format for dataset with variable number of features - posted by SK <sk...@gmail.com> on 2014/07/12 02:12:23 UTC, 1 replies.
- Announcing Spark 1.0.1 - posted by Patrick Wendell <pw...@gmail.com> on 2014/07/12 03:35:18 UTC, 5 replies.
- Akka Client disconnected - posted by Srikrishna S <sr...@gmail.com> on 2014/07/12 17:50:25 UTC, 3 replies.
- Putting block rdd failed when running example svm on large data - posted by crater <cq...@ucmerced.edu> on 2014/07/12 20:10:15 UTC, 3 replies.
- Confused by groupByKey() and the default partitioner - posted by Guanhua Yan <gh...@lanl.gov> on 2014/07/12 21:20:09 UTC, 3 replies.
- Scalability issue in Spark with SparkPageRank example - posted by "lokesh.gidra" <lo...@gmail.com> on 2014/07/12 23:43:18 UTC, 0 replies.
- Stopping StreamingContext does not kill receiver - posted by Nick Chammas <ni...@gmail.com> on 2014/07/13 00:03:39 UTC, 3 replies.
- Convert from RDD[Object] to RDD[Array[Object]] - posted by Parthus <pe...@gmail.com> on 2014/07/13 03:03:59 UTC, 2 replies.
- Supported SQL syntax in Spark SQL - posted by Nick Chammas <ni...@gmail.com> on 2014/07/13 04:16:55 UTC, 7 replies.
- Large Task Size? - posted by Kyle Ellrott <ke...@soe.ucsc.edu> on 2014/07/13 04:27:49 UTC, 6 replies.
- Repeated data item search with Spark SQL(1.0.1) - posted by anyweil <we...@gmail.com> on 2014/07/13 08:16:26 UTC, 10 replies.
- can't print DStream after reduce - posted by Walrus theCat <wa...@gmail.com> on 2014/07/13 10:03:12 UTC, 13 replies.
- Re: Nested Query With Spark SQL(1.0.1) - posted by anyweil <we...@gmail.com> on 2014/07/13 13:11:56 UTC, 4 replies.
- Error in JavaKafkaWordCount.java example - posted by Mahebub Sayyed <ma...@gmail.com> on 2014/07/13 15:43:40 UTC, 1 replies.
- Problem reading in LZO compressed files - posted by Ognen Duzlevski <og...@gmail.com> on 2014/07/13 16:49:02 UTC, 4 replies.
- SparkSql newbie problems with nested selects - posted by Andy Davidson <An...@SantaCruzIntegration.com> on 2014/07/13 21:43:08 UTC, 2 replies.
- Task serialized size dependent on size of RDD? - posted by Sébastien Rainville <se...@gmail.com> on 2014/07/13 22:32:38 UTC, 0 replies.
- Ideal core count within a single JVM - posted by "lokesh.gidra" <lo...@gmail.com> on 2014/07/14 01:03:28 UTC, 7 replies.
- Possible bug in ClientBase.scala? - posted by Ron Gonzalez <zl...@yahoo.com> on 2014/07/14 03:49:18 UTC, 3 replies.
- Catalyst dependency on Spark Core - posted by Aniket Bhatnagar <an...@gmail.com> on 2014/07/14 05:51:02 UTC, 5 replies.
- SPARK S3 LZO input; worker stuck - posted by hassan <He...@gmail.com> on 2014/07/14 06:16:19 UTC, 1 replies.
- mapPartitionsWithIndex - posted by Madhura <da...@gmail.com> on 2014/07/14 08:26:26 UTC, 2 replies.
- Error when testing with large sparse svm - posted by crater <cq...@ucmerced.edu> on 2014/07/14 09:15:14 UTC, 13 replies.
- spark1.0.1 catalyst transform filter not push down - posted by victor sheng <vi...@gmail.com> on 2014/07/14 12:42:59 UTC, 2 replies.
- sbt + idea + test - posted by boci <bo...@gmail.com> on 2014/07/14 12:50:06 UTC, 0 replies.
- Running Spark on Microsoft Azure HDInsight - posted by Niek Tax <ni...@gmail.com> on 2014/07/14 13:00:12 UTC, 2 replies.
- Spark SQL 1.0.1 error on reading fixed length byte array - posted by Pei-Lun Lee <pl...@appier.com> on 2014/07/14 13:17:28 UTC, 5 replies.
- Error in spark: Exception in thread "delete Spark temp dir" - posted by Rahul Bhojwani <ra...@gmail.com> on 2014/07/14 14:16:31 UTC, 0 replies.
- Can we get a spark context inside a mapper - posted by Rahul Bhojwani <ra...@gmail.com> on 2014/07/14 15:22:41 UTC, 5 replies.
- Spark Streaming Json file groupby function - posted by srinivas <ku...@gmail.com> on 2014/07/14 19:59:40 UTC, 18 replies.
- Trouble with spark-ec2 script: --ebs-vol-size - posted by Ben Horner <be...@atigeo.com> on 2014/07/14 19:59:57 UTC, 4 replies.
- Gradient Boosted Machines - posted by Daniel Bendavid <da...@creditkarma.com> on 2014/07/14 20:24:18 UTC, 0 replies.
- Memory & compute-intensive tasks - posted by Ravi Pandya <ra...@iecommerce.com> on 2014/07/14 22:09:12 UTC, 9 replies.
- Spark 1.0.1 EC2 - Launching Applications - posted by Josh Happoldt <jo...@trueffect.com> on 2014/07/14 22:12:52 UTC, 1 replies.
- Client application that calls Spark and receives an MLlib *model* Scala Object, not just result - posted by Aris Vlasakakis <ar...@vlasakakis.com> on 2014/07/14 22:27:19 UTC, 2 replies.
- How to kill running spark yarn application - posted by "hsy541@gmail.com" <hs...@gmail.com> on 2014/07/14 23:08:35 UTC, 7 replies.
- Parsing Json object definition spanning multiple lines - posted by SK <sk...@gmail.com> on 2014/07/14 23:55:27 UTC, 0 replies.
- import org.apache.spark.streaming.twitter._ in Shell - posted by durin <ma...@simon-schaefer.net> on 2014/07/15 00:47:07 UTC, 12 replies.
- Change when loading/storing String data using Parquet - posted by Michael Armbrust <mi...@databricks.com> on 2014/07/15 00:55:59 UTC, 0 replies.
- SQL + streaming - posted by "hsy541@gmail.com" <hs...@gmail.com> on 2014/07/15 01:06:55 UTC, 10 replies.
- Spark-Streaming collect/take functionality. - posted by "jon.burns" <jo...@uleth.ca> on 2014/07/15 01:11:55 UTC, 2 replies.
- running spark from intellj - posted by jamborta <ja...@gmail.com> on 2014/07/15 02:17:22 UTC, 1 replies.
- SPARK_WORKER_PORT (standalone cluster) - posted by jay vyas <ja...@gmail.com> on 2014/07/15 04:01:14 UTC, 1 replies.
- jsonRDD: NoSuchMethodError - posted by SK <sk...@gmail.com> on 2014/07/15 04:06:29 UTC, 3 replies.
- Re: ---cores option in spark-shell - posted by cjwang <cj...@cjwang.us> on 2014/07/15 04:10:15 UTC, 1 replies.
- count on RDD yields NoClassDefFoundError on 1.0.1 - posted by Nicholas Chammas <ni...@gmail.com> on 2014/07/15 04:12:12 UTC, 6 replies.
- RACK_LOCAL Tasks Failed to finish - posted by 洪奇 <qi...@alibaba-inc.com> on 2014/07/15 04:55:19 UTC, 1 replies.
- branch-1.0-jdbc on EC2? - posted by billk <bi...@qlik.com> on 2014/07/15 04:57:56 UTC, 0 replies.
- Re: hdfs replication on saving RDD - posted by valgrind_girl <12...@qq.com> on 2014/07/15 05:06:55 UTC, 3 replies.
- truly bizarre behavior with local[n] on Spark 1.0.1 - posted by Walrus theCat <wa...@gmail.com> on 2014/07/15 05:27:10 UTC, 4 replies.
- KMeansModel Construtor error - posted by Rohit Pujari <rp...@hortonworks.com> on 2014/07/15 07:41:47 UTC, 2 replies.
- Eclipse Spark plugin and sample Scala projects - posted by buntu <bu...@gmail.com> on 2014/07/15 08:08:42 UTC, 0 replies.
- Spark SQL throws ClassCastException on first try; works on second - posted by Nick Chammas <ni...@gmail.com> on 2014/07/15 08:16:55 UTC, 2 replies.
- 答复:RACK_LOCAL Tasks Failed to finish - posted by 洪奇 <qi...@alibaba-inc.com> on 2014/07/15 08:20:01 UTC, 0 replies.
- ALS on EC2 - posted by Srikrishna S <sr...@gmail.com> on 2014/07/15 08:23:06 UTC, 1 replies.
- Spark Streaming w/ tshark exception problem on EC2 - posted by Gianluca Privitera <gi...@studio.unibo.it> on 2014/07/15 11:17:13 UTC, 1 replies.
- Kryo NoSuchMethodError on Spark 1.0.0 standalone - posted by jfowkes <ma...@gmail.com> on 2014/07/15 11:24:59 UTC, 0 replies.
- Re: "the default GraphX graph-partition strategy on multicore machine"? - posted by Yifan LI <ia...@gmail.com> on 2014/07/15 12:06:58 UTC, 6 replies.
- shared object between threads - posted by Wanda Hawk <wa...@yahoo.com> on 2014/07/15 13:23:08 UTC, 1 replies.
- Need help on spark Hbase - posted by Madabhattula Rajesh Kumar <mr...@gmail.com> on 2014/07/15 15:47:53 UTC, 9 replies.
- Store one to many relation ship in parquet file with spark sql - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/07/15 16:12:51 UTC, 1 replies.
- persistence state of an RDD - posted by Nathan Kronenfeld <nk...@oculusinfo.com> on 2014/07/15 16:31:23 UTC, 2 replies.
- Ambiguous references to id : what does it mean ? - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/07/15 17:52:49 UTC, 3 replies.
- Driver cannot receive StatusUpdate message for FINISHED - posted by 林武康 <vb...@gmail.com> on 2014/07/15 18:15:04 UTC, 0 replies.
- Re: NotSerializableException in Spark Streaming - posted by Nicholas Chammas <ni...@gmail.com> on 2014/07/15 18:21:39 UTC, 3 replies.
- Error while running Spark SQL join when using Spark 1.0.1 - posted by Keith Simmons <ke...@gmail.com> on 2014/07/15 19:49:23 UTC, 11 replies.
- How does Spark speculation prevent duplicated work? - posted by Mingyu Kim <mk...@palantir.com> on 2014/07/15 19:55:24 UTC, 7 replies.
- Spark Performance Bench mark - posted by Malligarjunan S <ma...@aerifymedia.com> on 2014/07/15 20:09:52 UTC, 0 replies.
- MLLib - Regularized logistic regression in python - posted by fjeg <fr...@gmail.com> on 2014/07/15 20:12:47 UTC, 5 replies.
- Count distinct with groupBy usage - posted by buntu <bu...@gmail.com> on 2014/07/15 20:14:01 UTC, 8 replies.
- Question on Apache Spark custom InputFormat Integration - posted by "Nick R. Katsipoulakis" <ka...@cs.pitt.edu> on 2014/07/15 20:24:46 UTC, 0 replies.
- Spark Performance issue - posted by Malligarjunan S <ma...@aerifymedia.com> on 2014/07/15 20:39:43 UTC, 0 replies.
- count vs countByValue in for/yield - posted by Ognen Duzlevski <og...@gmail.com> on 2014/07/15 21:23:38 UTC, 1 replies.
- Re: getting ClassCastException on collect() - posted by _soumya_ <so...@gmail.com> on 2014/07/15 21:31:06 UTC, 0 replies.
- parallel stages? - posted by Wei Tan <wt...@us.ibm.com> on 2014/07/15 21:38:42 UTC, 3 replies.
- Help with Json array parsing - posted by SK <sk...@gmail.com> on 2014/07/15 21:56:06 UTC, 1 replies.
- Multiple streams at the same time - posted by gorenuru <go...@gmail.com> on 2014/07/15 22:50:30 UTC, 7 replies.
- Retrieve dataset of Big Data Benchmark - posted by Tom <th...@gmail.com> on 2014/07/15 23:10:15 UTC, 4 replies.
- can't get jobs to run on cluster (enough memory and cpus are available on worker) - posted by Matt Work Coarr <ma...@gmail.com> on 2014/07/15 23:43:49 UTC, 7 replies.
- Spark misconfigured? Small input split sizes in shark query - posted by David Rosenstrauch <da...@darose.net> on 2014/07/15 23:58:39 UTC, 0 replies.
- No parallelism in map transformation - posted by Roch Denis <rd...@exostatic.com> on 2014/07/16 04:37:38 UTC, 1 replies.
- Spark 1.0.1 akka connection refused - posted by Kevin Jung <it...@samsung.com> on 2014/07/16 04:44:21 UTC, 2 replies.
- Re: Kyro deserialisation error - posted by Hao Wang <wh...@gmail.com> on 2014/07/16 05:36:52 UTC, 8 replies.
- Can Spark stack scale to petabyte scale without performance degradation? - posted by Rohit Pujari <rp...@hortonworks.com> on 2014/07/16 05:47:08 UTC, 3 replies.
- executor-cores vs. num-executors - posted by innowireless TaeYun Kim <ta...@innowireless.co.kr> on 2014/07/16 06:57:01 UTC, 3 replies.
- akka disassociated on GC - posted by Makoto Yui <yu...@gmail.com> on 2014/07/16 07:48:31 UTC, 4 replies.
- Error: No space left on device - posted by Chris DuBois <ch...@gmail.com> on 2014/07/16 08:35:54 UTC, 13 replies.
- How does Apache Spark handles system failure when deployed in YARN? - posted by Matthias Kricke <Ma...@mgm-tp.com> on 2014/07/16 09:21:05 UTC, 2 replies.
- Spark Streaming, external windowing? - posted by Sargun Dhillon <sa...@sargun.me> on 2014/07/16 09:56:36 UTC, 2 replies.
- Server IPC version 7 cannot communicate with client version 4 with Spark Streaming 1.0.0 in Java and CH4 quickstart in local mode - posted by Juan Rodríguez Hortalá <ju...@gmail.com> on 2014/07/16 11:32:49 UTC, 2 replies.
- Reading file header in Spark - posted by Silvina Caíno Lores <si...@gmail.com> on 2014/07/16 12:01:57 UTC, 2 replies.
- Read all the columns from a file in spark sql - posted by pandees waran <pa...@gmail.com> on 2014/07/16 14:24:24 UTC, 2 replies.
- Problem running Spark shell (1.0.0) on EMR - posted by Ian Wilkinson <ia...@me.com> on 2014/07/16 15:10:12 UTC, 1 replies.
- Simple record matching using Spark SQL - posted by Sarath Chandra <sa...@algofusiontech.com> on 2014/07/16 15:21:07 UTC, 16 replies.
- Re: Re: how to construct a ClassTag object as a method parameter in Java - posted by balvisio <ba...@mit.edu> on 2014/07/16 15:53:07 UTC, 0 replies.
- Gradient Boosting Decision Trees - posted by Pedro Silva <jp...@gmail.com> on 2014/07/16 18:08:47 UTC, 2 replies.
- Errors accessing hdfs while in local mode - posted by Chris DuBois <ch...@gmail.com> on 2014/07/16 18:20:03 UTC, 3 replies.
- running Spark App on Yarn produces: Exception in thread "main" java.lang.NoSuchFieldException: DEFAULT_YARN_APPLICATION_CLASSPATH - posted by Andrew Milkowski <am...@gmail.com> on 2014/07/16 18:45:47 UTC, 7 replies.
- using multiple dstreams together (spark streaming) - posted by Walrus theCat <wa...@gmail.com> on 2014/07/16 19:08:32 UTC, 7 replies.
- ClassNotFoundException: $line11.$read$ when loading an HDFS text file with SparkQL in spark-shell - posted by Svend <sv...@gmail.com> on 2014/07/16 19:31:04 UTC, 6 replies.
- Difference among batchDuration, windowDuration, slideDuration - posted by "hsy541@gmail.com" <hs...@gmail.com> on 2014/07/16 20:28:15 UTC, 5 replies.
- SaveAsTextFile of RDD taking much time - posted by sudiprc <su...@gmail.com> on 2014/07/17 00:02:58 UTC, 0 replies.
- Release date for new pyspark - posted by Paul Wais <pw...@yelp.com> on 2014/07/17 01:03:08 UTC, 4 replies.
- Spark Streaming timestamps - posted by Bill Jay <bi...@gmail.com> on 2014/07/17 02:39:15 UTC, 5 replies.
- spark-ec2 script with Tachyon - posted by nit <ni...@gmail.com> on 2014/07/17 02:54:44 UTC, 0 replies.
- Use Spark with HBase' HFileOutputFormat - posted by Jianshi Huang <ji...@gmail.com> on 2014/07/17 03:44:13 UTC, 0 replies.
- Kmeans - posted by amin mohebbi <am...@yahoo.com> on 2014/07/17 05:16:31 UTC, 2 replies.
- spark building error - posted by Jack Yang <ji...@uow.edu.au> on 2014/07/17 06:21:34 UTC, 1 replies.
- jar changed on src filesystem - posted by cmti95035 <cm...@gmail.com> on 2014/07/17 07:49:50 UTC, 4 replies.
- can we insert and update with spark sql - posted by "Hu, Leo" <le...@sap.com> on 2014/07/17 08:12:56 UTC, 2 replies.
- Using RDD in RDD transformation - posted by tbin <tb...@foxmail.com> on 2014/07/17 08:23:21 UTC, 0 replies.
- preservesPartitioning - posted by Kamal Banga <ba...@gmail.com> on 2014/07/17 09:02:32 UTC, 1 replies.
- Pysparkshell are not listing in the web UI while running - posted by MEETHU MATHEW <me...@yahoo.co.in> on 2014/07/17 10:05:28 UTC, 2 replies.
- class after join - posted by Luis Guerra <lu...@gmail.com> on 2014/07/17 10:15:08 UTC, 3 replies.
- Speeding up K-Means Clustering - posted by Ravishankar Rajagopalan <vi...@gmail.com> on 2014/07/17 10:48:58 UTC, 3 replies.
- Bad Digest error while doing aws s3 put - posted by lmk <la...@gmail.com> on 2014/07/17 10:57:57 UTC, 3 replies.
- Apache kafka + spark + Parquet - posted by Mahebub Sayyed <ma...@gmail.com> on 2014/07/17 11:19:17 UTC, 5 replies.
- Spark scheduling with Capacity scheduler - posted by Konstantin Kudryavtsev <ku...@gmail.com> on 2014/07/17 11:33:05 UTC, 2 replies.
- GraphX Pragel implementation - posted by Arun Kumar <to...@gmail.com> on 2014/07/17 12:54:40 UTC, 7 replies.
- Re: Getting in local shell - posted by newbee88 <fo...@gmail.com> on 2014/07/17 14:53:19 UTC, 0 replies.
- Seattle Spark Meetup: Evan Chan's Interactive OLAP Queries with Spark and Cassandra - posted by Denny Lee <de...@gmail.com> on 2014/07/17 16:00:14 UTC, 0 replies.
- Equivalent functions for NVL() and CASE expressions in Spark SQL - posted by pandees waran <pa...@gmail.com> on 2014/07/17 16:26:45 UTC, 1 replies.
- Is there a way to get previous/other keys' state in Spark Streaming? - posted by Yan Fang <ya...@gmail.com> on 2014/07/17 18:14:19 UTC, 7 replies.
- Error while running example/scala application using spark-submit - posted by ShanxT <ma...@gmail.com> on 2014/07/17 19:16:58 UTC, 4 replies.
- Need help on Spark UDF (Join) Performance tuning . - posted by S Malligarjunan <sm...@yahoo.com> on 2014/07/17 19:17:02 UTC, 3 replies.
- Custom Metrics Sink - posted by jjaffe <jj...@marinsoftware.com> on 2014/07/17 20:56:58 UTC, 0 replies.
- Include permalinks in mail footer - posted by Nick Chammas <ni...@gmail.com> on 2014/07/17 21:59:38 UTC, 3 replies.
- An abstraction over Spark - posted by Andrea Esposito <an...@gmail.com> on 2014/07/17 22:16:44 UTC, 0 replies.
- replacement for SPARK_LIBRARY_PATH ? - posted by Eric Friedman <er...@gmail.com> on 2014/07/17 22:25:07 UTC, 2 replies.
- unserializable object in Spark Streaming context - posted by Yan Fang <ya...@gmail.com> on 2014/07/17 22:37:18 UTC, 8 replies.
- how to pass extra Java opts to workers for spark streaming jobs - posted by Chen Song <ch...@gmail.com> on 2014/07/17 23:05:26 UTC, 5 replies.
- Large scale ranked recommendation - posted by "m3.sharma" <sh...@umn.edu> on 2014/07/18 00:32:39 UTC, 10 replies.
- Error with spark-submit - posted by ranjanp <pi...@hotmail.com> on 2014/07/18 00:32:40 UTC, 0 replies.
- Error with spark-submit (formatting corrected) - posted by ranjanp <pi...@hotmail.com> on 2014/07/18 00:57:26 UTC, 4 replies.
- Spark Streaming - posted by Guangle Fan <fa...@gmail.com> on 2014/07/18 01:41:24 UTC, 2 replies.
- Hive From Spark - posted by JiajiaJing <jj...@gmail.com> on 2014/07/18 02:48:21 UTC, 7 replies.
- iScala or Scala-notebook - posted by ericjohnston1989 <er...@gmail.com> on 2014/07/18 04:59:14 UTC, 2 replies.
- Cannot connect to hive metastore - posted by linkpatrickliu <li...@live.com> on 2014/07/18 05:52:59 UTC, 1 replies.
- Last step of processing is using too much memory. - posted by Roch Denis <rd...@exostatic.com> on 2014/07/18 07:14:04 UTC, 2 replies.
- spark1.0.1 spark sql error java.lang.NoClassDefFoundError: Could not initialize class $line11.$read$ - posted by Victor Sheng <vi...@gmail.com> on 2014/07/18 07:39:24 UTC, 11 replies.
- submit failure in standalone mode - posted by "Hu, Leo" <le...@sap.com> on 2014/07/18 07:57:34 UTC, 2 replies.
- error from DecisonTree Training: - posted by Jack Yang <ji...@uow.edu.au> on 2014/07/18 08:52:09 UTC, 1 replies.
- data locality - posted by Haopu Wang <HW...@qilinsoft.com> on 2014/07/18 09:35:07 UTC, 6 replies.
- incompatible local class serialVersionUID with spark & Shark - posted by Megane1994 <le...@yahoo.fr> on 2014/07/18 11:52:55 UTC, 1 replies.
- concurrent jobs - posted by Haopu Wang <HW...@qilinsoft.com> on 2014/07/18 12:06:26 UTC, 1 replies.
- TreeNodeException: No function to evaluate expression. type: AttributeReference, tree: id#0 on GROUP BY - posted by Martin Gammelsæter <ma...@gmail.com> on 2014/07/18 12:11:42 UTC, 2 replies.
- spark sql left join gives KryoException: Buffer overflow - posted by Pei-Lun Lee <pl...@appier.com> on 2014/07/18 13:05:14 UTC, 3 replies.
- What is shuffle spill to memory? - posted by Sébastien Rainville <se...@gmail.com> on 2014/07/18 13:09:13 UTC, 1 replies.
- Dividing tasks among Spark workers - posted by Madhura <da...@gmail.com> on 2014/07/18 14:57:32 UTC, 3 replies.
- Python: saving/reloading RDD - posted by Roch Denis <rd...@exostatic.com> on 2014/07/18 17:39:44 UTC, 3 replies.
- Job aborted due to stage failure: TID x failed for unknown reasons - posted by Shannon Quinn <sq...@gatech.edu> on 2014/07/18 20:30:39 UTC, 1 replies.
- Spark Streaming with long batch / window duration - posted by aaronjosephs <aa...@placeiq.com> on 2014/07/18 21:09:38 UTC, 3 replies.
- Visualization/Summary tools for Spark Streaming data - posted by Subodh Nijsure <su...@sigsensetech.com> on 2014/07/18 22:03:17 UTC, 1 replies.
- Reading Avro Sequence Files - posted by tcg <Gu...@bah.com> on 2014/07/18 22:47:10 UTC, 1 replies.
- Re: NullPointerException When Reading Avro Sequence Files - posted by aaronjosephs <aa...@placeiq.com> on 2014/07/18 22:55:44 UTC, 6 replies.
- Broadcasting a set in PySpark - posted by Vedant Dhandhania <ve...@retentionscience.com> on 2014/07/18 23:56:22 UTC, 2 replies.
- Running Spark/YARN on AWS EMR - Issues finding file on hdfs? - posted by _soumya_ <so...@gmail.com> on 2014/07/19 00:44:22 UTC, 0 replies.
- BUG in spark-ec2 script (--ebs-vol-size) and workaround... - posted by Ben Horner <be...@atigeo.com> on 2014/07/19 01:56:18 UTC, 1 replies.
- Java null pointer exception while saving hadoop file - posted by durga <du...@gmail.com> on 2014/07/19 02:34:44 UTC, 2 replies.
- Graphx : Perfomance comparison over cluster - posted by ShreyanshB <sh...@gmail.com> on 2014/07/19 04:14:10 UTC, 5 replies.
- SparkSQL operator priority - posted by Christos Kozanitis <ko...@berkeley.edu> on 2014/07/19 05:04:31 UTC, 2 replies.
- SeattleSparkMeetup: Spark at eBay - Troubleshooting the everyday issues - posted by Denny Lee <de...@gmail.com> on 2014/07/19 06:58:46 UTC, 0 replies.
- Task not serializable: java.io.NotSerializableException: org.apache.spark.SparkContext - posted by lihu <li...@gmail.com> on 2014/07/19 09:31:10 UTC, 3 replies.
- registerAsTable can't be compiled - posted by junius <ju...@gmail.com> on 2014/07/19 09:39:57 UTC, 1 replies.
- Uber jar with SBT - posted by boci <bo...@gmail.com> on 2014/07/19 15:30:49 UTC, 3 replies.
- Need help with coalesce - posted by Madhura <da...@gmail.com> on 2014/07/19 17:28:59 UTC, 0 replies.
- Real-time segmentation with SPARK - posted by Mahesh Govind <ma...@redknee.com> on 2014/07/19 18:20:51 UTC, 0 replies.
- Caching issue with msg: RDD block could not be dropped from memory as it does not exist - posted by rindra <ri...@gmail.com> on 2014/07/19 22:01:13 UTC, 3 replies.
- java.net.ConnectException: Connection timed out - posted by Soren Macbeth <so...@yieldbot.com> on 2014/07/19 22:36:45 UTC, 0 replies.
- Out of any idea - posted by boci <bo...@gmail.com> on 2014/07/19 23:39:27 UTC, 3 replies.
- Debugging spark - posted by Ruchir Jha <ru...@gmail.com> on 2014/07/20 01:01:48 UTC, 0 replies.
- Spark 1.0.1 SQL on 160 G parquet file (snappy compressed, made by cloudera impala), 23 core and 60G mem / node, yarn-client mode, always failed - posted by chutium <te...@gmail.com> on 2014/07/20 01:10:50 UTC, 9 replies.
- spark1.0.1 & hadoop2.2.0 issue - posted by "Hu, Leo" <le...@sap.com> on 2014/07/20 03:16:47 UTC, 2 replies.
- Launching with m3.2xlarge instances: /mnt and /mnt2 mounted on 7gb drive - posted by Chris DuBois <ch...@gmail.com> on 2014/07/20 10:22:13 UTC, 4 replies.
- JDBC Connections / newbie question - posted by Ahmed Ibrahim <ao...@itscorpmd.com> on 2014/07/20 20:18:08 UTC, 0 replies.
- RDD.pipe(...) - posted by jay vyas <ja...@gmail.com> on 2014/07/20 22:09:43 UTC, 1 replies.
- JDBCRDD / Example - posted by Ahmed Ibrahim <ao...@itscorpmd.com> on 2014/07/21 03:27:50 UTC, 1 replies.
- What does @developerApi means? - posted by 我是will <kh...@qq.com> on 2014/07/21 05:46:42 UTC, 1 replies.
- which kind of BlockId should I use? - posted by william <kh...@qq.com> on 2014/07/21 07:25:40 UTC, 1 replies.
- 回复: What does @developerApi means? - posted by william <kh...@qq.com> on 2014/07/21 07:52:52 UTC, 0 replies.
- 回复: which kind of BlockId should I use? - posted by william <kh...@qq.com> on 2014/07/21 07:53:45 UTC, 0 replies.
- LabeledPoint with weight - posted by Jiusheng Chen <ch...@gmail.com> on 2014/07/21 09:58:54 UTC, 1 replies.
- Data row-operation processing advice - posted by Brian Cohn <bc...@gmail.com> on 2014/07/21 12:04:06 UTC, 0 replies.
- Can't see any thing one the storage panel of application UI - posted by binbinbin915 <bi...@live.cn> on 2014/07/21 12:46:25 UTC, 1 replies.
- java.lang.OutOfMemoryError: GC overhead limit exceeded - posted by Yifan LI <ia...@gmail.com> on 2014/07/21 14:48:21 UTC, 2 replies.
- DROP IF EXISTS still throws exception about "table does not exist"? - posted by Nan Zhu <zh...@gmail.com> on 2014/07/21 16:10:25 UTC, 3 replies.
- Is there anyone who use streaming join to filter spam as guide mentioned? - posted by hawkwang <wa...@gmail.com> on 2014/07/21 16:23:30 UTC, 2 replies.
- Why spark-submit command hangs? - posted by Sam Liu <li...@sina.com> on 2014/07/21 16:47:06 UTC, 4 replies.
- Is deferred execution of multiple RDDs ever coming? - posted by Harry Brundage <ha...@shopify.com> on 2014/07/21 17:23:27 UTC, 0 replies.
- gain access to persisted rdd - posted by mrm <ma...@skimlinks.com> on 2014/07/21 17:37:08 UTC, 3 replies.
- Give more Java Heap Memory on Standalone mode - posted by "Nick R. Katsipoulakis" <ka...@cs.pitt.edu> on 2014/07/21 18:35:12 UTC, 3 replies.
- relationship of RDD[Array[String]] to Array[Array[String]] - posted by Philip Ogren <ph...@oracle.com> on 2014/07/21 19:01:58 UTC, 3 replies.
- Re: LiveListenerBus throws exception and weird web UI bug - posted by mrm <ma...@skimlinks.com> on 2014/07/21 19:23:41 UTC, 1 replies.
- RDD pipe partitionwise - posted by Jaonary Rabarisoa <ja...@gmail.com> on 2014/07/21 19:51:47 UTC, 0 replies.
- launching a spark cluster in ec2 from within an application - posted by "M@" <ma...@gmail.com> on 2014/07/21 21:04:12 UTC, 0 replies.
- Does spark streaming fit to our application - posted by srinivas <ku...@gmail.com> on 2014/07/21 22:54:50 UTC, 0 replies.
- broadcast variable get cleaned by ContextCleaner unexpectedly ? - posted by Nan Zhu <zh...@gmail.com> on 2014/07/21 23:29:37 UTC, 5 replies.
- Spark Partitioner vs Cassandra Partitioner - posted by Marcelo Elias Del Valle <ma...@s1mbi0se.com.br> on 2014/07/22 00:16:16 UTC, 0 replies.
- unable to create rdd with pyspark newAPIHadoopRDD - posted by umeshdangat <um...@gmail.com> on 2014/07/22 01:10:24 UTC, 0 replies.
- Re: error from DecisonTree Training: - posted by Xiangrui Meng <me...@gmail.com> on 2014/07/22 01:30:58 UTC, 1 replies.
- Re: How to map each line to (line number, line)? - posted by Andrew Ash <an...@andrewash.com> on 2014/07/22 01:40:07 UTC, 0 replies.
- 答复: LiveListenerBus throws exception and weird web UI bug - posted by "余根茂(木艮)" <ge...@alibaba-inc.com> on 2014/07/22 02:15:42 UTC, 0 replies.
- Joining by timestamp. - posted by durga <du...@gmail.com> on 2014/07/22 02:41:08 UTC, 6 replies.
- saveAsSequenceFile for DStream - posted by Barnaby <bf...@outlook.com> on 2014/07/22 03:06:42 UTC, 2 replies.
- Understanding Spark - posted by omergul123 <om...@gmail.com> on 2014/07/22 03:49:28 UTC, 0 replies.
- defaultMinPartitions in textFile - posted by "Wang, Jensen" <je...@sap.com> on 2014/07/22 04:18:15 UTC, 2 replies.
- Question about initial message in graphx - posted by Bin WU <bw...@connect.ust.hk> on 2014/07/22 05:05:23 UTC, 1 replies.
- new error for me - posted by Nathan Kronenfeld <nk...@oculusinfo.com> on 2014/07/22 05:35:22 UTC, 1 replies.
- Re: Executor metrics in spark application - posted by Denes <te...@outlook.com> on 2014/07/22 08:01:43 UTC, 5 replies.
- number of "Cached Partitions" v.s. "Total Partitions" - posted by Haopu Wang <HW...@qilinsoft.com> on 2014/07/22 09:09:18 UTC, 1 replies.
- Spark over graphviz (SPARK-1015, SPARK-975) - posted by jay vyas <ja...@gmail.com> on 2014/07/22 14:57:56 UTC, 0 replies.
- collect() on small group of Avro files causes plain NullPointerException - posted by Sparky <Gu...@bah.com> on 2014/07/22 15:01:13 UTC, 1 replies.
- Re: collect() on small list causes NullPointerException - posted by Sparky <Gu...@bah.com> on 2014/07/22 15:18:28 UTC, 0 replies.
- hadoop version - posted by mrm <ma...@skimlinks.com> on 2014/07/22 16:07:47 UTC, 2 replies.
- the implications of some items in webUI - posted by Yifan LI <ia...@gmail.com> on 2014/07/22 16:08:20 UTC, 1 replies.
- Using case classes as keys does not seem to work. - posted by Gerard Maas <ge...@gmail.com> on 2014/07/22 16:20:37 UTC, 4 replies.
- Spark Streaming - How to save all items in batchs from beginning to a single stream rdd? - posted by hawkwang <wa...@gmail.com> on 2014/07/22 16:59:31 UTC, 0 replies.
- Tranforming flume events using Spark transformation functions - posted by "Sundaram, Muthu X." <Mu...@sabre.com> on 2014/07/22 17:24:11 UTC, 2 replies.
- Re: Spark Streaming source from Amazon Kinesis - posted by Chris Fregly <ch...@fregly.com> on 2014/07/22 18:30:11 UTC, 1 replies.
- Spark sql with hive table running on Yarn-cluster mode - posted by Jenny Zhao <li...@gmail.com> on 2014/07/22 19:19:05 UTC, 1 replies.
- Spark app vs SparkSQL app - posted by buntu <bu...@gmail.com> on 2014/07/22 19:34:01 UTC, 0 replies.
- combineByKey at ShuffledDStream.scala - posted by Bill Jay <bi...@gmail.com> on 2014/07/22 20:05:55 UTC, 2 replies.
- Very wierd behavior - posted by Nathan Kronenfeld <nk...@oculusinfo.com> on 2014/07/22 21:04:38 UTC, 1 replies.
- Need info on log4j.properties for apache spark. - posted by abhiguruvayya <sh...@gmail.com> on 2014/07/22 21:42:52 UTC, 1 replies.
- Spark Streaming: no job has started yet - posted by Bill Jay <bi...@gmail.com> on 2014/07/22 21:52:59 UTC, 2 replies.
- What if there are large, read-only variables shared by all map functions? - posted by Parthus <pe...@gmail.com> on 2014/07/22 21:54:03 UTC, 2 replies.
- How to do an interactive Spark SQL - posted by "hsy541@gmail.com" <hs...@gmail.com> on 2014/07/23 01:03:49 UTC, 7 replies.
- Spark clustered client - posted by Asaf Lahav <as...@gmail.com> on 2014/07/23 01:20:39 UTC, 1 replies.
- How could I start new spark cluster with hadoop2.0.2 - posted by durga <du...@gmail.com> on 2014/07/23 02:37:10 UTC, 3 replies.
- streaming window not behaving as advertised (v1.0.1) - posted by Alan Ngai <al...@opsclarity.com> on 2014/07/23 03:01:59 UTC, 4 replies.
- Where is the "PowerGraph abstraction" - posted by shijiaxin <sh...@gmail.com> on 2014/07/23 05:22:16 UTC, 1 replies.
- Re: Spark 0.9.1 core dumps on Mesos 0.18.0 - posted by Dale Johnson <da...@ebay.com> on 2014/07/23 08:53:11 UTC, 2 replies.
- "spark.streaming.unpersist" and "spark.cleaner.ttl" - posted by Haopu Wang <HW...@qilinsoft.com> on 2014/07/23 09:00:05 UTC, 4 replies.
- Spark deployed by Cloudera Manager - posted by Debasish Das <de...@gmail.com> on 2014/07/23 09:08:27 UTC, 5 replies.
- Use of SPARK_DAEMON_JAVA_OPTS - posted by MEETHU MATHEW <me...@yahoo.co.in> on 2014/07/23 10:04:10 UTC, 1 replies.
- spark-shell -- running into ArrayIndexOutOfBoundsException - posted by buntu <bu...@gmail.com> on 2014/07/23 10:52:21 UTC, 2 replies.
- Spark execution plan - posted by Luis Guerra <lu...@gmail.com> on 2014/07/23 11:03:30 UTC, 2 replies.
- driver memory - posted by mrm <ma...@skimlinks.com> on 2014/07/23 12:29:52 UTC, 2 replies.
- Re: java.lang.StackOverflowError when calling count() - posted by lalit1303 <la...@sigmoidanalytics.com> on 2014/07/23 13:13:54 UTC, 1 replies.
- Down-scaling Spark on EC2 cluster - posted by Shubhabrata <ma...@gmail.com> on 2014/07/23 15:06:15 UTC, 4 replies.
- Configuring Spark Memory - posted by Martin Goodson <ma...@skimlinks.com> on 2014/07/23 15:10:39 UTC, 8 replies.
- Cluster submit mode - only supported on Yarn? - posted by Chris Schneider <ch...@christopher-schneider.com> on 2014/07/23 16:39:53 UTC, 2 replies.
- Workarounds for accessing sequence file data via PySpark? - posted by Gary Malouf <ma...@gmail.com> on 2014/07/23 16:42:12 UTC, 1 replies.
- Have different reduce key than mapper key - posted by soumick86 <sd...@dstsystems.com> on 2014/07/23 17:22:12 UTC, 1 replies.
- Re: wholeTextFiles not working with HDFS - posted by kmader <ke...@gmail.com> on 2014/07/23 17:25:32 UTC, 1 replies.
- Lost executors - posted by Eric Friedman <er...@gmail.com> on 2014/07/23 17:27:54 UTC, 3 replies.
- akka 2.3.x? - posted by Lee Mighdoll <le...@underneath.ca> on 2014/07/23 18:16:14 UTC, 3 replies.
- error: bad symbolic reference. A signature in SparkContext.class refers to term io in package org.apache.hadoop which is not available - posted by Sameer Tilak <ss...@live.com> on 2014/07/23 19:01:48 UTC, 3 replies.
- why there is only getString(index) but no getString(columnName) in catalyst.expressions.Row.scala ? - posted by chutium <te...@gmail.com> on 2014/07/23 19:37:41 UTC, 0 replies.
- spark-submit to remote master fails - posted by didi <di...@gmail.com> on 2014/07/23 19:40:49 UTC, 1 replies.
- Convert raw data files to Parquet format - posted by buntu <bu...@gmail.com> on 2014/07/23 20:09:15 UTC, 5 replies.
- Help in merging a RDD agaisnt itself using the V of a (K,V). - posted by Roch Denis <rd...@exostatic.com> on 2014/07/23 20:21:49 UTC, 4 replies.
- spark github source build error - posted by "m3.sharma" <sh...@umn.edu> on 2014/07/23 20:23:55 UTC, 2 replies.
- Error in History UI - Seeing stdout/stderr - posted by balvisio <ba...@mit.edu> on 2014/07/23 21:01:51 UTC, 0 replies.
- Spark cluster spanning multiple data centers - posted by Ray Qiu <ra...@gmail.com> on 2014/07/23 21:30:01 UTC, 0 replies.
- using shapeless in spark to optimize data layout in memory - posted by Koert Kuipers <ko...@tresata.com> on 2014/07/23 22:04:09 UTC, 0 replies.
- Get Spark Streaming timestamp - posted by Bill Jay <bi...@gmail.com> on 2014/07/23 23:39:18 UTC, 2 replies.
- Announcing Spark 0.9.2 - posted by Xiangrui Meng <me...@gmail.com> on 2014/07/24 00:16:03 UTC, 0 replies.
- persistent HDFS instance for cluster restarts/destroys - posted by durga <du...@gmail.com> on 2014/07/24 00:26:27 UTC, 2 replies.
- streaming sequence files? - posted by Barnaby <bf...@outlook.com> on 2014/07/24 03:43:31 UTC, 4 replies.
- spark streaming actor receiver doesn't play well with kryoserializer - posted by Alan Ngai <al...@opsclarity.com> on 2014/07/24 12:09:00 UTC, 4 replies.
- Spark Function setup and cleanup - posted by Yosi Botzer <yo...@gmail.com> on 2014/07/24 12:32:36 UTC, 5 replies.
- Starting with spark - posted by Sameer Sayyed <sa...@gmail.com> on 2014/07/24 12:53:27 UTC, 5 replies.
- save to HDFS - posted by lmk <la...@gmail.com> on 2014/07/24 12:54:42 UTC, 4 replies.
- GraphX for pyspark? - posted by Eric Friedman <er...@gmail.com> on 2014/07/24 16:32:23 UTC, 0 replies.
- Spark got stuck with a loop - posted by Denis RP <qq...@gmail.com> on 2014/07/24 17:53:41 UTC, 1 replies.
- rdd.saveAsTextFile blows up - posted by Eric Friedman <er...@gmail.com> on 2014/07/24 19:30:35 UTC, 2 replies.
- GraphX canonical conflation issues - posted by e5c <el...@gmail.com> on 2014/07/24 19:52:39 UTC, 0 replies.
- continuing processing when errors occur - posted by Art Peel <fo...@gmail.com> on 2014/07/24 20:12:52 UTC, 2 replies.
- Getting the number of slaves - posted by Nicolas Mai <ni...@gmail.com> on 2014/07/24 20:16:58 UTC, 6 replies.
- Spark Training at Scala By the Bay with Databricks, Fast Tracl to Scala - posted by Alexy Khrabrov <al...@scalable.pro> on 2014/07/24 20:29:42 UTC, 0 replies.
- Emacs Setup Anyone? - posted by Steve Nunez <sn...@hortonworks.com> on 2014/07/24 21:14:17 UTC, 2 replies.
- Kmeans: set initial centers explicitly - posted by SK <sk...@gmail.com> on 2014/07/24 21:39:14 UTC, 1 replies.
- KMeans: expensiveness of large vectors - posted by durin <ma...@simon-schaefer.net> on 2014/07/24 23:30:24 UTC, 7 replies.
- cache changes precision - posted by Ron Gonzalez <zl...@yahoo.com> on 2014/07/24 23:41:15 UTC, 2 replies.
- mapToPair vs flatMapToPair vs flatMap function usage. - posted by abhiguruvayya <sh...@gmail.com> on 2014/07/25 00:41:08 UTC, 2 replies.
- Need help, got java.lang.ExceptionInInitializerError in Yarn-Client/Cluster mode - posted by Jianshi Huang <ji...@gmail.com> on 2014/07/25 06:24:37 UTC, 5 replies.
- actor serialization error - posted by Alan Ngai <al...@opsclarity.com> on 2014/07/25 07:31:50 UTC, 0 replies.
- Hadoop client protocol mismatch with spark 1.0.1, cdh3u5 - posted by Bharath Ravi Kumar <re...@gmail.com> on 2014/07/25 12:12:59 UTC, 6 replies.
- Strange exception on coalesce() - posted by innowireless TaeYun Kim <ta...@innowireless.co.kr> on 2014/07/25 14:34:17 UTC, 3 replies.
- EOFException when I list all files in hdfs directory - posted by Sparky <Gu...@bah.com> on 2014/07/25 15:22:21 UTC, 4 replies.
- How to pass additional options to Mesos when submitting job? - posted by Krisztián Szűcs <sz...@gmail.com> on 2014/07/25 16:31:07 UTC, 0 replies.
- NMF implementaion is Spark - posted by Aureliano Buendia <bu...@gmail.com> on 2014/07/25 16:38:04 UTC, 1 replies.
- Support for Percentile and Variance Aggregation functions in Spark with HiveContext - posted by vi...@socialinfra.net on 2014/07/25 17:06:48 UTC, 1 replies.
- Initial job has not accepted any resources (but workers are in UI) - posted by Ed Sweeney <ed...@falkonry.com> on 2014/07/25 17:08:42 UTC, 2 replies.
- sharing spark context among machines - posted by myxjtu <my...@yahoo.com> on 2014/07/25 17:54:10 UTC, 0 replies.
- Issue submitting spark job to yarn - posted by Ron Gonzalez <zl...@yahoo.com> on 2014/07/25 18:36:28 UTC, 1 replies.
- Using Spark Streaming with Kafka 0.7.2 - posted by maddenpj <ma...@gmail.com> on 2014/07/25 19:16:21 UTC, 2 replies.
- Re: Are all transformations lazy? - posted by Rico <ri...@gmail.com> on 2014/07/25 19:21:13 UTC, 0 replies.
- sparkcontext stop and then start again - posted by Mohit Jaggi <mo...@gmail.com> on 2014/07/25 19:21:50 UTC, 1 replies.
- Kryo Issue on Spark 1.0.1, Mesos 0.18.2 - posted by Gary Malouf <ma...@gmail.com> on 2014/07/25 20:27:56 UTC, 1 replies.
- saveAsTextFiles file not found exception - posted by Bill Jay <bi...@gmail.com> on 2014/07/25 22:32:06 UTC, 2 replies.
- Spark SQL and Hive tables - posted by Sameer Tilak <ss...@live.com> on 2014/07/25 23:25:01 UTC, 12 replies.
- SparkSQL extensions - posted by Christos Kozanitis <ko...@berkeley.edu> on 2014/07/26 12:32:17 UTC, 3 replies.
- How can I integrate spark cluster into my own program without using spark-submit? - posted by "Lizhengbing (bing, BIPA)" <zh...@huawei.com> on 2014/07/26 14:28:55 UTC, 0 replies.
- Fwd: Exception : org.apache.hadoop.hive.ql.metadata.HiveException: Unable to fetch table - posted by Bilna Govind <bi...@gmail.com> on 2014/07/26 15:00:33 UTC, 1 replies.
- Help using streaming from Spark Shell - posted by Yana Kadiyska <ya...@gmail.com> on 2014/07/26 16:50:13 UTC, 1 replies.
- Lot of object serialization even with MEMORY_ONLY - posted by "lokesh.gidra" <lo...@gmail.com> on 2014/07/26 21:53:45 UTC, 0 replies.
- "Spilling in-memory..." messages in log even with MEMORY_ONLY - posted by "lokesh.gidra" <lo...@gmail.com> on 2014/07/26 22:22:08 UTC, 9 replies.
- graphx cached partitions wont go away - posted by Koert Kuipers <ko...@tresata.com> on 2014/07/26 22:44:56 UTC, 1 replies.
- Re: SparkContext startup time out - posted by Anand Avati <av...@gluster.org> on 2014/07/27 04:30:42 UTC, 0 replies.
- Spark MLlib vs BIDMach Benchmark - posted by DB Tsai <db...@dbtsai.com> on 2014/07/27 05:30:45 UTC, 3 replies.
- MLlib NNLS implementation is buggy, returning wrong solutions - posted by Aureliano Buendia <bu...@gmail.com> on 2014/07/27 20:06:51 UTC, 3 replies.
- Maximum jobs finish very soon, some of them take longer time. - posted by Sarthak Dash <da...@gmail.com> on 2014/07/28 02:20:19 UTC, 0 replies.
- Spark as a application library vs infra - posted by Mayur Rustagi <ma...@gmail.com> on 2014/07/28 03:32:44 UTC, 4 replies.
- spark checkpoint details - posted by Madabhattula Rajesh Kumar <mr...@gmail.com> on 2014/07/28 06:09:42 UTC, 0 replies.
- subscribe - posted by James Todd <ja...@gmail.com> on 2014/07/28 08:31:17 UTC, 0 replies.
- Re: Hadoop Input Format - newAPIHadoopFile - posted by chang cheng <my...@gmail.com> on 2014/07/28 09:15:49 UTC, 0 replies.
- VertexPartition and ShippableVertexPartition - posted by shijiaxin <sh...@gmail.com> on 2014/07/28 09:41:37 UTC, 2 replies.
- Confusing behavior of newAPIHadoopFile - posted by chang cheng <my...@gmail.com> on 2014/07/28 10:02:14 UTC, 9 replies.
- [Spark 1.0.1][SparkSQL] reduce stage of shuffle is slow。 - posted by Earthson <Ea...@gmail.com> on 2014/07/28 10:52:42 UTC, 4 replies.
- NotSerializableException exception while using TypeTag in Scala 2.10 - posted by Aniket Bhatnagar <an...@gmail.com> on 2014/07/28 12:33:34 UTC, 0 replies.
- sbt directory missed - posted by redocpot <ju...@gmail.com> on 2014/07/28 17:15:24 UTC, 4 replies.
- Debugging "Task not serializable" - posted by Juan Rodríguez Hortalá <ju...@gmail.com> on 2014/07/28 17:51:07 UTC, 4 replies.
- Re: Fraud management system implementation - posted by Sandy Ryza <sa...@cloudera.com> on 2014/07/28 18:13:40 UTC, 1 replies.
- akka.tcp://spark@localhost:7077/user/MapOutputTracker akka.actor.ActorNotFound - posted by Andrew Milkowski <am...@gmail.com> on 2014/07/28 20:42:35 UTC, 1 replies.
- how to publish spark inhouse? - posted by Koert Kuipers <ko...@tresata.com> on 2014/07/28 21:05:06 UTC, 9 replies.
- javasparksql Hbase - posted by Madabhattula Rajesh Kumar <mr...@gmail.com> on 2014/07/28 21:12:51 UTC, 0 replies.
- Spark java.lang.AbstractMethodError - posted by Alex Minnaar <am...@verticalscope.com> on 2014/07/28 21:39:44 UTC, 1 replies.
- Issues on spark-shell and spark-submit behave differently on spark-defaults.conf parameter spark.eventLog.dir - posted by Andrew Lee <al...@hotmail.com> on 2014/07/28 21:40:26 UTC, 2 replies.
- Re: Spark streaming vs. spark usage - posted by Nathan Kronenfeld <nk...@oculusinfo.com> on 2014/07/28 21:53:50 UTC, 2 replies.
- zip two RDD in pyspark - posted by lllll <li...@gmail.com> on 2014/07/28 21:58:54 UTC, 2 replies.
- Posterior probability in PySpark (MLLib) models - posted by Vedant Dhandhania <ve...@retentionscience.com> on 2014/07/29 02:17:25 UTC, 0 replies.
- ssh connection refused - posted by sparking <re...@gmail.com> on 2014/07/29 02:17:33 UTC, 1 replies.
- How true is this about spark streaming? - posted by Rohit Pujari <rp...@hortonworks.com> on 2014/07/29 02:37:30 UTC, 3 replies.
- evaluating classification accuracy - posted by SK <sk...@gmail.com> on 2014/07/29 03:07:38 UTC, 3 replies.
- hdfs.BlockMissingException on Iterator.hasNext() in mapPartitionsWithIndex() - posted by innowireless TaeYun Kim <ta...@innowireless.co.kr> on 2014/07/29 03:27:08 UTC, 0 replies.
- The function of ClosureCleaner.clean - posted by "Wang, Jensen" <je...@sap.com> on 2014/07/29 05:28:06 UTC, 2 replies.
- Reading hdf5 formats with pyspark - posted by Mohit Singh <mo...@gmail.com> on 2014/07/29 06:05:26 UTC, 1 replies.
- HiveContext is creating metastore warehouse locally instead of in hdfs - posted by nikroy16 <ni...@gmail.com> on 2014/07/29 06:51:28 UTC, 6 replies.
- Joining spark user group - posted by jitendra shelar <ji...@gmail.com> on 2014/07/29 07:40:17 UTC, 2 replies.
- SparkSQL can not use SchemaRDD from Hive - posted by Kevin Jung <it...@samsung.com> on 2014/07/29 07:47:14 UTC, 3 replies.
- [GraphX] How to access a vertex via vertexId? - posted by Bin <wu...@126.com> on 2014/07/29 11:40:09 UTC, 6 replies.
- SPARK OWLQN Exception: Iteration Stage is so slow - posted by John Wu <jo...@zamplus.com> on 2014/07/29 11:51:42 UTC, 1 replies.
- UpdatestateByKey assumptions - posted by RodrigoB <ro...@aspect.com> on 2014/07/29 13:08:54 UTC, 1 replies.
- Avro Schema + GenericRecord to HadoopRDD - posted by "Laird, Benjamin" <Be...@capitalone.com> on 2014/07/29 17:00:37 UTC, 2 replies.
- Unit Testing (JUnit) with Spark - posted by soumick86 <sd...@dstsystems.com> on 2014/07/29 17:29:48 UTC, 5 replies.
- Job using Spark for Machine Learning - posted by Martin Goodson <ma...@skimlinks.com> on 2014/07/29 17:34:12 UTC, 1 replies.
- the pregel operator of graphx throws NullPointerException - posted by Denis RP <qq...@gmail.com> on 2014/07/29 18:16:00 UTC, 4 replies.
- GraphX Connected Components - posted by Jeffrey Picard <jp...@columbia.edu> on 2014/07/29 19:27:34 UTC, 5 replies.
- Empty RDD after LzoTextInputFormat in newAPIHadoopFile - posted by Ivoirians <kv...@gmail.com> on 2014/07/29 20:05:47 UTC, 0 replies.
- python project like spark-jobserver? - posted by Chris Grier <gr...@imchris.org> on 2014/07/29 20:09:01 UTC, 0 replies.
- Using countApproxDistinct in pyspark - posted by Diederik <dv...@gmail.com> on 2014/07/29 20:45:45 UTC, 1 replies.
- Spark and Flume integration - do I understand this correctly? - posted by dapooley <da...@gmail.com> on 2014/07/29 21:13:04 UTC, 3 replies.
- Example standalone app error! - posted by Alex Minnaar <am...@verticalscope.com> on 2014/07/29 22:01:12 UTC, 3 replies.
- java.io.StreamCorruptedException: invalid type code: 00 - posted by Alexis Roos <al...@gmail.com> on 2014/07/29 23:53:59 UTC, 1 replies.
- How to submit Pyspark job in mesos? - posted by daijia <ji...@intsig.com> on 2014/07/30 03:42:54 UTC, 4 replies.
- How do you debug a PythonException? - posted by Nick Chammas <ni...@gmail.com> on 2014/07/30 03:56:30 UTC, 6 replies.
- How to specify the job to run on the specific nodes(machines) in the hadoop yarn cluster? - posted by adu <du...@hzduozhun.com> on 2014/07/30 05:45:03 UTC, 2 replies.
- Is it possible to read file head in each partition? - posted by Fengyun RAO <ra...@gmail.com> on 2014/07/30 06:02:51 UTC, 5 replies.
- why a machine learning application run slowly on the spark cluster - posted by Tan Tim <un...@gmail.com> on 2014/07/30 06:15:59 UTC, 7 replies.
- Logging in Spark through YARN. - posted by Archit Thakur <ar...@gmail.com> on 2014/07/30 08:37:48 UTC, 2 replies.
- Converting matrix format - posted by Chengi Liu <ch...@gmail.com> on 2014/07/30 08:39:40 UTC, 2 replies.
- Spark Streaming : CassandraRDD not getting refreshed with new rows in column family - posted by Praful CJ <cj...@gmail.com> on 2014/07/30 09:04:48 UTC, 0 replies.
- Spark & Ooyala Job Server - posted by nightwolf <ni...@gmail.com> on 2014/07/30 09:12:32 UTC, 0 replies.
- NotSerializableException - posted by Ron Gonzalez <zl...@yahoo.com> on 2014/07/30 09:19:23 UTC, 0 replies.
- spark.shuffle.consolidateFiles seems not working - posted by Jianshi Huang <ji...@gmail.com> on 2014/07/30 10:01:06 UTC, 4 replies.
- spark.scheduler.pool seems not working in spark streaming - posted by liuwei <st...@126.com> on 2014/07/30 10:43:42 UTC, 2 replies.
- Is there a way to write spark RDD to Avro files - posted by Fengyun RAO <ra...@gmail.com> on 2014/07/30 11:14:32 UTC, 3 replies.
- Initialize custom serializer on YARN - posted by Anthony F <af...@gmail.com> on 2014/07/30 14:40:53 UTC, 0 replies.
- Streaming on different store types - posted by Flavio Pompermaier <po...@okkam.it> on 2014/07/30 16:45:29 UTC, 1 replies.
- Re: Spark 0.9.1 - saveAsTextFile() exception: _temporary doesn't exist! - posted by Andrew Ash <an...@andrewash.com> on 2014/07/30 16:54:11 UTC, 0 replies.
- the EC2 setup script often will not allow me to SSH into my machines. Ideas? - posted by William Cox <wi...@distilnetworks.com> on 2014/07/30 17:22:38 UTC, 4 replies.
- Worker logs - posted by Ruchir Jha <ru...@gmail.com> on 2014/07/30 19:04:33 UTC, 1 replies.
- Keep state inside map function - posted by Kevin <ke...@gmail.com> on 2014/07/30 19:07:47 UTC, 5 replies.
- Implementing percentile through top Vs take - posted by Bharath Ravi Kumar <re...@gmail.com> on 2014/07/30 20:07:42 UTC, 2 replies.
- Do I need to know Scala to take full advantage of spark? - posted by Majid Azimi <az...@yahoo.com> on 2014/07/30 20:15:49 UTC, 1 replies.
- Partioner to process data in the same order for each key - posted by Venkat Subramanian <vs...@gmail.com> on 2014/07/30 20:32:18 UTC, 0 replies.
- Re: Decision Tree requires regression LabeledPoint - posted by SK <sk...@gmail.com> on 2014/07/30 21:18:29 UTC, 0 replies.
- Number of partitions and Number of concurrent tasks - posted by Darin McBeath <dd...@yahoo.com> on 2014/07/30 21:56:20 UTC, 4 replies.
- Re: Spark Streaming Checkpoint: SparkContext is not serializable class - posted by RodrigoB <ro...@aspect.com> on 2014/07/30 21:58:11 UTC, 0 replies.
- Re: Spark SQL JDBC Connectivity - posted by Venkat Subramanian <vs...@gmail.com> on 2014/07/30 22:04:44 UTC, 1 replies.
- Installing Spark 1.0.1 - posted by cetaylor <co...@hp.com> on 2014/07/30 23:35:42 UTC, 0 replies.
- Data from Mysql using JdbcRDD - posted by srinivas <ku...@gmail.com> on 2014/07/31 00:55:26 UTC, 3 replies.
- Spark fault tolerance after a executor failure. - posted by Sung Hwan Chung <co...@cs.stanford.edu> on 2014/07/31 01:02:49 UTC, 0 replies.
- A task gets stuck after following messages in the std error log: - posted by Sung Hwan Chung <co...@cs.stanford.edu> on 2014/07/31 01:29:44 UTC, 0 replies.
- Spark Deployment Patterns - Automated Deployment & Performance Testing - posted by nightwolf <ni...@gmail.com> on 2014/07/31 01:51:45 UTC, 1 replies.
- Deploying spark applications from within Eclipse? - posted by nunarob <ro...@nunahealth.com> on 2014/07/31 02:20:52 UTC, 1 replies.
- RDD.coalesce got compilation error - posted by Jianshi Huang <ji...@gmail.com> on 2014/07/31 03:45:02 UTC, 2 replies.
- Index calculation will cause integer overflow of numPartitions > 10362 in sortByKey - posted by Jianshi Huang <ji...@gmail.com> on 2014/07/31 04:47:40 UTC, 2 replies.
- Spark on Yarn - posted by "li.ching.090" <li...@gmail.com> on 2014/07/31 05:54:50 UTC, 0 replies.
- Spark partition - posted by Sameer Tilak <ss...@live.com> on 2014/07/31 07:07:58 UTC, 1 replies.
- java.util.concurrent.TimeoutException: Futures timed out after [30 seconds] - posted by Bin <wu...@126.com> on 2014/07/31 09:54:08 UTC, 1 replies.
- Ports required for running spark - posted by Konstantin Kudryavtsev <ku...@gmail.com> on 2014/07/31 12:04:08 UTC, 6 replies.
- understanding use of "filter" function in Spark - posted by Greg <gr...@zooniverse.org> on 2014/07/31 13:00:15 UTC, 1 replies.
- set spark.local.dir on driver program doesn't take effect - posted by redocpot <ju...@gmail.com> on 2014/07/31 13:58:11 UTC, 0 replies.
- configuration needed to run twitter(25GB) dataset - posted by Jiaxin Shi <sh...@gmail.com> on 2014/07/31 14:28:40 UTC, 1 replies.
- SQLCtx cacheTable - posted by Gurvinder Singh <gu...@uninett.no> on 2014/07/31 15:16:56 UTC, 0 replies.
- How to share a NonSerializable variable among tasks in the same worker node? - posted by Fengyun RAO <ra...@gmail.com> on 2014/07/31 15:47:51 UTC, 0 replies.
- SparkStreaming -- Suppored directory structure - posted by Yana Kadiyska <ya...@gmail.com> on 2014/07/31 16:07:02 UTC, 0 replies.
- Hbase - posted by Madabhattula Rajesh Kumar <mr...@gmail.com> on 2014/07/31 18:49:16 UTC, 0 replies.
- Shark/Spark running on EC2 can read from S3 bucket but cannot write to it - "Wrong FS" - posted by William Cox <wi...@distilnetworks.com> on 2014/07/31 18:50:47 UTC, 1 replies.
- java.lang.OutOfMemoryError: Java heap space - posted by Sameer Tilak <ss...@live.com> on 2014/07/31 18:58:13 UTC, 0 replies.
- Inconsistent Spark SQL behavior when column names contain dots - posted by "Budde, Adam" <bu...@amazon.com> on 2014/07/31 20:16:11 UTC, 3 replies.
- store spark streaming dstream in hdfs or cassandra - posted by salemi <al...@udo.edu> on 2014/07/31 20:46:57 UTC, 2 replies.
- SchemaRDD select expression - posted by buntu <bu...@gmail.com> on 2014/07/31 21:27:24 UTC, 6 replies.
- RDD operation examples with data? - posted by Chris Curtin <cu...@gmail.com> on 2014/07/31 22:00:06 UTC, 1 replies.
- Installing Spark 0.9.1 on EMR Cluster - posted by Rahul Bhojwani <ra...@gmail.com> on 2014/07/31 23:41:26 UTC, 1 replies.