You are viewing a plain text version of this content. The canonical link for it is here.
- [ANN]: Scala By The Bay Conference ( aka Silicon Valley Scala Symposium) - posted by Chester Chen <ch...@yahoo.com> on 2014/05/01 00:01:34 UTC, 0 replies.
- My talk on "Spark: The Next Top (Compute) Model" - posted by Dean Wampler <de...@gmail.com> on 2014/05/01 00:25:19 UTC, 6 replies.
- Re: Any advice for using big spark.cleaner.delay value in Spark Streaming? - posted by buremba <em...@gmail.com> on 2014/05/01 00:45:24 UTC, 1 replies.
- update of RDDs - posted by narayanabhatla NarasimhaMurthy <NN...@cmcltd.com> on 2014/05/01 01:12:01 UTC, 3 replies.
- Re: Strange lookup behavior. Possible bug? - posted by Yadid Ayzenberg <ya...@media.mit.edu> on 2014/05/01 01:29:40 UTC, 0 replies.
- CDH 5.0 and Spark 0.9.0 - posted by Paul Schooss <pa...@gmail.com> on 2014/05/01 01:44:46 UTC, 2 replies.
- same partition id means same location? - posted by wxhsdp <wx...@gmail.com> on 2014/05/01 03:25:57 UTC, 1 replies.
- Re: Reading multiple S3 objects, transforming, writing back one - posted by Patrick Wendell <pw...@gmail.com> on 2014/05/01 03:33:21 UTC, 7 replies.
- Re: something about memory usage - posted by wxhsdp <wx...@gmail.com> on 2014/05/01 03:53:12 UTC, 0 replies.
- How to handle this situation: Huge File Shared by All maps and Each Computer Has one copy? - posted by PengWeiPRC <pe...@gmx.com> on 2014/05/01 06:37:35 UTC, 3 replies.
- GraphX. How to remove vertex or edge? - posted by Николай Кинаш <pe...@gmail.com> on 2014/05/01 08:52:21 UTC, 1 replies.
- Broadcst RDD Lookup - posted by "vivek.ys" <vi...@gmail.com> on 2014/05/01 10:36:52 UTC, 2 replies.
- Multiple Streams with Spark Streaming - posted by Laeeq Ahmed <la...@yahoo.com> on 2014/05/01 11:54:39 UTC, 2 replies.
- "sbt/sbt run" command returns a JVM problem - posted by Carter <gy...@hotmail.com> on 2014/05/01 15:47:04 UTC, 12 replies.
- RE: What is Seq[V] in updateStateByKey? - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/05/01 16:29:23 UTC, 1 replies.
- Spark Training - posted by Nicholas Chammas <ni...@gmail.com> on 2014/05/01 17:12:13 UTC, 4 replies.
- Spark profiler - posted by Punya Biswal <pb...@palantir.com> on 2014/05/01 17:14:05 UTC, 1 replies.
- Equally weighted partitions in Spark - posted by "deenar.toraskar" <de...@db.com> on 2014/05/01 17:30:37 UTC, 8 replies.
- Re: Efficient Aggregation over DB data - posted by Andrea Esposito <an...@gmail.com> on 2014/05/01 18:47:58 UTC, 0 replies.
- Spark "streaming" - posted by Mohit Singh <mo...@gmail.com> on 2014/05/01 19:45:58 UTC, 1 replies.
- permition problem - posted by "Livni, Dana" <da...@intel.com> on 2014/05/01 20:00:16 UTC, 1 replies.
- ClassNotFoundException - posted by Joe L <se...@yahoo.com> on 2014/05/01 21:03:03 UTC, 2 replies.
- Can't be built on MAC - posted by Zhige Xin <xi...@gmail.com> on 2014/05/01 21:32:13 UTC, 2 replies.
- updateStateByKey example not using correct input data? - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/05/01 22:12:37 UTC, 0 replies.
- Running Spark jobs via oozie - posted by Shivani Rao <ra...@gmail.com> on 2014/05/01 22:13:10 UTC, 0 replies.
- Setting the Scala version in the EC2 script? - posted by Ian Ferreira <ia...@hotmail.com> on 2014/05/01 23:14:30 UTC, 1 replies.
- range partitioner with updateStateByKey - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/05/01 23:35:59 UTC, 2 replies.
- java.lang.ClassNotFoundException - posted by İbrahim Rıza HALLAÇ <ib...@live.com> on 2014/05/02 00:37:10 UTC, 3 replies.
- Task not serializable: collect, take - posted by SK <sk...@gmail.com> on 2014/05/02 00:47:03 UTC, 2 replies.
- Re: Opinions stratosphere - posted by Christopher Nguyen <ct...@adatao.com> on 2014/05/02 02:02:00 UTC, 2 replies.
- Question regarding doing aggregation over custom partitions - posted by Arun Swami <ar...@caspida.com> on 2014/05/02 02:03:50 UTC, 2 replies.
- configure spark history server for running on Yarn - posted by Jenny Zhao <li...@gmail.com> on 2014/05/02 02:09:20 UTC, 1 replies.
- Re: Spark: issues with running a sbt fat jar due to akka dependencies - posted by Shivani Rao <ra...@gmail.com> on 2014/05/02 02:45:33 UTC, 3 replies.
- YARN issues with resourcemanager.scheduler.address - posted by zsterone <zs...@hotmail.com> on 2014/05/02 05:05:26 UTC, 1 replies.
- sbt package, NoClassDefFoundError - posted by SK <sk...@gmail.com> on 2014/05/02 05:21:03 UTC, 0 replies.
- Getting the following error using EC2 deployment - posted by Ian Ferreira <ia...@hotmail.com> on 2014/05/02 07:04:45 UTC, 0 replies.
- Fwd: New Spark Meetup Group in London, UK. First meeting 28th May - posted by Martin Goodson <ma...@gmail.com> on 2014/05/02 11:56:21 UTC, 0 replies.
- Apache Spark is not building in Mac/Java 8 - posted by "N.Venkata Naga Ravi" <nv...@hotmail.com> on 2014/05/02 12:26:27 UTC, 4 replies.
- Re: Incredible slow iterative computation - posted by Andrea Esposito <an...@gmail.com> on 2014/05/02 12:29:33 UTC, 6 replies.
- Re: help me - posted by Mayur Rustagi <ma...@gmail.com> on 2014/05/02 15:07:00 UTC, 1 replies.
- when to use broadcast variables - posted by Diana Carroll <dc...@cloudera.com> on 2014/05/02 15:12:01 UTC, 2 replies.
- Re: Scala Spark / Shark: How to access existing Hive tables in Hortonworks? - posted by Mayur Rustagi <ma...@gmail.com> on 2014/05/02 15:37:11 UTC, 0 replies.
- Re: getting an error - posted by Mayur Rustagi <ma...@gmail.com> on 2014/05/02 15:48:30 UTC, 0 replies.
- performance improvement on second operation...without caching? - posted by Diana Carroll <dc...@cloudera.com> on 2014/05/02 16:04:30 UTC, 7 replies.
- Re: Spark's behavior - posted by Eduardo Costa Alfaia <e....@unibs.it> on 2014/05/02 17:24:23 UTC, 7 replies.
- another updateStateByKey question - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/05/02 18:49:05 UTC, 2 replies.
- spark-shell driver interacting with Workers in YARN mode - firewall blocking communication - posted by Andrew Lee <al...@hotmail.com> on 2014/05/02 19:52:40 UTC, 7 replies.
- GraphX vertices and connected edges - posted by Kyle Ellrott <ke...@soe.ucsc.edu> on 2014/05/02 20:34:53 UTC, 1 replies.
- Re: is it possible to initiate Spark jobs from Oozie? - posted by Shivani Rao <ra...@gmail.com> on 2014/05/02 20:55:48 UTC, 0 replies.
- Re: java.lang.ClassNotFoundException - spark on mesos - posted by "bouke@shopify.com" <bo...@shopify.com> on 2014/05/02 21:48:08 UTC, 0 replies.
- Invoke spark-shell without attempting to start the http server - posted by Stephen Boesch <ja...@gmail.com> on 2014/05/02 22:51:24 UTC, 0 replies.
- docker image build issue for spark 0.9.1 - posted by Weide Zhang <we...@gmail.com> on 2014/05/02 23:17:05 UTC, 2 replies.
- Seattle Spark Meetup Slides - posted by Denny Lee <de...@gmail.com> on 2014/05/02 23:48:45 UTC, 0 replies.
- spark 0.9.1: ClassNotFoundException - posted by SK <sk...@gmail.com> on 2014/05/02 23:58:12 UTC, 1 replies.
- Crazy Kryo Exception - posted by Soren Macbeth <so...@yieldbot.com> on 2014/05/03 00:35:28 UTC, 2 replies.
- string to int conversion - posted by SK <sk...@gmail.com> on 2014/05/03 03:00:59 UTC, 2 replies.
- Reading and processing binary format files using spark - posted by Chengi Liu <ch...@gmail.com> on 2014/05/03 06:42:30 UTC, 1 replies.
- Re: Checkpoint Vs Cache - posted by Chris Fregly <ch...@fregly.com> on 2014/05/03 08:19:49 UTC, 0 replies.
- what's local[n] - posted by Weide Zhang <we...@gmail.com> on 2014/05/03 16:00:05 UTC, 1 replies.
- Lease Exception hadoop 2.4 - posted by Andre Kuhnen <an...@gmail.com> on 2014/05/03 18:09:23 UTC, 5 replies.
- Spark-1.0.0-rc3 compiled against Hadoop 2.3.0 cannot read HDFS 2.3.0? - posted by Nan Zhu <zh...@gmail.com> on 2014/05/03 19:04:25 UTC, 2 replies.
- spark run issue - posted by Weide Zhang <we...@gmail.com> on 2014/05/04 03:16:46 UTC, 4 replies.
- cache not work as expected for iteration? - posted by Earthson <Ea...@gmail.com> on 2014/05/04 05:16:49 UTC, 3 replies.
- Re: the spark configuage - posted by Sophia <sl...@163.com> on 2014/05/04 09:52:03 UTC, 0 replies.
- different in spark on yarn mode and standalone mode - posted by Sophia <sl...@163.com> on 2014/05/04 09:56:49 UTC, 6 replies.
- using kryo for spark.closure.serializer with a registrator doesn't work - posted by Soren Macbeth <so...@yieldbot.com> on 2014/05/04 10:37:11 UTC, 0 replies.
- sbt run with spark.ContextCleaner ERROR - posted by wxhsdp <wx...@gmail.com> on 2014/05/04 11:40:58 UTC, 6 replies.
- Re: SparkException: env SPARK_YARN_APP_JAR is not set - posted by phoenix bai <mi...@gmail.com> on 2014/05/04 11:58:37 UTC, 0 replies.
- NoSuchMethodError: breeze.linalg.DenseMatrix - posted by wxhsdp <wx...@gmail.com> on 2014/05/04 13:07:37 UTC, 9 replies.
- unsubscribe - posted by Nabeel Memon <nm...@gmail.com> on 2014/05/04 13:08:23 UTC, 15 replies.
- works disconnected with master but still keep alive - posted by Cheney Sun <su...@gmail.com> on 2014/05/04 17:21:50 UTC, 1 replies.
- Initial job has not accepted any resources - posted by pedro <sk...@gmail.com> on 2014/05/04 22:59:45 UTC, 3 replies.
- spark ec2 error - posted by Jeremy Freeman <fr...@gmail.com> on 2014/05/04 23:00:00 UTC, 3 replies.
- spark streaming question - posted by Weide Zhang <we...@gmail.com> on 2014/05/04 23:10:04 UTC, 2 replies.
- Re: compile spark 0.9.1 in hadoop 2.2 above exception - posted by arsingh <ar...@yahoo.com> on 2014/05/04 23:19:43 UTC, 0 replies.
- Error starting EC2 cluster - posted by Aliaksei Litouka <al...@gmail.com> on 2014/05/05 02:51:53 UTC, 2 replies.
- Re: KafkaInputDStream mapping of partitions to tasks - posted by Aries <ar...@gmail.com> on 2014/05/05 04:03:09 UTC, 0 replies.
- Re: master attempted to re-register the worker and then took all workers as unregistered - posted by Cheney Sun <su...@gmail.com> on 2014/05/05 04:15:39 UTC, 4 replies.
- what`s the meaning of primitive in "gradient descent primitive"? - posted by phoenix bai <mi...@gmail.com> on 2014/05/05 04:49:43 UTC, 1 replies.
- Re: pySpark memory usage - posted by Aaron Davidson <il...@gmail.com> on 2014/05/05 05:02:56 UTC, 5 replies.
- Cache issue for iteration with broadcast - posted by Earthson <Ea...@gmail.com> on 2014/05/05 05:15:15 UTC, 8 replies.
- Is any idea on architecture based on Spark + Spray + Akka - posted by ZhangYi <yi...@thoughtworks.com> on 2014/05/05 05:37:10 UTC, 3 replies.
- spark streaming kafka output - posted by Weide Zhang <we...@gmail.com> on 2014/05/05 08:45:56 UTC, 1 replies.
- Check your cluster UI to ensure that workers are registered and have sufficient memory - posted by Sai Prasanna <an...@gmail.com> on 2014/05/05 09:51:05 UTC, 0 replies.
- unsibscribe - posted by Konstantin Kudryavtsev <ku...@gmail.com> on 2014/05/05 10:42:56 UTC, 1 replies.
- java.io.FileNotFoundException: /test/spark-0.9.1/work/app-20140505053550-0000/2/stdout (No such file or directory) - posted by "Francis.Hu" <fr...@reachjunction.com> on 2014/05/05 12:06:10 UTC, 1 replies.
- Spark Streaming and JMS - posted by Patrick McGloin <mc...@gmail.com> on 2014/05/05 14:31:25 UTC, 2 replies.
- Re: Shark on cloudera CDH5 error - posted by manas Kar <Ma...@exactearth.com> on 2014/05/05 15:53:51 UTC, 0 replies.
- Spark GCE Script - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2014/05/05 16:18:32 UTC, 7 replies.
- Re: Using google cloud storage for spark big data - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2014/05/05 16:25:22 UTC, 0 replies.
- Caused by: java.lang.OutOfMemoryError: unable to create new native thread - posted by Soumya Simanta <so...@gmail.com> on 2014/05/05 17:32:28 UTC, 1 replies.
- RE: another updateStateByKey question - updated w possible Spark bug - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/05/05 18:06:14 UTC, 1 replies.
- Comprehensive Port Configuration reference? - posted by Scott Clasen <sc...@gmail.com> on 2014/05/05 18:38:21 UTC, 8 replies.
- Problem with sharing class across worker nodes using spark-shell on Spark 1.0.0 - posted by Soumya Simanta <so...@gmail.com> on 2014/05/05 21:14:57 UTC, 0 replies.
- Local Dev Env with Mesos + Spark Streaming on Docker: Can't submit jobs. - posted by Gerard Maas <ge...@gmail.com> on 2014/05/05 22:11:59 UTC, 8 replies.
- Increase Stack Size Workers - posted by Andrea Esposito <an...@gmail.com> on 2014/05/05 23:20:55 UTC, 1 replies.
- Spark 0.9.1 - saveAsSequenceFile and large RDD - posted by Allen Lee <al...@mediacrossing.com> on 2014/05/05 23:47:33 UTC, 0 replies.
- How can adding a random count() change the behavior of my program? - posted by Nicholas Chammas <ni...@gmail.com> on 2014/05/06 02:52:09 UTC, 2 replies.
- 答复: java.io.FileNotFoundException: /test/spark-0.9.1/work/app-20140505053550-0000/2/stdout (No such file or directory) - posted by "Francis.Hu" <fr...@reachjunction.com> on 2014/05/06 04:13:03 UTC, 1 replies.
- How to use spark-submit - posted by Stephen Boesch <ja...@gmail.com> on 2014/05/06 04:24:36 UTC, 11 replies.
- details about event log - posted by wxhsdp <wx...@gmail.com> on 2014/05/06 04:27:48 UTC, 3 replies.
- 答复: 答复: java.io.FileNotFoundException: /test/spark-0.9.1/work/app-20140505053550-0000/2/stdout (No such file or directory) - posted by "Francis.Hu" <fr...@reachjunction.com> on 2014/05/06 04:31:15 UTC, 5 replies.
- Can I share RDD between a pyspark and spark API - posted by manas Kar <Ma...@exactearth.com> on 2014/05/06 05:02:34 UTC, 0 replies.
- about broadcast - posted by randylu <ra...@gmail.com> on 2014/05/06 06:06:21 UTC, 2 replies.
- Better option to use Querying in Spark - posted by prabeesh k <pr...@gmail.com> on 2014/05/06 07:52:50 UTC, 2 replies.
- run spark0.9.1 on yarn with hadoop CDH4 - posted by Sophia <sl...@163.com> on 2014/05/06 09:23:02 UTC, 3 replies.
- How can I run sbt? - posted by Sophia <sl...@163.com> on 2014/05/06 10:23:41 UTC, 1 replies.
- Re: Storage information about an RDD from the API - posted by Andras Nemeth <an...@lynxanalytics.com> on 2014/05/06 10:36:22 UTC, 0 replies.
- KryoSerializer Exception - posted by Andrea Esposito <an...@gmail.com> on 2014/05/06 12:05:37 UTC, 4 replies.
- If it due to my file has been breakdown? - posted by Sophia <sl...@163.com> on 2014/05/06 14:23:24 UTC, 2 replies.
- Spark and Java 8 - posted by Kristoffer Sjögren <st...@gmail.com> on 2014/05/06 15:16:50 UTC, 5 replies.
- No space left on device error when pulling data from s3 - posted by Han JU <ju...@gmail.com> on 2014/05/06 18:05:43 UTC, 4 replies.
- Re: is Mesos falling out of favor? - posted by deric <ba...@gmail.com> on 2014/05/06 18:42:58 UTC, 9 replies.
- logging in pyspark - posted by Diana Carroll <dc...@cloudera.com> on 2014/05/06 21:31:50 UTC, 4 replies.
- Spark Summit 2014 (Hotel suggestions) - posted by Jerry Lam <ch...@gmail.com> on 2014/05/06 21:32:22 UTC, 4 replies.
- maven for building scala simple program - posted by Laeeq Ahmed <la...@yahoo.com> on 2014/05/07 01:10:25 UTC, 2 replies.
- How to read a multipart s3 file? - posted by kamatsuoka <ke...@gmail.com> on 2014/05/07 02:19:07 UTC, 12 replies.
- Easy one - posted by Ian Ferreira <ia...@hotmail.com> on 2014/05/07 02:29:18 UTC, 1 replies.
- Re: Easy one - posted by Aaron Davidson <il...@gmail.com> on 2014/05/07 02:32:13 UTC, 2 replies.
- customized comparator in groupByKey - posted by Ameet Kini <am...@gmail.com> on 2014/05/07 02:46:21 UTC, 0 replies.
- Unable to load native-hadoop library problem - posted by Sophia <sl...@163.com> on 2014/05/07 03:25:09 UTC, 2 replies.
- Re: log4j question - posted by Sophia <sl...@163.com> on 2014/05/07 05:09:26 UTC, 2 replies.
- spark+mesos: configure mesos 'callback' port? - posted by Scott Clasen <sc...@gmail.com> on 2014/05/07 06:39:34 UTC, 2 replies.
- No configuration setting found for key 'akka.zeromq' - posted by "Francis.Hu" <fr...@reachjunction.com> on 2014/05/07 07:02:40 UTC, 0 replies.
- Is there anything that I need to modify? - posted by Sophia <sl...@163.com> on 2014/05/07 09:20:50 UTC, 1 replies.
- spark-env.sh do not take effect. - posted by lihu <li...@gmail.com> on 2014/05/07 10:40:03 UTC, 0 replies.
- os buffer cache does not cache shuffle output file - posted by wxhsdp <wx...@gmail.com> on 2014/05/07 11:40:11 UTC, 3 replies.
- Preferred RDD Size - posted by Sai Prasanna <an...@gmail.com> on 2014/05/07 12:52:32 UTC, 1 replies.
- problem about broadcast variable in iteration - posted by randylu <ra...@gmail.com> on 2014/05/07 15:39:20 UTC, 6 replies.
- Average of each RDD in Stream - posted by Laeeq Ahmed <la...@yahoo.com> on 2014/05/07 16:59:46 UTC, 4 replies.
- Caching in graphX - posted by Franco Avi <fr...@gmail.com> on 2014/05/07 18:15:23 UTC, 1 replies.
- Spark's Behavior 2 - posted by Eduardo Costa Alfaia <e....@unibs.it> on 2014/05/07 18:33:59 UTC, 0 replies.
- Turn BLAS on MacOSX - posted by Debasish Das <de...@gmail.com> on 2014/05/07 19:52:13 UTC, 6 replies.
- Doubts regarding Shark - posted by vinay Bajaj <vb...@gmail.com> on 2014/05/07 21:12:41 UTC, 3 replies.
- cant get tests to pass anymore on master master - posted by Koert Kuipers <ko...@tresata.com> on 2014/05/07 23:01:23 UTC, 5 replies.
- Real world - posted by Ian Ferreira <ia...@hotmail.com> on 2014/05/08 02:05:35 UTC, 1 replies.
- Re: 0.9 wont start cluster on ec2, SSH connection refused? - posted by wxhsdp <wx...@gmail.com> on 2014/05/08 04:21:10 UTC, 0 replies.
- Is there a way to load a large file from HDFS faster into Spark - posted by Soumya Simanta <so...@gmail.com> on 2014/05/08 04:27:24 UTC, 3 replies.
- Reading from .bz2 files with Spark - posted by Andrew Ash <an...@andrewash.com> on 2014/05/08 06:19:58 UTC, 9 replies.
- Schema view of HadoopRDD - posted by Debasish Das <de...@gmail.com> on 2014/05/08 07:09:52 UTC, 6 replies.
- ERROR: Unknown Spark version - posted by wxhsdp <wx...@gmail.com> on 2014/05/08 10:49:07 UTC, 2 replies.
- NotSerializableException in Spark Streaming - posted by Diana Carroll <dc...@cloudera.com> on 2014/05/08 12:37:09 UTC, 0 replies.
- Spark to utilize HDFS's mmap caching - posted by Chanwit Kaewkasi <ch...@gmail.com> on 2014/05/08 13:43:17 UTC, 6 replies.
- File present but file not found exception - posted by Sai Prasanna <an...@gmail.com> on 2014/05/08 20:48:53 UTC, 3 replies.
- pyspark python exceptions / py4j exceptions - posted by Patrick Donovan <pa...@jadedpixel.com> on 2014/05/08 21:52:05 UTC, 0 replies.
- same log4j slf4j error in spark 9.1 - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/05/08 21:56:01 UTC, 4 replies.
- Re: Task not serializable? - posted by pedro <sk...@gmail.com> on 2014/05/08 23:49:38 UTC, 0 replies.
- Creating time-sequential pairs - posted by Nicholas Pritchard <ni...@falkonry.com> on 2014/05/09 00:04:42 UTC, 1 replies.
- Re: Spark temp dir (spark.local.dir) - posted by Scott Clasen <sc...@gmail.com> on 2014/05/09 01:49:20 UTC, 0 replies.
- Is there any problem on the spark mailing list? - posted by Cheney Sun <su...@gmail.com> on 2014/05/09 03:35:19 UTC, 7 replies.
- [Suggestion]Strange behavior for broadcast cleaning with spark 0.9 - posted by Earthson <Ea...@gmail.com> on 2014/05/09 04:18:00 UTC, 0 replies.
- Variables outside of mapPartitions scope - posted by pedro <sk...@gmail.com> on 2014/05/09 10:06:16 UTC, 3 replies.
- spark 0.9.1 textFile hdfs unknown host exception - posted by Eugen Cepoi <ce...@gmail.com> on 2014/05/09 10:51:38 UTC, 1 replies.
- Taking value out from Dstream for each RDD - posted by Laeeq Ahmed <la...@yahoo.com> on 2014/05/09 11:53:39 UTC, 0 replies.
- Not getting mails from user group - posted by Laeeq Ahmed <la...@yahoo.com> on 2014/05/09 12:18:05 UTC, 0 replies.
- spark on yarn-standalone, throws StackOverflowError and fails somtimes and succeed for the rest - posted by phoenix bai <mi...@gmail.com> on 2014/05/09 12:45:58 UTC, 3 replies.
- Using String Dataset for Logistic Regression - posted by praveshjain1991 <pr...@gmail.com> on 2014/05/09 14:31:56 UTC, 4 replies.
- Re: slf4j and log4j loop - posted by amoc <am...@verticalscope.com> on 2014/05/09 16:13:10 UTC, 1 replies.
- writing my own RDD - posted by Koert Kuipers <ko...@tresata.com> on 2014/05/09 20:11:28 UTC, 3 replies.
- executor processes are still there even I killed the app and the workers - posted by Nan Zhu <zh...@gmail.com> on 2014/05/09 21:51:05 UTC, 0 replies.
- A new resource for getting examples of Spark RDD API calls - posted by zhen <z....@latrobe.edu.au> on 2014/05/09 23:54:53 UTC, 4 replies.
- Bug when zip with longs and too many partitions? - posted by Michael Malak <mi...@yahoo.com> on 2014/05/10 02:51:49 UTC, 1 replies.
- application detail ui can not open on ec2 - posted by wxhsdp <wx...@gmail.com> on 2014/05/10 05:11:40 UTC, 0 replies.
- time exhausted in BlockFetcher - posted by wxhsdp <wx...@gmail.com> on 2014/05/11 02:52:13 UTC, 0 replies.
- Spark LIBLINEAR - posted by Chieh-Yen <r0...@csie.ntu.edu.tw> on 2014/05/11 10:49:55 UTC, 7 replies.
- java.lang.NoSuchMethodError on Java API - posted by Alessandro De Carli <de...@gmail.com> on 2014/05/11 11:07:23 UTC, 5 replies.
- Re: Test - posted by Azuryy <az...@gmail.com> on 2014/05/11 17:09:03 UTC, 1 replies.
- Re: why is Spark 0.9.1 (context creation?) so slow on my OSX laptop? - posted by Madhu <ma...@madhu.com> on 2014/05/11 19:10:26 UTC, 2 replies.
- streaming on hdfs can detected all new file, but the sum of all the rdd.count() not equals which had detected - posted by zzzzzqf12345 <zz...@gmail.com> on 2014/05/12 04:30:43 UTC, 3 replies.
- Driver process succeed exiting but web UI shows FAILED - posted by Cheney Sun <su...@gmail.com> on 2014/05/12 05:05:11 UTC, 0 replies.
- build shark(hadoop CDH5) on hadoop2.0.0 CDH4 - posted by Sophia <sl...@163.com> on 2014/05/12 05:29:23 UTC, 2 replies.
- about spark interactive shell - posted by fengshen <lw...@gmail.com> on 2014/05/12 05:43:43 UTC, 1 replies.
- Client cannot authenticate via:[TOKEN] - posted by innowireless TaeYun Kim <ta...@innowireless.co.kr> on 2014/05/12 10:31:23 UTC, 0 replies.
- How to run shark? - posted by Sophia <sl...@163.com> on 2014/05/12 10:37:39 UTC, 3 replies.
- Job failed: java.io.NotSerializableException: org.apache.spark.SparkContext - posted by yh18190 <yh...@gmail.com> on 2014/05/12 11:27:26 UTC, 2 replies.
- Proper way to stop Spark stream processing - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/05/12 11:49:27 UTC, 1 replies.
- Spark on Yarn - A small issue ! - posted by Sai Prasanna <an...@gmail.com> on 2014/05/12 12:37:47 UTC, 1 replies.
- missing method in my slf4j after excluding Spark ZK log4j - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/05/12 16:51:39 UTC, 4 replies.
- Packaging a spark job using maven - posted by Laurent Thoulon <la...@ldmobile.net> on 2014/05/12 17:41:53 UTC, 7 replies.
- Forcing spark to send exactly one element to each worker node - posted by NevinLi158 <Ne...@gmail.com> on 2014/05/12 19:29:22 UTC, 5 replies.
- Distribute jar dependencies via sc.AddJar(fileName) - posted by DB Tsai <db...@stanford.edu> on 2014/05/12 20:14:25 UTC, 9 replies.
- Dead lock running multiple Spark Jobs on Mesos - posted by Martin Weindel <ma...@gmail.com> on 2014/05/12 21:29:23 UTC, 1 replies.
- Dead lock running multiple Spark jobs on Mesos - posted by Martin Weindel <ma...@gmail.com> on 2014/05/12 21:37:02 UTC, 5 replies.
- Is their a way to Create SparkContext object? - posted by yh18190 <yh...@gmail.com> on 2014/05/12 21:37:41 UTC, 3 replies.
- Accuracy in mllib BinaryClassificationMetrics - posted by Debasish Das <de...@gmail.com> on 2014/05/12 22:26:12 UTC, 1 replies.
- java.lang.StackOverflowError when calling count() - posted by Guanhua Yan <gh...@lanl.gov> on 2014/05/13 00:42:42 UTC, 9 replies.
- Unexpected results when caching data - posted by paul <pa...@datalogix.com> on 2014/05/13 00:46:25 UTC, 0 replies.
- Serializable different behavior Spark Shell vs. Scala Shell - posted by Michael Malak <mi...@yahoo.com> on 2014/05/13 02:02:35 UTC, 0 replies.
- something about pipeline - posted by wxhsdp <wx...@gmail.com> on 2014/05/13 03:42:29 UTC, 0 replies.
- - posted by "Herman, Matt (CORP)" <Ma...@ADP.com> on 2014/05/13 15:32:20 UTC, 0 replies.
- filling missing values in a sequence - posted by Mohit Jaggi <mo...@gmail.com> on 2014/05/13 17:56:59 UTC, 5 replies.
- 1.0.0 Release Date? - posted by bhusted <br...@gmail.com> on 2014/05/13 18:40:39 UTC, 3 replies.
- accessing partition i+1 from mapper of partition i - posted by Mohit Jaggi <mo...@gmail.com> on 2014/05/14 06:31:00 UTC, 4 replies.
- how to speed the count operation - posted by lihu <li...@gmail.com> on 2014/05/14 07:25:00 UTC, 3 replies.
- How to use Mahout VectorWritable in Spark. - posted by Stuti Awasthi <st...@hcl.com> on 2014/05/14 07:37:45 UTC, 8 replies.
- EndpointWriter: AssociationError - posted by Laurent Thoulon <la...@ldmobile.net> on 2014/05/14 10:02:44 UTC, 1 replies.
- Proper way to create standalone app with custom Spark version - posted by Andrei <fa...@gmail.com> on 2014/05/14 10:30:04 UTC, 2 replies.
- saveAsTextFile with replication factor in HDFS - posted by Sai Prasanna <an...@gmail.com> on 2014/05/14 11:37:20 UTC, 0 replies.
- Spark unit testing best practices - posted by Andras Nemeth <an...@lynxanalytics.com> on 2014/05/14 12:34:55 UTC, 5 replies.
- Express VMs - good idea? - posted by Marco Shaw <ma...@gmail.com> on 2014/05/14 14:00:50 UTC, 4 replies.
- Understanding epsilon in KMeans - posted by Stuti Awasthi <st...@hcl.com> on 2014/05/14 14:50:11 UTC, 6 replies.
- problem with hdfs access in spark job - posted by Marcin Cylke <ma...@ext.allegro.pl> on 2014/05/14 16:22:07 UTC, 2 replies.
- little confused about SPARK_JAVA_OPTS alternatives - posted by Koert Kuipers <ko...@tresata.com> on 2014/05/14 18:09:46 UTC, 2 replies.
- Worker re-spawn and dynamic node joining - posted by Han JU <ju...@gmail.com> on 2014/05/14 18:22:37 UTC, 4 replies.
- Re: Class not found in Kafka-Stream due to multi-thread without correct ClassLoader? - posted by n0rb3rt <jn...@gmail.com> on 2014/05/14 18:47:41 UTC, 0 replies.
- Stable Hadoop version supported ? - posted by Soumya Simanta <so...@gmail.com> on 2014/05/14 21:17:51 UTC, 2 replies.
- How to run the SVM and LogisticRegression - posted by yxzhao <yx...@ualr.edu> on 2014/05/14 22:36:57 UTC, 8 replies.
- Hadoop 2.3 Centralized Cache vs RDD - posted by William Kang <we...@gmail.com> on 2014/05/14 23:30:08 UTC, 2 replies.
- Re: instalation de spark - posted by Madhu <ma...@madhu.com> on 2014/05/15 00:04:11 UTC, 0 replies.
- Re: Hadoop Writable and Spark serialization - posted by Madhu <ma...@madhu.com> on 2014/05/15 03:02:02 UTC, 1 replies.
- Equivalent of collect() on DStream - posted by Stephen Boesch <ja...@gmail.com> on 2014/05/15 06:25:34 UTC, 4 replies.
- Re: SparkContext startup time out - posted by Sophia <sl...@163.com> on 2014/05/15 09:46:39 UTC, 2 replies.
- Spark workers keep getting disconnected(Keep dying) from the cluster. - posted by Ravi Hemnani <ra...@gmail.com> on 2014/05/15 10:04:03 UTC, 1 replies.
- Efficient implementation of getting top 10 hashtags in last 5 mins window - posted by nilmish <ni...@gmail.com> on 2014/05/15 10:33:18 UTC, 1 replies.
- Advanced log processing - posted by Laurent T <la...@ldmobile.net> on 2014/05/15 11:26:58 UTC, 3 replies.
- Standalone client failing with docker deployed cluster - posted by Bharath Ravi Kumar <re...@gmail.com> on 2014/05/15 13:38:47 UTC, 1 replies.
- Historical Data as Stream - posted by Laeeq Ahmed <la...@yahoo.com> on 2014/05/15 15:52:59 UTC, 4 replies.
- Workers unable to find class, even when in the SparkConf JAR list - posted by Robert James <sr...@gmail.com> on 2014/05/15 17:34:23 UTC, 0 replies.
- java serialization errors with spark.files.userClassPathFirst=true - posted by Koert Kuipers <ko...@tresata.com> on 2014/05/15 21:03:17 UTC, 5 replies.
- count()-ing gz files gives java.io.IOException: incorrect header check - posted by Nick Chammas <ni...@gmail.com> on 2014/05/15 22:15:43 UTC, 10 replies.
- Problem when sorting big file - posted by Gustavo Enrique Salazar Torres <gs...@ime.usp.br> on 2014/05/15 23:55:19 UTC, 1 replies.
- What does Spark cache() actually do? - posted by PengWeiPRC <pe...@gmx.com> on 2014/05/16 01:09:40 UTC, 0 replies.
- How to pass config variables to workers - posted by srobertjames <sr...@gmail.com> on 2014/05/16 02:06:48 UTC, 3 replies.
- increase the akka.frameSize lead to Lost Executor - posted by lihu <li...@gmail.com> on 2014/05/16 05:12:17 UTC, 1 replies.
- help me: Out of memory when spark streaming - posted by "Francis.Hu" <fr...@reachjunction.com> on 2014/05/16 10:41:22 UTC, 0 replies.
- Calling external classes added by sc.addJar needs to be through reflection - posted by DB Tsai <db...@stanford.edu> on 2014/05/16 10:46:34 UTC, 0 replies.
- Benchmarking Spark with YCSB - posted by bhusted <br...@gmail.com> on 2014/05/16 12:37:37 UTC, 3 replies.
- JavaNetworkWordCount - posted by Eduardo Costa Alfaia <e....@unibs.it> on 2014/05/16 14:41:44 UTC, 1 replies.
- Spark with Drill - posted by "N.Venkata Naga Ravi" <nv...@hotmail.com> on 2014/05/16 17:08:39 UTC, 0 replies.
- Counting things only once - posted by Daniel Siegmann <da...@velos.io> on 2014/05/16 18:05:26 UTC, 2 replies.
- advice on maintaining a production spark cluster? - posted by Josh Marcus <jm...@meetup.com> on 2014/05/16 18:53:56 UTC, 13 replies.
- Error while launching ec2 spark cluster with HVM (r3.large) - posted by Usman Ghani <us...@platfora.com> on 2014/05/16 21:22:36 UTC, 3 replies.
- spark-submit / S3 - posted by Nick Pentreath <ni...@gmail.com> on 2014/05/16 21:53:05 UTC, 0 replies.
- Passing runtime config to workers? - posted by Robert James <sr...@gmail.com> on 2014/05/16 22:59:07 UTC, 3 replies.
- What is the difference between a Spark Worker and a Spark Slave? - posted by Robert James <sr...@gmail.com> on 2014/05/16 23:02:45 UTC, 1 replies.
- Nested method in a class: Task not serializable? - posted by Pierre B <pi...@realimpactanalytics.com> on 2014/05/16 23:15:55 UTC, 0 replies.
- Debugging Spark AWS S3 - posted by Robert James <sr...@gmail.com> on 2014/05/16 23:37:47 UTC, 1 replies.
- breeze DGEMM slow in spark - posted by wxhsdp <wx...@gmail.com> on 2014/05/17 08:57:01 UTC, 10 replies.
- Using mongo with PySpark - posted by Samarth Mailinglist <ma...@gmail.com> on 2014/05/17 10:37:34 UTC, 4 replies.
- Apache Spark Throws java.lang.IllegalStateException: unread block data - posted by sam <sa...@gmail.com> on 2014/05/17 12:14:58 UTC, 0 replies.
- Spark and Solr indexing - posted by Flavio Pompermaier <po...@okkam.it> on 2014/05/17 21:42:07 UTC, 0 replies.
- Configuring Spark for reduceByKey on on massive data sets - posted by Daniel Mahler <dm...@gmail.com> on 2014/05/17 23:52:26 UTC, 4 replies.
- Benchmarking Graphx - posted by Hari <ha...@yahoo.com> on 2014/05/17 23:59:15 UTC, 1 replies.
- Text file and shuffle - posted by Puneet Lakhina <pu...@gmail.com> on 2014/05/18 04:41:59 UTC, 1 replies.
- java.lang.NoClassDefFoundError: org/apache/spark/deploy/worker/Worker - posted by Hao Wang <wh...@gmail.com> on 2014/05/18 07:52:10 UTC, 1 replies.
- Re: File list read into single RDD - posted by Pat Ferrel <pa...@gmail.com> on 2014/05/18 18:13:43 UTC, 2 replies.
- IllegelAccessError when writing to HBase? - posted by Nan Zhu <zh...@gmail.com> on 2014/05/18 22:18:49 UTC, 1 replies.
- unsubscribe - posted by Terje Berg-Hansen <te...@axenna.com> on 2014/05/18 22:38:35 UTC, 0 replies.
- First sample with Spark Streaming and three Time's? - posted by Jacek Laskowski <ja...@japila.pl> on 2014/05/18 23:29:57 UTC, 0 replies.
- Spark Shell stuck on standalone mode - posted by Sidharth Kashyap <si...@outlook.com> on 2014/05/19 01:02:06 UTC, 4 replies.
- making spark/conf/spark-defaults.conf changes take effect - posted by Daniel Mahler <dm...@gmail.com> on 2014/05/19 01:56:30 UTC, 1 replies.
- sync master with slaves with bittorrent? - posted by Daniel Mahler <dm...@gmail.com> on 2014/05/19 06:22:55 UTC, 11 replies.
- For performance, Spark prefers OracleJDK or OpenJDK? - posted by Hao Wang <wh...@gmail.com> on 2014/05/19 08:50:57 UTC, 2 replies.
- persist @ disk-only failing - posted by Sai Prasanna <an...@gmail.com> on 2014/05/19 09:41:26 UTC, 4 replies.
- How to compile the examples directory? - posted by Hao Wang <wh...@gmail.com> on 2014/05/19 13:53:55 UTC, 1 replies.
- specifying worker nodes when using the repl? - posted by Eric Friedman <er...@spottedsnake.net> on 2014/05/19 17:08:52 UTC, 2 replies.
- Re: Yarn configuration file doesn't work when run with yarn-client mode - posted by Arun Ahuja <aa...@gmail.com> on 2014/05/19 18:07:05 UTC, 6 replies.
- OutofMemory: Failed on spark/examples/bagel/WikipediaPageRank.scala - posted by Hao Wang <wh...@gmail.com> on 2014/05/19 18:59:36 UTC, 0 replies.
- Spark Streaming and Shark | Streaming Taking All CPUs - posted by "anishsneh@yahoo.co.in" <an...@yahoo.co.in> on 2014/05/19 19:43:36 UTC, 2 replies.
- spark ec2 commandline tool error "VPC security groups may not be used for a non-VPC launch" - posted by Matt Work Coarr <ma...@gmail.com> on 2014/05/19 19:58:43 UTC, 0 replies.
- Which component(s) of Spark do not support IPv6? - posted by queniesun <qu...@tekcomms.com> on 2014/05/19 20:58:48 UTC, 0 replies.
- Checkpoint serialization - posted by Vadim Chekan <ko...@gmail.com> on 2014/05/20 01:36:04 UTC, 0 replies.
- combinebykey throw classcastexception - posted by xiemeilong <xi...@gmail.com> on 2014/05/20 03:05:45 UTC, 2 replies.
- Spark stalling during shuffle (maybe a memory issue) - posted by "jonathan.keebler" <jk...@gmail.com> on 2014/05/20 05:39:02 UTC, 6 replies.
- life if an executor - posted by Koert Kuipers <ko...@tresata.com> on 2014/05/20 05:44:03 UTC, 7 replies.
- facebook data mining with Spark - posted by Joe L <se...@yahoo.com> on 2014/05/20 06:07:26 UTC, 2 replies.
- Setting queue for spark job on yarn - posted by Ron Gonzalez <zl...@yahoo.com> on 2014/05/20 06:29:47 UTC, 3 replies.
- Status stays at ACCEPTED - posted by Jan Holmberg <ja...@perigeum.fi> on 2014/05/20 08:43:57 UTC, 3 replies.
- rdd.map() can't pass parameters - posted by zzzzzqf12345 <zz...@gmail.com> on 2014/05/20 09:52:59 UTC, 0 replies.
- Problem with loading files: Loss was due to java.io.EOFException java.io.EOFException - posted by hakanilter <ha...@gmail.com> on 2014/05/20 10:11:22 UTC, 1 replies.
- question about the license of akka and Spark - posted by YouPeng Yang <yy...@gmail.com> on 2014/05/20 11:16:55 UTC, 3 replies.
- Ignoring S3 0 files exception - posted by Laurent T <la...@ldmobile.net> on 2014/05/20 15:51:26 UTC, 3 replies.
- issue with Scala, Spark and Akka - posted by Greg <gr...@zooniverse.org> on 2014/05/20 17:01:53 UTC, 1 replies.
- reading large XML files - posted by Nathan Kronenfeld <nk...@oculusinfo.com> on 2014/05/20 17:25:00 UTC, 4 replies.
- Evaluating Spark just for Cluster Computing - posted by pcutil <pu...@gmail.com> on 2014/05/20 19:26:10 UTC, 1 replies.
- Spark and Hadoop - posted by pcutil <pu...@gmail.com> on 2014/05/20 19:43:10 UTC, 2 replies.
- Spark Streaming using Flume body size limitation - posted by lemieud <da...@radialpoint.com> on 2014/05/20 21:39:19 UTC, 3 replies.
- java.lang.NoClassDefFoundError: org/apache/hadoop/io/Writable - posted by pcutil <pu...@gmail.com> on 2014/05/20 21:54:16 UTC, 0 replies.
- Imports that need to be specified in a Spark application jar? - posted by Shivani Rao <ra...@gmail.com> on 2014/05/20 22:18:38 UTC, 0 replies.
- Spark Performace Comparison Spark on YARN vs Spark Standalone - posted by "anishsneh@yahoo.co.in" <an...@yahoo.co.in> on 2014/05/20 23:46:35 UTC, 1 replies.
- Python, Spark and HBase - posted by twizansk <tw...@gmail.com> on 2014/05/21 01:21:29 UTC, 8 replies.
- Using Spark to analyze complex JSON - posted by Nick Chammas <ni...@gmail.com> on 2014/05/21 03:46:26 UTC, 11 replies.
- Unsubscribe - posted by "A.Khanolkar" <an...@gmail.com> on 2014/05/21 03:56:26 UTC, 2 replies.
- How to Unsubscribe from the Spark user list - posted by Nick Chammas <ni...@gmail.com> on 2014/05/21 04:11:45 UTC, 2 replies.
- IllegalStateException when creating Job from shell - posted by Alex Holmes <gr...@gmail.com> on 2014/05/21 04:37:37 UTC, 0 replies.
- any way to control memory usage when streaming input's speed is faster than the speed of handled by spark streaming ? - posted by "Francis.Hu" <fr...@reachjunction.com> on 2014/05/21 05:31:05 UTC, 2 replies.
- MLlib ALS-- Errors communicating with MapOutputTracker - posted by Sue Cai <ca...@hotmail.co.uk> on 2014/05/21 09:32:40 UTC, 0 replies.
- ClassNotFoundException with Spark/Mesos (spark-shell works fine) - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/05/21 11:51:41 UTC, 8 replies.
- RDD union of a window in Dstream - posted by Laeeq Ahmed <la...@yahoo.com> on 2014/05/21 14:42:13 UTC, 3 replies.
- Log analysis - posted by Shubhabrata <ma...@gmail.com> on 2014/05/21 14:57:08 UTC, 0 replies.
- pyspark.rdd.ResultIterable? - posted by "T.J. Alumbaugh" <tj...@continuum.io> on 2014/05/21 17:15:31 UTC, 0 replies.
- Is spark 1.0.0 "spark-shell --master=yarn" running in yarn-cluster mode or yarn-client mode? - posted by Andrew Lee <al...@hotmail.com> on 2014/05/21 19:57:52 UTC, 2 replies.
- Job Processing Large Data Set Got Stuck - posted by yxzhao <yx...@ualr.edu> on 2014/05/21 20:23:09 UTC, 3 replies.
- ExternalAppendOnlyMap: Spilling in-memory map - posted by Mohit Jaggi <mo...@gmail.com> on 2014/05/21 20:35:14 UTC, 3 replies.
- tests that run locally fail when run through bamboo - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/05/21 22:58:10 UTC, 2 replies.
- Inconsistent RDD Sample size - posted by glxc <r....@gmail.com> on 2014/05/21 23:05:08 UTC, 1 replies.
- I want to filter a stream by a subclass. - posted by Ian Holsman <ia...@holsman.com.au> on 2014/05/22 00:28:30 UTC, 3 replies.
- Run Apache Spark on Mini Cluster - posted by Upender Nimbekar <up...@gmail.com> on 2014/05/22 02:14:58 UTC, 2 replies.
- yarn-client mode question - posted by Sophia <sl...@163.com> on 2014/05/22 03:17:58 UTC, 4 replies.
- Re: Failed RC-10 yarn-cluster job for FS closed error when cleaning up staging directory - posted by Tom Graves <tg...@yahoo.com> on 2014/05/22 05:35:21 UTC, 1 replies.
- Best way to deploy a jar to spark cluster? - posted by Min Li <li...@gmail.com> on 2014/05/22 07:51:09 UTC, 0 replies.
- java.io.IOException: Failed to save output of task - posted by Grega Kešpret <gr...@celtra.com> on 2014/05/22 08:18:57 UTC, 1 replies.
- Spark Streaming on Mesos, various questions - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/05/22 09:10:38 UTC, 0 replies.
- Spark on HBase vs. Spark on HDFS - posted by "Limbeck, Philip" <Ph...@automic.com> on 2014/05/22 09:33:26 UTC, 2 replies.
- Spark Streaming Error: SparkException: Error sending message to BlockManagerMaster - posted by Sourav Chandra <so...@livestream.com> on 2014/05/22 10:07:52 UTC, 0 replies.
- SparkContext#stop - posted by Piotr Kołaczkowski <pk...@datastax.com> on 2014/05/22 10:32:01 UTC, 2 replies.
- Workers disconnected from master sometimes and never reconnect back - posted by Piotr Kołaczkowski <pk...@datastax.com> on 2014/05/22 10:39:11 UTC, 0 replies.
- how to set task number? - posted by qingyang li <li...@gmail.com> on 2014/05/22 11:50:06 UTC, 12 replies.
- GraphX partition problem - posted by "Zhicharevich, Alex" <az...@ebay.com> on 2014/05/22 13:53:00 UTC, 8 replies.
- Drop shark Cache - posted by vinay Bajaj <vb...@gmail.com> on 2014/05/22 14:19:49 UTC, 0 replies.
- reading task failed 4 times, for unknown reason - posted by "Kostakis, Orestis" <or...@f-secure.com> on 2014/05/22 14:47:06 UTC, 0 replies.
- spark setting maximum available memory - posted by İbrahim Rıza HALLAÇ <ib...@live.com> on 2014/05/22 16:23:26 UTC, 2 replies.
- Computing cosine similiarity using pyspark - posted by jamal sasha <ja...@gmail.com> on 2014/05/22 16:49:08 UTC, 3 replies.
- ETL and workflow management on Spark - posted by William Kang <we...@gmail.com> on 2014/05/22 16:49:34 UTC, 2 replies.
- Use SparkListener to get overall progress of an action - posted by Pierre B <pi...@realimpactanalytics.com> on 2014/05/22 16:51:29 UTC, 11 replies.
- Shark resilience to unusable slaves - posted by Yana Kadiyska <ya...@gmail.com> on 2014/05/22 17:34:34 UTC, 1 replies.
- controlling the time in spark-streaming - posted by Ian Holsman <ia...@holsman.com.au> on 2014/05/22 17:38:37 UTC, 1 replies.
- Akka disassociation on Java SE Embedded - posted by Chanwit Kaewkasi <ch...@gmail.com> on 2014/05/22 18:19:37 UTC, 3 replies.
- How to turn off MetadataCleaner? - posted by Adrian Mocanu <am...@verticalscope.com> on 2014/05/22 18:24:32 UTC, 2 replies.
- Fwd: Spark Streaming: Flume stream not found - posted by Andy Konwinski <an...@gmail.com> on 2014/05/22 18:46:04 UTC, 0 replies.
- Spark Streaming with Kafka | Check if DStream is Empty | HDFS Write - posted by Anish Sneh <an...@yahoo.co.in> on 2014/05/22 19:13:41 UTC, 0 replies.
- Spark Job Server first steps - posted by Gerard Maas <ge...@gmail.com> on 2014/05/22 19:25:35 UTC, 2 replies.
- Spark / YARN classpath issues - posted by Jon Bender <jo...@gmail.com> on 2014/05/22 20:38:10 UTC, 3 replies.
- How to Run Machine Learning Examples - posted by yxzhao <yx...@ualr.edu> on 2014/05/22 21:48:17 UTC, 5 replies.
- Unable to run a Standalone job - posted by Shrikar archak <sh...@gmail.com> on 2014/05/22 23:27:59 UTC, 8 replies.
- Broadcast Variables - posted by Puneet Lakhina <pu...@gmail.com> on 2014/05/23 03:58:07 UTC, 1 replies.
- Unsubscribe - posted by Donna-M Fernandez <do...@metistream.com> on 2014/05/23 04:01:20 UTC, 1 replies.
- java.lang.OutOfMemoryError while running Shark on Mesos - posted by prabeesh k <pr...@gmail.com> on 2014/05/23 06:52:14 UTC, 1 replies.
- Shuffle file consolidation - posted by Nathan Kronenfeld <nk...@oculusinfo.com> on 2014/05/23 16:00:10 UTC, 3 replies.
- credential in UserGroupInformation - posted by Hollyen Edison <12...@gmail.com> on 2014/05/23 17:59:07 UTC, 0 replies.
- RDD values and defensive copying - posted by Allen Chang <al...@yahoo.com> on 2014/05/23 23:55:53 UTC, 0 replies.
- Setting spark.akka.frameSize - posted by MattSills <ma...@gmail.com> on 2014/05/24 00:08:33 UTC, 1 replies.
- Invalid Class Exception - posted by Suman Somasundar <su...@oracle.com> on 2014/05/24 00:20:01 UTC, 4 replies.
- Trying to run Spark on Yarn - posted by zsterone <zs...@hotmail.com> on 2014/05/24 04:18:54 UTC, 0 replies.
- Sources for kafka-0.7.2-spark - posted by Stephen Boesch <ja...@gmail.com> on 2014/05/24 05:19:12 UTC, 0 replies.
- Seattle Spark Meetup: xPatterns Slides and @pacoid session next week! - posted by Denny Lee <de...@gmail.com> on 2014/05/24 05:58:21 UTC, 0 replies.
- Working with Avro Generic Records in the interactive scala shell - posted by Jeremy Lewi <je...@lewi.us> on 2014/05/24 06:15:42 UTC, 6 replies.
- can communication and computation be overlapped in spark? - posted by wxhsdp <wx...@gmail.com> on 2014/05/24 09:43:09 UTC, 1 replies.
- Issue with the parallelize method in SparkContext - posted by Wisc Forum <wi...@gmail.com> on 2014/05/24 16:18:45 UTC, 1 replies.
- Custom Accumulator: Type Mismatch Error - posted by "Muttineni, Vinay" <vm...@ebay.com> on 2014/05/25 02:33:12 UTC, 0 replies.
- com.google.protobuf out of memory - posted by Zuhair Khayyat <zu...@gmail.com> on 2014/05/25 10:15:54 UTC, 1 replies.
- PySpark & Mesos random crashes - posted by Perttu Ranta-aho <pe...@gmail.com> on 2014/05/25 21:10:30 UTC, 2 replies.
- counting degrees graphx - posted by dizzy5112 <da...@gmail.com> on 2014/05/26 03:34:07 UTC, 9 replies.
- Fails: Spark sbt/sbt publish local - posted by ABHISHEK <ab...@gmail.com> on 2014/05/26 04:51:48 UTC, 3 replies.
- Re: Running a spark-submit compatible app in spark-shell - posted by Perttu Ranta-aho <ra...@iki.fi> on 2014/05/26 10:35:49 UTC, 2 replies.
- Re: Using PySpark for Streaming - posted by "gaurav.dasgupta" <ga...@gmail.com> on 2014/05/26 11:20:18 UTC, 0 replies.
- maprfs and spark libraries - posted by nelson <ne...@ysance.com> on 2014/05/26 11:48:10 UTC, 8 replies.
- K-nearest neighbors search in Spark - posted by Carter <gy...@hotmail.com> on 2014/05/26 14:06:13 UTC, 5 replies.
- Sorting data large data- "too many open files" exception - posted by Matt Kielo <mk...@oculusinfo.com> on 2014/05/26 16:18:28 UTC, 1 replies.
- Context switch in spark - posted by Andras Nemeth <an...@lynxanalytics.com> on 2014/05/26 17:01:15 UTC, 0 replies.
- Map failed [dupliacte 1] error - posted by Joe L <se...@yahoo.com> on 2014/05/27 10:47:19 UTC, 0 replies.
- Re: how to control task number? - posted by qingyang li <li...@gmail.com> on 2014/05/27 11:05:48 UTC, 0 replies.
- too many temporary app files left after app finished - posted by Cheney Sun <su...@gmail.com> on 2014/05/27 11:22:05 UTC, 0 replies.
- Re: KryoDeserialization getting java.io.EOFException - posted by jaranda <jo...@bsc.es> on 2014/05/27 11:49:57 UTC, 0 replies.
- Re: spark table to hive table - posted by John Omernik <jo...@omernik.com> on 2014/05/27 12:28:33 UTC, 1 replies.
- Spark streaming issue - posted by Sourav Chandra <so...@livestream.com> on 2014/05/27 13:37:16 UTC, 0 replies.
- Spark On Mesos - posted by Gileny <gp...@hp.com> on 2014/05/27 15:36:16 UTC, 0 replies.
- Persist and unpersist - posted by Daniel Darabos <da...@lynxanalytics.com> on 2014/05/27 18:28:39 UTC, 4 replies.
- Re: file not found - posted by jaranda <jo...@bsc.es> on 2014/05/27 18:30:02 UTC, 0 replies.
- proximity of events within the next group of events instead of time - posted by "Navarro, John" <JN...@verisk.com> on 2014/05/27 20:06:17 UTC, 0 replies.
- Running Jars on Spark, program just hanging there - posted by Min Li <li...@gmail.com> on 2014/05/27 20:48:50 UTC, 2 replies.
- Profiling tasks - posted by Puneet Lakhina <pu...@gmail.com> on 2014/05/27 23:06:14 UTC, 0 replies.
- Spark 1.0: slf4j version conflicts with pig - posted by Ryan Compton <co...@gmail.com> on 2014/05/27 23:45:17 UTC, 4 replies.
- Java RDD structure for Matrix predict? - posted by Sandeep Parikh <sa...@clusterbeep.org> on 2014/05/28 00:27:26 UTC, 2 replies.
- Spark Memory Bounds - posted by Keith Simmons <ke...@pulse.io> on 2014/05/28 02:33:47 UTC, 4 replies.
- AMPCamp Training materials are broken due to overwritten AMIs? - posted by Toshinari Kureha <to...@flite.com> on 2014/05/28 03:20:28 UTC, 0 replies.
- Re: Akka Connection refused - standalone cluster using spark-0.9.0 - posted by jaranda <jo...@bsc.es> on 2014/05/28 11:35:18 UTC, 1 replies.
- Problem using Spark with Hbase - posted by Vibhor Banga <vi...@gmail.com> on 2014/05/28 14:33:04 UTC, 3 replies.
- Inter and Inra Cluster Density in KMeans - posted by Stuti Awasthi <st...@hcl.com> on 2014/05/28 14:53:43 UTC, 0 replies.
- Writing RDDs from Python Spark progrma (pyspark) to HBase - posted by "gaurav.dasgupta" <ga...@gmail.com> on 2014/05/28 15:00:50 UTC, 1 replies.
- Reading bz2 files that do not end with .bz2 - posted by Laurent T <la...@ldmobile.net> on 2014/05/28 15:52:54 UTC, 1 replies.
- Spark on an HPC setup - posted by Sidharth Kashyap <si...@outlook.com> on 2014/05/28 17:02:39 UTC, 1 replies.
- Re: rdd ordering gets scrambled - posted by Michael Malak <mi...@yahoo.com> on 2014/05/28 17:44:18 UTC, 0 replies.
- Integration issue between Apache Shark-0.9.1 (with in-house hive-0.11) and pre-existing CDH4.6 HIVE-0.10 server - posted by bijoy deb <bi...@gmail.com> on 2014/05/28 17:47:13 UTC, 2 replies.
- K-NN by efficient sparse matrix product - posted by Christian Jauvin <cj...@gmail.com> on 2014/05/28 18:00:46 UTC, 3 replies.
- Re: Spark Streaming RDD to Shark table - posted by Chang Lim <ch...@gmail.com> on 2014/05/28 18:53:18 UTC, 0 replies.
- persistence and fault tolerance in Spark Streaming - posted by Diana Carroll <dc...@cloudera.com> on 2014/05/28 19:45:47 UTC, 0 replies.
- A Standalone App in Scala: Standalone mode issues - posted by jaranda <jo...@bsc.es> on 2014/05/28 21:35:08 UTC, 2 replies.
- Checking spark cache percentage programatically. And how to clear cache. - posted by Sung Hwan Chung <co...@cs.stanford.edu> on 2014/05/29 02:32:08 UTC, 1 replies.
- Spark Stand-alone mode job not starting (akka Connection refused) - posted by "T.J. Alumbaugh" <tj...@continuum.io> on 2014/05/29 06:13:12 UTC, 0 replies.
- Spark SQL JDBC Connectivity - posted by Venkat Subramanian <vs...@gmail.com> on 2014/05/29 08:39:23 UTC, 1 replies.
- Re: Use mvn run Spark program occur problem - posted by jaranda <jo...@bsc.es> on 2014/05/29 10:27:40 UTC, 0 replies.
- Driver OOM while using reduceByKey - posted by "haitao .yao" <ya...@gmail.com> on 2014/05/29 11:03:10 UTC, 2 replies.
- How can I dispose an Accumulator? - posted by innowireless TaeYun Kim <ta...@innowireless.co.kr> on 2014/05/29 11:13:16 UTC, 1 replies.
- Selecting first ten values in a RDD/partition - posted by nilmish <ni...@gmail.com> on 2014/05/29 14:04:58 UTC, 4 replies.
- Is uberjar a recommended way of running Spark/Scala applications? - posted by Andrei <fa...@gmail.com> on 2014/05/29 14:24:02 UTC, 4 replies.
- ClassCastExceptions when using Spark shell - posted by Sebastian Schelter <ss...@apache.org> on 2014/05/29 14:51:57 UTC, 1 replies.
- Spark hook to create external process - posted by ansriniv <an...@gmail.com> on 2014/05/29 18:39:50 UTC, 3 replies.
- [ANN]: Scala By the Bay Developer Conference, CFP now open - posted by Chester Chen <ch...@yahoo.com> on 2014/05/29 20:48:54 UTC, 0 replies.
- Why Scala? - posted by Nick Chammas <ni...@gmail.com> on 2014/05/29 22:55:57 UTC, 8 replies.
- Re: Spark SQL JDBC Connectivity and more - posted by Venkat Subramanian <vs...@gmail.com> on 2014/05/30 00:26:45 UTC, 1 replies.
- pyspark MLlib examples don't work with Spark 1.0.0 - posted by jamborta <ja...@gmail.com> on 2014/05/30 02:22:45 UTC, 4 replies.
- access hdfs file name in map() - posted by "Xu (Simon) Chen" <xc...@gmail.com> on 2014/05/30 04:49:38 UTC, 1 replies.
- getPreferredLocations - posted by ansriniv <an...@gmail.com> on 2014/05/30 05:47:14 UTC, 1 replies.
- Re: spark job stuck when running on mesos fine grained mode - posted by prabeesh <pr...@gmail.com> on 2014/05/30 07:28:54 UTC, 0 replies.
- Create/shutdown objects before/after RDD use (or: Non-serializable classes) - posted by Tobias Pfeiffer <tg...@preferred.jp> on 2014/05/30 08:38:54 UTC, 1 replies.
- Web frontend issue - posted by "Limbeck, Philip" <Ph...@automic.com> on 2014/05/30 10:03:19 UTC, 0 replies.
- Exception failure: java.lang.ClassNotFoundException: org.apache.spark.streaming.kafka.KafkaReceiver - posted by Margusja <ma...@roo.ee> on 2014/05/30 11:06:20 UTC, 0 replies.
- Announcing Spark 1.0.0 - posted by Patrick Wendell <pw...@gmail.com> on 2014/05/30 12:12:39 UTC, 16 replies.
- Yay for 1.0.0! EC2 Still has problems. - posted by Jeremy Lee <un...@gmail.com> on 2014/05/30 14:08:15 UTC, 4 replies.
- Local file being refrenced in mapper function - posted by Rahul Bhojwani <ra...@gmail.com> on 2014/05/30 15:55:19 UTC, 5 replies.
- Monitoring / Instrumenting jobs in 1.0 - posted by Daniel Siegmann <da...@velos.io> on 2014/05/30 16:09:12 UTC, 1 replies.
- Using Spark on Data size larger than Memory size - posted by Vibhor Banga <vi...@gmail.com> on 2014/05/30 16:21:14 UTC, 2 replies.
- Subscribing to news releases - posted by Nick Chammas <ni...@gmail.com> on 2014/05/30 17:07:34 UTC, 0 replies.
- Spark 1.0.0 - Java 8 - posted by Upender Nimbekar <up...@gmail.com> on 2014/05/30 18:20:11 UTC, 2 replies.
- Configuration error while working using HDFS as storage for spark cluster - posted by Salih Kardan <ka...@gmail.com> on 2014/05/30 20:12:44 UTC, 0 replies.
- Trouble with EC2 - posted by PJ$ <p...@chickenandwaffl.es> on 2014/05/30 20:18:27 UTC, 1 replies.
- Spark shell never leaves ACCEPTED state in YARN CDH5 - posted by David Belling <da...@gmail.com> on 2014/05/30 21:38:51 UTC, 2 replies.
- apache whirr for spark - posted by chirag lakhani <ch...@gmail.com> on 2014/05/30 21:59:43 UTC, 1 replies.
- Failed to remove RDD error - posted by Michael Chang <mi...@tellapart.com> on 2014/05/31 03:22:43 UTC, 1 replies.
- possible typos in spark 1.0 documentation - posted by Yadid Ayzenberg <ya...@media.mit.edu> on 2014/05/31 04:28:49 UTC, 1 replies.
- Unable to execute saveAsTextFile on multi node mesos - posted by prabeesh k <pr...@gmail.com> on 2014/05/31 12:22:34 UTC, 1 replies.