You are viewing a plain text version of this content. The canonical link for it is here.
- Spark on yarn enviroment var - posted by "Saurabh Malviya (samalviy)" <sa...@cisco.com> on 2016/10/01 00:11:04 UTC, 1 replies.
- Re: get different results when debugging and running scala program - posted by Jakob Odersky <ja...@odersky.com> on 2016/10/01 00:44:32 UTC, 1 replies.
- Deep learning libraries for scala - posted by janardhan shetty <ja...@gmail.com> on 2016/10/01 02:30:30 UTC, 12 replies.
- Re: Spark ML Decision Trees Algorithm - posted by janardhan shetty <ja...@gmail.com> on 2016/10/01 02:34:07 UTC, 1 replies.
- Re: S3 DirectParquetOutputCommitter + PartitionBy + SaveMode.Append - posted by Igor Berman <ig...@gmail.com> on 2016/10/01 10:14:13 UTC, 1 replies.
- 答复: get different results when debugging and running scala program - posted by chen yong <cy...@hotmail.com> on 2016/10/01 13:11:34 UTC, 0 replies.
- execution sequence puzzle - posted by chen yong <cy...@hotmail.com> on 2016/10/01 13:35:08 UTC, 0 replies.
- Performance problem with BlockMatrix.add() - posted by Andi <ar...@googlemail.com> on 2016/10/01 13:36:46 UTC, 0 replies.
- Re: Pls assist: Spark 2.0 build failure on Ubuntu 16.06 - posted by Marco Mistroni <mm...@gmail.com> on 2016/10/01 20:24:19 UTC, 2 replies.
- Loading data into Hbase table throws NoClassDefFoundError: org/apache/htrace/Trace error - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/10/01 22:22:09 UTC, 17 replies.
- Re: Broadcast big dataset - posted by Anastasios Zouzias <zo...@gmail.com> on 2016/10/02 01:27:04 UTC, 0 replies.
- Re: Restful WS for Spark - posted by Vadim Semenov <va...@datadoghq.com> on 2016/10/02 02:12:23 UTC, 0 replies.
- Re: DataFrame Sort gives Cannot allocate a page with more than 17179869176 bytes - posted by Babak Alipour <ba...@gmail.com> on 2016/10/02 03:35:45 UTC, 6 replies.
- use CrossValidatorModel for prediction - posted by Pengcheng <pc...@gmail.com> on 2016/10/02 05:04:00 UTC, 4 replies.
- unsubscribe - posted by Nikos Viorres <nv...@gmail.com> on 2016/10/02 07:13:20 UTC, 5 replies.
- Re: Dataframe, Java: How to convert String to Vector ? - posted by "颜发才 (Yan Facai)" <ya...@gmail.com> on 2016/10/02 11:41:55 UTC, 0 replies.
- Partitioned windows in spark streaming - posted by Adrienne Kole <ad...@gmail.com> on 2016/10/02 14:05:36 UTC, 0 replies.
- statistical theory behind estimating the number of total tasks in GroupedSumEvaluator.scala - posted by philipghu <ph...@gmail.com> on 2016/10/02 19:47:40 UTC, 2 replies.
- Spark Streaming: How to load a Pipeline on a Stream? - posted by manueslapera <ma...@hotmail.com> on 2016/10/02 22:01:43 UTC, 0 replies.
- filtering in SparkR - posted by Yogesh Vyas <in...@gmail.com> on 2016/10/03 06:19:58 UTC, 1 replies.
- Filtering in SparkR - posted by Yogesh Vyas <in...@gmail.com> on 2016/10/03 07:08:23 UTC, 1 replies.
- unsubsribe - posted by as...@cybervisiontech.com on 2016/10/03 08:17:49 UTC, 0 replies.
- Killing a running application - posted by Kevin McGhee <km...@mimecast.com> on 2016/10/03 11:58:50 UTC, 0 replies.
- Spark_Jdbc_Hive - posted by Ajay Chander <it...@gmail.com> on 2016/10/03 13:56:29 UTC, 0 replies.
- Document listing spark sql aggregate functions - posted by Ashish Tadose <as...@gmail.com> on 2016/10/03 15:41:30 UTC, 0 replies.
- Pros and cons of using different persistence layers for Spark - posted by Ashok Kumar <as...@yahoo.com.INVALID> on 2016/10/03 16:55:07 UTC, 0 replies.
- Data Format for Running Collaborative Filtering in Spark MLlib - posted by Baktaawar <tr...@gmail.com> on 2016/10/03 18:08:34 UTC, 0 replies.
- MulticlassClassificationEvaluator how weighted precision and weighted recall calculated - posted by Nirav Patel <np...@xactlycorp.com> on 2016/10/03 21:52:57 UTC, 0 replies.
- ML - MulticlassClassificationEvaluator How to get metrics for each class - posted by Nirav Patel <np...@xactlycorp.com> on 2016/10/03 22:31:09 UTC, 1 replies.
- Executor Lost error - posted by Punit Naik <na...@gmail.com> on 2016/10/04 00:07:22 UTC, 3 replies.
- Re: access spark thrift server from another spark session - posted by Takeshi Yamamuro <li...@gmail.com> on 2016/10/04 00:44:50 UTC, 2 replies.
- Re: Prototype Implementation of Hierarchical Clustering on Spark - posted by pcandido <pc...@gmail.com> on 2016/10/04 02:19:06 UTC, 0 replies.
- RE: Spark Hive Rejection - posted by Mostafa Alaa Mohamed <mo...@etisalat.ae> on 2016/10/04 05:27:22 UTC, 0 replies.
- When will the next version of spark be released? - posted by Aseem Bansal <as...@gmail.com> on 2016/10/04 07:03:58 UTC, 1 replies.
- Re: Problems with new experimental Kafka Consumer for 0.10 - posted by Matthias Niehoff <ma...@codecentric.de> on 2016/10/04 07:18:45 UTC, 10 replies.
- NotSerializableException in DStream.transform - posted by Andrew A <an...@gmail.com> on 2016/10/04 10:29:07 UTC, 0 replies.
- DataFrame API: how to partition by a "virtual" column, or by a nested column? - posted by Samy Dindane <sa...@dindane.com> on 2016/10/04 13:10:53 UTC, 1 replies.
- java.net.URISyntaxException - posted by Hafiz Mujadid <ha...@gmail.com> on 2016/10/04 14:29:06 UTC, 2 replies.
- Extracting Row Value for Deserializer Expression - posted by Aleksander Eskilson <al...@gmail.com> on 2016/10/04 15:36:28 UTC, 0 replies.
- Package org.apache.spark.annotation no longer exist in Spark 2.0? - posted by Liren Ding <sk...@gmail.com> on 2016/10/04 17:33:53 UTC, 2 replies.
- [ANNOUNCE] Announcing Spark 2.0.1 - posted by Reynold Xin <rx...@databricks.com> on 2016/10/04 17:39:04 UTC, 5 replies.
- Time-unit of RDD.countApprox timeout parameter - posted by "Sesterhenn, Mike" <ms...@cars.com> on 2016/10/04 18:02:25 UTC, 3 replies.
- Re: Spark metrics when running with YARN? - posted by Vladimir Tretyakov <vl...@sematext.com> on 2016/10/04 18:44:01 UTC, 1 replies.
- Error downloading Spark 2.0.1 - posted by Daniel <da...@gmail.com> on 2016/10/04 18:56:37 UTC, 5 replies.
- [Spark] native snappy library not available: this version of libhadoop was built without snappy support. - posted by Uthayan Suthakar <ut...@gmail.com> on 2016/10/04 19:20:57 UTC, 0 replies.
- building Spark 2.1 vs Java 1.8 on Ubuntu 16/06 - posted by Marco Mistroni <mm...@gmail.com> on 2016/10/04 20:21:15 UTC, 7 replies.
- Parsing XML - posted by Jean Georges Perrin <jg...@jgp.net> on 2016/10/04 21:35:54 UTC, 2 replies.
- Any issues if spark 1.6.1 client connects to spark 1.6.0 external shuffle services - posted by Manoj Samel <ma...@gmail.com> on 2016/10/04 23:43:22 UTC, 0 replies.
- UseCase_Design_Help - posted by Ajay Chander <it...@gmail.com> on 2016/10/04 23:44:22 UTC, 8 replies.
- Problem creating SparkContext to connect to YARN cluster - posted by Alberto Andreotti <al...@whiteprompt.com> on 2016/10/05 00:41:38 UTC, 0 replies.
- Re: MLib : Non Linear Optimization - posted by nsareen <ns...@gmail.com> on 2016/10/05 04:28:01 UTC, 3 replies.
- spark streaming job stopped - posted by Divya Gehlot <di...@gmail.com> on 2016/10/05 04:59:13 UTC, 1 replies.
- Need help to setup eclipse environment - posted by "Md. Mahedi Kaysar" <md...@gmail.com> on 2016/10/05 08:54:04 UTC, 6 replies.
- Multiple-streaming context within a jvm - posted by Hafiz Mujadid <ha...@gmail.com> on 2016/10/05 09:13:52 UTC, 1 replies.
- RE: Spark Streaming-- for each new file in HDFS - posted by "Kappaganthu, Sivaram (ES)" <Si...@ADP.com> on 2016/10/05 09:14:12 UTC, 1 replies.
- Filename with DStreams - posted by "Kappaganthu, Sivaram (ES)" <Si...@ADP.com> on 2016/10/05 09:33:33 UTC, 0 replies.
- Why add --driver-class-path jbdc.jar works and --jars not? (1.6.1) - posted by Chanh Le <gi...@gmail.com> on 2016/10/05 10:32:33 UTC, 3 replies.
- SparkR 2.0 glm prediction confidences - posted by Zsolt Tóth <to...@gmail.com> on 2016/10/05 12:09:17 UTC, 0 replies.
- pyspark: sqlContext.read.text() does not work with a list of paths - posted by Laurent Legrand <ll...@skapane.com> on 2016/10/05 12:12:08 UTC, 2 replies.
- Are Task Closures guaranteed to be accessed by only one Thread? - posted by Matthew Dailey <ma...@gmail.com> on 2016/10/05 16:23:38 UTC, 2 replies.
- mesos in spark 2.0.1 - must call stop() otherwise app hangs - posted by Adrian Bridgett <ad...@opensignal.com> on 2016/10/05 17:05:53 UTC, 6 replies.
- can mllib Logistic Regression package handle 10 million sparse features? - posted by Yang <te...@gmail.com> on 2016/10/05 19:00:07 UTC, 5 replies.
- java.lang.NoClassDefFoundError: org/apache/spark/sql/Dataset - posted by kant kodali <ka...@gmail.com> on 2016/10/05 19:58:35 UTC, 6 replies.
- Reading Phoenix table in Spark throws org.apache.hadoop.hbase.client.HBaseAdmin error - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/10/05 20:05:23 UTC, 0 replies.
- How to stop a running job - posted by Richard Siebeling <rs...@gmail.com> on 2016/10/05 20:55:03 UTC, 4 replies.
- How to implement a scheduling algorithm in Spark? - posted by anilsingh <an...@iitrpr.ac.in> on 2016/10/05 20:58:09 UTC, 3 replies.
- PySpark UDF Performance Exploration w/Jython (Early/rough 2~3X improvement*) [SPARK-15369] - posted by Holden Karau <ho...@pigscanfly.ca> on 2016/10/05 21:05:59 UTC, 1 replies.
- Cannot pass 3rd party jars to mesos executors - posted by SpasticParanoia <st...@asu.edu> on 2016/10/05 23:26:49 UTC, 0 replies.
- codecFactory is not able to return codec for a file, created by spark - posted by shamu <pr...@hotmail.com> on 2016/10/06 01:58:23 UTC, 0 replies.
- Fwd: Dynamic allocation - when DataFrame's unpersist actually happens? - posted by "Jung, Soonoh" <so...@gmail.com> on 2016/10/06 03:45:30 UTC, 0 replies.
- spark 2.0.1 upgrade breaks on WAREHOUSE_PATH - posted by Koert Kuipers <ko...@tresata.com> on 2016/10/06 04:18:24 UTC, 3 replies.
- How to make Mesos Cluster Dispatcher of Spark 1.6.1 load my config files? - posted by Chanh Le <gi...@gmail.com> on 2016/10/06 04:25:47 UTC, 3 replies.
- strange pyspark behavior - posted by Sourav Chakraborty <so...@gmail.com> on 2016/10/06 04:43:16 UTC, 0 replies.
- Solve system of linear equations in Spark - posted by Cooper <ah...@gmail.com> on 2016/10/06 05:49:22 UTC, 1 replies.
- PermGen space error - posted by Saurav Sinha <sa...@gmail.com> on 2016/10/06 09:45:10 UTC, 0 replies.
- Detected yarn-cluster mode, but isn't running on a cluster. Deployment to YARN is not supported directly by SparkContext. Please use spark-submit. - posted by Saurav Sinha <sa...@gmail.com> on 2016/10/06 09:51:03 UTC, 3 replies.
- Best approach for processing all files parallelly - posted by Arun Patel <ar...@gmail.com> on 2016/10/06 11:26:49 UTC, 7 replies.
- Spark REST API YARN client mode is not full? - posted by Vladimir Tretyakov <vl...@sematext.com> on 2016/10/06 12:40:30 UTC, 2 replies.
- Spark SQL query - posted by AJT <at...@currenex.com> on 2016/10/06 13:40:34 UTC, 0 replies.
- Kryo serializer slower than Java serializer for Spark Streaming - posted by Rajkiran Rajkumar <ra...@gmail.com> on 2016/10/06 13:47:23 UTC, 2 replies.
- Support for uniVocity in Spark 2.x - posted by Jean Georges Perrin <jg...@jgp.net> on 2016/10/06 15:00:13 UTC, 2 replies.
- spark stateful streaming error - posted by backtrack5 <so...@live.com> on 2016/10/06 15:26:15 UTC, 0 replies.
- Best Savemode option to write Parquet file - posted by Anubhav Agarwal <an...@gmail.com> on 2016/10/06 15:32:39 UTC, 4 replies.
- spark standalone with multiple workers gives a warning - posted by "Mendelson, Assaf" <As...@rsa.com> on 2016/10/06 15:46:50 UTC, 2 replies.
- Best practice of complicated SQL query in Spark/Hive - posted by Shi Yu <sh...@gmail.com> on 2016/10/06 15:50:58 UTC, 0 replies.
- Submit job with driver options in Mesos Cluster mode - posted by vonnagy <iv...@vadio.com> on 2016/10/06 16:20:47 UTC, 4 replies.
- How to Disable or do minimal Logging for apache spark client Driver program? - posted by kant kodali <ka...@gmail.com> on 2016/10/06 16:27:56 UTC, 5 replies.
- Spark Streaming Advice - posted by Kevin Mellott <ke...@gmail.com> on 2016/10/06 21:22:07 UTC, 4 replies.
- Zombie Driver process (Standalone Cluster) - posted by map reduced <k3...@gmail.com> on 2016/10/06 22:09:32 UTC, 0 replies.
- Re: Spark 2.0.1 has been published? - posted by miliofotou <il...@gmail.com> on 2016/10/06 23:00:53 UTC, 1 replies.
- RESTful Endpoint and Spark - posted by Benjamin Kim <bb...@gmail.com> on 2016/10/06 23:27:19 UTC, 3 replies.
- - posted by ayan guha <gu...@gmail.com> on 2016/10/07 01:37:41 UTC, 2 replies.
- [Spark][issue]Writing Hive Partitioned table - posted by ayan guha <gu...@gmail.com> on 2016/10/07 02:46:16 UTC, 2 replies.
- spark 2.0.1, union on non-null and null String dataframes causing ClassCastException UTF8String cannot be cast to java.lang.String - posted by William Kinney <wi...@gmail.com> on 2016/10/07 02:58:40 UTC, 0 replies.
- Executors under utilized - posted by Pradeep Gollakota <pr...@gmail.com> on 2016/10/07 03:49:49 UTC, 0 replies.
- Spark SQL is slower when DataFrame is cache in Memory - posted by Chin Wei Low <lo...@gmail.com> on 2016/10/07 04:03:46 UTC, 7 replies.
- MLlib: word2vec - words vectors into feature vector - posted by kaching <wa...@o2.pl> on 2016/10/07 08:05:02 UTC, 1 replies.
- issue accessing Phoenix table from Spark - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/10/07 08:27:53 UTC, 2 replies.
- How to resubmit the job after it is done? - posted by kant kodali <ka...@gmail.com> on 2016/10/07 09:59:23 UTC, 0 replies.
- SqlContext in below code - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/10/07 11:22:57 UTC, 0 replies.
- When will spark 2.0.1 be available in maven repo? - posted by Sushrut Ikhar <su...@gmail.com> on 2016/10/07 14:39:07 UTC, 1 replies.
- Spark SQL Thriftserver with HBase - posted by Benjamin Kim <bb...@gmail.com> on 2016/10/07 14:56:20 UTC, 33 replies.
- SaveToCassandra - how to handle failed inserts? - posted by Pablo Federigi <pa...@mercadolibre.com> on 2016/10/07 15:30:38 UTC, 0 replies.
- Re: Could we expose log likelihood of EM algorithm in MLLIB? - posted by Yanbo Liang <yb...@gmail.com> on 2016/10/07 15:35:59 UTC, 3 replies.
- Spark 2.0 Encoder().schema() is sorting StructFields - posted by Paul Stewart <pa...@imperva.com> on 2016/10/07 15:42:06 UTC, 1 replies.
- Is there a way to pause spark job - posted by Evgenii Morozov <ev...@gmail.com> on 2016/10/07 16:12:28 UTC, 0 replies.
- Executor errors out connecting to external shuffle service when using dynamic allocation - posted by Manoj Samel <ma...@gmail.com> on 2016/10/07 17:07:06 UTC, 0 replies.
- Writing/Saving RDD to HDFS using saveAsTextFile - posted by Mahendra Kutare <ma...@gmail.com> on 2016/10/07 17:33:32 UTC, 1 replies.
- Map with state keys serialization - posted by Joey Echeverria <jo...@rocana.com> on 2016/10/07 18:39:39 UTC, 9 replies.
- Kafaka 0.8, 0.9 in Structured Streaming - posted by Michael Armbrust <mi...@databricks.com> on 2016/10/07 19:41:19 UTC, 7 replies.
- Fw: Issue with Spark Streaming with checkpointing in Spark 2.0 - posted by Arijit <Ar...@live.com> on 2016/10/08 00:11:14 UTC, 0 replies.
- Re: working with multiple values - posted by Robineast <Ro...@xense.co.uk> on 2016/10/08 13:10:48 UTC, 0 replies.
- Issue while creating spark datasets - posted by "Kappaganthu, Sivaram (ES)" <Si...@ADP.com> on 2016/10/08 14:50:01 UTC, 1 replies.
- Inserting New Primary Keys - posted by Benjamin Kim <bb...@gmail.com> on 2016/10/08 16:42:17 UTC, 4 replies.
- Reason for Kafka topic existence check / "Does the topic exist?" error - posted by Dmitry Goldenberg <dg...@gmail.com> on 2016/10/08 16:44:24 UTC, 4 replies.
- Code review / sqlContext Scope - posted by Ajay Chander <it...@gmail.com> on 2016/10/08 17:17:26 UTC, 1 replies.
- How to include book title at "Books" section on Spark website - posted by "Karim, Md. Rezaul" <re...@insight-centre.org> on 2016/10/08 18:45:07 UTC, 2 replies.
- Apply UDF to SparseVector column in spark 2.0 - posted by abby <as...@doximity.com> on 2016/10/08 21:04:18 UTC, 0 replies.
- SparkConf.setExecutorEnv works differently in Spark 2.0.0 - posted by Dmitry Goldenberg <dg...@gmail.com> on 2016/10/08 22:47:40 UTC, 0 replies.
- Kafka 0.10 integ offset commit - posted by Srikanth <sr...@gmail.com> on 2016/10/09 00:25:04 UTC, 5 replies.
- Scientific Notation and Precision Error - posted by Meeraj Kunnumpurath <me...@servicesymphony.com> on 2016/10/09 01:50:39 UTC, 0 replies.
- "How to change rdd fields for each key combination." - posted by Vikash Kumar <vi...@gmail.com> on 2016/10/09 04:11:14 UTC, 0 replies.
- Convert hive sql to spark sql - posted by Sree Eedupuganti <sr...@inndata.in> on 2016/10/09 08:55:48 UTC, 2 replies.
- java: see logging output in in UI - posted by Miro Karpis <mi...@gmail.com> on 2016/10/09 15:50:19 UTC, 0 replies.
- when does a Row object have a schema - posted by Koert Kuipers <ko...@tresata.com> on 2016/10/09 20:50:31 UTC, 1 replies.
- This Exception has been really hard to trace - posted by kant kodali <ka...@gmail.com> on 2016/10/10 03:13:23 UTC, 4 replies.
- SPARK-17845 - window function frame boundary API - posted by Reynold Xin <rx...@databricks.com> on 2016/10/10 04:50:10 UTC, 1 replies.
- ClassCastException while running a simple wordCount - posted by vaibhav thapliyal <va...@gmail.com> on 2016/10/10 07:05:12 UTC, 7 replies.
- Spark Streaming Custom Receivers - How to use metadata store API during processing - posted by "Manjunath, Kiran" <ki...@akamai.com> on 2016/10/10 09:17:02 UTC, 0 replies.
- What happens when an executor crashes? - posted by Samy Dindane <sa...@dindane.com> on 2016/10/10 09:19:49 UTC, 8 replies.
- spark using two different versions of netty? - posted by Paweł Szulc <pa...@gmail.com> on 2016/10/10 10:56:13 UTC, 2 replies.
- Logistic Regression Standardization in ML - posted by Cesar <ce...@gmail.com> on 2016/10/10 14:15:10 UTC, 2 replies.
- Manually committing offset in Spark 2.0 with Kafka 0.10 and Java - posted by static-max <fl...@googlemail.com> on 2016/10/10 15:34:42 UTC, 3 replies.
- JSON Arrays and Spark - posted by Jean Georges Perrin <jg...@jgp.net> on 2016/10/10 16:57:33 UTC, 7 replies.
- Error: PartitioningCollection requires all of its partitionings have the same numPartitions. - posted by cuevasclemente <cu...@gmail.com> on 2016/10/10 17:24:29 UTC, 0 replies.
- Large variation in spark in Task Deserialization Time - posted by Pulasthi Supun Wickramasinghe <pu...@gmail.com> on 2016/10/10 17:53:05 UTC, 0 replies.
- converting hBaseRDD to DataFrame - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/10/10 18:02:43 UTC, 1 replies.
- Spark S3 - posted by Selvam Raman <se...@gmail.com> on 2016/10/10 21:46:38 UTC, 3 replies.
- [Spark] RDDs are not persisting in memory - posted by diplomatic Guru <di...@gmail.com> on 2016/10/10 22:14:18 UTC, 3 replies.
- Design consideration for a trading System - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/10/10 22:40:53 UTC, 0 replies.
- Kryo on Zeppelin - posted by Fei Hu <hu...@gmail.com> on 2016/10/11 04:11:27 UTC, 0 replies.
- GraphFrame BFS - posted by cashinpj <pc...@unm.edu> on 2016/10/11 04:32:12 UTC, 0 replies.
- Exception while shutting down the Spark master. - posted by Gimantha Bandara <gi...@wso2.com> on 2016/10/11 09:57:20 UTC, 1 replies.
- Can mapWithState state func be called every batchInterval? - posted by DandyDev <de...@gmail.com> on 2016/10/11 11:28:32 UTC, 4 replies.
- mllib model in production web API - posted by Nicolas Long <ni...@gmail.com> on 2016/10/11 15:53:18 UTC, 6 replies.
- Limit Kafka batches size with Spark Streaming - posted by Samy Dindane <sa...@dindane.com> on 2016/10/11 15:57:29 UTC, 8 replies.
- Spark 2.0.0 Error Caused by: java.lang.IllegalArgumentException: requirement failed: Block broadcast_21_piece0 is already present in the MemoryStore - posted by sandesh deshmane <sa...@gmail.com> on 2016/10/11 16:10:21 UTC, 0 replies.
- Recommended way to run spark streaming in production in EMR - posted by pandees waran <pa...@gmail.com> on 2016/10/11 18:09:15 UTC, 0 replies.
- Spark Docker Container - Jars problem when deploying my app - posted by doruchiulan <do...@gmail.com> on 2016/10/11 19:10:39 UTC, 1 replies.
- one executor runs multiple parallel tasks VS multiple excutors each runs one task - posted by Xiaoye Sun <su...@gmail.com> on 2016/10/11 20:49:42 UTC, 1 replies.
- textFileStream dStream to DataFrame issues - posted by Nick <ni...@gmail.com> on 2016/10/11 23:43:41 UTC, 0 replies.
- Anyone attending spark summit? - posted by Andrew James <at...@gmail.com> on 2016/10/12 01:04:19 UTC, 1 replies.
- Spark Shuffle Issue - posted by Ankur Srivastava <an...@gmail.com> on 2016/10/12 06:16:10 UTC, 1 replies.
- Spark ML OOM problem - posted by 陈哲 <cz...@gmail.com> on 2016/10/12 07:24:28 UTC, 1 replies.
- Kafka integration: get existing Kafka messages? - posted by Haopu Wang <HW...@qilinsoft.com> on 2016/10/12 09:15:31 UTC, 5 replies.
- Reading from and writing to different S3 buckets in spark - posted by Aseem Bansal <as...@gmail.com> on 2016/10/12 09:49:18 UTC, 2 replies.
- Spark-Sql 2.0 nullpointerException - posted by Selvam Raman <se...@gmail.com> on 2016/10/12 15:10:08 UTC, 1 replies.
- How to prevent having more than one instance of a specific job running on the cluster - posted by Samy Dindane <sa...@dindane.com> on 2016/10/12 16:36:26 UTC, 0 replies.
- DataFrame/Dataset join not producing correct results in Spark 2.0/Yarn - posted by Stephen Hankinson <st...@affinio.com> on 2016/10/12 16:55:57 UTC, 1 replies.
- UDF on multiple columns - posted by Meeraj Kunnumpurath <me...@servicesymphony.com> on 2016/10/12 17:56:48 UTC, 1 replies.
- Spyder and SPARK combination problem...Please help! - posted by innocent73 <er...@gmail.com> on 2016/10/12 19:32:36 UTC, 3 replies.
- Matrix Operations - posted by Meeraj Kunnumpurath <me...@servicesymphony.com> on 2016/10/12 19:47:18 UTC, 0 replies.
- Python Spark Improvements (forked from Spark Improvement Proposals) - posted by Holden Karau <ho...@pigscanfly.ca> on 2016/10/12 19:49:48 UTC, 6 replies.
- Linear Regression Error - posted by Meeraj Kunnumpurath <me...@servicesymphony.com> on 2016/10/12 19:52:30 UTC, 2 replies.
- Memory leak warnings in Spark 2.0.1 - posted by vonnagy <iv...@vadio.com> on 2016/10/12 20:32:47 UTC, 0 replies.
- Mark DataFrame/Dataset APIs stable - posted by Reynold Xin <rx...@databricks.com> on 2016/10/13 04:26:12 UTC, 0 replies.
- Unsuscribe - posted by "R. Revert" <ra...@gmail.com> on 2016/10/13 04:28:07 UTC, 0 replies.
- download spark 1.2.1 - posted by Irfan Sayyed <ir...@gmail.com> on 2016/10/13 05:33:48 UTC, 0 replies.
- receiving stream data options - posted by vr spark <vr...@gmail.com> on 2016/10/13 06:10:58 UTC, 0 replies.
- Want to test spark-sql-kafka but get unresolved dependency error - posted by JayKay <ju...@gmail.com> on 2016/10/13 08:24:23 UTC, 5 replies.
- OOM when running Spark SQL by PySpark on Java 8 - posted by Shady Xu <sh...@gmail.com> on 2016/10/13 09:00:18 UTC, 4 replies.
- Spark with kerberos - posted by Denis Bolshakov <bo...@gmail.com> on 2016/10/13 09:43:51 UTC, 0 replies.
- spark with kerberos - posted by dbolshak <bo...@gmail.com> on 2016/10/13 09:50:22 UTC, 8 replies.
- [1.6.0] Skipped stages keep increasing and causes OOM finally - posted by Mungeol Heo <mu...@gmail.com> on 2016/10/13 10:17:37 UTC, 0 replies.
- RowMatrix from DenseVector - posted by Meeraj Kunnumpurath <me...@servicesymphony.com> on 2016/10/13 10:27:33 UTC, 1 replies.
- Spark security - posted by "Mendelson, Assaf" <As...@rsa.com> on 2016/10/13 12:40:36 UTC, 2 replies.
- spark on mesos memory sizing with offheap - posted by vincent gromakowski <vi...@gmail.com> on 2016/10/13 14:23:36 UTC, 1 replies.
- pyspark doesn't recognize MMM dateFormat pattern in spark.read.load() for dates like 1989Dec31 and 31Dec1989 - posted by Pietro Pugni <pi...@gmail.com> on 2016/10/13 14:32:00 UTC, 8 replies.
- Spark 2.0.0 TreeAggregate with larger depth will be OOM? - posted by Jy Chen <ch...@gmail.com> on 2016/10/13 15:32:38 UTC, 0 replies.
- No way to set mesos cluster driver memory overhead? - posted by drewrobb <dr...@gmail.com> on 2016/10/13 17:42:04 UTC, 3 replies.
- Re: Re-partitioning mapwithstateDstream - posted by manasdebashiskar <po...@gmail.com> on 2016/10/13 20:59:59 UTC, 0 replies.
- How to spark-submit using python subprocess module? - posted by Vikram Kone <vi...@gmail.com> on 2016/10/13 22:13:29 UTC, 0 replies.
- Java.util.ArrayList is not a valid external type for schema of array - posted by Mohamed Nadjib MAMI <mo...@gmail.com> on 2016/10/13 22:30:45 UTC, 0 replies.
- [Spark 2.0.0] error when unioning to an empty dataset - posted by Efe Selcuk <ef...@gmail.com> on 2016/10/14 03:25:38 UTC, 8 replies.
- detecting last record of partition - posted by Shushant Arora <sh...@gmail.com> on 2016/10/14 03:33:13 UTC, 1 replies.
- SparkR execution hang on when handle a RDD which is converted from DataFrame - posted by Lantao Jin <ji...@gmail.com> on 2016/10/14 03:50:25 UTC, 7 replies.
- does it support by the submission request with client deploy mode to master rest port - posted by Marc Pan <px...@gmail.com> on 2016/10/14 05:53:30 UTC, 0 replies.
- import sql.implicits._ - posted by Jakub Dubovsky <sp...@gmail.com> on 2016/10/14 20:42:50 UTC, 4 replies.
- Why the json file used by sparkSession.read.json must be a valid json object per line - posted by codlife <10...@qq.com> on 2016/10/15 15:09:18 UTC, 5 replies.
- reading files with .list extension - posted by Hafiz Mujadid <ha...@gmail.com> on 2016/10/15 15:35:32 UTC, 0 replies.
- NoClassDefFoundError: org/apache/spark/Logging in SparkSession.getOrCreate - posted by Brad Cox <br...@gmail.com> on 2016/10/15 19:25:30 UTC, 1 replies.
- Spark-submit Problems - posted by Tobi Bosede <an...@gmail.com> on 2016/10/16 00:04:21 UTC, 0 replies.
- 回复:Spark-submit Problems - posted by hxfeng <98...@qq.com> on 2016/10/16 00:23:39 UTC, 4 replies.
- Aggregate UDF (UDAF) in Python - posted by Tobi Bosede <an...@gmail.com> on 2016/10/16 02:20:35 UTC, 9 replies.
- Couchbase-Spark 2.0.0 - posted by "Devi P.V" <de...@gmail.com> on 2016/10/16 14:51:35 UTC, 3 replies.
- Accessing Hbase tables through Spark, this seems to work - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/10/16 18:37:16 UTC, 6 replies.
- Is spark a right tool for updating a dataframe repeatedly - posted by Mungeol Heo <mu...@gmail.com> on 2016/10/17 01:50:56 UTC, 5 replies.
- Question about the offiicial binary Spark 2 package - posted by Xi Shen <da...@gmail.com> on 2016/10/17 06:08:28 UTC, 2 replies.
- Resizing Image with Scrimage in Spark - posted by Adline Dsilva <ad...@mimos.my> on 2016/10/17 07:03:57 UTC, 1 replies.
- Possible memory leak after closing spark context in v2.0.1 - posted by lev <ka...@gmail.com> on 2016/10/17 09:02:03 UTC, 2 replies.
- Driver storage memory getting waste - posted by Sushrut Ikhar <su...@gmail.com> on 2016/10/17 10:21:28 UTC, 0 replies.
- Did anybody come across this random-forest issue with spark 2.0.1. - posted by "张建鑫 (市场部)" <zh...@didichuxing.com> on 2016/10/17 11:18:29 UTC, 0 replies.
- rdd and dataframe columns dtype - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/10/17 11:51:48 UTC, 0 replies.
- Re: Did anybody come across this random-forest issue with spark 2.0.1. - posted by Xi Shen <da...@gmail.com> on 2016/10/17 12:00:58 UTC, 4 replies.
- OutputMetrics with data frames (spark-avro) - posted by Tim Moran <ti...@privitar.com> on 2016/10/17 12:37:19 UTC, 0 replies.
- Help in generating unique Id in spark row - posted by Saurav Sinha <sa...@gmail.com> on 2016/10/17 13:57:59 UTC, 2 replies.
- Substitute Certain Rows a data Frame using SparkR - posted by shilp <ts...@Hotmail.com> on 2016/10/17 14:37:51 UTC, 1 replies.
- Indexing w spark joins? - posted by Michael Segel <ms...@hotmail.com> on 2016/10/17 16:49:26 UTC, 0 replies.
- question on the structured DataSet API join - posted by Yang <te...@gmail.com> on 2016/10/17 16:53:11 UTC, 0 replies.
- K-Mean retrieving Cluster Members - posted by Reth RM <re...@gmail.com> on 2016/10/17 17:56:08 UTC, 2 replies.
- Re: Indexing w spark joins? - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/10/17 19:14:14 UTC, 0 replies.
- PostgresSql queries vs spark sql - posted by Selvam Raman <se...@gmail.com> on 2016/10/17 21:00:55 UTC, 1 replies.
- Broadcasting Complex Custom Objects - posted by Pedro Tuero <tu...@gmail.com> on 2016/10/17 21:32:00 UTC, 0 replies.
- Re: Consuming parquet files built with version 1.8.1 - posted by Cheng Lian <li...@gmail.com> on 2016/10/17 22:33:51 UTC, 0 replies.
- previous stage results are not saved? - posted by Yang <te...@gmail.com> on 2016/10/17 23:11:56 UTC, 2 replies.
- Fwd: jdbcRDD for data ingestion from RDBMS - posted by Ninad Shringarpure <ni...@cloudera.com> on 2016/10/18 02:24:47 UTC, 2 replies.
- About Error while reading large JSON file in Spark - posted by Chetan Khatri <ck...@gmail.com> on 2016/10/18 07:43:58 UTC, 4 replies.
- tutorial for access elements of dataframe columns and column values of a specific rows? - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/10/18 07:48:40 UTC, 1 replies.
- Contributing to PySpark - posted by Krishna Kalyan <kr...@gmail.com> on 2016/10/18 09:16:55 UTC, 1 replies.
- Spark Streaming 2 Kafka 0.10 Integration for Aggregating Data - posted by Furkan KAMACI <fu...@gmail.com> on 2016/10/18 13:15:57 UTC, 1 replies.
- Making more features in Logistic Regression - posted by aditya1702 <ad...@gmail.com> on 2016/10/18 17:09:34 UTC, 6 replies.
- How to add all jars in a folder to executor classpath? - posted by nitinkak001 <ni...@gmail.com> on 2016/10/18 17:36:21 UTC, 0 replies.
- Broadcasting Non Serializable Objects - posted by pedroT <tu...@gmail.com> on 2016/10/18 19:27:01 UTC, 1 replies.
- spark streaming client program needs to be restarted after few hours of idle time. how can I fix it? - posted by kant kodali <ka...@gmail.com> on 2016/10/18 20:25:36 UTC, 0 replies.
- Does the delegator map task of SparkLauncher need to stay alive until Spark job finishes ? - posted by Elkhan Dadashov <el...@gmail.com> on 2016/10/18 22:01:06 UTC, 4 replies.
- How does Spark determine in-memory partition count when reading Parquet ~files? - posted by "shea.parkes" <sh...@gmail.com> on 2016/10/19 02:04:22 UTC, 2 replies.
- hive.exec.stagingdir not effect in spark2.0.1 - posted by 谭 成灶 <ta...@live.cn> on 2016/10/19 02:13:56 UTC, 0 replies.
- how to extract arraytype data to file - posted by lk_spark <lk...@163.com> on 2016/10/19 03:35:54 UTC, 4 replies.
- Equivalent Parquet File Repartitioning Benefits for Join/Shuffle? - posted by adam kramer <ad...@gmail.com> on 2016/10/19 05:59:53 UTC, 1 replies.
- question about the new Dataset API - posted by Yang <te...@gmail.com> on 2016/10/19 06:30:35 UTC, 1 replies.
- how to see spark class variable values on variable explorer of spyder for python? - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/10/19 12:18:54 UTC, 0 replies.
- Joins of typed datasets - posted by daunnc <da...@gmail.com> on 2016/10/19 12:28:47 UTC, 1 replies.
- Re: LDA and Maximum Iterations - posted by Richard Garris <rl...@databricks.com> on 2016/10/19 14:46:31 UTC, 0 replies.
- Re: Spark 2.0 with Kafka 0.10 exception - posted by Srikanth <sr...@gmail.com> on 2016/10/19 17:22:15 UTC, 5 replies.
- ApacheCon is now less than a month away! - posted by Rich Bowen <rb...@apache.org> on 2016/10/19 18:20:03 UTC, 0 replies.
- 回复: Why the json file used by sparkSession.read.json must be a validjson object per line - posted by Wangjianfei <10...@qq.com> on 2016/10/20 00:44:37 UTC, 0 replies.
- Dataframe schema... - posted by Muthu Jayakumar <ba...@gmail.com> on 2016/10/20 01:07:00 UTC, 9 replies.
- partitionBy produces wrong number of tasks - posted by Daniel Haviv <da...@veracity-group.com> on 2016/10/20 05:27:05 UTC, 0 replies.
- Spark ExternalTable doesn't recognize subdir - posted by lk_spark <lk...@163.com> on 2016/10/20 05:56:18 UTC, 0 replies.
- How to iterate the element of an array in DataFrame? - posted by "颜发才 (Yan Facai)" <ya...@gmail.com> on 2016/10/20 08:34:03 UTC, 6 replies.
- pyspark dataframe codes for lead lag to column - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/10/20 08:35:53 UTC, 2 replies.
- Where condition on columns of Arrays does no longer work in spark 2 - posted by filthysocks <js...@uos.de> on 2016/10/20 08:54:04 UTC, 1 replies.
- Can i display message on console when use spark on yarn? - posted by Jone Zhang <jo...@gmail.com> on 2016/10/20 09:21:18 UTC, 1 replies.
- Spark Random Forest training cost same time on yarn as on standalone - posted by 陈哲 <cz...@gmail.com> on 2016/10/20 10:21:20 UTC, 1 replies.
- Microbatches length - posted by pcandido <pc...@gmail.com> on 2016/10/20 10:38:28 UTC, 4 replies.
- spark pi example fail on yarn - posted by Li Li <fa...@gmail.com> on 2016/10/20 10:51:45 UTC, 11 replies.
- Expression Encoder for Map[Int, String] in a custom Aggregator on a Dataset - posted by Anton Okolnychyi <an...@gmail.com> on 2016/10/20 11:12:33 UTC, 0 replies.
- Ensuring an Avro File is NOT Splitable - posted by Ashan Taha <at...@currenex.com> on 2016/10/20 12:00:14 UTC, 1 replies.
- HashingTF for TF.IDF computation - posted by Ciumac Sergiu <ci...@gmail.com> on 2016/10/20 17:00:55 UTC, 2 replies.
- Re: Mlib RandomForest (Spark 2.0) predict a single vector - posted by jglov <ja...@capsenrobotics.com> on 2016/10/20 17:10:18 UTC, 0 replies.
- Predict a single vector with the new spark.ml API to avoid groupByKey() after a flatMap()? - posted by jglov <ja...@capsenrobotics.com> on 2016/10/20 17:11:49 UTC, 0 replies.
- RDD groupBy() then random sort each group ? - posted by Yang <te...@gmail.com> on 2016/10/20 17:53:38 UTC, 4 replies.
- Spark SQL parallelize - posted by Selvam Raman <se...@gmail.com> on 2016/10/20 18:42:05 UTC, 0 replies.
- [Spark ML] Using GBTClassifier in OneVsRest - posted by ansari <hi...@gmail.com> on 2016/10/21 00:12:20 UTC, 3 replies.
- ALS.trainImplicit block sizes - posted by Nikhil Mishra <ni...@gmail.com> on 2016/10/21 06:12:15 UTC, 4 replies.
- Can we disable parquet logs in Spark? - posted by "Yu, Yucai" <yu...@intel.com> on 2016/10/21 06:49:33 UTC, 1 replies.
- kmeans|| waiting issue - posted by 김태준 <ki...@gmail.com> on 2016/10/21 06:50:10 UTC, 2 replies.
- How to clean the accumulator and broadcast from the driver manually? - posted by Mungeol Heo <mu...@gmail.com> on 2016/10/21 08:07:29 UTC, 0 replies.
- sql.functions partitionby AttributeError: 'NoneType' object has no attribute '_jvm' - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/10/21 08:24:08 UTC, 0 replies.
- Kafka Direct Stream: Offset Managed Manually (Exactly Once) - posted by Erwan ALLAIN <ea...@gmail.com> on 2016/10/21 08:32:24 UTC, 3 replies.
- Drop partition with PURGE fail - posted by bluishpenguin <bl...@gmail.com> on 2016/10/21 10:10:41 UTC, 0 replies.
- Issues with reading gz files with Spark Streaming - posted by Nkechi Achara <nk...@googlemail.com> on 2016/10/21 14:53:50 UTC, 3 replies.
- Plotting decision boundary in non-linear logistic regression - posted by aditya1702 <ad...@gmail.com> on 2016/10/21 15:39:26 UTC, 0 replies.
- About Reading Parquet - failed to read single gz parquet - failed entire transformation - posted by Chetan Khatri <ck...@gmail.com> on 2016/10/21 19:06:40 UTC, 0 replies.
- Writing to Parquet Job turns to wait mode after even completion of job - posted by Chetan Khatri <ck...@gmail.com> on 2016/10/21 20:47:03 UTC, 7 replies.
- RDD to Dataset results in fixed number of partitions - posted by Spark User <sp...@gmail.com> on 2016/10/22 00:07:34 UTC, 0 replies.
- Fwd: Spark optimization problem - posted by Maitray Thaker <ma...@gmail.com> on 2016/10/22 12:58:47 UTC, 0 replies.
- why is that two stages in apache spark are computing same thing? - posted by maitraythaker <ma...@gmail.com> on 2016/10/22 13:07:09 UTC, 1 replies.
- Dataflow of Spark/Hadoop in steps - posted by Or Raz <ra...@post.bgu.ac.il> on 2016/10/23 11:00:01 UTC, 0 replies.
- Spark streaming crashes with high throughput - posted by Jeyhun Karimov <je...@gmail.com> on 2016/10/23 14:28:34 UTC, 0 replies.
- Random forest classifier error : Size exceeds Integer.MAX_VALUE - posted by Kürşat Kurt <ku...@kursatkurt.com> on 2016/10/23 19:38:57 UTC, 0 replies.
- Spark submit running spark-sql-perf and additional jar - posted by Mr rty ff <ya...@yahoo.com.INVALID> on 2016/10/23 20:20:46 UTC, 0 replies.
- How to avoid the delay associated with Hive Metastore when loading parquet? - posted by ankits <an...@gmail.com> on 2016/10/24 05:05:22 UTC, 0 replies.
- Re: LIMIT issue of SparkSQL - posted by Michael Armbrust <mi...@databricks.com> on 2016/10/24 05:48:26 UTC, 6 replies.
- spark streaming with kinesis - posted by Shushant Arora <sh...@gmail.com> on 2016/10/24 07:43:33 UTC, 3 replies.
- Using a Custom Data Store with Spark 2.0 - posted by Sachith Withana <sw...@gmail.com> on 2016/10/24 08:55:51 UTC, 0 replies.
- Spark Sql 2.0 throws null pointer exception - posted by Selvam Raman <se...@gmail.com> on 2016/10/24 09:23:22 UTC, 1 replies.
- JAVA heap space issue - posted by sankarmittapally <sa...@creditvidya.com> on 2016/10/24 11:19:10 UTC, 7 replies.
- Spark 2.0 - DataFrames vs Dataset performance - posted by Antoaneta Marinova <an...@gmail.com> on 2016/10/24 12:50:43 UTC, 2 replies.
- Shortest path with directed and weighted graphs - posted by Brian Wilson <br...@gmail.com> on 2016/10/24 13:11:41 UTC, 1 replies.
- reading info from spark 2.0 application UI - posted by "TheGeorge1918 ." <zh...@gmail.com> on 2016/10/24 14:33:35 UTC, 3 replies.
- Accessing Phoenix table from Spark 2.0., any cure! - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/10/24 15:22:42 UTC, 0 replies.
- Generate random numbers from Normal Distribution with Specific Mean and Variance - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/10/24 16:04:58 UTC, 4 replies.
- Getting the IP address of Spark Driver in yarn-cluster mode - posted by Masood Krohy <ma...@intact.net> on 2016/10/24 18:34:23 UTC, 2 replies.
- Spark Streaming Kafka job stuck in 'processing' stage - posted by map reduced <k3...@gmail.com> on 2016/10/24 19:58:53 UTC, 1 replies.
- Need help with SVM - posted by aditya1702 <ad...@gmail.com> on 2016/10/24 20:43:52 UTC, 4 replies.
- Modifying Metadata in StructType schemas - posted by Everett Anderson <ev...@nuna.com.INVALID> on 2016/10/24 23:27:08 UTC, 0 replies.
- [Spark 2.0.1] Error in generated code, possible regression? - posted by Efe Selcuk <ef...@gmail.com> on 2016/10/25 01:21:59 UTC, 5 replies.
- [Spark 2] BigDecimal and 0 - posted by Efe Selcuk <ef...@gmail.com> on 2016/10/25 02:03:41 UTC, 6 replies.
- Re: Get size of intermediate results - posted by Takeshi Yamamuro <li...@gmail.com> on 2016/10/25 02:54:11 UTC, 0 replies.
- Help regarding reading text file within rdd operations - posted by Rohit Verma <ro...@rokittech.com> on 2016/10/25 06:03:36 UTC, 0 replies.
- Grouping into Arrays - posted by Matt Smith <ma...@gmail.com> on 2016/10/25 06:10:53 UTC, 0 replies.
- Re: java.lang.NoSuchMethodError - GraphX - posted by Brian Wilson <br...@gmail.com> on 2016/10/25 06:47:59 UTC, 1 replies.
- Spark streaming communication with different versions of kafka - posted by Prabhu GS <pr...@thedatateam.in> on 2016/10/25 09:30:23 UTC, 1 replies.
- Spark streaming communication with InfluxDB - posted by Gioacchino <gi...@gmail.com> on 2016/10/25 09:55:40 UTC, 0 replies.
- Passing command line arguments to Spark-shell in Spark 2.0.1 - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/10/25 10:23:29 UTC, 1 replies.
- Spark 1.2 - posted by ayan guha <gu...@gmail.com> on 2016/10/25 12:17:45 UTC, 3 replies.
- Re: Proper saving/loading of MatrixFactorizationModel - posted by eliasah <ab...@gmail.com> on 2016/10/25 12:33:12 UTC, 0 replies.
- i get the error of Py4JJavaError: An error occurred while calling o177.showString while running code below - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/10/25 13:18:03 UTC, 0 replies.
- How can I log the moment an action is called on a DataFrame? - posted by coldhyll <ca...@gmail.com> on 2016/10/25 14:55:49 UTC, 0 replies.
- Spark Sql - "broadcast-exchange-1" java.lang.OutOfMemoryError: Java heap space - posted by Selvam Raman <se...@gmail.com> on 2016/10/25 15:52:24 UTC, 0 replies.
- Getting only results out of Spark Shell - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/10/25 15:59:04 UTC, 0 replies.
- Transforming Spark SQL AST with extraOptimizations - posted by Michael David Pedersen <mi...@googlemail.com> on 2016/10/25 16:01:43 UTC, 0 replies.
- Re: Zero Data Loss in Spark with Kafka - posted by Sunita Arvind <su...@gmail.com> on 2016/10/25 18:09:22 UTC, 7 replies.
- Operator push down through JDBC driver - posted by AnilKumar B <ak...@gmail.com> on 2016/10/25 21:35:39 UTC, 1 replies.
- HiveContext is Serialized? - posted by Ajay Chander <it...@gmail.com> on 2016/10/26 03:28:15 UTC, 13 replies.
- Any Dynamic Compilation of Scala Query - posted by Mahender Sarangam <Ma...@outlook.com> on 2016/10/26 08:35:44 UTC, 2 replies.
- Can application JAR name contain + for dependency resolution to latest version? - posted by Aseem Bansal <as...@gmail.com> on 2016/10/26 09:08:13 UTC, 0 replies.
- Is there length limit for sparksql/hivesql? - posted by Jone Zhang <jo...@gmail.com> on 2016/10/26 09:28:59 UTC, 0 replies.
- What syntax can be used to specify the latest version of JAR found while using spark submit - posted by Aseem Bansal <as...@gmail.com> on 2016/10/26 10:03:45 UTC, 1 replies.
- CSV conversion - posted by Nathan Kronenfeld <nk...@uncharted.software> on 2016/10/26 16:11:00 UTC, 0 replies.
- Resiliency with SparkStreaming - fileStream - posted by Scott W <de...@gmail.com> on 2016/10/26 16:20:48 UTC, 1 replies.
- csv date/timestamp type inference in spark 2.0.1 - posted by Koert Kuipers <ko...@tresata.com> on 2016/10/26 17:15:25 UTC, 2 replies.
- spark infers date to be timestamp type - posted by Koert Kuipers <ko...@tresata.com> on 2016/10/26 17:16:07 UTC, 5 replies.
- Re: Will Spark SQL completely replace Apache Impala or Apache Hive? - posted by neil90 <ne...@icloud.com> on 2016/10/26 18:38:03 UTC, 0 replies.
- Executor shutdown hook and initialization - posted by Walter rakoff <wa...@gmail.com> on 2016/10/26 19:26:44 UTC, 4 replies.
- Spark Metrics monitoring using Graphite - posted by Sreekanth Jella <js...@gmail.com> on 2016/10/26 19:40:08 UTC, 0 replies.
- Reading old tweets from twitter in spark - posted by Cassa L <lc...@gmail.com> on 2016/10/26 21:13:26 UTC, 0 replies.
- No of partitions in a Dataframe - posted by Nipun Parasrampuria <pa...@umn.edu> on 2016/10/26 22:01:22 UTC, 1 replies.
- Cogrouping or joining datasets by rownum - posted by Rohit Verma <ro...@rokittech.com> on 2016/10/27 03:22:02 UTC, 1 replies.
- Question about In-Memory size (cache / cacheTable) - posted by Prithish <pr...@gmail.com> on 2016/10/27 05:19:24 UTC, 0 replies.
- Dynamic Resource Allocation in a standalone - posted by Ofer Eliassaf <of...@gmail.com> on 2016/10/27 08:00:20 UTC, 0 replies.
- Using SparkLauncher in cluster mode, in a Mesos cluster - posted by Nerea Ayestarán <ne...@gmail.com> on 2016/10/27 09:12:58 UTC, 0 replies.
- Using Hive UDTF in SparkSQL - posted by Lokesh Yadav <lo...@gmail.com> on 2016/10/27 10:05:59 UTC, 1 replies.
- Sharing RDDS across applications and users - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/10/27 10:28:00 UTC, 16 replies.
- Reading AVRO from S3 - No parallelism - posted by Prithish <pr...@gmail.com> on 2016/10/27 12:19:29 UTC, 2 replies.
- Run spark-shell inside Docker container against remote YARN cluster - posted by ponkin <al...@ya.ru> on 2016/10/27 13:30:26 UTC, 1 replies.
- Spark 2.0 on HDP - posted by Deenar Toraskar <de...@gmail.com> on 2016/10/27 13:48:47 UTC, 0 replies.
- Many Spark metric names do not include the application name - posted by Amit Sela <am...@gmail.com> on 2016/10/27 14:07:31 UTC, 0 replies.
- Running Hive and Spark together with Dynamic Resource Allocation - posted by rachmaninovquartet <ra...@gmail.com> on 2016/10/27 14:13:21 UTC, 2 replies.
- CSV escaping not working - posted by "Jain, Nishit" <nj...@underarmour.com> on 2016/10/27 15:54:06 UTC, 7 replies.
- TaskMemoryManager: Failed to allocate a page - posted by pietrop <pi...@gmail.com> on 2016/10/27 16:13:59 UTC, 4 replies.
- If you have used spark-sas7bdat package to transform SAS data set to Spark, please be aware - posted by Shi Yu <sh...@gmail.com> on 2016/10/27 17:16:38 UTC, 0 replies.
- large scheduler delay in OnlineLDAOptimizer, (MLlib and LDA) - posted by Xiaoye Sun <su...@gmail.com> on 2016/10/27 18:05:27 UTC, 0 replies.
- Infinite Loop in Spark - posted by Gervásio Santos <ge...@cobli.co> on 2016/10/27 18:46:42 UTC, 1 replies.
- Spark UI error spark 2.0.1 hadoop 2.6 - posted by gpatcham <gp...@gmail.com> on 2016/10/27 20:52:03 UTC, 1 replies.
- importing org.apache.spark.Logging class - posted by Reth RM <re...@gmail.com> on 2016/10/27 21:42:04 UTC, 1 replies.
- Spark Streaming and Kinesis - posted by Benjamin Kim <bb...@gmail.com> on 2016/10/27 21:53:06 UTC, 0 replies.
- Spark 2.0 with Hadoop 3.0? - posted by adam kramer <ad...@gmail.com> on 2016/10/27 22:04:12 UTC, 4 replies.
- Re: Need help Creating a rule using the Streaming API - posted by patrickhuang <re...@gmail.com> on 2016/10/28 01:30:37 UTC, 0 replies.
- [ANNOUNCE] Apache Bahir 2.0.1 - posted by Luciano Resende <lr...@apache.org> on 2016/10/28 01:35:26 UTC, 0 replies.
- convert spark dataframe to numpy (ndarray) - posted by Zakaria Hili <za...@gmail.com> on 2016/10/28 08:26:31 UTC, 0 replies.
- Weekly aggregation - posted by Oshadha Gunawardena <os...@gmail.com> on 2016/10/28 10:44:19 UTC, 0 replies.
- [SPARK 2.0.0] Specifying remote repository when submitting jobs - posted by Aseem Bansal <as...@gmail.com> on 2016/10/28 10:56:13 UTC, 2 replies.
- Can i get callback notification on Spark job completion ? - posted by Elkhan Dadashov <el...@gmail.com> on 2016/10/28 17:23:55 UTC, 3 replies.
- java.lang.OutOfMemoryError: unable to create new native thread - posted by kant kodali <ka...@gmail.com> on 2016/10/28 19:47:20 UTC, 12 replies.
- spark dataframe rolling window for user define operation - posted by "Manjunath, Kiran" <ki...@akamai.com> on 2016/10/29 10:28:01 UTC, 1 replies.
- spark-submit fails after setting userClassPathFirst to true - posted by sudhir patil <sp...@gmail.com> on 2016/10/29 12:04:22 UTC, 1 replies.
- Out Of Memory issue - posted by Kürşat Kurt <ku...@kursatkurt.com> on 2016/10/29 20:51:08 UTC, 2 replies.
- Happy Diwali to those forum members who celebrate this great festival - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/10/30 16:07:46 UTC, 5 replies.
- Performance bug in UDAF? - posted by Spark User <sp...@gmail.com> on 2016/10/30 18:58:16 UTC, 1 replies.
- task not serializable in case of groupByKey() + mapGroups + map? - posted by Yang <te...@gmail.com> on 2016/10/31 07:59:09 UTC, 0 replies.
- Do you use spark 2.0 in work? - posted by Yang Cao <cy...@gmail.com> on 2016/10/31 08:16:41 UTC, 2 replies.
- MapWithState partitioning - posted by Andrii Biletskyi <an...@yahoo.com.INVALID> on 2016/10/31 09:30:51 UTC, 3 replies.
- Efficient filtering on Spark SQL dataframes with ordered keys - posted by Michael David Pedersen <mi...@googlemail.com> on 2016/10/31 10:06:52 UTC, 5 replies.
- why spark driver program is creating so many threads? How can I limit this number? - posted by kant kodali <ka...@gmail.com> on 2016/10/31 10:20:35 UTC, 13 replies.
- Help needed in parsing JSon with nested structures - posted by "Kappaganthu, Sivaram (ES)" <Si...@ADP.com> on 2016/10/31 10:49:40 UTC, 1 replies.
- MapWithState with large state - posted by Abhishek Singh <ab...@tetrationanalytics.com> on 2016/10/31 14:35:38 UTC, 0 replies.