user@spark.apache.org, 2016-09

You are viewing a plain text version of this content. The canonical link for it is here.

- Re: Spark 2.0 - Parquet data with fields containing periods "." - posted by Don Drake <do...@gmail.com> on 2016/09/01 00:05:05 UTC, 0 replies.
- RE: AnalysisException exception while parsing XML - posted by sr...@gmail.com on 2016/09/01 00:54:41 UTC, 1 replies.
- Scala Vs Python - posted by ayan guha <gu...@gmail.com> on 2016/09/01 02:02:54 UTC, 35 replies.
- [Error:]while read s3 buckets in Spark 1.6 in spark -submit - posted by Divya Gehlot <di...@gmail.com> on 2016/09/01 02:45:49 UTC, 2 replies.
- Re: Spark build 1.6.2 error - posted by Divya Gehlot <di...@gmail.com> on 2016/09/01 02:53:13 UTC, 4 replies.
- KeyManager exception in Spark 1.6.2 - posted by Eric Ho <er...@analyticsmd.com> on 2016/09/01 04:11:42 UTC, 0 replies.
- Window Functions with SQLContext - posted by saurabh3d <sa...@oracle.com> on 2016/09/01 05:16:21 UTC, 6 replies.
- how should I compose keyStore and trustStore if Spark needs to talk to Kafka & Cassandra ? - posted by Eric Ho <er...@analyticsmd.com> on 2016/09/01 08:09:58 UTC, 2 replies.
- spark 2.0.0 - code generation inputadapter_value is not rvalue - posted by Aseem Bansal <as...@gmail.com> on 2016/09/01 08:36:04 UTC, 0 replies.
- Spark 2.0.0 - Java vs Scala performance difference - posted by Aseem Bansal <as...@gmail.com> on 2016/09/01 09:06:53 UTC, 4 replies.
- difference between package and jar Option in Spark - posted by Divya Gehlot <di...@gmail.com> on 2016/09/01 09:24:10 UTC, 3 replies.
- java.lang.OutOfMemoryError Spark MLlib ALS matrix factorization - posted by ANDREA SPINA <74...@studenti.unimore.it> on 2016/09/01 09:49:38 UTC, 0 replies.
- Re: Does a driver jvm houses some rdd partitions? - posted by Jakub Dubovsky <sp...@gmail.com> on 2016/09/01 11:47:46 UTC, 0 replies.
- Why there is no top method in dataset api - posted by Jakub Dubovsky <sp...@gmail.com> on 2016/09/01 11:53:48 UTC, 4 replies.
- Spark 2.0.0 - has anyone used spark ML to do predictions under 20ms? - posted by Aseem Bansal <as...@gmail.com> on 2016/09/01 12:37:28 UTC, 8 replies.
- using multiple worker instances in spark standalone - posted by AssafMendelson <as...@rsa.com> on 2016/09/01 12:54:04 UTC, 0 replies.
- Spark scheduling mode - posted by enrico d'urso <E....@live.com> on 2016/09/01 14:10:57 UTC, 8 replies.
- Difference between Data set and Data Frame in Spark 2 - posted by Ashok Kumar <as...@yahoo.com.INVALID> on 2016/09/01 14:17:09 UTC, 7 replies.
- Re: Hi, guys, does anyone use Spark in finance market? - posted by Adam Roberts <AR...@uk.ibm.com> on 2016/09/01 14:25:56 UTC, 3 replies.
- Re: Spark 2.0: SQL runs 5x times slower when adding 29th field to aggregation. - posted by Romanov <ro...@inbox.ru> on 2016/09/01 15:28:49 UTC, 1 replies.
- Error creating dataframe from schema with nested using case class - posted by Corentin Kerisit <co...@gmail.com> on 2016/09/01 15:41:56 UTC, 0 replies.
- Spark 2.0: SQL runs 5x times slower when adding 29th field to aggregation. - posted by Сергей Романов <ro...@inbox.ru.INVALID> on 2016/09/01 15:55:06 UTC, 0 replies.
- [HELP] Force stop a Spark Streaming application running on EMR - posted by Rajkiran Rajkumar <ra...@gmail.com> on 2016/09/01 16:46:16 UTC, 0 replies.
- Dataset Filter performance - trying to understand - posted by Darin McBeath <dd...@yahoo.com.INVALID> on 2016/09/01 17:40:41 UTC, 0 replies.
- What's the best way to detect and remove outliers in a table? - posted by Mobius ReX <ao...@gmail.com> on 2016/09/01 17:47:59 UTC, 0 replies.
- Fwd: Need some help - posted by Aakash Basu <aa...@gmail.com> on 2016/09/01 19:42:35 UTC, 3 replies.
- Possible Code Generation Bug: Can Spark 2.0 Datasets handle Scala Value Classes? - posted by Aris <ar...@gmail.com> on 2016/09/01 20:58:26 UTC, 4 replies.
- Re: Expected benefit of parquet filter pushdown? - posted by Christon DeWan <cd...@apple.com> on 2016/09/01 23:02:08 UTC, 0 replies.
- Spark 2.0.0 - SQL - Running query with outer join from 1.6 fails - posted by Don Drake <do...@gmail.com> on 2016/09/02 00:19:02 UTC, 0 replies.
- MLib : Non Linear Optimization - posted by nsareen <ns...@gmail.com> on 2016/09/02 01:50:45 UTC, 3 replies.
- PySpark: preference for Python 2.7 or Python 3.5? - posted by Ian Stokes Rees <ij...@continuum.io> on 2016/09/02 02:56:58 UTC, 3 replies.
- Is Spark ML model possible continues learning - posted by 김태준 <ki...@gmail.com> on 2016/09/02 03:44:25 UTC, 0 replies.
- Re: Custom return code - posted by Pierre Villard <pi...@gmail.com> on 2016/09/02 06:17:08 UTC, 0 replies.
- Re[2]: Spark 2.0: SQL runs 5x times slower when adding 29th field to aggregation. - posted by Сергей Романов <ro...@inbox.ru.INVALID> on 2016/09/02 11:03:25 UTC, 1 replies.
- Re: Grouping on bucketed and sorted columns - posted by Fridtjof Sander <fr...@googlemail.com> on 2016/09/02 12:25:13 UTC, 0 replies.
- Passing Custom App Id for consumption in History Server - posted by Amit Shanker <am...@gmail.com> on 2016/09/02 12:59:26 UTC, 2 replies.
- BinaryClassificationMetrics - get raw tp/fp/tn/fn stats per threshold? - posted by "Spencer, Alex (Santander)" <Al...@santander.co.uk.INVALID> on 2016/09/02 13:54:38 UTC, 1 replies.
- Dataset encoder for java.time.LocalDate? - posted by Daniel Siegmann <ds...@securityscorecard.io> on 2016/09/02 15:29:33 UTC, 1 replies.
- Pausing spark kafka streaming (direct) or exclude/include some partitions on the fly per batch - posted by "sagarcasual ." <sa...@gmail.com> on 2016/09/02 17:28:04 UTC, 3 replies.
- Spark web UI - Missing information - posted by Nirav Patel <np...@xactlycorp.com> on 2016/09/02 17:41:00 UTC, 0 replies.
- Reset auto.offset.reset in Kafka 0.10 integ - posted by Srikanth <sr...@gmail.com> on 2016/09/02 19:57:33 UTC, 7 replies.
- Is cache() still necessary for Spark DataFrames? - posted by apu <ap...@gmail.com> on 2016/09/02 20:05:31 UTC, 2 replies.
- how to pass trustStore path into pyspark ? - posted by Eric Ho <er...@analyticsmd.com> on 2016/09/02 21:12:29 UTC, 1 replies.
- Spark SQL Tables on top of HBase Tables - posted by Benjamin Kim <bb...@gmail.com> on 2016/09/02 21:46:59 UTC, 8 replies.
- NoClassDefFound exception after setting spark.eventLog.enabled=true - posted by "C. Josephson" <cj...@uhana.io> on 2016/09/03 00:10:11 UTC, 0 replies.
- any idea what this error could be? - posted by kant kodali <ka...@gmail.com> on 2016/09/03 06:49:32 UTC, 6 replies.
- Hive connection issues in spark-shell - posted by Diwakar Dhanuskodi <di...@gmail.com> on 2016/09/03 08:00:28 UTC, 0 replies.
- Need a help in row repetation - posted by Selvam Raman <se...@gmail.com> on 2016/09/03 08:55:23 UTC, 0 replies.
- Importing large file with SparkContext.textFile - posted by Somasundaram Sekar <so...@tigeranalytics.com> on 2016/09/03 10:08:35 UTC, 7 replies.
- Re[4]: Spark 2.0: SQL runs 5x times slower when adding 29th field to aggregation. - posted by Сергей Романов <ro...@inbox.ru.INVALID> on 2016/09/03 11:54:10 UTC, 0 replies.
- Catalog, SessionCatalog and ExternalCatalog in spark 2.0 - posted by Kapil Malik <ka...@snapdeal.com> on 2016/09/03 12:19:51 UTC, 2 replies.
- Re[5]: Spark 2.0: SQL runs 5x times slower when adding 29th field to aggregation. - posted by Сергей Романов <ro...@inbox.ru.INVALID> on 2016/09/03 12:40:28 UTC, 0 replies.
- Re[6]: Spark 2.0: SQL runs 5x times slower when adding 29th field to aggregation. - posted by Сергей Романов <ro...@inbox.ru.INVALID> on 2016/09/03 12:50:13 UTC, 1 replies.
- Re[7]: Spark 2.0: SQL runs 5x times slower when adding 29th field to aggregation. - posted by Сергей Романов <ro...@inbox.ru.INVALID> on 2016/09/03 13:04:09 UTC, 0 replies.
- Pls assist: Spark 2.0 build failure on Ubuntu 16.06 - posted by Marco Mistroni <mm...@gmail.com> on 2016/09/03 13:54:37 UTC, 2 replies.
- Re: Help with Jupyter Notebook Settup on CDH using Anaconda - posted by Marco Mistroni <mm...@gmail.com> on 2016/09/03 19:13:06 UTC, 0 replies.
- Creating RDD using swebhdfs with truststore - posted by Sourav Mazumder <so...@gmail.com> on 2016/09/03 21:31:34 UTC, 1 replies.
- seeing this message repeatedly. - posted by kant kodali <ka...@gmail.com> on 2016/09/04 00:39:35 UTC, 4 replies.
- Creating a UDF/UDAF using code generation - posted by AssafMendelson <as...@rsa.com> on 2016/09/04 11:09:39 UTC, 0 replies.
- How does chaining of Windowed Dstreams work? - posted by Hemalatha A <he...@googlemail.com> on 2016/09/04 11:12:27 UTC, 1 replies.
- Best ID Generator for ID field in parquet ? - posted by Kevin Tran <ke...@gmail.com> on 2016/09/04 11:43:41 UTC, 3 replies.
- spark cassandra issue - posted by Selvam Raman <se...@gmail.com> on 2016/09/04 14:35:21 UTC, 7 replies.
- Generating random Data using Spark and saving it to table, views appreciated - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/04 14:50:59 UTC, 0 replies.
- Spark transformations - posted by janardhan shetty <ja...@gmail.com> on 2016/09/04 16:08:27 UTC, 6 replies.
- Is Spark 2.0 master node compatible with Spark 1.5 work node? - posted by Rex X <dn...@gmail.com> on 2016/09/04 16:29:25 UTC, 9 replies.
- Re: S3A + EMR failure when writing Parquet? - posted by Everett Anderson <ev...@nuna.com.INVALID> on 2016/09/04 17:05:21 UTC, 1 replies.
- Resources for learning Spark administration - posted by Somasundaram Sekar <so...@tigeranalytics.com> on 2016/09/04 18:34:00 UTC, 1 replies.
- Reuters Market Data System connection to Spark Streaming - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/04 21:19:13 UTC, 0 replies.
- Problem in accessing swebhdfs - posted by Sourav Mazumder <so...@gmail.com> on 2016/09/04 22:25:45 UTC, 1 replies.
- RE: Why does spark take so much time for simple task without calculation? - posted by "Xie, Feng" <FX...@StateStreet.com> on 2016/09/05 02:24:12 UTC, 4 replies.
- Any method to set DataFrame's name on the web UI [ Storage Tab] ? - posted by "Taotao.Li" <ch...@gmail.com> on 2016/09/05 06:00:53 UTC, 1 replies.
- How to detect when a JavaSparkContext gets stopped - posted by "Hough, Stephen C" <St...@sc.com> on 2016/09/05 06:30:03 UTC, 2 replies.
- Re: Is there anyway Spark UI is set to poll and refreshes itself - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/05 09:54:09 UTC, 0 replies.
- Unable to get raw probabilities after clearing model threshold - posted by kundan kumar <ii...@gmail.com> on 2016/09/05 10:28:51 UTC, 1 replies.
- [SparkSQL+SparkStreaming]SparkStreaming APP can not load data into SparkSQL table - posted by lu...@sina.com on 2016/09/05 10:55:21 UTC, 0 replies.
- 回复：[SparkSQL+SparkStreaming]SparkStreaming APP can not load data into SparkSQL table - posted by lu...@sina.com on 2016/09/05 11:00:31 UTC, 0 replies.
- Re[8]: Spark 2.0: SQL runs 5x times slower when adding 29th field to aggregation. - posted by Сергей Романов <ro...@inbox.ru.INVALID> on 2016/09/05 11:21:36 UTC, 2 replies.
- SPARK ML- Feature Selection Techniques - posted by Bahubali Jain <ba...@gmail.com> on 2016/09/05 12:31:08 UTC, 1 replies.
- Splitting columns from a text file - posted by Ashok Kumar <as...@yahoo.com.INVALID> on 2016/09/05 12:48:50 UTC, 10 replies.
- Real Time Recommendation Engines with Spark and Scala - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/05 13:41:27 UTC, 5 replies.
- Spark 2.0.0 Thrift Server problem with Hive metastore - posted by "Campagnola, Francesco" <Fr...@anritsu.com> on 2016/09/05 15:25:31 UTC, 4 replies.
- Cassandra timestamp to spark Date field - posted by Selvam Raman <se...@gmail.com> on 2016/09/05 19:29:19 UTC, 0 replies.
- Spark ML 2.1.0 new features - posted by janardhan shetty <ja...@gmail.com> on 2016/09/05 20:50:22 UTC, 3 replies.
- Any estimate for a Spark 2.0.1 release date? - posted by mhornbech <mo...@datasolvr.com> on 2016/09/05 22:42:41 UTC, 2 replies.
- Spark Metrics: custom source/sink configurations not getting recognized - posted by map reduced <k3...@gmail.com> on 2016/09/06 03:30:50 UTC, 4 replies.
- Dataframe, Java: How to convert String to Vector ? - posted by "颜发才 (Yan Facai)" <ya...@gmail.com> on 2016/09/06 06:56:04 UTC, 8 replies.
- Consuming parquet files built with version 1.8.1 - posted by Dinesh Narayanan <nd...@gmail.com> on 2016/09/06 07:34:14 UTC, 0 replies.
- How to make the result of sortByKey distributed evenly? - posted by "Zhang, Liyun" <li...@intel.com> on 2016/09/06 08:13:04 UTC, 2 replies.
- How to convert String to Vector ? - posted by "颜发才 (Yan Facai)" <ya...@gmail.com> on 2016/09/06 09:56:46 UTC, 1 replies.
- clear steps for installation of spark, cassandra and cassandra connector to run on spyder 2.3.7 using python 3.5 and anaconda 2.4 ipython 4.0 - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/09/06 10:34:25 UTC, 1 replies.
- [Spark-Submit:]Error while reading from s3n - posted by Divya Gehlot <di...@gmail.com> on 2016/09/06 11:11:42 UTC, 0 replies.
- [Spark submit] getting error when use properties file parameter in spark submit - posted by Divya Gehlot <di...@gmail.com> on 2016/09/06 11:15:26 UTC, 2 replies.
- Spark Checkpoint for JDBC/ODBC - posted by Selvam Raman <se...@gmail.com> on 2016/09/06 11:31:27 UTC, 0 replies.
- LabeledPoint creation - posted by Madabhattula Rajesh Kumar <mr...@gmail.com> on 2016/09/06 12:10:55 UTC, 4 replies.
- Total memory of workers - posted by tan shai <ta...@gmail.com> on 2016/09/06 13:39:41 UTC, 0 replies.
- distribute work (files) - posted by Lydia Ickler <ic...@googlemail.com> on 2016/09/06 14:51:48 UTC, 9 replies.
- spark 1.6.0 web console shows running application in a "waiting" status, but it's acutally running - posted by sarlindo <sa...@hotmail.com> on 2016/09/06 15:15:10 UTC, 6 replies.
- anyone know what the status of spark-ec2 is? - posted by Andy Davidson <An...@SantaCruzIntegration.com> on 2016/09/06 16:15:53 UTC, 0 replies.
- YARN memory overhead settings - posted by Tim Moran <ti...@privitar.com> on 2016/09/06 16:23:24 UTC, 1 replies.
- Spray Client VS PlayWS vs Spring RestTemplate within Spark Job - posted by prosp4300 <pr...@163.com> on 2016/09/06 16:23:39 UTC, 0 replies.
- Spark 1.6.0 web console shows a running application in a "waiting" status, but it's actually running. Is this an existing bug? - posted by sarlindo <sa...@hotmail.com> on 2016/09/06 16:33:36 UTC, 0 replies.
- Re: Spark metrics when running with YARN? - posted by Vladimir Tretyakov <vl...@sematext.com> on 2016/09/06 16:38:05 UTC, 9 replies.
- Datasets and Partitioners - posted by Darin McBeath <dd...@yahoo.com.INVALID> on 2016/09/06 18:13:22 UTC, 0 replies.
- Re: Using spark package XGBoost - posted by janardhan shetty <ja...@gmail.com> on 2016/09/06 18:26:45 UTC, 1 replies.
- Complex RDD operation as DataFrame UDF ? - posted by Thunder Stumpges <th...@gmail.com> on 2016/09/06 18:28:39 UTC, 1 replies.
- I noticed LinearRegression sometimes produces negative R^2 values - posted by evanzamir <za...@gmail.com> on 2016/09/06 19:49:45 UTC, 5 replies.
- Re: Is it possible to submit Spark Application remotely? - posted by neil90 <ne...@icloud.com> on 2016/09/06 20:24:38 UTC, 1 replies.
- Getting figures from spark streaming - posted by Ashok Kumar <as...@yahoo.com.INVALID> on 2016/09/06 20:31:08 UTC, 2 replies.
- Q: Multiple spark streaming app, one kafka topic, same consumer group - posted by Mariano Semelman <ma...@despegar.com> on 2016/09/06 20:51:13 UTC, 1 replies.
- Difference between UDF and Transformer in Spark ML - posted by janardhan shetty <ja...@gmail.com> on 2016/09/06 20:54:58 UTC, 0 replies.
- Getting memory error when starting spark shell but not often - posted by Divya Gehlot <di...@gmail.com> on 2016/09/07 02:51:42 UTC, 1 replies.
- How to write data into CouchBase using Spark & Scala? - posted by "Devi P.V" <de...@gmail.com> on 2016/09/07 06:42:11 UTC, 2 replies.
- SparkStreaming is not working with SparkLauncher - posted by aditya barve <ad...@gmail.com> on 2016/09/07 06:42:58 UTC, 0 replies.
- call() function being called 3 times - posted by Kevin Tran <ke...@gmail.com> on 2016/09/07 07:30:37 UTC, 1 replies.
- Mesos coarse-grained problem with spark.shuffle.service.enabled - posted by Tamas Szuromi <ta...@odigeo.com.INVALID> on 2016/09/07 09:16:31 UTC, 1 replies.
- Re[10]: Spark 2.0: SQL runs 5x times slower when adding 29th field to aggregation. - posted by Сергей Романов <ro...@inbox.ru.INVALID> on 2016/09/07 09:33:17 UTC, 0 replies.
- dstream.foreachRDD iteration - posted by Ashok Kumar <as...@yahoo.com.INVALID> on 2016/09/07 10:39:58 UTC, 3 replies.
- No SparkR on Mesos? - posted by Peter Griessl <gr...@ihs.ac.at> on 2016/09/07 12:02:43 UTC, 5 replies.
- How to find the partitioner for a Dataset - posted by Darin McBeath <dd...@yahoo.com> on 2016/09/07 12:19:13 UTC, 0 replies.
- Failed to open native connection to Cassandra at - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/09/07 13:50:08 UTC, 0 replies.
- Managing Dataset API Partitions - Spark 2.0 - posted by ANDREA SPINA <74...@studenti.unimore.it> on 2016/09/07 15:04:05 UTC, 0 replies.
- Re: Spark Java Heap Error - posted by neil90 <ne...@icloud.com> on 2016/09/07 17:52:07 UTC, 7 replies.
- Split RDD by key and save to different files - posted by Vikash Kumar <vi...@gmail.com> on 2016/09/07 17:58:06 UTC, 1 replies.
- Error while storing datetime read from MySQL back to MySQL - posted by Dhaval Patel <ma...@gmail.com> on 2016/09/07 18:14:53 UTC, 0 replies.
- Re: Spark 2.0 with Kafka 0.10 exception - posted by Srikanth <sr...@gmail.com> on 2016/09/07 19:02:48 UTC, 5 replies.
- Stackoverflow with a huge stack trace in Spark 1.6.1 - posted by N B <nb...@gmail.com> on 2016/09/07 21:25:46 UTC, 0 replies.
- collect_set without nulls (1.6 vs 2.0) - posted by Lee Becker <le...@hapara.com> on 2016/09/07 21:58:59 UTC, 0 replies.
- year out of range - posted by Daniel Lopes <da...@onematch.com.br> on 2016/09/07 23:37:54 UTC, 8 replies.
- Forecasting algorithms in spark ML - posted by Madabhattula Rajesh Kumar <mr...@gmail.com> on 2016/09/08 04:30:27 UTC, 2 replies.
- weightCol doesn't seem to be handled properly in PySpark - posted by evanzamir <za...@gmail.com> on 2016/09/08 05:50:20 UTC, 2 replies.
- How does wholeTextFiles() work in Spark-Hadoop Cluster? - posted by Nisha Menon <ni...@gmail.com> on 2016/09/08 06:58:41 UTC, 4 replies.
- Re: How to convert an ArrayType to DenseVector within DataFrame? - posted by Nick Pentreath <ni...@gmail.com> on 2016/09/08 07:12:20 UTC, 0 replies.
- pyspakr 1.5.0 boradcast join - posted by pseudo oduesp <ps...@gmail.com> on 2016/09/08 07:59:50 UTC, 1 replies.
- Calling udf in Spark - posted by Divya Gehlot <di...@gmail.com> on 2016/09/08 08:55:01 UTC, 1 replies.
- Error while calling udf Spark submit - posted by Divya Gehlot <di...@gmail.com> on 2016/09/08 10:15:57 UTC, 1 replies.
- Spark yarn use IP instead hostname - posted by 李剑 <hu...@gmail.com> on 2016/09/08 10:26:24 UTC, 0 replies.
- Creating HiveContext withing Spark streaming - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/08 11:28:21 UTC, 3 replies.
- Will be there any ml.linalg.distributed? - posted by Boris Schminke <sc...@gmail.com> on 2016/09/08 12:10:49 UTC, 0 replies.
- Posting selected rows of Spark streaming data to Hive table - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/08 17:27:26 UTC, 0 replies.
- Returning DataFrame as Scala method return type - posted by Ashish Tadose <as...@gmail.com> on 2016/09/08 17:35:24 UTC, 2 replies.
- "Job duration" and "Processing time" don't match - posted by Srikanth <sr...@gmail.com> on 2016/09/08 19:31:44 UTC, 0 replies.
- spark-xml to avro - SchemaParseException: Can't redefine - posted by Arun Patel <ar...@gmail.com> on 2016/09/08 21:31:15 UTC, 2 replies.
- spark streaming kafka connector questions - posted by Cheng Yi <ph...@gmail.com> on 2016/09/08 21:44:08 UTC, 5 replies.
- Graphhopper/routing in Spark - posted by kodonnell <ka...@datamine.com> on 2016/09/08 21:45:05 UTC, 2 replies.
- Access application-jar name within main method. - posted by "sagarcasual ." <sa...@gmail.com> on 2016/09/08 22:27:45 UTC, 0 replies.
- Spark 2 does not recognize CURRENT_TIMESTAMP of Hive 2.0 - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/08 23:18:21 UTC, 0 replies.
- Video analytics on SPark - posted by Priya Ch <le...@gmail.com> on 2016/09/09 09:16:54 UTC, 0 replies.
- iterating over DataFrame Partitions sequentially - posted by sujeet jog <su...@gmail.com> on 2016/09/09 10:29:33 UTC, 3 replies.
- Does it run distributed if class not Serializable - posted by Yusuf Can Gürkan <yu...@useinsider.com> on 2016/09/09 10:47:32 UTC, 2 replies.
- pyspark persist MEMORY_ONLY vs MEMORY_AND_DISK - posted by Ben Leslie <be...@benno.id.au> on 2016/09/09 12:01:32 UTC, 2 replies.
- Get spark metrics in code - posted by Han JU <ju...@gmail.com> on 2016/09/09 12:20:04 UTC, 1 replies.
- add jars like spark-csv to ipython notebook with pyspakr - posted by pseudo oduesp <ps...@gmail.com> on 2016/09/09 12:55:03 UTC, 1 replies.
- spark-deployer 3.0.1 released - posted by pishen tsai <pi...@gmail.com> on 2016/09/09 16:07:37 UTC, 1 replies.
- spark nightly builds with Hadoop 2.7 - posted by Joseph Naegele <jn...@grierforensics.com> on 2016/09/09 16:47:47 UTC, 0 replies.
- Spark + Parquet + IBM Block Storage at Bluemix - posted by Daniel Lopes <da...@onematch.com.br> on 2016/09/09 16:56:06 UTC, 6 replies.
- Assign values to existing column in SparkR - posted by xingye <xi...@hotmail.com> on 2016/09/09 17:29:22 UTC, 2 replies.
- SparkR error: reference is ambiguous. - posted by xingye <tr...@gmail.com> on 2016/09/09 17:33:42 UTC, 2 replies.
- questions about using dapply - posted by xingye <tr...@gmail.com> on 2016/09/09 17:35:32 UTC, 2 replies.
- accessing spark packages through proxy - posted by "Ulanov, Alexander" <al...@hpe.com> on 2016/09/09 17:37:31 UTC, 0 replies.
- Using sparkContext.stop() - posted by Bruno Faria <br...@hotmail.com> on 2016/09/09 17:45:20 UTC, 1 replies.
- Approximate Nearest Neighbors (ann) for Scala Spark - posted by "Kim, Min-Seok" <ms...@gmail.com> on 2016/09/09 19:21:16 UTC, 0 replies.
- scalable-deeplearning 1.0.0 released - posted by "Ulanov, Alexander" <al...@hpe.com> on 2016/09/09 19:25:45 UTC, 0 replies.
- Spark with S3 DirectOutputCommitter - posted by Srikanth <sr...@gmail.com> on 2016/09/09 20:54:58 UTC, 3 replies.
- classpath conflict with spark internal libraries and the spark shell. - posted by Colin Kincaid Williams <di...@uw.edu> on 2016/09/09 21:23:33 UTC, 3 replies.
- Spark Memory Allocation Exception - posted by Sunil Tripathy <su...@gmail.com> on 2016/09/09 22:42:04 UTC, 0 replies.
- Streaming Backpressure with Multiple Streams - posted by Jeff Nadler <jn...@srcginc.com> on 2016/09/09 23:41:41 UTC, 3 replies.
- SparkSQL DAG generation , DAG optimization , DAG execution - posted by Rabin Banerjee <de...@gmail.com> on 2016/09/10 05:21:47 UTC, 3 replies.
- SparkR API problem with subsetting distributed data frame - posted by Bene <be...@outlook.com> on 2016/09/10 08:44:31 UTC, 4 replies.
- Spark CSV skip lines - posted by Selvam Raman <se...@gmail.com> on 2016/09/10 09:14:13 UTC, 3 replies.
- Reading a TSV file - posted by Muhammad Asif Abbasi <as...@gmail.com> on 2016/09/10 11:30:37 UTC, 10 replies.
- java.io.IOException: FAILED_TO_UNCOMPRESS(5) - posted by 齐忠 <ce...@gmail.com> on 2016/09/10 14:08:44 UTC, 1 replies.
- Spark CSV output - posted by KhajaAsmath Mohammed <md...@gmail.com> on 2016/09/10 15:04:08 UTC, 2 replies.
- Spark_JDBC_Partitions - posted by Ajay Chander <it...@gmail.com> on 2016/09/10 15:20:16 UTC, 9 replies.
- Problems with Reading CSV Files - Java - Eclipse - posted by Irfan Kabli <ir...@gmail.com> on 2016/09/10 15:50:27 UTC, 1 replies.
- Spark using my last job resources and jar files - posted by nagaraj <na...@gmail.com> on 2016/09/10 16:22:28 UTC, 0 replies.
- Not sure why Filter on DStream doesn't get invoked? - posted by kant kodali <ka...@gmail.com> on 2016/09/10 20:17:04 UTC, 0 replies.
- Why is Spark getting Kafka data out from port 2181 ? - posted by Eric Ho <er...@analyticsmd.com> on 2016/09/10 20:44:45 UTC, 1 replies.
- Selecting the top 100 records per group by? - posted by Kevin Burton <bu...@spinn3r.com> on 2016/09/11 01:04:18 UTC, 8 replies.
- "Too many elements to create a power set" on Elasticsearch - posted by Kevin Burton <bu...@spinn3r.com> on 2016/09/11 17:21:14 UTC, 0 replies.
- Using Zeppelin with Spark FP - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/11 21:12:26 UTC, 10 replies.
- GraphX drawing algorithm - posted by agc studio <ag...@gmail.com> on 2016/09/11 23:59:20 UTC, 1 replies.
- Spark Save mode "Overwrite" -Lock wait timeout exceeded; try restarting transaction Error - posted by Subhajit Purkayastha <sp...@p3si.net> on 2016/09/12 04:08:53 UTC, 0 replies.
- Access HDFS within Spark Map Operation - posted by Saliya Ekanayake <es...@gmail.com> on 2016/09/12 04:14:05 UTC, 10 replies.
- Spark word count program , need help on integration - posted by gobi s <go...@gmail.com> on 2016/09/12 09:32:16 UTC, 0 replies.
- Unsubscribe - posted by bi...@gmail.com on 2016/09/12 09:53:40 UTC, 0 replies.
- Small files - posted by ayan guha <gu...@gmail.com> on 2016/09/12 10:39:21 UTC, 3 replies.
- Spark tasks blockes randomly on standalone cluster - posted by bogdanbaraila <bo...@gmail.com> on 2016/09/12 12:31:40 UTC, 2 replies.
- 回复：Re: Selecting the top 100 records per group by? - posted by lu...@sina.com on 2016/09/12 14:37:44 UTC, 0 replies.
- Debugging a spark application in a none lazy mode - posted by Hagai <ha...@akamai.com> on 2016/09/12 15:44:53 UTC, 3 replies.
- Partition n keys into exacly n partitions - posted by sujeet jog <su...@gmail.com> on 2016/09/12 16:44:39 UTC, 2 replies.
- How to know how are the slaves for an application - posted by Xiaoye Sun <su...@gmail.com> on 2016/09/12 17:47:18 UTC, 0 replies.
- LDA spark ML visualization - posted by janardhan shetty <ja...@gmail.com> on 2016/09/12 18:45:03 UTC, 1 replies.
- Strings not converted when calling Scala code from a PySpark app - posted by Alexis Seigneurin <as...@ippon.fr> on 2016/09/12 20:47:40 UTC, 2 replies.
- Check if a nested column exists in DataFrame - posted by Arun Patel <ar...@gmail.com> on 2016/09/12 21:28:55 UTC, 1 replies.
- Zeppelin patterns with the streaming data - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/12 21:43:13 UTC, 4 replies.
- unsubscribe - posted by "ChangMingMin (常明敏)" <ch...@founder.com> on 2016/09/13 03:38:06 UTC, 3 replies.
- [Erorr:]vieiwng Web UI on EMR cluster - posted by Divya Gehlot <di...@gmail.com> on 2016/09/13 03:58:11 UTC, 6 replies.
- Ways to check Spark submit running - posted by Divya Gehlot <di...@gmail.com> on 2016/09/13 06:17:38 UTC, 1 replies.
- Any viable DATEDIFF function in Spark/Scala - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/13 11:28:22 UTC, 0 replies.
- Spark SQL - Actions and Transformations - posted by brccosta <br...@gmail.com> on 2016/09/13 12:27:38 UTC, 0 replies.
- Unable to compare SparkSQL Date columns - posted by Praseetha <pr...@gmail.com> on 2016/09/13 12:54:48 UTC, 5 replies.
- Character encoding corruption in Spark JDBC connector - posted by Mark Bittmann <mb...@gmail.com> on 2016/09/13 13:18:59 UTC, 2 replies.
- Spark Streaming - dividing DStream into mini batches - posted by DandyDev <de...@gmail.com> on 2016/09/13 13:25:20 UTC, 8 replies.
- Fetching Hive table data from external cluster - posted by Satish Chandra J <js...@gmail.com> on 2016/09/13 13:59:18 UTC, 0 replies.
- Master OOM in "master-rebuild-ui-thread" while running stream app - posted by Mariano Semelman <ma...@despegar.com> on 2016/09/13 14:18:06 UTC, 2 replies.
- Spark SQL - Applying transformation on a struct inside an array - posted by Olivier Girardot <o....@lateral-thoughts.com> on 2016/09/13 15:08:47 UTC, 4 replies.
- What's the best way to find the nearest neighbor in Spark? Any windowing function? - posted by Mobius ReX <ao...@gmail.com> on 2016/09/13 17:18:00 UTC, 6 replies.
- Spark 2.0.0 won't let you create a new SparkContext? - posted by Kevin Burton <bu...@spinn3r.com> on 2016/09/13 17:49:58 UTC, 7 replies.
- Spark SQL Thriftserver - posted by Benjamin Kim <bb...@gmail.com> on 2016/09/13 21:32:54 UTC, 6 replies.
- Spark kafka integration issues - posted by Mukesh Jha <me...@gmail.com> on 2016/09/13 23:46:01 UTC, 3 replies.
- KafkaUtils.createDirectStream() with kafka topic expanded - posted by vinay gupta <vi...@yahoo.com.INVALID> on 2016/09/14 00:13:49 UTC, 1 replies.
- Using Spark SQL to Create JDBC Tables - posted by Benjamin Kim <bb...@gmail.com> on 2016/09/14 01:08:28 UTC, 3 replies.
- Shuffle Spill (Memory) greater than Shuffle Spill (Disk) - posted by prayag chandran <pr...@gmail.com> on 2016/09/14 01:25:40 UTC, 0 replies.
- Can I assign affinity for spark executor processes? - posted by Xiaoye Sun <su...@gmail.com> on 2016/09/14 02:53:37 UTC, 3 replies.
- how to specify cores and executor to run spark jobs simultaneously - posted by Divya Gehlot <di...@gmail.com> on 2016/09/14 06:07:36 UTC, 1 replies.
- Re: Spark stalling during shuffle (maybe a memory issue) - posted by bogdanbaraila <bo...@gmail.com> on 2016/09/14 07:41:18 UTC, 0 replies.
- Spark Interview questions - posted by Ashok Kumar <as...@yahoo.com.INVALID> on 2016/09/14 11:35:13 UTC, 2 replies.
- Efficiently write a Dataframe to Text file(Spark Version 1.6.1) - posted by sanat kumar Patnaik <pa...@gmail.com> on 2016/09/14 11:46:19 UTC, 6 replies.
- Anyone got a good solid example of integrating Spark and Solr - posted by Nkechi Achara <nk...@googlemail.com> on 2016/09/14 11:52:25 UTC, 1 replies.
- Add sqldriver.jar to Spark 1.6.0 executors - posted by Kevin Tran <ke...@gmail.com> on 2016/09/14 12:42:32 UTC, 1 replies.
- Error casting from data frame to case class object - posted by franz_butterbaum <ar...@gmail.com> on 2016/09/14 13:48:29 UTC, 0 replies.
- t it does not stop at breakpoints which is in an anonymous function - posted by chen yong <cy...@hotmail.com> on 2016/09/14 14:26:11 UTC, 0 replies.
- Sqoop on Spark - posted by KhajaAsmath Mohammed <md...@gmail.com> on 2016/09/14 14:31:35 UTC, 1 replies.
- Re: t it does not stop at breakpoints which is in an anonymous function - posted by Dirceu Semighini Filho <di...@gmail.com> on 2016/09/14 14:33:42 UTC, 0 replies.
- 答复: t it does not stop at breakpoints which is in an anonymous function - posted by chen yong <cy...@hotmail.com> on 2016/09/14 14:43:53 UTC, 0 replies.
- 答复: 答复: t it does not stop at breakpoints which is in an anonymous function - posted by chen yong <cy...@hotmail.com> on 2016/09/14 15:20:13 UTC, 0 replies.
- Reading the most recent text files created by Spark streaming - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/14 15:28:39 UTC, 2 replies.
- ACID transactions on data added from Spark not working - posted by Jack Wenger <ja...@gmail.com> on 2016/09/14 16:47:55 UTC, 1 replies.
- The coming data on Spark Streaming - posted by pcandido <pc...@gmail.com> on 2016/09/14 17:40:52 UTC, 1 replies.
- Best Practices for Spark-Python App Deployment - posted by RK Aduri <rk...@collectivei.com> on 2016/09/14 18:02:50 UTC, 0 replies.
- Streaming - lookup against reference data - posted by Tom Davis <ma...@gmail.com> on 2016/09/14 18:44:43 UTC, 2 replies.
- CPU Consumption of spark process - posted by شجاع الرحمن بیگ <sh...@gmail.com> on 2016/09/14 19:11:41 UTC, 0 replies.
- LIVY VS Spark Job Server - posted by SamyaMaiti <sa...@gmail.com> on 2016/09/14 19:32:37 UTC, 1 replies.
- RMSE in ALS - posted by Pasquinell Urbani <pa...@exalitica.com> on 2016/09/14 19:33:28 UTC, 5 replies.
- Not all KafkaReceivers processing the data Why? - posted by Rachana Srivastava <Ra...@markmonitor.com> on 2016/09/14 19:33:38 UTC, 3 replies.
- Please assist: migrating RandomForestExample from MLLib to ML - posted by Marco Mistroni <mm...@gmail.com> on 2016/09/14 21:18:59 UTC, 2 replies.
- Job Opportunity - posted by datajunkie <ap...@gmail.com> on 2016/09/14 22:35:11 UTC, 1 replies.
- Re: Write to Cassandra table from pyspark fails with scala reflect error - posted by Trivedi Amit <am...@yahoo.com.INVALID> on 2016/09/15 02:44:41 UTC, 4 replies.
- Spark job failing with Adjusted frame length exceeds 2147483647: 2222367317 - discarded - posted by Trinadh Kaja <kt...@gmail.com> on 2016/09/15 08:18:47 UTC, 0 replies.
- Best way to present data collected by Flume through Spark - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/15 08:35:29 UTC, 8 replies.
- Spark processing Multiple Streams from a single stream - posted by Udbhav Agarwal <ud...@syncoms.com> on 2016/09/15 10:11:06 UTC, 6 replies.
- Total Shuffle Read and Write Size of Spark workload - posted by Cristina Rozee <ro...@gmail.com> on 2016/09/15 11:40:47 UTC, 7 replies.
- Re: Spark job within Web application - posted by rahulkumar-aws <ra...@gmail.com> on 2016/09/15 11:41:28 UTC, 0 replies.
- Partition RDD based on K-Means Clusters - posted by Punit Naik <na...@gmail.com> on 2016/09/15 13:57:12 UTC, 0 replies.
- 答复: 答复: 答复: t it does not stop at breakpoints which is in an anonymous function - posted by chen yong <cy...@hotmail.com> on 2016/09/15 14:41:08 UTC, 1 replies.
- Re: Write to Cassandra table from pyspark fails with scala reflect error [RESOLVED] - posted by Trivedi Amit <am...@yahoo.com.INVALID> on 2016/09/15 14:50:42 UTC, 0 replies.
- Spark Streaming-- for each new file in HDFS - posted by "Kappaganthu, Sivaram (ES)" <Si...@ADP.com> on 2016/09/15 17:00:39 UTC, 3 replies.
- countApprox - posted by Stefano Lodi <st...@unibo.it> on 2016/09/15 17:20:34 UTC, 2 replies.
- Missing output partition file in S3 - posted by "Chen, Kevin" <Ke...@neustar.biz> on 2016/09/15 18:37:01 UTC, 6 replies.
- Re: Guide Step by step Stark streaming - posted by rahulkumar-aws <ra...@gmail.com> on 2016/09/15 20:30:38 UTC, 1 replies.
- Issues while running MLlib matrix factorization ALS algorithm - posted by Roshani Nagmote <ro...@gmail.com> on 2016/09/15 21:00:35 UTC, 7 replies.
- 答复: 答复: 答复: 答复: t it does not stop at breakpoints which is in an anonymous function - posted by chen yong <cy...@hotmail.com> on 2016/09/16 01:23:18 UTC, 1 replies.
- Impersonate users using the same SparkContext - posted by gsvigruha <ge...@lynxanalytics.com> on 2016/09/16 03:43:49 UTC, 1 replies.
- Re: very slow parquet file write - posted by "tosaiganesh@gmail.com" <to...@gmail.com> on 2016/09/16 08:11:23 UTC, 0 replies.
- Hive api vs Dataset api - posted by "igor.berman" <ig...@gmail.com> on 2016/09/16 11:27:26 UTC, 0 replies.
- Spark can't connect to secure phoenix - posted by Ashish Gupta <as...@citiustech.com> on 2016/09/16 13:21:29 UTC, 0 replies.
- 答复: it does not stop at breakpoints which is in an anonymous function - posted by chen yong <cy...@hotmail.com> on 2016/09/16 14:18:44 UTC, 1 replies.
- Re: 答复: it does not stop at breakpoints which is in an anonymous function - posted by Dirceu Semighini Filho <di...@gmail.com> on 2016/09/16 14:23:52 UTC, 1 replies.
- Re: App works, but executor state is "killed" - posted by satishl <sa...@gmail.com> on 2016/09/16 16:57:37 UTC, 0 replies.
- JDBC Very Slow - posted by Benjamin Kim <bb...@gmail.com> on 2016/09/16 18:26:02 UTC, 3 replies.
- How PolynomialExpansion works - posted by Nirav Patel <np...@xactlycorp.com> on 2016/09/16 18:43:56 UTC, 1 replies.
- Error trying to connect to Hive from Spark (Yarn-Cluster Mode) - posted by an...@daimler.com on 2016/09/16 18:53:42 UTC, 5 replies.
- Apache Spark 2.0.0 on Microsoft Windows Create Dataframe - posted by Advait Mohan Raut <ad...@essexlg.com> on 2016/09/16 19:47:55 UTC, 1 replies.
- feasibility of ignite and alluxio for interfacing MPI and Spark - posted by AlexG <sw...@gmail.com> on 2016/09/17 01:08:35 UTC, 1 replies.
- Spark output data to S3 is very slow - posted by Qiang Li <ql...@appannie.com> on 2016/09/17 02:34:42 UTC, 2 replies.
- Can not control bucket files number if it was speficed - posted by Qiang Li <ql...@appannie.com> on 2016/09/17 12:59:35 UTC, 5 replies.
- Is there such thing as cache fusion with the underlying tables/files on HDFS - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/17 16:53:28 UTC, 9 replies.
- DataFrame defined within conditional IF ELSE statement - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/17 20:18:35 UTC, 4 replies.
- take() works on RDD but .write.json() does not work in 2.0.0 - posted by Kevin Burton <bu...@spinn3r.com> on 2016/09/17 21:42:43 UTC, 2 replies.
- NoSuchField Error : INSTANCE specify user defined httpclient jar - posted by "sagarcasual ." <sa...@gmail.com> on 2016/09/18 00:33:35 UTC, 1 replies.
- Re: Recovered state for updateStateByKey and incremental streams processing - posted by manasdebashiskar <po...@gmail.com> on 2016/09/18 02:49:40 UTC, 0 replies.
- How many are there PySpark Windows users? - posted by Hyukjin Kwon <gu...@gmail.com> on 2016/09/18 06:42:34 UTC, 0 replies.
- 答复: 答复: it does not stop at breakpoints which is in an anonymous function - posted by chen yong <cy...@hotmail.com> on 2016/09/18 09:47:14 UTC, 1 replies.
- Re: filling missing values in a sequence - posted by sudhindra <sm...@gmail.com> on 2016/09/18 12:26:44 UTC, 8 replies.
- Lemmatization using StanfordNLP in ML 2.0 - posted by janardhan shetty <ja...@gmail.com> on 2016/09/18 18:01:40 UTC, 10 replies.
- study materials for operators on Dataframe - posted by "颜发才 (Yan Facai)" <ya...@gmail.com> on 2016/09/19 02:41:16 UTC, 1 replies.
- Getting empty values while receiving from kafka Spark streaming - posted by Sateesh Karuturi <sa...@gmail.com> on 2016/09/19 02:56:07 UTC, 2 replies.
- Is RankingMetrics' NDCG implementation correct? - posted by Jong Wook Kim <jo...@nyu.edu> on 2016/09/19 03:42:25 UTC, 5 replies.
- cassandra can not accessed via pyspark or spark-shell but it is accessible using cqlsh. what is the problem. - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/09/19 06:01:43 UTC, 0 replies.
- true conf for sparkconf().set().setMaster() to connect to cassandra - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/09/19 06:56:36 UTC, 0 replies.
- 1TB shuffle failed with executor lost failure - posted by Cyanny LIANG <lg...@gmail.com> on 2016/09/19 06:57:02 UTC, 1 replies.
- cassandra 3.7 is compatible with datastax Spark Cassandra Connector 2.0? - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/09/19 07:37:57 UTC, 0 replies.
- best versions for cassandra spark connection - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/09/19 08:11:30 UTC, 0 replies.
- Finding unique across all columns in dataset - posted by Abhishek Anand <ab...@gmail.com> on 2016/09/19 09:05:44 UTC, 5 replies.
- cassandra.yaml configuration for cassandra spark connection - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/09/19 11:48:16 UTC, 0 replies.
- Spark to HBase Fast Bulk Upload - posted by Punit Naik <na...@gmail.com> on 2016/09/19 11:59:34 UTC, 1 replies.
- spark streaming slow checkpointing when calling Rserve - posted by "Piubelli, Manuel " <ma...@citi.com.INVALID> on 2016/09/19 12:11:01 UTC, 0 replies.
- Fwd: Write.df is failing on NFS and S3 based spark cluster - posted by Sankar Mittapally <sa...@creditvidya.com> on 2016/09/19 13:08:12 UTC, 0 replies.
- driver OOM - need recommended memory for driver - posted by Anand Viswanathan <an...@ymail.com.INVALID> on 2016/09/19 14:32:47 UTC, 4 replies.
- off heap to alluxio/tachyon in Spark 2 - posted by "aka.fe2s" <ak...@gmail.com> on 2016/09/19 14:56:52 UTC, 3 replies.
- Get profile from sbt - posted by "Saurabh Malviya (samalviy)" <sa...@cisco.com> on 2016/09/19 16:28:42 UTC, 1 replies.
- NumberFormatException: For input string: "0.00000" - posted by Mohamed ismail <mi...@yahoo.com.INVALID> on 2016/09/19 17:15:29 UTC, 2 replies.
- Spark Job not failing - posted by "tosaiganesh@gmail.com" <to...@gmail.com> on 2016/09/19 19:19:38 UTC, 3 replies.
- Spark DataFrame Join _ performance issues - posted by Subhajit Purkayastha <sp...@p3si.net> on 2016/09/19 19:28:51 UTC, 0 replies.
- Java Compatibity Problems when we install rJava - posted by "Arif,Mubaraka" <ar...@heb.com> on 2016/09/19 19:29:04 UTC, 2 replies.
- Re: Kinesis Receiver not respecting spark.streaming.receiver.maxRate - posted by "tosaiganesh@gmail.com" <to...@gmail.com> on 2016/09/19 20:06:04 UTC, 1 replies.
- Spark.1.6.1 on Apache Mesos : Log4j2 could not find a logging implementation - posted by "sagarcasual ." <sa...@gmail.com> on 2016/09/19 20:48:11 UTC, 0 replies.
- Similar Items - posted by Kevin Mellott <ke...@gmail.com> on 2016/09/19 20:49:01 UTC, 7 replies.
- very high maxresults setting (no collect()) - posted by Adrian Bridgett <ad...@opensignal.com> on 2016/09/19 21:05:50 UTC, 2 replies.
- Anyone used Zoomdata visual dashboard with Spark - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/19 22:39:57 UTC, 0 replies.
- Sending extraJavaOptions for Spark 1.6.1 on mesos 0.28.2 in cluster mode - posted by "sagarcasual ." <sa...@gmail.com> on 2016/09/20 00:07:53 UTC, 1 replies.
- as.Date can't be applied to Spark data frame in SparkR - posted by xingye <xi...@hotmail.com> on 2016/09/20 02:22:17 UTC, 1 replies.
- it does not stop at the breakpoint line within an anonymous function concerning RDD - posted by chen yong <cy...@hotmail.com> on 2016/09/20 02:48:50 UTC, 1 replies.
- LDA and Maximum Iterations - posted by Frank Zhang <da...@yahoo.com.INVALID> on 2016/09/20 04:19:53 UTC, 2 replies.
- SPARK-10835 in 2.0 - posted by janardhan shetty <ja...@gmail.com> on 2016/09/20 05:10:40 UTC, 5 replies.
- write.df is failing on Spark Cluster - posted by sankarmittapally <sa...@creditvidya.com> on 2016/09/20 05:16:11 UTC, 9 replies.
- How to know WHO are the slaves for an application - posted by Xiaoye Sun <su...@gmail.com> on 2016/09/20 05:26:50 UTC, 0 replies.
- is there any bug for the configuration of spark 2.0 cassandra spark connector 2.0 and cassandra 3.0.8 - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/09/20 05:47:08 UTC, 1 replies.
- Configuring Kinesis max records limit in KinesisReceiver - posted by Aravindh <ma...@aravindh.io> on 2016/09/20 05:49:16 UTC, 0 replies.
- cassandra and spark can be built and worked on the same computer? - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/09/20 06:08:40 UTC, 0 replies.
- Task Deserialization Error - posted by "Chawla,Sumit " <su...@gmail.com> on 2016/09/20 06:15:50 UTC, 3 replies.
- java.lang.ClassCastException: optional binary element (UTF8) is not a group - posted by "Rajan, Naveen" <Na...@sony.com> on 2016/09/20 08:01:24 UTC, 1 replies.
- spark sql thrift server: driver OOM - posted by Young <zz...@163.com> on 2016/09/20 09:46:56 UTC, 0 replies.
- Convert RDD to JSON Rdd and append more information - posted by sujeet jog <su...@gmail.com> on 2016/09/20 13:12:21 UTC, 1 replies.
- Re: Continuous warning while consuming using new kafka-spark010 API - posted by Cody Koeninger <co...@koeninger.org> on 2016/09/20 14:57:45 UTC, 0 replies.
- Options for method createExternalTable - posted by CalumAtTheGuardian <ca...@theguardian.com> on 2016/09/20 16:04:49 UTC, 0 replies.
- Spark Tasks Taking Increasingly Longer - posted by Chris Jansen <ja...@gmail.com> on 2016/09/20 16:36:44 UTC, 0 replies.
- Dataset doesn't have partitioner after a repartition on one of the columns - posted by "McBeath, Darin W (ELS-STL)" <D....@elsevier.com> on 2016/09/20 18:22:26 UTC, 0 replies.
- Israel Spark Meetup - posted by Romi Kuntsman <ro...@gmail.com> on 2016/09/21 04:53:35 UTC, 1 replies.
- OutOfMemory while calculating window functions - posted by Jeremy Davis <je...@speakeasy.net> on 2016/09/21 05:26:53 UTC, 1 replies.
- How to write multiple outputs in avro format in spark(java)? - posted by Mahebub Sayyed <ma...@gmail.com> on 2016/09/21 08:01:34 UTC, 0 replies.
- unresolved dependency: datastax#spark-cassandra-connector;2.0.0-s_2.11-M3-20-g75719df: not found - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/09/21 08:38:16 UTC, 1 replies.
- increase spark performance - posted by Trinadh Kaja <kt...@gmail.com> on 2016/09/21 11:33:42 UTC, 0 replies.
- SPARK PERFORMANCE TUNING - posted by Trinadh Kaja <kt...@gmail.com> on 2016/09/21 11:37:23 UTC, 2 replies.
- spark stream based deduplication - posted by backtrack5 <so...@live.com> on 2016/09/21 15:19:33 UTC, 1 replies.
- Apache Spark JavaRDD pipe() need help - posted by "shashikant.kulkarni@gmail.com" <sh...@gmail.com> on 2016/09/21 17:14:41 UTC, 4 replies.
- Bizarre behavior using Datasets/ML on Spark 2.0 - posted by Miles Crawford <mi...@allenai.org> on 2016/09/21 17:23:32 UTC, 0 replies.
- Re: problems with checkpoint and spark sql - posted by Dhimant <dh...@gmail.com> on 2016/09/21 17:36:25 UTC, 0 replies.
- Re: Sqoop vs spark jdbc - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/21 18:13:57 UTC, 4 replies.
- Spark writing to elasticsearch asynchronously - posted by Sunita Arvind <su...@gmail.com> on 2016/09/21 19:09:24 UTC, 0 replies.
- How to use a custom filesystem provider? - posted by Jean-Philippe Martin <jp...@google.com.INVALID> on 2016/09/21 19:10:23 UTC, 2 replies.
- Off Heap (Tungsten) Memory Usage / Management ? - posted by Michael Segel <ms...@hotmail.com> on 2016/09/21 20:02:57 UTC, 0 replies.
- Equivalent to --files for driver? - posted by Everett Anderson <ev...@nuna.com.INVALID> on 2016/09/21 20:17:00 UTC, 1 replies.
- Re: Has anyone installed the scala kernel for Jupyter notebook - posted by Jakob Odersky <ja...@odersky.com> on 2016/09/21 21:54:42 UTC, 3 replies.
- Hbase Connection not seraializible in Spark -> foreachrdd - posted by KhajaAsmath Mohammed <md...@gmail.com> on 2016/09/21 22:34:53 UTC, 3 replies.
- Re: Off Heap (Tungsten) Memory Usage / Management ? - posted by Jörn Franke <jo...@gmail.com> on 2016/09/21 22:41:53 UTC, 9 replies.
- Spark Application Log - posted by Divya Gehlot <di...@gmail.com> on 2016/09/22 04:06:47 UTC, 1 replies.
- Using Spark as a Maven dependency but with Hadoop 2.6 - posted by Olivier Girardot <o....@lateral-thoughts.com> on 2016/09/22 06:05:26 UTC, 6 replies.
- Memory usage by Spark jobs - posted by Hemant Bhanawat <he...@gmail.com> on 2016/09/22 06:36:50 UTC, 1 replies.
- Open source Spark based projects - posted by tahirhn <ta...@icloud.com> on 2016/09/22 10:15:07 UTC, 4 replies.
- Is executor computing time affected by network latency? - posted by gusiri <dr...@gmail.com> on 2016/09/22 13:54:09 UTC, 5 replies.
- Spark RDD and Memory - posted by Aditya <ad...@augmentiq.co.in> on 2016/09/22 14:54:24 UTC, 5 replies.
- sqoop Imported and Hbase ImportTsv issue with Fled: No enum constant mapreduce.JobCounter.MB_MILLIS_MAPS - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/22 16:34:11 UTC, 1 replies.
- Error while Spark 1.6.1 streaming from Kafka-2.11_0.10.0.1 cluster - posted by "sagarcasual ." <sa...@gmail.com> on 2016/09/22 18:37:23 UTC, 4 replies.
- pyspark ML example not working - posted by jypucca <jy...@gmail.com> on 2016/09/22 21:59:39 UTC, 1 replies.
- Re: spark stream on yarn oom - posted by manasdebashiskar <po...@gmail.com> on 2016/09/23 00:40:44 UTC, 0 replies.
- In Spark-scala, how to fill Vectors.dense in DataFrame from CSV? - posted by Dan Bikle <bi...@gmail.com> on 2016/09/23 01:40:47 UTC, 1 replies.
- Redshift Vs Spark SQL (Thrift) - posted by ayan guha <gu...@gmail.com> on 2016/09/23 04:09:10 UTC, 2 replies.
- How to specify file - posted by Sea <26...@qq.com> on 2016/09/23 06:56:29 UTC, 0 replies.
- Re: How to specify file - posted by Hemant Bhanawat <he...@gmail.com> on 2016/09/23 07:02:08 UTC, 2 replies.
- 回复： How to specify file - posted by Sea <26...@qq.com> on 2016/09/23 07:26:11 UTC, 0 replies.
- Spark Yarn Cluster with Reference File - posted by ABHISHEK <ab...@gmail.com> on 2016/09/23 07:33:18 UTC, 6 replies.
- UDAF collect_list: Hive Query or spark sql expression - posted by Jason Mop <cn...@gmail.com> on 2016/09/23 09:51:11 UTC, 0 replies.
- Tuning Spark memory - posted by tan shai <ta...@gmail.com> on 2016/09/23 12:06:45 UTC, 1 replies.
- ERROR StandaloneSchedulerBackend: Application has been killed. Reason: All masters are unresponsive! Giving up. - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/09/23 12:07:09 UTC, 0 replies.
- Spark MLlib ALS algorithm - posted by Roshani Nagmote <ro...@gmail.com> on 2016/09/23 18:07:30 UTC, 2 replies.
- Can somebody remove this guy? - posted by Dirceu Semighini Filho <di...@gmail.com> on 2016/09/23 19:23:02 UTC, 0 replies.
- Optimal/Expected way to run demo spark-scala scripts? - posted by Dan Bikle <bi...@gmail.com> on 2016/09/23 19:37:55 UTC, 1 replies.
- databricks spark-csv: linking coordinates are what? - posted by Dan Bikle <bi...@gmail.com> on 2016/09/23 20:26:31 UTC, 2 replies.
- Running Spark master/slave instances in non Daemon mode - posted by Jeff Puro <jp...@mustwin.com> on 2016/09/23 22:21:15 UTC, 2 replies.
- With spark DataFrame, how to write to existing folder? - posted by Dan Bikle <bi...@gmail.com> on 2016/09/23 22:45:24 UTC, 1 replies.
- Error in run multiple unit test that extends DataFrameSuiteBase - posted by Jinyuan Zhou <zh...@gmail.com> on 2016/09/24 00:01:31 UTC, 0 replies.
- Re: Spark job fails as soon as it starts. Driver requested a total number of 168510 executor - posted by Yash Sharma <ya...@gmail.com> on 2016/09/24 00:27:55 UTC, 4 replies.
- ideas on de duplication for spark streaming? - posted by kant kodali <ka...@gmail.com> on 2016/09/24 06:49:44 UTC, 2 replies.
- Spark 1.6.2 Concurrent append to a HDFS folder with different partition key - posted by Shing Hing Man <ma...@yahoo.com.INVALID> on 2016/09/24 16:12:55 UTC, 0 replies.
- spark-submit failing but job running from scala ide - posted by vr spark <vr...@gmail.com> on 2016/09/25 06:36:34 UTC, 5 replies.
- Left Join Yields Results And Not Results - posted by Aaron Jackson <aj...@pobox.com> on 2016/09/25 06:46:45 UTC, 0 replies.
- How to use Spark-Scala to download a CSV file from the web? - posted by Dan Bikle <bi...@gmail.com> on 2016/09/25 08:27:25 UTC, 2 replies.
- In Spark-Scala, how to copy Array of Lists into new DataFrame? - posted by Dan Bikle <bi...@gmail.com> on 2016/09/25 11:57:19 UTC, 2 replies.
- udf forces usage of Row for complex types? - posted by Koert Kuipers <ko...@tresata.com> on 2016/09/25 21:41:52 UTC, 6 replies.
- Spark 2.0 Structured Streaming: sc.parallelize in foreach sink cause Task not serializable error - posted by Jianshi <js...@gmail.com> on 2016/09/25 22:20:15 UTC, 1 replies.
- Re: ArrayType support in Spark SQL - posted by Koert Kuipers <ko...@tresata.com> on 2016/09/25 22:27:51 UTC, 0 replies.
- Extract timestamp from Kafka message - posted by Kevin Tran <ke...@gmail.com> on 2016/09/25 22:59:26 UTC, 1 replies.
- Writing Dataframe to CSV yields blank file called "_SUCCESS" - posted by Peter Figliozzi <pe...@gmail.com> on 2016/09/26 02:56:56 UTC, 4 replies.
- MLib Documentation Update Needed - posted by Tobi Bosede <an...@gmail.com> on 2016/09/26 05:22:15 UTC, 2 replies.
- how to find NaN values of each row of spark dataframe to decide whether the rows is dropeed or not - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/09/26 07:30:51 UTC, 3 replies.
- how to decide which part of process use spark dataframe and pandas dataframe? - posted by muhammet pakyürek <mp...@hotmail.com> on 2016/09/26 08:08:39 UTC, 1 replies.
- Can Spark Streaming 2.0 work with Kafka 0.10? - posted by Haopu Wang <HW...@qilinsoft.com> on 2016/09/26 08:40:53 UTC, 1 replies.
- Subscribe - posted by Lakshmi Rajagopalan <la...@indix.com> on 2016/09/26 09:11:54 UTC, 1 replies.
- Please unsubscribe me from this mailing list - posted by "Hogancamp, Aaron" <AA...@leidos.com> on 2016/09/26 14:36:51 UTC, 0 replies.
- SparkLauncher not receiving events - posted by Mariano Semelman <ma...@despegar.com> on 2016/09/26 14:37:48 UTC, 1 replies.
- increase efficiency of working with mongo and mysql database - posted by Yang Cao <cy...@gmail.com> on 2016/09/26 15:22:31 UTC, 0 replies.
- Running jobs against remote cluster from scala eclipse ide - posted by vr spark <vr...@gmail.com> on 2016/09/26 15:34:50 UTC, 1 replies.
- Pyspark ML - Unable to finish cross validation - posted by Simone <si...@gmail.com> on 2016/09/26 17:23:46 UTC, 0 replies.
- Non-linear regression of exponential form in Spark - posted by Cooper <ah...@gmail.com> on 2016/09/26 17:49:26 UTC, 0 replies.
- using SparkILoop.run - posted by Mohit Jaggi <mo...@gmail.com> on 2016/09/26 18:25:58 UTC, 1 replies.
- Native libraries using only one core in standalone spark cluster - posted by guangweiyu <gu...@mail.utoronto.ca> on 2016/09/26 18:27:29 UTC, 0 replies.
- Slow Shuffle Operation on Empty Batch - posted by Erwan ALLAIN <ea...@gmail.com> on 2016/09/26 21:10:24 UTC, 5 replies.
- Tutorial error - zeppelin 0.6.2 built with spark 2.0 and mapr - posted by Nirav Patel <np...@xactlycorp.com> on 2016/09/26 22:45:06 UTC, 1 replies.
- median of groups - posted by Peter Figliozzi <pe...@gmail.com> on 2016/09/27 00:52:50 UTC, 1 replies.
- Access Amazon s3 data - posted by Hitesh Goyal <hi...@nlpcaptcha.com> on 2016/09/27 01:50:33 UTC, 1 replies.
- Large-scale matrix inverse in Spark - posted by Cooper <ah...@gmail.com> on 2016/09/27 02:05:42 UTC, 4 replies.
- Newbie Q: Issue related to connecting Spark Master Standalone through Scala app - posted by Reth RM <re...@gmail.com> on 2016/09/27 05:59:27 UTC, 3 replies.
- why spark ml package doesn't contain svm algorithm - posted by hxw黄祥为 <hu...@Ctrip.com> on 2016/09/27 07:09:55 UTC, 0 replies.
- What is the difference between mini-batch vs real time streaming in practice (not theory)? - posted by kant kodali <ka...@gmail.com> on 2016/09/27 07:12:30 UTC, 4 replies.
- read multiple files - posted by Divya Gehlot <di...@gmail.com> on 2016/09/27 07:52:12 UTC, 2 replies.
- DataFrame Rejection Directory - posted by Mostafa Alaa Mohamed <mo...@etisalat.ae> on 2016/09/27 07:52:31 UTC, 0 replies.
- Re: why spark ml package doesn't contain svm algorithm - posted by Nick Pentreath <ni...@gmail.com> on 2016/09/27 10:59:02 UTC, 0 replies.
- Incremental model update - posted by Debasish Ghosh <gh...@gmail.com> on 2016/09/27 13:08:02 UTC, 1 replies.
- pyspark cluster mode on standalone deployment - posted by Ofer Eliassaf <of...@gmail.com> on 2016/09/27 13:38:16 UTC, 0 replies.
- Problems with new experimental Kafka Consumer for 0.10 - posted by Matthias Niehoff <ma...@codecentric.de> on 2016/09/27 14:37:47 UTC, 3 replies.
- Access S3 buckets in multiple accounts - posted by Daniel Siegmann <ds...@securityscorecard.io> on 2016/09/27 14:53:36 UTC, 5 replies.
- log4j custom properties for spark project - posted by KhajaAsmath Mohammed <md...@gmail.com> on 2016/09/27 15:57:47 UTC, 0 replies.
- Re: Pyspark not working on yarn-cluster mode - posted by ofer <of...@gmail.com> on 2016/09/27 17:09:10 UTC, 0 replies.
- ORC file stripe statistics in Spark - posted by Sudhir Babu Pothineni <sb...@gmail.com> on 2016/09/27 19:43:48 UTC, 0 replies.
- Question about single/multi-pass execution in Spark-2.0 dataset/dataframe - posted by Spark User <sp...@gmail.com> on 2016/09/27 20:02:25 UTC, 0 replies.
- Issue with rogue data in csv file used in Spark application - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/27 20:49:16 UTC, 8 replies.
- Parquet compression jars not found - both snappy and lzo - PySpark 2.0.0 - posted by Russell Jurney <ru...@gmail.com> on 2016/09/27 21:47:03 UTC, 0 replies.
- Question about executor memory setting - posted by Dogtail L <sp...@gmail.com> on 2016/09/28 02:27:27 UTC, 4 replies.
- Help required in validating an architecture using Structured Streaming - posted by Aravindh <ma...@aravindh.io> on 2016/09/28 04:09:04 UTC, 0 replies.
- CGroups and Spark - posted by Harut <ha...@gmail.com> on 2016/09/28 04:43:21 UTC, 0 replies.
- Trying to fetch S3 data - posted by Hitesh Goyal <hi...@nlpcaptcha.com> on 2016/09/28 05:28:17 UTC, 1 replies.
- Spark Executor Lost issue - posted by Aditya <ad...@augmentiq.co.in> on 2016/09/28 06:47:29 UTC, 3 replies.
- Treadting NaN fields in Spark - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/28 10:56:47 UTC, 8 replies.
- Re: Broadcast big dataset - posted by Takeshi Yamamuro <li...@gmail.com> on 2016/09/28 15:09:40 UTC, 0 replies.
- New to spark. - posted by Anirudh Muhnot <mu...@icloud.com> on 2016/09/28 16:11:06 UTC, 1 replies.
- Spark ML Decision Trees Algorithm - posted by janardhan shetty <ja...@gmail.com> on 2016/09/28 16:52:39 UTC, 5 replies.
- Spark Summit CfP Closes Sunday - posted by Jules Damji <dm...@comcast.net> on 2016/09/28 17:03:35 UTC, 0 replies.
- Re: Dataset doesn't have partitioner after a repartition on one of the columns - posted by Michael Armbrust <mi...@databricks.com> on 2016/09/28 18:26:25 UTC, 1 replies.
- Submit and Monitor standalone cluster application - posted by Mariano Semelman <ma...@despegar.com> on 2016/09/28 22:42:58 UTC, 1 replies.
- spark / mesos and GPU resources - posted by Jackie Tung <ja...@drive.ai> on 2016/09/28 23:48:10 UTC, 1 replies.
- Getting StackOverflowError in spark job some times - posted by ckanth99 <ck...@zoho.com> on 2016/09/29 05:00:15 UTC, 1 replies.
- spark persistence doubt - posted by Shushant Arora <sh...@gmail.com> on 2016/09/29 05:09:22 UTC, 1 replies.
- Spark Hive Rejection - posted by Mostafa Alaa Mohamed <mo...@etisalat.ae> on 2016/09/29 06:25:26 UTC, 1 replies.
- Need help :- org.apache.spark.SparkException :- No such file or directory - posted by Madabhattula Rajesh Kumar <mr...@gmail.com> on 2016/09/29 06:48:58 UTC, 0 replies.
- building runnable distribution from source - posted by AssafMendelson <as...@rsa.com> on 2016/09/29 08:08:38 UTC, 3 replies.
- spark sql on json - posted by Hitesh Goyal <hi...@nlpcaptcha.com> on 2016/09/29 09:58:58 UTC, 1 replies.
- spark listener do not get fail status - posted by Aseem Bansal <as...@gmail.com> on 2016/09/29 11:54:18 UTC, 1 replies.
- Re: S3 DirectParquetOutputCommitter + PartitionBy + SaveMode.Append - posted by "joffe.tal" <jo...@gmail.com> on 2016/09/29 12:28:13 UTC, 1 replies.
- mapWithState() without data checkpointing - posted by Alexey Kharlamov <ah...@gmail.com> on 2016/09/29 12:55:08 UTC, 0 replies.
- Architecture recommendations for a tricky use case - posted by Ali Akhtar <al...@gmail.com> on 2016/09/29 13:54:50 UTC, 29 replies.
- configure spark with openblas, thanks - posted by "TheGeorge1918 ." <zh...@gmail.com> on 2016/09/29 13:57:59 UTC, 0 replies.
- udf of aggregation in pyspark dataframe ? - posted by peng yu <yu...@gmail.com> on 2016/09/29 15:00:59 UTC, 3 replies.
- Fwd: todell@yahoo-inc.com is no longer with Yahoo! (was: Re: Treadting NaN fields in Spark) - posted by Michael Segel <ms...@hotmail.com> on 2016/09/29 16:02:52 UTC, 0 replies.
- spark streaming minimum batch interval - posted by Shushant Arora <sh...@gmail.com> on 2016/09/29 16:36:23 UTC, 0 replies.
- Pyspark - 1.5.0 pickle ML PipelineModel - posted by Simone <si...@gmail.com> on 2016/09/29 17:24:47 UTC, 0 replies.
- Metrics System not recognizing Custom Source/Sink in application jar - posted by map reduced <k3...@gmail.com> on 2016/09/29 18:24:02 UTC, 0 replies.
- Running in local mode as SQL engine - what to optimize? - posted by RodrigoB <ro...@aspect.com> on 2016/09/29 18:50:45 UTC, 0 replies.
- Is there a way to get the AUC metric for CrossValidator? - posted by evanzamir <za...@gmail.com> on 2016/09/29 19:18:40 UTC, 1 replies.
- Re: Questions about DataFrame's filter() - posted by Michael Armbrust <mi...@databricks.com> on 2016/09/29 19:22:38 UTC, 0 replies.
- writing to s3 failing to move parquet files from temporary folder - posted by jamborta <ja...@gmail.com> on 2016/09/29 19:26:37 UTC, 0 replies.
- Setting conf options in jupyter - posted by William Kupersanin <wk...@gmail.com> on 2016/09/29 20:23:53 UTC, 0 replies.
- Spark 2.0 issue - posted by Ashish Shrowty <as...@gmail.com> on 2016/09/29 21:26:48 UTC, 1 replies.
- How to extract bestModel parameters from a CrossValidatorModel - posted by Rich Tarro <ri...@gmail.com> on 2016/09/30 01:12:17 UTC, 0 replies.
- Issues in compiling spark 2.0.0 code using scala-maven-plugin - posted by satyajit vegesna <sa...@gmail.com> on 2016/09/30 02:00:16 UTC, 1 replies.
- FetchFailed exception with Spark 1.6 - posted by Ankur Srivastava <an...@gmail.com> on 2016/09/30 02:31:27 UTC, 0 replies.
- YARN - Pyspark - posted by ayan guha <gu...@gmail.com> on 2016/09/30 06:33:11 UTC, 2 replies.
- Re: compatibility issue with Jersey2 - posted by SimonL <si...@gmail.com> on 2016/09/30 08:21:26 UTC, 0 replies.
- Lots of spark-assembly jars localized to /usercache/username/filecache directory - posted by Lantao Jin <ji...@gmail.com> on 2016/09/30 09:13:02 UTC, 0 replies.
- Dataframe Grouping - Sorting - Mapping - posted by AJT <at...@currenex.com> on 2016/09/30 10:46:36 UTC, 1 replies.
- SPARK CREATING EXTERNAL TABLE - posted by Trinadh Kaja <kt...@gmail.com> on 2016/09/30 11:40:28 UTC, 1 replies.
- Grouped windows in spark streaming - posted by Adrienne Kole <ad...@gmail.com> on 2016/09/30 13:59:34 UTC, 0 replies.
- Stopping spark steaming context on encountering certain type of message on Kafka - posted by vatsal <va...@live.com> on 2016/09/30 14:05:42 UTC, 0 replies.
- Replying same post with proper formatting. - sorry for extra mail - posted by vatsal <va...@live.com> on 2016/09/30 14:07:53 UTC, 0 replies.
- get different results when debugging and running scala program - posted by chen yong <cy...@hotmail.com> on 2016/09/30 15:25:36 UTC, 0 replies.
- Design considerations for batch and speed layers - posted by Mich Talebzadeh <mi...@gmail.com> on 2016/09/30 16:17:37 UTC, 2 replies.
- DataFrame Sort gives Cannot allocate a page with more than 17179869176 bytes - posted by Babak Alipour <ba...@gmail.com> on 2016/09/30 16:57:24 UTC, 4 replies.
- Restful WS for Spark - posted by ABHISHEK <ab...@gmail.com> on 2016/09/30 18:07:01 UTC, 3 replies.
- Having parallelized job inside getPartitions method causes job hanging - posted by "Zhang, Yanyan" <ya...@amazon.com> on 2016/09/30 19:37:40 UTC, 0 replies.