user@spark.apache.org, 2017-07

You are viewing a plain text version of this content. The canonical link for it is here.

- json in Cassandra to RDDs - posted by Conconscious <co...@gmail.com> on 2017/07/01 10:54:00 UTC, 1 replies.
- Re: about broadcast join of base table in spark sql - posted by Paley Louie <pa...@gmail.com> on 2017/07/02 04:33:18 UTC, 4 replies.
- Re: How to reduce the amount of data that is getting written to the checkpoint from Spark Streaming - posted by "Yuval.Itzchakov" <yu...@gmail.com> on 2017/07/03 05:22:06 UTC, 1 replies.
- Structured Streaming UI similar to Spark Streaming - posted by "Yuval.Itzchakov" <yu...@gmail.com> on 2017/07/03 05:28:29 UTC, 0 replies.
- Analysis Exception after join - posted by Bernard Jesop <be...@gmail.com> on 2017/07/03 09:55:54 UTC, 3 replies.
- Re: What's the simplest way to Read Avro records from Kafka to Spark DataSet/DataFrame? - posted by kant kodali <ka...@gmail.com> on 2017/07/03 11:20:11 UTC, 0 replies.
- [PySpark] - running processes - posted by Sidney Feiner <si...@startapp.com> on 2017/07/03 11:53:03 UTC, 0 replies.
- [Spark SQL] JDBC connection from UDF - posted by Patrik Medvedev <pa...@gmail.com> on 2017/07/03 15:32:45 UTC, 0 replies.
- spark submit with logs and kerberos - posted by Juan Rods <ju...@gmail.com> on 2017/07/03 16:19:48 UTC, 0 replies.
- Re: spark-jdbc impala with kerberos using yarn-client - posted by morfious902002 <an...@gmail.com> on 2017/07/03 17:03:41 UTC, 0 replies.
- sparkJob - GenericUDTF - HS2 - error - posted by Sudha KS <Su...@fuzzylogix.com> on 2017/07/04 03:28:42 UTC, 0 replies.
- test mail - posted by Sudha KS <Su...@fuzzylogix.com> on 2017/07/04 06:40:13 UTC, 1 replies.
- SparkSession via HS2 - is it supported? - posted by Sudha KS <Su...@fuzzylogix.com> on 2017/07/04 08:07:34 UTC, 0 replies.
- RE: [PySpark] - running processes and computing time - posted by Sidney Feiner <si...@startapp.com> on 2017/07/04 12:15:44 UTC, 0 replies.
- Kafka 0.10 with PySpark - posted by Daniel van der Ende <da...@gmail.com> on 2017/07/04 14:53:57 UTC, 1 replies.
- Window function / streaming - posted by Julien CHAMP <jc...@tellmeplus.com> on 2017/07/04 15:45:54 UTC, 0 replies.
- SparkSession via HS2 - Error -spark.yarn.jars not read - posted by Sudha KS <Su...@fuzzylogix.com> on 2017/07/05 08:21:19 UTC, 2 replies.
- Spark | Window Function | - posted by Julien CHAMP <jc...@tellmeplus.com> on 2017/07/05 08:41:28 UTC, 4 replies.
- Reading csv.gz files - posted by Sea aj <sa...@gmail.com> on 2017/07/05 08:52:53 UTC, 0 replies.
- Re: Spark querying parquet data partitioned in S3 - posted by Steve Loughran <st...@hortonworks.com> on 2017/07/05 11:36:07 UTC, 0 replies.
- RE: SparkSession via HS2 - Error: Yarn application has already ended - posted by Sudha KS <Su...@fuzzylogix.com> on 2017/07/05 13:15:10 UTC, 1 replies.
- Re: Spark, S3A, and 503 SlowDown / rate limit issues - posted by Vadim Semenov <va...@datadoghq.com> on 2017/07/05 13:40:10 UTC, 3 replies.
- Load multiple CSV from different paths - posted by Didac Gil <di...@gmail.com> on 2017/07/05 14:08:14 UTC, 2 replies.
- Collecting matrix's entries raises an error only when run inside a test - posted by Simone Robutti <si...@gmail.com> on 2017/07/05 14:52:13 UTC, 1 replies.
- Re: PySpark working with Generators - posted by Saatvik Shah <sa...@gmail.com> on 2017/07/05 19:23:00 UTC, 0 replies.
- Exception: JDK-8154035 using Whole text files api - posted by Reth RM <re...@gmail.com> on 2017/07/05 19:46:38 UTC, 0 replies.
- UDAFs for sketching Dataset columns with T-Digests - posted by Erik Erlandson <ee...@redhat.com> on 2017/07/06 00:33:28 UTC, 1 replies.
- Re: Do we anything for Deep Learning in Spark? - posted by Gaurav1809 <ga...@gmail.com> on 2017/07/06 04:03:21 UTC, 3 replies.
- Re: custom column types for JDBC datasource writer - posted by Takeshi Yamamuro <li...@gmail.com> on 2017/07/06 04:44:36 UTC, 1 replies.
- Partitions cached by updatStateByKey does not seem to be getting evicted forever - posted by SRK <sw...@gmail.com> on 2017/07/06 16:59:46 UTC, 0 replies.
- Unsubscribe - posted by Kun Liu <li...@gmail.com> on 2017/07/06 18:35:54 UTC, 3 replies.
- Is there "EXCEPT ALL" in Spark SQL? - posted by jeff saremi <je...@hotmail.com> on 2017/07/06 19:22:23 UTC, 3 replies.
- Structured Streaming: consumerGroupId - posted by aravias <as...@homeaway.com> on 2017/07/06 19:41:52 UTC, 0 replies.
- Logging in lSpark streaming application - posted by anna stax <an...@gmail.com> on 2017/07/06 20:14:13 UTC, 0 replies.
- Spark 2.0.2 - JdbcRelationProvider does not allow create table as select - posted by Kanagha Kumar <kp...@salesforce.com> on 2017/07/06 23:00:09 UTC, 1 replies.
- GraphQL to Spark SQL - posted by kant kodali <ka...@gmail.com> on 2017/07/07 00:00:45 UTC, 0 replies.
- If I pass raw SQL string to dataframe do I still get the Spark SQL optimizations? - posted by kant kodali <ka...@gmail.com> on 2017/07/07 00:28:31 UTC, 2 replies.
- VS: Using Spark as a simulator - posted by Esa Heikkinen <es...@student.tut.fi> on 2017/07/07 06:46:32 UTC, 3 replies.
- Integrating Kafka 0.10 or higher with Spark 2.1.1 -- required jars - posted by mahendra singh meena <ma...@gmail.com> on 2017/07/07 12:29:36 UTC, 2 replies.
- Union of 2 streaming data frames - posted by "Lalwani, Jayesh" <Ja...@capitalone.com> on 2017/07/07 18:27:34 UTC, 6 replies.
- Iterate over grouped df to create new rows/df - posted by Junaid Nasir <jn...@an10.io> on 2017/07/07 21:06:10 UTC, 1 replies.
- Event time aggregation is possible in Spark Streaming ? - posted by Swapnil Chougule <th...@gmail.com> on 2017/07/08 11:18:06 UTC, 2 replies.
- Glue-like Functionality - posted by Benjamin Kim <bb...@gmail.com> on 2017/07/08 17:49:35 UTC, 1 replies.
- Re: SparkSQL to read XML Blob data to create multiple rows - posted by Amol Talap <am...@gmail.com> on 2017/07/08 21:23:45 UTC, 0 replies.
- Anyone used Kamanja for real time decision making - posted by Mich Talebzadeh <mi...@gmail.com> on 2017/07/09 07:45:07 UTC, 1 replies.
- PySpark saving custom pipelines - posted by Riccardo Ferrari <fe...@gmail.com> on 2017/07/09 21:58:57 UTC, 0 replies.
- Spark streaming, Storage tab questions - posted by anna stax <an...@gmail.com> on 2017/07/09 23:33:08 UTC, 2 replies.
- Spark streaming giving me a bunch of WARNINGS, please help me understand them - posted by shyla deshpande <de...@gmail.com> on 2017/07/10 00:17:10 UTC, 0 replies.
- Re: How do I find the time taken by each step in a stage in a Spark Job - posted by swetha kasireddy <sw...@gmail.com> on 2017/07/10 02:33:02 UTC, 0 replies.
- spark-submit via cluster mode - setting dependencies classpath! - posted by Kanagha <er...@gmail.com> on 2017/07/10 04:13:46 UTC, 0 replies.
- UI for spark machine learning. - posted by Mahesh Sawaiker <ma...@persistent.com> on 2017/07/10 04:35:30 UTC, 1 replies.
- Re: Spark streaming giving me a bunch of WARNINGS, please help meunderstand them - posted by 萝卜丝炒饭 <14...@qq.com> on 2017/07/10 07:39:19 UTC, 2 replies.
- Timeline for stable release for Spark Structured Streaming - posted by Dhrubajyoti Hati <dh...@gmail.com> on 2017/07/10 10:33:11 UTC, 1 replies.
- SparkException: Invalid master URL - posted by Mina Aslani <as...@gmail.com> on 2017/07/10 14:20:25 UTC, 0 replies.
- Databricks Spark XML parsing exception while iterating - posted by Amol Talap <am...@gmail.com> on 2017/07/10 18:14:48 UTC, 0 replies.
- error in running StructuredStreaming-Kafka integration code (Spark 2.x & Kafka 10) - posted by karan alang <ka...@gmail.com> on 2017/07/10 18:36:34 UTC, 2 replies.
- Spark streaming application is failing after running for few hours - posted by shyla deshpande <de...@gmail.com> on 2017/07/10 19:04:28 UTC, 0 replies.
- Runtime exception with AccumulatorV2 on Spark 2.2/2.1.1 - posted by B Li <on...@gmail.com> on 2017/07/10 19:56:18 UTC, 0 replies.
- spark-graphframes - posted by Dennis Grinwald <dg...@web.de> on 2017/07/10 23:50:41 UTC, 0 replies.
- Re: Spark 2.1.1 Graphx graph loader GC overhead error - posted by Aritra Mandal <ar...@gmail.com> on 2017/07/11 00:56:39 UTC, 1 replies.
- how to get the summary count of words by filestream please? - posted by 萝卜丝炒饭 <14...@qq.com> on 2017/07/11 02:43:38 UTC, 0 replies.
- Multiple Streaming Apps running on the Spark Cluster - posted by winter fresh <wi...@gmail.com> on 2017/07/11 05:20:59 UTC, 0 replies.
- Testing another Dataset after ML training - posted by mckunkel <m....@fz-juelich.de> on 2017/07/11 11:42:31 UTC, 9 replies.
- Query via Spark Thrift Server return wrong result. - posted by Valentin Ursu <va...@gmail.com> on 2017/07/11 16:58:55 UTC, 1 replies.
- [Spark Streaming] - ERROR Error cleaning broadcast Exception - posted by Nipun Arora <ni...@gmail.com> on 2017/07/11 20:22:06 UTC, 0 replies.
- DataFrame --- join / groupBy-agg question... - posted by Muthu Jayakumar <ba...@gmail.com> on 2017/07/11 21:05:03 UTC, 5 replies.
- [ANNOUNCE] Announcing Apache Spark 2.2.0 - posted by Michael Armbrust <mi...@databricks.com> on 2017/07/11 22:48:15 UTC, 3 replies.
- Spark streaming does not seem to clear MapPartitionsRDD and ShuffledRDD that are persisted after the use of updateStateByKey and reduceByKeyAndWindow with inverse functions even after checkpointing the data - posted by SRK <sw...@gmail.com> on 2017/07/12 00:10:55 UTC, 0 replies.
- java IllegalStateException: unread block data Exception - setBlockDataMode - posted by Kanagha <er...@gmail.com> on 2017/07/12 00:44:23 UTC, 0 replies.
- Limit the number of tasks submitted：spark.submit.tasks.threshold.enabled & spark.submit.tasks.threshold - posted by 李斌松 <li...@gmail.com> on 2017/07/12 03:45:22 UTC, 0 replies.
- CVE-2017-7678 Apache Spark XSS web UI MHTML vulnerability - posted by Sean Owen <sr...@apache.org> on 2017/07/12 10:30:43 UTC, 0 replies.
- [ML] Performance issues with GBTRegressor - posted by OBones <ob...@free.fr> on 2017/07/12 15:52:10 UTC, 0 replies.
- With 2.2.0 PySpark is now available for pip install from PyPI :) - posted by Holden Karau <ho...@pigscanfly.ca> on 2017/07/12 19:26:00 UTC, 5 replies.
- DataFrameReader read from S3 org.apache.spark.sql.AnalysisException: Path does not exist - posted by Sumona Routh <su...@gmail.com> on 2017/07/12 20:36:35 UTC, 2 replies.
- Implementing Dynamic Sampling in a Spark Streaming Application - posted by N B <nb...@gmail.com> on 2017/07/12 22:36:21 UTC, 0 replies.
- UnpicklingError while using spark streaming - posted by lovemoon <zt...@163.com> on 2017/07/13 07:20:47 UTC, 0 replies.
- how to identify the alive master spark via Zookeeper ? - posted by ma...@orange.com on 2017/07/13 08:43:22 UTC, 3 replies.
- Spark 2.1.1: A bug in org.apache.spark.ml.linalg.* when using VectorAssembler.scala - posted by xi...@birdsh.com on 2017/07/13 09:15:56 UTC, 2 replies.
- [SQL] Syntax "case when" doesn't be supported in JOIN - posted by 王双 <30...@qq.com> on 2017/07/13 10:18:30 UTC, 0 replies.
- underlying checkpoint - posted by Bernard Jesop <be...@gmail.com> on 2017/07/13 15:35:46 UTC, 3 replies.
- Does mapWithState need checkpointing to be specified in Spark Streaming? - posted by SRK <sw...@gmail.com> on 2017/07/13 20:01:22 UTC, 3 replies.
- Fwd: None.get on Redact in DataSourceScanExec - posted by Russell Spitzer <ru...@gmail.com> on 2017/07/14 00:06:25 UTC, 1 replies.
- Re: calculate diff of value and median in a group - posted by roni <ro...@gmail.com> on 2017/07/14 21:53:14 UTC, 0 replies.
- Memory consumption and checkpointed data seems to increase incrementally when reduceByKeyAndWIndow with inverse function is used with mapWithState in Stateful streaming - posted by SRK <sw...@gmail.com> on 2017/07/15 00:04:48 UTC, 0 replies.
- Querying on Deeply Nested JSON Structures - posted by Patrick <ti...@gmail.com> on 2017/07/15 17:41:08 UTC, 2 replies.
- splitting columns into new columns - posted by nayan sharma <na...@gmail.com> on 2017/07/16 18:25:17 UTC, 5 replies.
- to_json not working with selectExpr - posted by Matthew cao <cy...@gmail.com> on 2017/07/17 01:24:05 UTC, 7 replies.
- Spark 2.1.1 Error:java.lang.NoSuchMethodError: org.apache.spark.network.client.TransportClient.getChannel()Lio/netty/channel/Channel; - posted by zzcclp <44...@qq.com> on 2017/07/17 09:49:57 UTC, 1 replies.
- Reading Hive tables Parallel in Spark - posted by FN <nu...@gmail.com> on 2017/07/17 12:12:18 UTC, 7 replies.
- 回复： Spark 2.1.1 Error:java.lang.NoSuchMethodError: org.apache.spark.network.client.TransportClient.getChannel()Lio/netty/channel/Channel; - posted by 恩爸 <44...@qq.com> on 2017/07/17 14:35:28 UTC, 0 replies.
- running spark job with fat jar file - posted by Mich Talebzadeh <mi...@gmail.com> on 2017/07/17 15:41:50 UTC, 9 replies.
- [ANNOUNCE] Apache Bahir 2.1.1 Released - posted by Luciano Resende <lu...@gmail.com> on 2017/07/17 19:59:05 UTC, 0 replies.
- Running Spark und YARN on AWS EMR - posted by Pascal Stammer <st...@deichbrise.de> on 2017/07/17 20:18:36 UTC, 4 replies.
- Re: Slowness of Spark Thrift Server - posted by Maciej Bryński <ma...@brynski.pl> on 2017/07/17 20:30:35 UTC, 0 replies.
- Spark Streaming handling Kafka exceptions - posted by Jean-Francois Gosselin <jf...@gmail.com> on 2017/07/17 21:05:42 UTC, 0 replies.
- Spark UI crashes on Large Workloads - posted by saatvikshah1994 <sa...@gmail.com> on 2017/07/18 00:49:44 UTC, 4 replies.
- Flatten JSON to multiple columns in Spark - posted by Chetan Khatri <ch...@gmail.com> on 2017/07/18 06:05:55 UTC, 13 replies.
- Spark history server running on Mongo - posted by Ivan Sadikov <iv...@gmail.com> on 2017/07/18 08:01:06 UTC, 5 replies.
- Solutions.Hamburg conference - posted by Myrle Krantz <my...@apache.org> on 2017/07/18 10:25:43 UTC, 3 replies.
- Requesting feedback on Fluo+Spark - posted by Christopher <ct...@apache.org> on 2017/07/18 16:34:54 UTC, 0 replies.
- [Spark Core] unhashable type: 'dict' during shuffle step - posted by Josh Holbrook <jo...@fusion.net> on 2017/07/18 19:17:56 UTC, 1 replies.
- [Spark Streaming] How to make this code work? - posted by Noppanit Charassinvichai <no...@gmail.com> on 2017/07/19 00:54:51 UTC, 0 replies.
- Azure key vault - posted by ayan guha <gu...@gmail.com> on 2017/07/19 02:15:12 UTC, 0 replies.
- Structured Streaming: Row differences, e.g., with Window and lag() - posted by Karamba <ph...@web.de> on 2017/07/19 06:31:11 UTC, 0 replies.
- Feature Generation for Large datasets composed of many time series - posted by ju...@free.fr on 2017/07/19 11:30:27 UTC, 0 replies.
- Slow responce on Solr Cloud with Spark - posted by Imran Rajjad <ra...@gmail.com> on 2017/07/19 12:49:26 UTC, 1 replies.
- about aggregateByKey of pairrdd. - posted by qihuagao <qi...@icloud.com> on 2017/07/19 12:50:21 UTC, 0 replies.
- Regarding Logistic Regression changes in Spark 2.2.0 - posted by Aseem Bansal <as...@gmail.com> on 2017/07/19 14:09:29 UTC, 1 replies.
- MPEG files optimisation with Spark - posted by Mich Talebzadeh <mi...@gmail.com> on 2017/07/19 14:39:44 UTC, 0 replies.
- Question regarding Sparks new Internal authentication mechanism - posted by Udit Mehrotra <ud...@gmail.com> on 2017/07/19 18:19:15 UTC, 12 replies.
- how does spark handle compressed files - posted by Ashok Kumar <as...@yahoo.com.INVALID> on 2017/07/19 20:22:07 UTC, 1 replies.
- ClassNotFoundException for Workers - posted by Noppanit Charassinvichai <no...@gmail.com> on 2017/07/19 22:12:44 UTC, 2 replies.
- How to insert a dataframe as a static partition to a partitioned table - posted by ctang <ct...@gmail.com> on 2017/07/19 22:13:16 UTC, 3 replies.
- Spark 2.0 and Oracle 12.1 error - posted by Cassa L <lc...@gmail.com> on 2017/07/20 06:10:20 UTC, 6 replies.
- Spark-2.0 and Oracle 12.1 error: Unsupported type -101 - posted by Cassa L <lc...@gmail.com> on 2017/07/20 06:21:13 UTC, 0 replies.
- Setting initial weights of ml.classification.LogisticRegression similar to mllib.classification.LogisticRegressionWithLBFGS - posted by Aseem Bansal <as...@gmail.com> on 2017/07/20 06:36:32 UTC, 3 replies.
- Issue: Hive Table Stored as col(array) instead of Columns with Spark - posted by Chetan Khatri <ch...@gmail.com> on 2017/07/20 08:38:04 UTC, 1 replies.
- solr data source not working - posted by Imran Rajjad <ra...@gmail.com> on 2017/07/20 08:51:30 UTC, 0 replies.
- What does spark.python.worker.memory affect? - posted by Cyanny LIANG <lg...@gmail.com> on 2017/07/20 09:06:03 UTC, 0 replies.
- Failed to find Spark jars directory - posted by Kaushal Shriyan <ka...@gmail.com> on 2017/07/20 12:34:28 UTC, 4 replies.
- Spark sc.textFile() files with more partitions Vs files with less partitions - posted by Gokula Krishnan D <em...@gmail.com> on 2017/07/20 12:46:36 UTC, 0 replies.
- Spark Streaming: Blocks and Partitions - posted by "Kalim, Faria" <ka...@illinois.edu> on 2017/07/20 19:52:42 UTC, 0 replies.
- How to use Update statement or call stored procedure of Oracle from Spark - posted by Cassa L <lc...@gmail.com> on 2017/07/20 20:19:23 UTC, 1 replies.
- Spark on Cloudera Configuration (Scheduler Mode = FAIR) - posted by Gokula Krishnan D <em...@gmail.com> on 2017/07/20 20:45:08 UTC, 6 replies.
- Re: Spark (SQL / Structured Streaming) Cassandra - PreparedStatement - posted by Russell Spitzer <ru...@gmail.com> on 2017/07/21 13:33:40 UTC, 0 replies.
- Supporting columns with heterogenous data - posted by "Lalwani, Jayesh" <Ja...@capitalone.com> on 2017/07/21 15:51:27 UTC, 0 replies.
- Spark Data Frame Writer - Range Partiotioning - posted by "Jain, Nishit" <nj...@underarmour.com> on 2017/07/21 16:46:47 UTC, 0 replies.
- Get full RDD lineage for a spark job - posted by Ron Gonzalez <zl...@yahoo.com.INVALID> on 2017/07/21 18:24:02 UTC, 3 replies.
- [ Spark SQL ] Conversion from Spark SQL to Avro decimal - posted by Ernesto Valentino <er...@gmail.com> on 2017/07/21 19:21:02 UTC, 0 replies.
- Re: Spark Data Frame Writer - Range Partiotioning - posted by ayan guha <gu...@gmail.com> on 2017/07/21 20:25:59 UTC, 1 replies.
- unsuscribe - posted by Cornelio Iñigo <co...@gmail.com> on 2017/07/21 20:49:44 UTC, 0 replies.
- Spark Structured Streaming - Spark Consumer does not display messages - posted by Cassa L <lc...@gmail.com> on 2017/07/21 20:58:29 UTC, 1 replies.
- [Spark] Working with JavaPairRDD from Scala - posted by Lukasz Tracewski <lu...@outlook.com> on 2017/07/21 22:18:28 UTC, 2 replies.
- Spark Job crash due to File Not found when shuffle intermittently - posted by Martin Peng <we...@gmail.com> on 2017/07/22 01:58:56 UTC, 5 replies.
- Is there a way to run Spark SQL through REST? - posted by kant kodali <ka...@gmail.com> on 2017/07/22 08:01:56 UTC, 3 replies.
- unsubscribe - posted by Vasilis Hadjipanos <ha...@gmail.com> on 2017/07/22 08:12:25 UTC, 3 replies.
- custom joins on dataframe - posted by Stephen Fletcher <st...@gmail.com> on 2017/07/22 15:39:11 UTC, 3 replies.
- Informing Spark about specific Partitioning scheme to avoid shuffles - posted by saatvikshah1994 <sa...@gmail.com> on 2017/07/22 17:23:08 UTC, 0 replies.
- Querying Drill with Spark DataFrame - posted by Luqman Ghani <lg...@gmail.com> on 2017/07/22 20:42:51 UTC, 4 replies.
- java.lang.NoClassDefFoundError: scala/runtime/AbstractPartialFunction$mcJL$sp - posted by Kaushal Shriyan <ka...@gmail.com> on 2017/07/23 17:48:59 UTC, 0 replies.
- how to convert the binary from kafak to srring pleaae - posted by 萝卜丝炒饭 <14...@qq.com> on 2017/07/24 02:44:35 UTC, 4 replies.
- Re: Question on Spark code - posted by Reynold Xin <rx...@databricks.com> on 2017/07/24 03:19:39 UTC, 1 replies.
- How to configure spark with java - posted by amit kumar singh <am...@gmail.com> on 2017/07/24 03:22:26 UTC, 1 replies.
- using Kudu with Spark - posted by Mich Talebzadeh <mi...@gmail.com> on 2017/07/24 07:25:34 UTC, 6 replies.
- Union large number of DataFrames - posted by ju...@free.fr on 2017/07/24 08:35:23 UTC, 0 replies.
- Complext JSON Handling in Spark 2.1 - posted by Patrick <ti...@gmail.com> on 2017/07/24 10:34:35 UTC, 1 replies.
- Is there a difference between these aggregations - posted by Aseem Bansal <as...@gmail.com> on 2017/07/24 11:34:23 UTC, 2 replies.
- NullPointer when collecting a dataset grouped a column - posted by Aseem Bansal <as...@gmail.com> on 2017/07/24 13:29:55 UTC, 0 replies.
- Conflict resolution for data in spark streaming - posted by Biplob Biswas <re...@gmail.com> on 2017/07/24 13:30:23 UTC, 0 replies.
- Parquet error while saving in HDFS - posted by unk1102 <um...@gmail.com> on 2017/07/24 18:51:14 UTC, 0 replies.
- real world spark code - posted by Adaryl Wakefield <ad...@hotmail.com> on 2017/07/24 23:08:45 UTC, 6 replies.
- how to set the assignee in JIRA please? - posted by 萝卜丝炒饭 <14...@qq.com> on 2017/07/25 00:57:59 UTC, 10 replies.
- How to list only erros for a stage - posted by jeff saremi <je...@hotmail.com> on 2017/07/25 02:02:19 UTC, 2 replies.
- What are some disadvantages of issuing a raw sql query to spark? - posted by kant kodali <ka...@gmail.com> on 2017/07/25 07:50:32 UTC, 4 replies.
- some Ideas on expressing Spark SQL using JSON - posted by kant kodali <ka...@gmail.com> on 2017/07/25 09:46:14 UTC, 4 replies.
- [Spark-Core] sc.textFile() explicit minPartitions did not work - posted by Gokula Krishnan D <em...@gmail.com> on 2017/07/25 12:15:06 UTC, 5 replies.
- Re: Nested JSON Handling in Spark 2.1 - posted by Patrick <ti...@gmail.com> on 2017/07/25 14:28:43 UTC, 0 replies.
- [SPARK STRUCTURED STREAMING]: Alternatives to using Foreach sink in pyspark - posted by Priyank Shrivastava <pr...@asperasoft.com> on 2017/07/26 01:05:58 UTC, 9 replies.
- Need some help around a Spark Error - posted by Debabrata Ghosh <ma...@gmail.com> on 2017/07/26 01:13:36 UTC, 1 replies.
- Cache problem ? Too much Stages ? Need help :( - posted by Julien CHAMP <jc...@tellmeplus.com> on 2017/07/26 08:33:40 UTC, 1 replies.
- [Spark SQL] [pyspark.sql]: Potential bug in toDF using nested structures - posted by msachdev <ma...@gmail.com> on 2017/07/26 11:33:31 UTC, 0 replies.
- [Spark streaming-Mesos-cluster mode] java.lang.RuntimeException: Stream jar not found - posted by RCinna <ca...@gmail.com> on 2017/07/26 17:13:18 UTC, 0 replies.
- DStream Spark 2.1.1 Streaming on EMR at scale - long running job fails after two hours - posted by "Mikhailau, Alex" <Al...@mlb.com> on 2017/07/26 18:47:20 UTC, 0 replies.
- Can i move TFS and TSFT out of spark package - posted by Jone Zhang <jo...@gmail.com> on 2017/07/27 03:32:39 UTC, 0 replies.
- Create static Map Type column - posted by ayan guha <gu...@gmail.com> on 2017/07/27 04:30:32 UTC, 1 replies.
- Please unsubscribe - posted by sowmya ramesh <kr...@gmail.com> on 2017/07/27 05:12:55 UTC, 1 replies.
- running spark application compiled with 1.6 on spark 2.1 cluster - posted by satishl <sa...@gmail.com> on 2017/07/27 05:45:53 UTC, 2 replies.
- Re: Complex types projection handling with Spark 2 SQL and Parquet - posted by Patrick <ti...@gmail.com> on 2017/07/27 07:55:16 UTC, 0 replies.
- A tool to generate simulation data - posted by lu...@sina.com on 2017/07/27 09:23:01 UTC, 1 replies.
- How does Spark handle timestamps during Pandas dataframe conversion - posted by saatvikshah1994 <sa...@gmail.com> on 2017/07/27 14:44:32 UTC, 0 replies.
- guava not compatible to hadoop version 2.6.5 - posted by Ma...@materna.de on 2017/07/27 14:46:53 UTC, 0 replies.
- Spark2.1 installation issue - posted by Vikash Kumar <vi...@oneconvergence.com> on 2017/07/27 17:54:39 UTC, 2 replies.
- Persisting RDD: Low Percentage with a lot of memory available - posted by Pedro Tuero <tu...@gmail.com> on 2017/07/27 18:11:10 UTC, 2 replies.
- SPARK Storagelevel issues - posted by Gourav Sengupta <go...@gmail.com> on 2017/07/27 19:04:07 UTC, 4 replies.
- 回复：Re: A tool to generate simulation data - posted by lu...@sina.com on 2017/07/28 02:19:05 UTC, 0 replies.
- How to configure spark on Yarn cluster - posted by jeff saremi <je...@hotmail.com> on 2017/07/28 06:03:21 UTC, 5 replies.
- Support Dynamic Partition Inserts params with SET command in Spark 2.0.1 - posted by Chetan Khatri <ch...@gmail.com> on 2017/07/28 10:19:12 UTC, 4 replies.
- subscribe - posted by ajit roshen <aj...@gmail.com> on 2017/07/28 13:48:58 UTC, 0 replies.
- Spark Streaming as a Service - posted by ajit roshen <aj...@gmail.com> on 2017/07/28 15:12:38 UTC, 0 replies.
- Re: Spark Streaming with long batch / window duration - posted by emceemouli <ch...@calgary.ca> on 2017/07/28 16:03:52 UTC, 0 replies.
- Job keeps aborting because of org.apache.spark.shuffle.FetchFailedException: Failed to connect to server/ip:39232 - posted by jeff saremi <je...@hotmail.com> on 2017/07/28 16:57:17 UTC, 4 replies.
- RE: changing directories in Spark Streming - posted by Siddhartha Singh Sandhu <sa...@gmail.com> on 2017/07/28 19:25:22 UTC, 0 replies.
- can I do spark-submit --jars [s3://bucket/folder/jar_file]? or --jars - posted by Richard Xin <ri...@yahoo.com.INVALID> on 2017/07/28 20:52:43 UTC, 1 replies.
- Logging in RDD mapToPair of Java Spark application - posted by johnzengspark <jo...@hotmail.com> on 2017/07/29 18:09:21 UTC, 5 replies.
- ALSModel.load not working on pyspark 2.1.0 - posted by Cristian Garcia <cg...@gmail.com> on 2017/07/29 19:57:13 UTC, 3 replies.
- OrderedDict to DF - posted by ayan guha <gu...@gmail.com> on 2017/07/30 13:34:04 UTC, 0 replies.
- Spark parquet file read problem ! - posted by serkan taş <se...@hotmail.com> on 2017/07/30 15:11:58 UTC, 6 replies.
- SPARK Issue in Standalone cluster - posted by Gourav Sengupta <go...@gmail.com> on 2017/07/31 00:14:21 UTC, 2 replies.
- how to get the key in Map with SQL - posted by 萝卜丝炒饭 <14...@qq.com> on 2017/07/31 02:32:29 UTC, 0 replies.
- Running several spark actions in parallel - posted by Guy Harmach <Gu...@Amdocs.com> on 2017/07/31 06:48:39 UTC, 0 replies.
- transactional data in sparksql - posted by lu...@sina.com on 2017/07/31 11:59:40 UTC, 0 replies.