You are viewing a plain text version of this content. The canonical link for it is here.
- RE: Recursive Queries or Recursive UDF? - posted by "Bode, Meikel, NMA-CFD" <Me...@Bertelsmann.de> on 2021/05/01 13:17:40 UTC, 0 replies.
- Re: Delivery Status Notification (Failure) - posted by Mich Talebzadeh <mi...@gmail.com> on 2021/05/02 09:45:42 UTC, 1 replies.
- Re: Spark JDBC errors out - posted by Farhan Misarwala <fa...@gmail.com> on 2021/05/02 11:46:08 UTC, 0 replies.
- How to handle auto-restart in Kubernetes Spark application - posted by Sachit Murarka <co...@gmail.com> on 2021/05/02 16:08:14 UTC, 1 replies.
- Broadcast Variable - posted by "Bode, Meikel, NMA-CFD" <Me...@Bertelsmann.de> on 2021/05/03 12:54:02 UTC, 1 replies.
- Stream which needs to be “joined” with another Stream of “Reference” data. - posted by Eric Beabes <ma...@gmail.com> on 2021/05/03 16:36:18 UTC, 9 replies.
- - posted by Tianchen Zhang <du...@gmail.com> on 2021/05/03 18:37:19 UTC, 0 replies.
- [Spark Catalog API] Support for metadata Backup/Restore - posted by Tianchen Zhang <du...@gmail.com> on 2021/05/03 18:38:52 UTC, 6 replies.
- [PySpark][apache-spark]: Pass Pyspark SparkSession instance to Scala - posted by Scott Gerard <sc...@gerard.guru> on 2021/05/04 17:02:44 UTC, 0 replies.
- Windows Jupyter Notebook Cannot Connect to Kubernetes Master - posted by Taylor Schneider <ts...@live.com> on 2021/05/04 19:23:00 UTC, 0 replies.
- Does Pyspark script support Sonarqube - posted by Priyanka Kakkar <pr...@gmail.com> on 2021/05/05 08:08:28 UTC, 0 replies.
- How to read multiple HDFS directories - posted by Kapil Garg <ka...@flipkart.com.INVALID> on 2021/05/05 14:45:05 UTC, 12 replies.
- Fwd: Graceful shutdown SPARK Structured Streaming - posted by Gourav Sengupta <go...@gmail.com> on 2021/05/05 16:29:53 UTC, 4 replies.
- Performance Improvement: Collect in spark taking huge time - posted by Chetan Khatri <ch...@gmail.com> on 2021/05/06 02:15:18 UTC, 1 replies.
- (无主题) - posted by Tang Jinxin <xi...@gmail.com> on 2021/05/06 09:32:06 UTC, 0 replies.
- Re: Updating spark-env.sh per application - posted by Renu Yadav <yr...@gmail.com> on 2021/05/07 10:33:50 UTC, 4 replies.
- Issue while calling foreach in Pyspark - posted by rajat kumar <ku...@gmail.com> on 2021/05/07 15:06:49 UTC, 8 replies.
- How to have map_from_arrays() in Spark 2.3 - posted by Sebastian Schere <ss...@gmail.com> on 2021/05/08 22:01:39 UTC, 0 replies.
- Calculate average from Spark stream - posted by Giuseppe Ricci <pe...@gmail.com> on 2021/05/10 14:47:23 UTC, 13 replies.
- Re: compile spark 3.1.1 error - posted by jason_xu <xu...@gmail.com> on 2021/05/10 23:46:55 UTC, 0 replies.
- Installation Error - Please Help! - posted by Talha Javed <im...@gmail.com> on 2021/05/11 20:21:22 UTC, 1 replies.
- Merge two dataframes - posted by kushagra deep <ku...@gmail.com> on 2021/05/12 12:50:00 UTC, 16 replies.
- Spark with External Shuffle Service - using saved shuffle files in the event of executor failure - posted by Chris Thomas <ch...@heath-studios.com> on 2021/05/12 14:56:52 UTC, 1 replies.
- Understanding what happens when a job is submitted to a cluster - posted by "abhilash.kr" <ab...@gmail.com> on 2021/05/13 14:13:53 UTC, 4 replies.
- beeline spark thrift server issue - posted by Suryansh Agnihotri <sa...@gmail.com> on 2021/05/13 15:22:49 UTC, 1 replies.
- Thrift2 Server on Kubernetes? - posted by "Bode, Meikel, NMA-CFD" <Me...@Bertelsmann.de> on 2021/05/14 08:43:40 UTC, 2 replies.
- Multiple destination single source - posted by "abhilash.kr" <ab...@gmail.com> on 2021/05/14 17:15:54 UTC, 0 replies.
- Urgent Help - Py Spark submit error - posted by KhajaAsmath Mohammed <md...@gmail.com> on 2021/05/14 21:49:34 UTC, 0 replies.
- Re: [EXTERNAL] Urgent Help - Py Spark submit error - posted by KhajaAsmath Mohammed <md...@gmail.com> on 2021/05/14 22:03:19 UTC, 8 replies.
- Unable to create direct stream with SSL enabled Kafka cluster - posted by dwgw <dw...@gmail.com> on 2021/05/15 14:49:12 UTC, 0 replies.
- Spark History Server to S3 doesn't show up incomplete jobs - posted by Tianbin Jiang <ji...@gmail.com> on 2021/05/17 17:48:53 UTC, 0 replies.
- RE: Why is Spark 3.0.x faster than Spark 3.1.x - posted by "Rao, Abhishek (Nokia - IN/Bangalore)" <ab...@nokia.com> on 2021/05/18 05:42:09 UTC, 1 replies.
- S3 Access Issues - Spark - posted by KhajaAsmath Mohammed <md...@gmail.com> on 2021/05/18 23:11:31 UTC, 0 replies.
- Does spark3.1.1 support parquet nested column predicate pushdown for array type and map type column - posted by 石鹏磊 <sh...@163.com> on 2021/05/19 06:43:14 UTC, 0 replies.
- Spark Executor dies in K8 cluster - posted by Philipp Kraus <ph...@gmail.com> on 2021/05/19 09:18:18 UTC, 0 replies.
- unresolved dependency: graphframes#graphframes;0.8.1-spark2.4-s_2.11: not found - posted by Wensheng Deng <we...@yahoo.com.INVALID> on 2021/05/19 17:41:46 UTC, 2 replies.
- PySpark Write File Container exited with a non-zero exit code 143 - posted by Clay McDonald <st...@bateswhite.com> on 2021/05/19 19:09:01 UTC, 4 replies.
- spark 3.1.1 history server fails to boot with scala/MatchError - posted by Bulldog20630405 <bu...@gmail.com> on 2021/05/20 17:34:49 UTC, 0 replies.
- Question on spark on Kubernetes - posted by Mithalee Mohapatra <mi...@gmail.com> on 2021/05/20 20:25:37 UTC, 1 replies.
- Memory issues in 3.0.2 but works well on 2.4.4 - posted by Praneeth Shishtla <pr...@gmail.com> on 2021/05/21 10:57:04 UTC, 0 replies.
- DF blank value fill - posted by "Bode, Meikel, NMA-CFD" <Me...@Bertelsmann.de> on 2021/05/21 11:27:51 UTC, 1 replies.
- Re: [External Sender] Memory issues in 3.0.2 but works well on 2.4.4 - posted by Femi Anthony <ol...@capitalone.com.INVALID> on 2021/05/21 11:54:21 UTC, 0 replies.
- multiple query with structured streaming in spark does not work - posted by ji...@xtronica.no on 2021/05/21 19:08:57 UTC, 2 replies.
- RE: multiple query with structured streaming in spark does not work - posted by ji...@xtronica.no on 2021/05/22 00:10:24 UTC, 1 replies.
- Spark query performance of cached data affected by RDD lineage - posted by Fred Yeadon <fw...@qad.com> on 2021/05/22 22:43:22 UTC, 3 replies.
- Spark Prometheus Metrics for Executors Not Working - posted by paulp <pp...@outbrain.com.INVALID> on 2021/05/24 15:09:03 UTC, 1 replies.
- Re: About Spark executs sqlscript - posted by Wenchen Fan <cl...@gmail.com> on 2021/05/24 17:40:22 UTC, 2 replies.
- NullPointerException in SparkSession while reading Parquet files on S3 - posted by Eric Beabes <ma...@gmail.com> on 2021/05/25 15:30:49 UTC, 1 replies.
- Reading parquet files in parallel on the cluster - posted by Eric Beabes <ma...@gmail.com> on 2021/05/25 17:23:50 UTC, 8 replies.
- Reading Large File in Pyspark - posted by Sukanya Sarma <su...@gmail.com> on 2021/05/27 03:20:49 UTC, 1 replies.
- [apache spark] Does Spark 2.4.8 have issues with ServletContextHandler - posted by Kanchan Kauthale <ka...@gmail.com> on 2021/05/27 11:46:58 UTC, 2 replies.
- Accumulators and other important metrics for your job - posted by Hamish Whittal <ha...@cloud-fundis.co.za> on 2021/05/27 17:03:36 UTC, 0 replies.
- can not find module of mqtt under pyspark.streaming - posted by ji...@xtronica.no on 2021/05/28 00:46:37 UTC, 0 replies.
- mqtt module - posted by ji...@xtronica.no on 2021/05/28 00:49:53 UTC, 0 replies.
- Load Share point list(file) data to impala table using pyspark - posted by Rao Bandaru <ra...@outlook.com> on 2021/05/28 08:04:28 UTC, 1 replies.
- Profiling options for PandasUDF (2.4.7 on yarn) - posted by Patrick McCarthy <pm...@dstillery.com.INVALID> on 2021/05/28 13:51:04 UTC, 0 replies.
- spark sql StackOverflowError - posted by Deemo <th...@foxmail.com> on 2021/05/29 09:43:37 UTC, 1 replies.
- Missing module spark-hadoop-cloud in Maven central - posted by Erik Torres <et...@gmail.com> on 2021/05/31 10:36:39 UTC, 1 replies.
- Spark Structured Streaming - posted by S <sh...@gmail.com> on 2021/05/31 18:31:06 UTC, 2 replies.