You are viewing a plain text version of this content. The canonical link for it is here.
- Re: Spark on K8s , some applications ended ungracefully - posted by Martin Grigorov <mg...@apache.org> on 2022/04/01 06:52:33 UTC, 0 replies.
- Re: loop of spark jobs leads to increase in memory on worker nodes and eventually faillure - posted by Joris Billen <jo...@bigindustries.be> on 2022/04/01 07:49:38 UTC, 15 replies.
- data type missing - posted by ca...@free.fr on 2022/04/01 11:48:24 UTC, 0 replies.
- how to change data type for columns of dataframe - posted by ca...@free.fr on 2022/04/02 01:42:57 UTC, 2 replies.
- - posted by Sungwoo Park <gl...@gmail.com> on 2022/04/02 11:16:55 UTC, 2 replies.
- Spark History Server in GCP - posted by Gnana Kumar <gn...@gmail.com> on 2022/04/04 07:44:23 UTC, 0 replies.
- protobuf data as input to spark streaming - posted by Kiran Biswal <bi...@gmail.com> on 2022/04/05 21:43:19 UTC, 3 replies.
- Writing Custom Spark Readers and Writers - posted by Dyanesh Varun <dy...@gmail.com> on 2022/04/06 06:56:45 UTC, 4 replies.
- [Spark] spark client for Hadoop 2.x - posted by Amin Borjian <bo...@outlook.com> on 2022/04/06 08:31:19 UTC, 1 replies.
- query time comparison to several SQL engines - posted by Wes Peng <we...@freenetMail.de> on 2022/04/07 11:05:23 UTC, 3 replies.
- Spark 3.0.1 and spark 3.2 compatibility - posted by Pralabh Kumar <pr...@gmail.com> on 2022/04/07 17:27:42 UTC, 3 replies.
- Executorlost failure - posted by rajat kumar <ku...@gmail.com> on 2022/04/07 18:13:20 UTC, 5 replies.
- negative time duration in event log accumulables - posted by "wangcheng (AK)" <wa...@huawei.com.INVALID> on 2022/04/08 01:45:43 UTC, 0 replies.
- Aggregate over a column: the proper way to do - posted by sam smith <qu...@gmail.com> on 2022/04/08 03:30:34 UTC, 6 replies.
- Spark Write BinaryType Column as continues file to S3 - posted by Philipp Kraus <ph...@gmail.com> on 2022/04/08 15:13:52 UTC, 7 replies.
- Grabbing the current MemoryManager in a plugin - posted by Andrew Melo <an...@gmail.com> on 2022/04/08 21:04:05 UTC, 1 replies.
- binaryFile write - posted by Philipp Kraus <ph...@gmail.com> on 2022/04/09 18:48:11 UTC, 0 replies.
- Re: A simple comparison for three SQL engines - posted by Wes Peng <we...@freenetMail.de> on 2022/04/10 03:15:21 UTC, 0 replies.
- Question about bucketing and custom partitioners - posted by David Diebold <da...@gmail.com> on 2022/04/11 15:46:49 UTC, 3 replies.
- cannot access class sun.nio.ch.DirectBuffer - posted by Arunachalam Sibisakkaravarthi <ar...@mcruncher.com> on 2022/04/12 09:42:11 UTC, 8 replies.
- How to overwrite PySpark DataFrame schema without data scan? - posted by Rafał Wojdyła <ra...@gmail.com> on 2022/04/12 16:55:44 UTC, 0 replies.
- [Spark Streaming]: Why planInputPartitions is called multiple times for each micro-batch in Spark 3? - posted by "Hussain, Saghir" <sa...@akamai.com.INVALID> on 2022/04/13 07:05:46 UTC, 0 replies.
- Streaming partition-by data locality for state lookupon executor - posted by Sandip Khanzode <sa...@nutanix.com> on 2022/04/13 09:40:21 UTC, 0 replies.
- Problems with DataFrameReader in Structured Streaming - posted by Artemis User <ar...@dtechspace.com> on 2022/04/13 19:52:19 UTC, 0 replies.
- Spark sql slowness in Spark 3.0.1 - posted by Anil Dasari <ad...@guidewire.com> on 2022/04/14 16:13:17 UTC, 2 replies.
- Monitoring with elastic search in spark job - posted by Xinyu Luan <xi...@moesif.com.INVALID> on 2022/04/14 21:11:20 UTC, 1 replies.
- Custom metrics in py-spark 3 - posted by Harut Martirosyan <ha...@gmail.com> on 2022/04/15 01:48:34 UTC, 0 replies.
- Re: {EXT} Re: Spark sql slowness in Spark 3.0.1 - posted by Anil Dasari <ad...@guidewire.com> on 2022/04/15 18:14:04 UTC, 0 replies.
- Please Review My Code - posted by marc nicole <mk...@gmail.com> on 2022/04/16 13:03:11 UTC, 0 replies.
- [Spark Web UI] Integrating Keycloak SSO - posted by "Solomon, Brad" <br...@deloitte.com.INVALID> on 2022/04/18 12:58:11 UTC, 0 replies.
- [Spark Streaming] [Debug] Memory error when using NER model in Python - posted by Xavier Gervilla <xa...@datapta.com> on 2022/04/18 13:17:38 UTC, 7 replies.
- How is union() implemented? Need to implement column bind - posted by Andrew Davidson <ae...@ucsc.edu.INVALID> on 2022/04/19 01:20:42 UTC, 13 replies.
- RDD memory use question - posted by wilson <in...@bigcount.xyz> on 2022/04/19 10:33:42 UTC, 2 replies.
- Grouping and counting occurences of specific column rows - posted by marc nicole <mk...@gmail.com> on 2022/04/19 11:27:45 UTC, 3 replies.
- When should we cache / persist ? After or Before Actions? - posted by Sid <fl...@gmail.com> on 2022/04/21 07:25:33 UTC, 4 replies.
- [Spark Core]: Unexpectedly exiting executor while gracefully decommissioning - posted by Yeachan Park <ye...@gmail.com> on 2022/04/21 09:01:01 UTC, 2 replies.
- Spark error with jupyter - posted by Wassim Yaich <wa...@limonade.tn> on 2022/04/21 09:06:25 UTC, 0 replies.
- Why is spark running multiple stages with the same code line? - posted by Joe <jo...@net2020.org> on 2022/04/21 14:09:47 UTC, 4 replies.
- Coalesce, parallelism, time, idle cores, spills... - posted by Pedro Tuero <tu...@gmail.com> on 2022/04/21 18:43:29 UTC, 1 replies.
- [ANNOUNCE] Apache Kyuubi (Incubating) released 1.5.1-incubating - posted by Fu Chen <cf...@gmail.com> on 2022/04/22 02:21:15 UTC, 0 replies.
- Spark3.2 on K8s with proxy-user - posted by Pralabh Kumar <pr...@gmail.com> on 2022/04/22 04:51:59 UTC, 1 replies.
- Streaming write to orc problem - posted by "hsy541@gmail.com" <hs...@gmail.com> on 2022/04/22 14:56:59 UTC, 2 replies.
- Spark job failing and not giving error to do diagnosis - posted by rajat kumar <ku...@gmail.com> on 2022/04/23 19:18:00 UTC, 0 replies.
- Log4j vulnerability fix | CVE-2021-44228 - posted by "Shankar, Prakash" <Pr...@comcast.com.INVALID> on 2022/04/25 05:47:11 UTC, 1 replies.
- Vulnerabilities in htrace-core4-4.1.0-incubating.jar jar used in spark. - posted by HARSH TAKKAR <ta...@gmail.com> on 2022/04/26 10:47:36 UTC, 2 replies.
- Dealing with large number of small files - posted by Sid <fl...@gmail.com> on 2022/04/26 14:43:14 UTC, 9 replies.
- [window aggregate][debug] Rows not dropping with watermark and window - posted by Xavier Gervilla <xa...@datapta.com> on 2022/04/27 14:44:51 UTC, 0 replies.
- unsubscribe - posted by Deepak Gajare <de...@yahoo.com.INVALID> on 2022/04/29 02:05:16 UTC, 0 replies.
- Reg: CVE-2020-9480 - posted by Sundar Sabapathi Meenakshi <su...@mcruncher.com> on 2022/04/29 02:51:03 UTC, 1 replies.
- Unsubscribe - posted by Ajay Thompson <aj...@gmail.com> on 2022/04/29 03:29:49 UTC, 2 replies.
- structured streaming- checkpoint metadata growing indefinetely - posted by Wojciech Indyk <wo...@gmail.com> on 2022/04/29 05:00:33 UTC, 3 replies.