You are viewing a plain text version of this content. The canonical link for it is here.
- spark null values calculation - posted by wilson <wi...@4shield.net> on 2022/05/01 03:01:08 UTC, 1 replies.
- how spark handle the abnormal values - posted by wilson <wi...@4shield.net> on 2022/05/01 08:14:43 UTC, 4 replies.
- Idea for improving performance when reading from hive-like partition folders and specifying a filter [Spark 3.2] - posted by Martin <bo...@gmx.de> on 2022/05/01 12:08:04 UTC, 0 replies.
- Re: Vulnerabilities in htrace-core4-4.1.0-incubating.jar jar used in spark. - posted by HARSH TAKKAR <ta...@gmail.com> on 2022/05/02 05:46:29 UTC, 1 replies.
- Unsubscribe - posted by Sahil Bali <da...@gmail.com> on 2022/05/02 08:41:39 UTC, 1 replies.
- unsubscribe - posted by Ray Qiu <ra...@gmail.com> on 2022/05/02 19:31:47 UTC, 1 replies.
- Parse Execution Plan from PySpark - posted by Pablo Alcain <pa...@gmail.com> on 2022/05/03 03:06:58 UTC, 0 replies.
- RE: [EXTERNAL] Parse Execution Plan from PySpark - posted by Shay Elbaz <sh...@gm.com> on 2022/05/03 08:17:48 UTC, 2 replies.
- REMINDER - Travel Assistance available for ApacheCon NA New Orleans 2022 - posted by Gavin McDonald <gm...@apache.org> on 2022/05/03 11:05:04 UTC, 0 replies.
- Re: Spark error with jupyter - posted by Bjørn Jørgensen <bj...@gmail.com> on 2022/05/03 18:35:45 UTC, 1 replies.
- trouble using spark in kubernetes - posted by Andreas Klos <an...@fernuni-hagen.de> on 2022/05/03 22:29:34 UTC, 0 replies.
- Re: structured streaming- checkpoint metadata growing indefinetely - posted by Wojciech Indyk <wo...@gmail.com> on 2022/05/05 05:18:24 UTC, 0 replies.
- Disable/Remove datasources in Spark - posted by Aditya <ad...@gmail.com> on 2022/05/05 06:39:13 UTC, 4 replies.
- Something about Spark which has bothered me for a very long time, which I've never understood - posted by Denarian Kislata <de...@gmail.com> on 2022/05/05 08:07:03 UTC, 1 replies.
- Kafka Spark Structure Streaming Error - posted by nayan sharma <na...@gmail.com> on 2022/05/05 11:45:25 UTC, 0 replies.
- groupby question - posted by Irene Markelic <ir...@markelic.de> on 2022/05/05 17:30:25 UTC, 1 replies.
- Count() action leading to errors | Pyspark - posted by Sid <fl...@gmail.com> on 2022/05/06 08:25:22 UTC, 1 replies.
- Need help on migrating Spark on Hortonworks to Kubernetes Cluster - posted by Chetan Khatri <ch...@gmail.com> on 2022/05/08 16:36:48 UTC, 0 replies.
- How do I read parquet with python object - posted by be...@datalab.run on 2022/05/09 13:43:29 UTC, 1 replies.
- Spark on K8s - repeating annoying exception - posted by Shay Elbaz <sh...@gm.com> on 2022/05/09 14:55:19 UTC, 1 replies.
- Structured streaming help on releasing memory - posted by Xavi Gervilla <xa...@datapta.com> on 2022/05/09 23:02:51 UTC, 0 replies.
- RE: [EXTERNAL] Re: Spark on K8s - repeating annoying exception - posted by Shay Elbaz <sh...@gm.com> on 2022/05/15 16:18:29 UTC, 0 replies.
- [Spark SQL]: Does Spark SQL support WAITFOR? - posted by "K. N. Ramachandran" <kn...@gmail.com> on 2022/05/16 00:16:36 UTC, 5 replies.
- [Spark SQL]: Configuring/Using Spark + Catalyst optimally for read-heavy transactional workloads in JDBC sources? - posted by Gavin Ray <ra...@gmail.com> on 2022/05/16 16:55:38 UTC, 2 replies.
- Reverse proxy for Spark UI on Kubernetes - posted by bo yang <bo...@gmail.com> on 2022/05/17 05:46:30 UTC, 6 replies.
- A scene with unstable Spark performance - posted by Bowen Song <bo...@kyligence.io> on 2022/05/17 14:32:34 UTC, 4 replies.
- Stopping streaming after the write commit and before the read commit? - posted by kineret M <ki...@gmail.com> on 2022/05/18 11:52:03 UTC, 0 replies.
- What does Apache Spark do? - posted by Turritopsis Dohrnii Teo En Ming <ce...@gmail.com> on 2022/05/18 13:09:08 UTC, 1 replies.
- Spark 3 migration question - posted by Jason Xu <ja...@gmail.com> on 2022/05/18 15:00:00 UTC, 0 replies.
- [SQL] Why does a small two-source JDBC query take ~150-200ms with all optimizations (AQE, CBO, pushdown, Kryo, unsafe) enabled? (v3.4.0-SNAPSHOT) - posted by Gavin Ray <ra...@gmail.com> on 2022/05/19 01:21:22 UTC, 0 replies.
- Final reminder: ApacheCon North America call for presentations closing soon - posted by Rich Bowen <rb...@apache.org> on 2022/05/19 09:44:23 UTC, 0 replies.
- Problem with implementing the Datasource V2 API for Salesforce - posted by Rohit Pant <rp...@gmail.com> on 2022/05/21 14:45:12 UTC, 1 replies.
- Spark Push-Based Shuffle causing multiple stage failures - posted by Han Altae-Tran <al...@mit.edu> on 2022/05/23 06:04:08 UTC, 4 replies.
- how to add a column for percent - posted by wilson <wi...@4shield.net> on 2022/05/23 06:04:08 UTC, 1 replies.
- Job migrated from EMR to Dataproc takes 20 hours instead of 90 minutes - posted by Ori Popowski <or...@gmail.com> on 2022/05/24 09:38:50 UTC, 7 replies.
- GCP Dataproc - adding multiple packages(kafka, mongodb) while submitting spark jobs not working - posted by karan alang <ka...@gmail.com> on 2022/05/24 21:30:52 UTC, 0 replies.
- [SPARK SQL] Spark Thrift server, It is not releasing memory. - posted by Ramakrishna Chilaka <ra...@nference.net.INVALID> on 2022/05/25 06:52:33 UTC, 0 replies.
- Complexity with the data - posted by Sid <fl...@gmail.com> on 2022/05/25 20:06:51 UTC, 16 replies.
- java.lang.NoSuchMethodError: org.apache.hadoop.hive.common.FileUtils.mkdir --> Spark to Hive - posted by Prasanth M Sasidharan <pr...@gmail.com> on 2022/05/26 16:01:46 UTC, 1 replies.
- Issues getting Apache Spark - posted by "Martin, Michael" <mi...@cgi.com.INVALID> on 2022/05/26 19:19:31 UTC, 1 replies.
- k-anonymity with Spark in Java - posted by marc nicole <mk...@gmail.com> on 2022/05/28 10:13:44 UTC, 0 replies.
- Unable to convert double values - posted by Sid <fl...@gmail.com> on 2022/05/29 17:24:03 UTC, 3 replies.
- Unable to format timestamp values in pyspark - posted by Sid <fl...@gmail.com> on 2022/05/30 08:04:22 UTC, 2 replies.
- Re: protobuf data as input to spark streaming - posted by Kiran Biswal <bi...@gmail.com> on 2022/05/30 17:47:46 UTC, 0 replies.
- GCP Cloud Logging Cost increasing with Dataproc img version 2.0.39-ubuntu18 - posted by karan alang <ka...@gmail.com> on 2022/05/30 23:14:57 UTC, 0 replies.
- Kotlin API for Apache Spark feedback - posted by finkel <pa...@gmail.com> on 2022/05/31 16:29:24 UTC, 0 replies.