You are viewing a plain text version of this content. The canonical link for it is here.
- Error while getting RDD partitions for a parquet dataframe in Spark 3 - posted by Albert Butterscotch <ri...@gmail.com> on 2020/09/01 13:39:05 UTC, 0 replies.
- value col is not a member of org.apache.spark.rdd.RDD - posted by dwgw <dw...@gmail.com> on 2020/09/02 04:39:56 UTC, 0 replies.
- Adding isolation level when reading from DB2 with spark.read - posted by Filipa Sousa <Fi...@criticaltechworks.com> on 2020/09/02 14:34:01 UTC, 3 replies.
- Submitting Spark Job thru REST API? - posted by Eric Beabes <ma...@gmail.com> on 2020/09/02 20:58:12 UTC, 4 replies.
- 回复:Submitting Spark Job thru REST API? - posted by tianlangstudio <ti...@aliyun.com.INVALID> on 2020/09/03 07:06:45 UTC, 0 replies.
- Spark Streaming Checkpointing - posted by András Kolbert <ko...@gmail.com> on 2020/09/03 09:41:05 UTC, 2 replies.
- Re: Merging Parquet Files - posted by Michael Segel <ms...@hotmail.com> on 2020/09/03 18:52:33 UTC, 0 replies.
- Iterating all columns in a pyspark dataframe - posted by "Devi P.V" <de...@gmail.com> on 2020/09/04 07:11:22 UTC, 1 replies.
- Keeping track of how long something has been in a queue - posted by Hamish Whittal <ha...@cloud-fundis.co.za> on 2020/09/04 14:02:15 UTC, 2 replies.
- Spark Application REST API, looking for a way to kill specific task or executor - posted by Ivan Petrov <ca...@gmail.com> on 2020/09/05 14:41:27 UTC, 2 replies.
- Query about Spark - posted by Ankur Das <da...@gmail.com> on 2020/09/06 13:30:17 UTC, 6 replies.
- Elastic Search sink showing -1 for numOutputRows - posted by jainshasha <ja...@gmail.com> on 2020/09/07 07:20:02 UTC, 3 replies.
- arbitrary state handling in python API - posted by "Georg Heiler (TU Vienna)" <ge...@tuwien.ac.at> on 2020/09/08 11:21:29 UTC, 0 replies.
- [Spark Core] makeRDD() preferredLocations do not appear to be considered - posted by Tom Scott <th...@gmail.com> on 2020/09/08 21:11:34 UTC, 1 replies.
- subscribe user@spark.apache.org - posted by Joan <jo...@foxmail.com> on 2020/09/09 08:22:02 UTC, 0 replies.
- Missing / Duplicate Data when Spark retries - posted by Ruijing Li <li...@gmail.com> on 2020/09/10 05:03:02 UTC, 2 replies.
- RE: Spark 3.0 using S3 taking long time for some set of TPC DS Queries - posted by "Rao, Abhishek (Nokia - IN/Bangalore)" <ab...@nokia.com> on 2020/09/10 07:26:17 UTC, 0 replies.
- [ANNOUNCE] Announcing Apache Spark 3.0.1 - posted by 郑瑞峰 <ru...@foxmail.com> on 2020/09/11 08:52:23 UTC, 4 replies.
- Re: [DISCUSS] Spark cannot identify the problem executor - posted by Sean Owen <sr...@gmail.com> on 2020/09/11 12:42:59 UTC, 3 replies.
- Re: LiveListenerBus is occupying most of the Driver Memory and frequent GC is degrading the performance - posted by Teja <sa...@gmail.com> on 2020/09/11 17:18:33 UTC, 1 replies.
- Query /Bug Spark Streaming / Context Cleaner/ GC question - posted by Tarun Rajput <Ta...@microsoft.com.INVALID> on 2020/09/15 21:05:01 UTC, 0 replies.
- Is there any good Docker container / compose with spark 2.4+ and YARN 2.8.2+ - posted by Ivan Petrov <ca...@gmail.com> on 2020/09/16 10:49:30 UTC, 1 replies.
- Structured Streaming Checkpoint Error - posted by German Schiavon <gs...@gmail.com> on 2020/09/16 14:11:18 UTC, 2 replies.
- Re: Spark Kafka Streaming With Transactional Messages - posted by jianyangusa <ji...@gmail.com> on 2020/09/16 18:39:22 UTC, 0 replies.
- [pyspark 2.4] broadcasting DataFrame throws error - posted by Rishi Shah <ri...@gmail.com> on 2020/09/17 04:13:09 UTC, 4 replies.
- Re: Spark structured streaming: periodically refresh static data frame - posted by Harsh <ta...@gmail.com> on 2020/09/17 09:46:16 UTC, 0 replies.
- unsubscribe - posted by Kaden Cho <ka...@gmail.com> on 2020/09/17 10:22:49 UTC, 0 replies.
- Pre query execution hook for custom datasources - posted by Shubham Chaurasia <sh...@gmail.com> on 2020/09/18 08:17:31 UTC, 0 replies.
- Spark streaming job not able to launch more number of executors - posted by "Vibhor Banga ( Engineering - VS)" <vi...@flipkart.com.INVALID> on 2020/09/18 12:19:13 UTC, 0 replies.
- Spark : Very simple query failing [Needed help please] - posted by Debabrata Ghosh <ma...@gmail.com> on 2020/09/18 13:10:32 UTC, 1 replies.
- how to integrate hbase and hive in spark3.0.1? - posted by 李继先 <69...@qq.com> on 2020/09/19 03:40:10 UTC, 0 replies.
- UnknownHostException is thrown when spark job whose jar files will be uploaded to s3 object storage via https is submitted to kubernetes - posted by mykidong <my...@gmail.com> on 2020/09/20 05:18:02 UTC, 1 replies.
- Apache Spark Error. - posted by Ömer Ölmez <om...@gmail.com> on 2020/09/20 17:33:14 UTC, 0 replies.
- Re: UnknownHostException is thrown when spark job whose jar files will be uploaded to s3 object storage via https is submitted to kubernetes - posted by Hitesh Tiwari <hi...@gmail.com> on 2020/09/20 19:43:53 UTC, 0 replies.
- Exporting spark custom metrics via prometheus jmx exporter - posted by adilerman <ad...@startapp.com> on 2020/09/21 08:26:50 UTC, 0 replies.
- 【Spark ML】How to get access of the MLlib's LogisticRegressionWithSGD after 3.0.0? - posted by Lyx <11...@qq.com> on 2020/09/22 05:53:18 UTC, 0 replies.
- Re: 【Spark ML】How to get access of the MLlib's LogisticRegressionWithSGD after 3.0.0? - posted by Sean Owen <sr...@gmail.com> on 2020/09/22 12:10:43 UTC, 0 replies.
- Spark Submit processes hanging & leaking memory - posted by ER <el...@gmail.com> on 2020/09/22 19:15:58 UTC, 0 replies.
- Is RDD.persist honoured if multiple actions are executed in parallel - posted by Arya Ketan <ke...@gmail.com> on 2020/09/23 07:44:09 UTC, 3 replies.
- Bloom Filter to filter huge dataframes with PySpark - posted by Breno Arosa <br...@edumobi.com.br> on 2020/09/23 14:58:14 UTC, 0 replies.
- Spark watermarked aggregation query and append output mode - posted by Sergey Oboguev <ob...@gmail.com> on 2020/09/23 20:47:22 UTC, 2 replies.
- Edge AI with Spark - posted by Marco Sassarini <Ma...@overit.it> on 2020/09/24 07:19:11 UTC, 3 replies.
- Distribute entire columns to executors - posted by Pedro Cardoso <pe...@feedzai.com> on 2020/09/24 09:51:36 UTC, 2 replies.
- Let multiple jobs share one rdd? - posted by Gang Li <lg...@gmail.com> on 2020/09/24 13:52:20 UTC, 1 replies.
- [Pyspark 3 Debug] Date values reset to Unix epoch - posted by Andrew Mullins <an...@cascadedatalabs.com> on 2020/09/24 18:02:19 UTC, 2 replies.
- A simple example that demonstrates that a Spark distributed cluster is faster than Spark Local Standalone - posted by javaguy Java <ja...@gmail.com> on 2020/09/24 18:43:06 UTC, 5 replies.
- https://issues.apache.org/jira/browse/SPARK-18381 - posted by ayan guha <gu...@gmail.com> on 2020/09/25 06:42:43 UTC, 0 replies.
- Query around Spark Checkpoints - posted by Debabrata Ghosh <ma...@gmail.com> on 2020/09/27 10:39:02 UTC, 5 replies.
- WARN ProcfsMetricsGetter: Exception when trying to compute pagesize, as a result reporting of ProcessTree metrics is stopped - posted by xorz57 <xo...@gmail.com> on 2020/09/27 13:56:31 UTC, 0 replies.
- To monitor the executors of a Spark application - posted by Dhiman <se...@gmail.com> on 2020/09/27 15:00:06 UTC, 0 replies.
- 回复:To monitor the executors of a Spark application - posted by tianlangstudio <ti...@aliyun.com.INVALID> on 2020/09/28 03:09:13 UTC, 0 replies.
- [Spark Prometheus Metrics] How to add my own metrics in spark streaming job? - posted by Christine Gong <ch...@gmail.com> on 2020/09/28 07:21:50 UTC, 2 replies.
- [SQL] How to get an encoder for string array in java? - posted by tanelk <ta...@gmail.com> on 2020/09/29 08:03:45 UTC, 1 replies.
- Should SHOW TABLES statement return a hive-compatible output? - posted by Ricardo Martinelli de Oliveira <rm...@redhat.com> on 2020/09/29 13:38:06 UTC, 0 replies.
- Offset Management in Spark - posted by Siva Samraj <sa...@gmail.com> on 2020/09/30 07:44:33 UTC, 1 replies.
- [Spark SQL] does pyspark udf support spark.sql inside def - posted by Lakshmi Nivedita <kl...@gmail.com> on 2020/09/30 11:58:11 UTC, 2 replies.
- Apache Spark Bogotá Meetup - posted by Miguel Angel Díaz Rodríguez <ma...@gmail.com> on 2020/09/30 12:03:01 UTC, 2 replies.
- Spark and Twistlock - posted by Khurram Qureshi <kh...@fermatsoftware.com> on 2020/09/30 12:32:45 UTC, 0 replies.
- Custom Metrics Source -> Sink routing - posted by Dávid Szakállas <da...@gmail.com> on 2020/09/30 14:22:39 UTC, 0 replies.
- Spark JDBC- OAUTH example - posted by KhajaAsmath Mohammed <md...@gmail.com> on 2020/09/30 17:54:36 UTC, 2 replies.