You are viewing a plain text version of this content. The canonical link for it is here.
- Re: [Structured Streaminig] multiple queries in one application - posted by Abhisheks <sm...@gmail.com> on 2020/05/01 02:32:35 UTC, 1 replies.
- Re: Spark job stuck at s3a-file-system metrics system started - posted by Abhisheks <sm...@gmail.com> on 2020/05/01 02:56:46 UTC, 1 replies.
- Subscribe - posted by Nelson Mandela <tr...@mail.com> on 2020/05/01 06:27:16 UTC, 0 replies.
- TRUMP: clean hindutwa with an injection of DETTOL then grabbed the pussy in the locker room - posted by Nelson Mandela <tr...@mail.com> on 2020/05/01 13:29:36 UTC, 0 replies.
- Have you paid your bug bounty or did you log him off without paying - posted by Nelson Mandela <tr...@mail.com> on 2020/05/01 13:33:00 UTC, 0 replies.
- Hey crazy natzi Sean Owen do your job you incompetent useless pratt. You wrote you "subscribed " for this - posted by Nelson Mandela <tr...@mail.com> on 2020/05/01 13:38:24 UTC, 0 replies.
- Would Nelson Mandela work and make money while his people suffered from apartheid. You all do it. - posted by Nelson Mandela <tr...@mail.com> on 2020/05/01 13:42:32 UTC, 0 replies.
- You shook hands with butchers of Gujarat now you are locked same as kashmir - posted by Nelson Mandela <tr...@mail.com> on 2020/05/01 13:45:37 UTC, 1 replies.
- The new sock-puppet account sending the last few emails has been banned - posted by Sean Owen <sr...@apache.org> on 2020/05/01 13:47:33 UTC, 0 replies.
- [spark streaming] checkpoint location feature for batch processing - posted by Rishi Shah <ri...@gmail.com> on 2020/05/01 21:55:02 UTC, 6 replies.
- Path style access fs.s3a.path.style.access property is not working in spark code - posted by Aniruddha P Tekade <at...@binghamton.edu> on 2020/05/02 00:08:04 UTC, 2 replies.
- Modularising Spark/Scala program - posted by Mich Talebzadeh <mi...@gmail.com> on 2020/05/02 13:00:00 UTC, 2 replies.
- Re: Spark structured streaming - performance tuning - posted by Srinivas V <sr...@gmail.com> on 2020/05/02 18:37:01 UTC, 1 replies.
- Unsubscribe - posted by Bibudh Lahiri <bi...@gmail.com> on 2020/05/03 06:54:14 UTC, 4 replies.
- Good idea to do multi-threading in spark job? - posted by Ruijing Li <li...@gmail.com> on 2020/05/03 16:31:41 UTC, 2 replies.
- Watch "Airbus makes more of the sky with Spark - Jesse Anderson & Hassene Ben Salem" on YouTube - posted by Fuo Bol <gf...@mail.com> on 2020/05/03 16:44:55 UTC, 2 replies.
- Alternative for spark-redshift on scala 2.12 - posted by Jun Zhu <ju...@vungle.com.INVALID> on 2020/05/05 08:22:48 UTC, 0 replies.
- Any impact on Driver out memory with long batch queue. - posted by Hrishikesh Mishra <sd...@gmail.com> on 2020/05/05 14:03:25 UTC, 0 replies.
- Exception handling in Spark - posted by Mich Talebzadeh <mi...@gmail.com> on 2020/05/05 15:25:18 UTC, 14 replies.
- Pyspark and snowflake Column Mapping - posted by anbutech <an...@outlook.com> on 2020/05/05 17:19:33 UTC, 0 replies.
- PyArrow Exception in Pandas UDF GROUPEDAGG() - posted by Gautham Acharya <ga...@alleninstitute.org> on 2020/05/06 00:07:58 UTC, 2 replies.
- URL what is ? SecureByDesign & Use of LOGIN form not pop box. - posted by Secure Bydesign <gy...@mail.com> on 2020/05/06 05:17:15 UTC, 0 replies.
- Re: Spark hangs while reading from jdbc - does nothing Removing Guess work from trouble shooting - posted by Ruijing Li <li...@gmail.com> on 2020/05/06 07:18:24 UTC, 0 replies.
- Pyspark Kafka Structured Stream not working. - posted by Vijayant Kumar <Vi...@mavenir.com.INVALID> on 2020/05/06 08:36:00 UTC, 1 replies.
- Query on Spark Dataframe Aggregations - posted by Subash Prabakar <su...@gmail.com> on 2020/05/06 14:18:50 UTC, 2 replies.
- How to unsubscribe - posted by Fred Liu <fr...@synnex.com> on 2020/05/06 17:12:33 UTC, 1 replies.
- Which SQL flavor does Spark SQL follow? - posted by Aakash Basu <aa...@gmail.com> on 2020/05/06 20:34:56 UTC, 2 replies.
- Abstract of child object from Parent Object - posted by JeffEvans <Je...@protonmail.com.INVALID> on 2020/05/07 00:17:51 UTC, 0 replies.
- cyber bullying by sowen@apache.org - posted by JeffEvans1112 <Je...@protonmail.com.INVALID> on 2020/05/07 04:28:05 UTC, 0 replies.
- Cyber bullying for reporting bugs - posted by JeffEvans1112 <Je...@protonmail.com.INVALID> on 2020/05/07 04:33:01 UTC, 0 replies.
- How to populate all possible combination values in columns using Spark SQL - posted by Aakash Basu <aa...@gmail.com> on 2020/05/07 04:55:50 UTC, 5 replies.
- RE: [E] Re: Pyspark Kafka Structured Stream not working. - posted by Vijayant Kumar <Vi...@mavenir.com.INVALID> on 2020/05/07 06:55:51 UTC, 1 replies.
- java.lang.OutOfMemoryError Spark Worker - posted by Hrishikesh Mishra <sd...@gmail.com> on 2020/05/07 11:12:37 UTC, 9 replies.
- [Spark SQL][Beginner] Spark throw Catalyst error while writing the dataframe in ORC format - posted by Deepak Garg <ro...@gmail.com> on 2020/05/07 13:55:58 UTC, 2 replies.
- No. of active states? - posted by Something Something <ma...@gmail.com> on 2020/05/07 18:26:10 UTC, 4 replies.
- Dynamically changing maxOffsetsPerTrigger - posted by Something Something <ma...@gmail.com> on 2020/05/07 18:42:56 UTC, 0 replies.
- Spark Window Documentation - posted by neeraj bhadani <bh...@gmail.com> on 2020/05/08 09:33:31 UTC, 3 replies.
- How to deal Schema Evolution with Dataset API - posted by Jorge Machado <jo...@me.com.INVALID> on 2020/05/09 11:28:07 UTC, 0 replies.
- Re: How to deal Schema Evolution with Dataset API - posted by Jorge Machado <jo...@me.com.INVALID> on 2020/05/09 14:50:27 UTC, 1 replies.
- dynamic executor scalling spark on kubernetes client mode - posted by Pradeepta Choudhury <pr...@gmail.com> on 2020/05/09 16:34:20 UTC, 4 replies.
- AnalysisException - Infer schema for the Parquet path - posted by Chetan Khatri <ch...@gmail.com> on 2020/05/09 21:50:43 UTC, 3 replies.
- Spark wrote to Hive table. file content format and fileformat in metadata doesn't match - posted by 马阳阳 <ma...@163.com> on 2020/05/11 09:37:29 UTC, 0 replies.
- unsubscribe - posted by Nikita Goyal <go...@gmail.com> on 2020/05/11 10:49:45 UTC, 6 replies.
- Regarding anomaly detection in real time streaming data - posted by Hemant Garg <ga...@gmail.com> on 2020/05/11 12:25:10 UTC, 0 replies.
- GrupState limits - posted by tleilaxu <tl...@gmail.com> on 2020/05/11 21:18:27 UTC, 1 replies.
- XPATH_INT behavior - XML - Function in Spark - posted by Chetan Khatri <ch...@gmail.com> on 2020/05/11 21:29:15 UTC, 4 replies.
- [PySpark] Tagging descriptions - posted by Rishi Shah <ri...@gmail.com> on 2020/05/11 22:40:49 UTC, 7 replies.
- [Spark SQL][reopen SPARK-16951]:Alternative implementation of NOT IN to Anti-join - posted by "Shuang, Linna1" <li...@intel.com> on 2020/05/12 02:31:24 UTC, 2 replies.
- Dependency management using https in spark on kubernetes - posted by Pradeepta Choudhury <pr...@gmail.com> on 2020/05/12 20:09:22 UTC, 0 replies.
- to_avro/from_avro inserts extra values from Kafka - posted by Alex Nastetsky <al...@verve.com> on 2020/05/12 21:28:39 UTC, 0 replies.
- Huge difference in speed between pyspark and scalaspark - posted by Steven Van Ingelgem <st...@kbc.be.INVALID> on 2020/05/13 11:29:02 UTC, 10 replies.
- Calling HTTP Rest APIs from Spark Job - posted by Chetan Khatri <ch...@gmail.com> on 2020/05/14 21:03:07 UTC, 9 replies.
- Using Spark Accumulators with Structured Streaming - posted by Something Something <ma...@gmail.com> on 2020/05/14 21:36:45 UTC, 18 replies.
- spark on k8s - can driver and executor have separate checkpoint location? - posted by wzhan <we...@nokia.com> on 2020/05/16 04:05:58 UTC, 1 replies.
- How to change Dataframe schema - posted by Manjunath Shetty H <ma...@live.com> on 2020/05/16 14:50:40 UTC, 1 replies.
- Spark Streaming Memory - posted by András Kolbert <ko...@gmail.com> on 2020/05/17 17:35:54 UTC, 1 replies.
- FeigenBaumConstant - posted by FeigenB aum <Fe...@mail.com> on 2020/05/18 03:25:52 UTC, 0 replies.
- Boosting executorMemory vs. executorMemoryOverhead - posted by Shelby Vanhooser <sv...@palantir.com.INVALID> on 2020/05/18 15:52:17 UTC, 0 replies.
- How to split a dataframe into two dataframes based on count - posted by Mohit Durgapal <du...@gmail.com> on 2020/05/18 16:57:18 UTC, 1 replies.
- CSV data source : Garbled Japanese text and handling multilines - posted by Ashika Umagiliya <as...@gmail.com> on 2020/05/19 00:24:24 UTC, 1 replies.
- array_sort function behaviour - posted by neeraj bhadani <bh...@gmail.com> on 2020/05/19 11:09:12 UTC, 0 replies.
- 回复: array_sort function behaviour - posted by Liu Genie <ge...@outlook.com> on 2020/05/19 12:39:42 UTC, 0 replies.
- Unit testing Spark/Scala code with Mockito - posted by Mich Talebzadeh <mi...@gmail.com> on 2020/05/20 10:58:48 UTC, 2 replies.
- BOOK review of Spark: WARNING to spark users - posted by emma davis <em...@aol.com.INVALID> on 2020/05/20 20:43:23 UTC, 0 replies.
- Re: BOOK review of Spark: WARNING to spark users - posted by Jacek Laskowski <ja...@japila.pl> on 2020/05/21 09:11:43 UTC, 0 replies.
- ETL Using Spark - posted by Avadhut Narayan Joshi <AJ...@slb.com.INVALID> on 2020/05/21 14:10:54 UTC, 0 replies.
- Re: ETL Using Spark - posted by Mich Talebzadeh <mi...@gmail.com> on 2020/05/21 15:25:13 UTC, 1 replies.
- Re: Spark Kafka Streaming with Offset Gaps - posted by "nimmi.cv" <ni...@gmail.com> on 2020/05/21 21:09:58 UTC, 0 replies.
- Spark Kafka Streaming With Transactional Messages - posted by "nimmi.cv" <ni...@gmail.com> on 2020/05/21 21:15:20 UTC, 0 replies.
- [apache-spark]-spark-shuffle - posted by Vijay Kumar <vi...@gmail.com> on 2020/05/22 08:00:22 UTC, 1 replies.
- [structured streaming] [stateful] Null value appeared in non-nullable field - posted by Srinivas V <sr...@gmail.com> on 2020/05/23 11:13:55 UTC, 1 replies.
- spar kafka option properties - posted by Gunjan Kumar <gu...@gmail.com> on 2020/05/24 09:23:30 UTC, 0 replies.
- Cleanup hook for temporary files produced as part of a spark job - posted by jelmer <jk...@gmail.com> on 2020/05/24 13:42:15 UTC, 0 replies.
- - posted by Vijaya Phanindra Sarma B <bv...@gmail.com> on 2020/05/24 15:04:50 UTC, 0 replies.
- Parallelising JDBC reads in spark - posted by Manjunath Shetty H <ma...@live.com> on 2020/05/25 02:50:41 UTC, 0 replies.
- Re: Parallelising JDBC reads in spark - posted by Mike Artz <mi...@gmail.com> on 2020/05/25 05:20:25 UTC, 6 replies.
- Arrow RecordBatches/Pandas Dataframes to (Arrow enabled) Spark Dataframe conversion in streaming fashion - posted by Tanveer Ahmad - EWI <T....@tudelft.nl> on 2020/05/25 11:53:17 UTC, 1 replies.
- Fwd: Spark API and immutability - posted by Chris Thomas <he...@gmail.com> on 2020/05/25 17:55:57 UTC, 1 replies.
- RecordTooLargeException in Spark *Structured* Streaming - posted by Something Something <ma...@gmail.com> on 2020/05/25 21:42:34 UTC, 2 replies.
- PySpark .collect() output to Scala Array[Row] - posted by Nick Ruest <ru...@gmail.com> on 2020/05/26 01:04:20 UTC, 1 replies.
- How to enable hive support on an existing Spark session? - posted by "Kun Huang (COSMOS)" <ku...@microsoft.com.INVALID> on 2020/05/26 16:20:50 UTC, 1 replies.
- Spark on kubernetes memory spike and spark.kubernetes.memoryOverheadFactor not working - posted by "Maiti, Mousam" <mm...@informatica.com.INVALID> on 2020/05/27 08:27:29 UTC, 0 replies.
- Regarding Spark 3.0 GA - posted by ARNAV NEGI SOFTWARE ARCHITECT <ne...@gmail.com> on 2020/05/27 08:52:02 UTC, 4 replies.
- Spark dataframe hdfs vs s3 - posted by Dark Crusader <re...@gmail.com> on 2020/05/27 16:16:28 UTC, 11 replies.
- Different execution results with wholestage codegen on and off - posted by Pasha Finkelshteyn <pa...@gmail.com> on 2020/05/27 20:19:23 UTC, 2 replies.
- CSV parsing issue - posted by elango vaidyanathan <el...@gmail.com> on 2020/05/28 15:20:55 UTC, 4 replies.
- External hive metastore (remote) managed tables - posted by Debajyoti Roy <ne...@gmail.com> on 2020/05/28 20:25:12 UTC, 0 replies.
- [Apache Spark][Streaming Job][Checkpoint]Spark job failed on Checkpoint recovery with Batch not found error - posted by taylorwu <wy...@hotmail.com> on 2020/05/29 01:04:18 UTC, 0 replies.
- Spark Security - posted by wi...@gmail.com on 2020/05/29 14:09:47 UTC, 3 replies.
- [pyspark 2.3+] Dedupe records - posted by Rishi Shah <ri...@gmail.com> on 2020/05/30 02:47:21 UTC, 3 replies.
- Dataframe to nested json document - posted by Chidananda Unchi <ch...@gmail.com> on 2020/05/30 10:50:05 UTC, 3 replies.
- [bug] Scala reflection "assertion failed: class Byte" in Dataset.toJSON - posted by Brandon Vincent <br...@gmail.com> on 2020/05/30 19:48:49 UTC, 0 replies.
- Apache Spark Machine Learning Unleashed Book Review author: Jillur Quddus - posted by patrice molinchaeux <pa...@engineer.com> on 2020/05/31 00:51:22 UTC, 0 replies.
- Using existing distribution for join when subset of keys - posted by Patrick Woody <pa...@gmail.com> on 2020/05/31 14:43:15 UTC, 3 replies.