You are viewing a plain text version of this content. The canonical link for it is here.
- [build system] DOWNTIME jenkins unreachable overnight - posted by shane knapp <sk...@berkeley.edu> on 2018/08/01 00:14:41 UTC, 1 replies.
- Re: Review notification bot - posted by Hyukjin Kwon <gu...@gmail.com> on 2018/08/01 01:21:02 UTC, 0 replies.
- Re: code freeze and branch cut for Apache Spark 2.4 - posted by Imran Rashid <im...@therashids.com> on 2018/08/01 04:21:44 UTC, 33 replies.
- Re: Writing file - posted by mattbuttow <ma...@yandex.com> on 2018/08/01 11:20:20 UTC, 0 replies.
- Re: [DISCUSS] Multiple catalog support - posted by Wenchen Fan <cl...@gmail.com> on 2018/08/01 14:07:36 UTC, 0 replies.
- Migrating from kafka08 client to kafka010 - posted by sandeep_katta <sa...@gmail.com> on 2018/08/02 07:02:58 UTC, 1 replies.
- [Proposal] New feature: reconfigurable number of partitions on stateful operators in Structured Streaming - posted by Jungtaek Lim <ka...@gmail.com> on 2018/08/03 06:45:42 UTC, 9 replies.
- Spark kafka streaming failure recovery scenario - posted by sujith71955 <su...@gmail.com> on 2018/08/03 10:38:39 UTC, 0 replies.
- Spark sql syntax checker - posted by Alessandro Liparoti <al...@gmail.com> on 2018/08/03 10:39:47 UTC, 0 replies.
- SPIP: Executor Plugin (SPARK-24918) - posted by Imran Rashid <ir...@cloudera.com.INVALID> on 2018/08/03 16:59:46 UTC, 7 replies.
- Re: Spark model serving - posted by Saikat Kanjilal <sx...@hotmail.com> on 2018/08/03 18:49:48 UTC, 0 replies.
- re: should we dump a warning if we drop batches due to window move? - posted by Peter Liu <pe...@gmail.com> on 2018/08/03 20:10:12 UTC, 0 replies.
- Am I crazy, or does the binary distro not have Kafka integration? - posted by Sean Owen <sr...@gmail.com> on 2018/08/04 15:48:18 UTC, 5 replies.
- Set up Scala 2.12 test build in Jenkins - posted by Sean Owen <sr...@gmail.com> on 2018/08/05 14:10:23 UTC, 10 replies.
- Why is SQLImplicits an abstract class rather than a trait? - posted by "assaf.mendelson" <as...@rsa.com> on 2018/08/05 15:34:57 UTC, 2 replies.
- Re: [DISCUSS][SQL] Control the number of output files - posted by Koert Kuipers <ko...@tresata.com> on 2018/08/06 01:06:54 UTC, 6 replies.
- Handle BlockMissingException in pyspark - posted by Divay Jindal <di...@gmail.com> on 2018/08/06 09:20:51 UTC, 2 replies.
- Re: [VOTE] SPARK 2.3.2 (RC3) - posted by Yuval Itzchakov <yu...@gmail.com> on 2018/08/06 10:10:16 UTC, 1 replies.
- [Performance] Spark DataFrame is slow with wide data. Polynomial complexity on the number of columns is observed. Why? - posted by makatun <d....@gmail.com> on 2018/08/06 12:44:41 UTC, 12 replies.
- [build system] bumped pull request builder job timeout to 400mins - posted by shane knapp <sk...@berkeley.edu> on 2018/08/07 17:05:30 UTC, 1 replies.
- [build system] jenkins/github commit access exploit - posted by shane knapp <sk...@berkeley.edu> on 2018/08/07 20:44:02 UTC, 0 replies.
- SparkContext singleton get w/o create? - posted by Andrew Melo <an...@gmail.com> on 2018/08/07 22:11:38 UTC, 9 replies.
- [build system] IMPORTANT: taking centos workers offline for pyarrow upgrade - posted by shane knapp <sk...@berkeley.edu> on 2018/08/08 17:31:16 UTC, 3 replies.
- unsubscribe - posted by Al Pivonka <al...@gmail.com> on 2018/08/08 18:57:49 UTC, 1 replies.
- [pyspark][SPARK-25079]: preparing to enter the brave new world of python3.5! - posted by shane knapp <sk...@berkeley.edu> on 2018/08/09 17:41:42 UTC, 1 replies.
- [R] discuss: removing lint-r checks for old branches - posted by shane knapp <sk...@berkeley.edu> on 2018/08/10 20:39:11 UTC, 8 replies.
- [Structured Streaming SPARK-23966] Why non-atomic rename is problem in State Store ? - posted by chandan prakash <ch...@gmail.com> on 2018/08/11 16:33:35 UTC, 0 replies.
- [DISCUSS] Handling correctness/data loss jiras - posted by Tom Graves <tg...@yahoo.com.INVALID> on 2018/08/13 13:45:39 UTC, 7 replies.
- CVE-2018-11770: Apache Spark standalone master, Mesos REST APIs not controlled by authentication - posted by Sean Owen <sr...@apache.org> on 2018/08/13 14:24:04 UTC, 0 replies.
- Re: Cleaning Spark releases from mirrors, and the flakiness of HiveExternalCatalogVersionsSuite - posted by Marcelo Vanzin <va...@cloudera.com.INVALID> on 2018/08/13 18:49:17 UTC, 0 replies.
- [discuss][minor] impending python 3.x jenkins upgrade... 3.5.x? 3.6.x? - posted by shane knapp <sk...@berkeley.edu> on 2018/08/13 20:14:59 UTC, 5 replies.
- Re: [DISCUSS] SPIP: APIs for Table Metadata Operations - posted by Ryan Blue <rb...@netflix.com.INVALID> on 2018/08/13 20:58:55 UTC, 3 replies.
- [VOTE] SPARK 2.3.2 (RC5) - posted by Saisai Shao <sa...@gmail.com> on 2018/08/14 08:04:29 UTC, 7 replies.
- [DISCUSS][SPARK-22674][PYTHON] Disabled _hack_namedtuple for picklable namedtuples - posted by Sergei Lebedev <se...@gmail.com> on 2018/08/14 14:25:13 UTC, 0 replies.
- Same code in DataFrameWriter.runCommand and Dataset.withAction? - posted by Jacek Laskowski <ja...@japila.pl> on 2018/08/14 15:05:16 UTC, 0 replies.
- sql compile failing with Zinc? - posted by Steve Loughran <st...@hortonworks.com> on 2018/08/14 19:56:28 UTC, 3 replies.
- [SPARK-24771] Upgrade AVRO version from 1.7.7 to 1.8 - posted by Wenchen Fan <cl...@gmail.com> on 2018/08/15 02:29:33 UTC, 0 replies.
- Naming policy for packages - posted by Simon Dirmeier <si...@web.de> on 2018/08/15 11:12:59 UTC, 11 replies.
- Proposing an 18-month maintenance period for feature branches - posted by Sean Owen <sr...@apache.org> on 2018/08/15 18:50:19 UTC, 0 replies.
- [DISCUSS] SparkR support on k8s back-end for Spark 2.4 - posted by Erik Erlandson <ee...@redhat.com> on 2018/08/15 19:33:46 UTC, 12 replies.
- Spark Kafka adapter questions - posted by Basil Hariri <Ba...@microsoft.com.INVALID> on 2018/08/17 22:48:18 UTC, 3 replies.
- best way to run one python test? - posted by Imran Rashid <ir...@cloudera.com.INVALID> on 2018/08/20 03:07:40 UTC, 3 replies.
- [DISCUSS] USING syntax for Datasource V2 - posted by Hyukjin Kwon <gu...@gmail.com> on 2018/08/20 07:19:34 UTC, 4 replies.
- Unsubscribe - posted by Michael Styles <mi...@shopify.com.INVALID> on 2018/08/20 11:54:00 UTC, 1 replies.
- Apache Airflow (incubator) PMC binding vote needed - posted by t4 <re...@hotmail.com> on 2018/08/20 12:38:38 UTC, 0 replies.
- Why repartitionAndSortWithinPartitions slower than MapReducer - posted by 周浥尘 <zh...@gmail.com> on 2018/08/20 12:52:57 UTC, 1 replies.
- Persisting driver logs in yarn client mode (SPARK-25118) - posted by Ankur Gupta <an...@cloudera.com.INVALID> on 2018/08/21 21:19:10 UTC, 6 replies.
- Spark DataFrame UNPIVOT feature - posted by Ivan Gozali <iv...@lecida.com> on 2018/08/21 22:05:50 UTC, 3 replies.
- [MLlib][Test] Smoke and Metamorphic Testing of MLlib - posted by Steffen Herbold <he...@cs.uni-goettingen.de> on 2018/08/22 11:12:54 UTC, 5 replies.
- Spark github sync works now - posted by Xiao Li <ga...@gmail.com> on 2018/08/22 16:08:47 UTC, 0 replies.
- Spark data quality bug when reading parquet files from hive metastore - posted by "Long, Andrew" <lo...@amazon.com.INVALID> on 2018/08/22 17:16:08 UTC, 2 replies.
- Porting or explicitly linking project style in Apache Spark based on https://github.com/databricks/scala-style-guide - posted by Hyukjin Kwon <gu...@gmail.com> on 2018/08/24 01:14:17 UTC, 5 replies.
- Off Heap Memory - posted by Jack Kolokasis <ko...@ics.forth.gr> on 2018/08/24 08:53:33 UTC, 0 replies.
- python tests: any reason for a huge tests.py? - posted by Imran Rashid <ir...@cloudera.com.INVALID> on 2018/08/24 16:53:26 UTC, 1 replies.
- Handling Very Large volume(500TB) data using spark - posted by Great Info <gu...@gmail.com> on 2018/08/25 02:54:13 UTC, 0 replies.
- multiple group by action - posted by 崔苗 <cu...@danale.com> on 2018/08/25 02:54:31 UTC, 0 replies.
- Reading 20 GB of log files from Directory - Out of Memory Error - posted by Chetan Khatri <ch...@gmail.com> on 2018/08/25 10:08:24 UTC, 1 replies.
- Why is View logical operator not a UnaryNode explicitly? - posted by Jacek Laskowski <ja...@japila.pl> on 2018/08/27 10:10:35 UTC, 0 replies.
- no logging in pyspark code? - posted by Imran Rashid <ir...@cloudera.com.INVALID> on 2018/08/27 17:29:05 UTC, 2 replies.
- [VOTE] SPIP: Executor Plugin (SPARK-24918) - posted by Imran Rashid <ir...@cloudera.com.INVALID> on 2018/08/28 13:50:09 UTC, 6 replies.
- Joining DataFrames derived from the same source yields confusing/incorrect results - posted by Nicholas Chammas <ni...@gmail.com> on 2018/08/29 16:44:16 UTC, 1 replies.
- [DISCUSS] move away from python doctests - posted by Imran Rashid <ir...@cloudera.com.INVALID> on 2018/08/29 18:35:28 UTC, 6 replies.
- mllib + SQL - posted by Hemant Bhanawat <he...@gmail.com> on 2018/08/30 06:45:28 UTC, 4 replies.
- Update to Kryo 4 for Spark 2.4? - posted by Sean Owen <sr...@apache.org> on 2018/08/30 16:36:12 UTC, 0 replies.
- Spark Streaming : Multiple sources found for csv : Error - posted by Srabasti Banerjee <sr...@ymail.com.INVALID> on 2018/08/31 03:52:11 UTC, 4 replies.
- data source api v2 refactoring - posted by Reynold Xin <rx...@databricks.com> on 2018/08/31 05:59:58 UTC, 1 replies.
- TimSort bug - posted by Reynold Xin <rx...@databricks.com> on 2018/08/31 07:37:04 UTC, 3 replies.
- Upgrade SBT to the latest - posted by Darcy Shen <sa...@zoho.com> on 2018/08/31 13:16:38 UTC, 2 replies.
- [discuss] replacing SPIP template with Heilmeier's Catechism? - posted by Reynold Xin <rx...@databricks.com> on 2018/08/31 18:23:09 UTC, 7 replies.
- Re: Nightly Builds in the docs (in spark-nightly/spark-master-bin/latest? Can't seem to find it) - posted by Cody Koeninger <co...@koeninger.org> on 2018/08/31 20:14:55 UTC, 3 replies.