user@spark.apache.org, 2017-04

You are viewing a plain text version of this content. The canonical link for it is here.

- Re: spark kafka consumer with kerberos - posted by Saisai Shao <sa...@gmail.com> on 2017/04/01 00:58:30 UTC, 0 replies.
- [Spark Core]: flatMap/reduceByKey seems to be quite slow with Long keys on some distributions - posted by Richard Tsai <ri...@gmail.com> on 2017/04/01 07:29:21 UTC, 0 replies.
- Convert Dataframe to Dataset in pyspark - posted by Selvam Raman <se...@gmail.com> on 2017/04/01 12:36:25 UTC, 1 replies.
- Cuesheet - spark deployment - posted by Deepu Raj <de...@outlook.com> on 2017/04/01 12:55:01 UTC, 0 replies.
- getting error while storing data in Hbase - posted by Chintan Bhatt <ch...@charusat.ac.in> on 2017/04/01 16:47:10 UTC, 0 replies.
- pyspark bug with PYTHONHASHSEED - posted by Paul Tremblay <pa...@gmail.com> on 2017/04/01 19:43:29 UTC, 0 replies.
- bug with PYTHONHASHSEED - posted by Paul Tremblay <pa...@gmail.com> on 2017/04/01 19:54:17 UTC, 5 replies.
- strange behavior of spark 2.1.0 - posted by Jiang Jacky <ji...@gmail.com> on 2017/04/01 20:14:30 UTC, 2 replies.
- read binary file in PySpark - posted by Yogesh Vyas <in...@gmail.com> on 2017/04/02 06:46:11 UTC, 0 replies.
- Partitioning strategy - posted by ja...@accenture.com on 2017/04/02 10:32:13 UTC, 1 replies.
- Does Apache Spark use any Dependency Injection framework? - posted by kant kodali <ka...@gmail.com> on 2017/04/02 13:28:17 UTC, 1 replies.
- Update DF record with delta data in spark - posted by Selvam Raman <se...@gmail.com> on 2017/04/02 13:57:35 UTC, 1 replies.
- Represent documents as a sequence of wordID & frequency and perform PCA - posted by Old-School <gi...@outlook.com> on 2017/04/02 14:51:47 UTC, 0 replies.
- Re: Spark SQL 2.1 Complex SQL - Query Planning Issue - posted by Sathish Kumaran Vairavelu <vs...@gmail.com> on 2017/04/02 15:54:58 UTC, 0 replies.
- Graph Analytics on HBase with HGraphDB and Spark GraphFrames - posted by Robert Yokota <ra...@gmail.com> on 2017/04/02 16:40:07 UTC, 3 replies.
- Re: Looking at EMR Logs - posted by Paul Tremblay <pa...@gmail.com> on 2017/04/02 20:05:33 UTC, 0 replies.
- org.apache.spark.sql.AnalysisException: resolved attribute(s) code#906 missing from code#1992, - posted by grjohnson35 <gj...@artemishealth.com> on 2017/04/03 02:30:52 UTC, 0 replies.
- What is the difference between forEachAsync vs forEachPartitionAsync? - posted by kant kodali <ka...@gmail.com> on 2017/04/03 03:36:01 UTC, 1 replies.
- Benchmarking streaming frameworks - posted by gvdongen <gi...@ugent.be> on 2017/04/03 07:34:19 UTC, 2 replies.
- Do we support excluding the current row in PARTITION BY windowing functions? - posted by mathewwicks <ma...@gmail.com> on 2017/04/03 08:52:26 UTC, 2 replies.
- Read file and represent rows as Vectors - posted by Old-School <gi...@outlook.com> on 2017/04/03 12:05:18 UTC, 2 replies.
- Executor unable to pick postgres driver in Spark standalone cluster - posted by Rishikesh Teke <ri...@gmail.com> on 2017/04/03 13:43:51 UTC, 1 replies.
- Pyspark - pickle.PicklingError: Can't pickle - posted by Selvam Raman <se...@gmail.com> on 2017/04/03 18:34:37 UTC, 0 replies.
- _SUCCESS file validation on read - posted by drewrobb <dr...@gmail.com> on 2017/04/03 20:58:21 UTC, 0 replies.
- Re: Alternatives for dataframe collectAsList() - posted by Paul Tremblay <pa...@gmail.com> on 2017/04/04 00:44:49 UTC, 3 replies.
- Do we support excluding the CURRENT ROW in PARTITION BY windowing functions? - posted by mathewwicks <ma...@gmail.com> on 2017/04/04 02:21:34 UTC, 0 replies.
- map transform on array in spark sql - posted by Koert Kuipers <ko...@tresata.com> on 2017/04/04 03:18:17 UTC, 1 replies.
- is there a way to persist the lineages generated by spark? - posted by kant kodali <ka...@gmail.com> on 2017/04/04 03:19:40 UTC, 4 replies.
- how do i force unit test to do whole stage codegen - posted by Koert Kuipers <ko...@tresata.com> on 2017/04/04 20:10:55 UTC, 5 replies.
- Why do we ever run out of memory in Spark Structured Streaming? - posted by kant kodali <ka...@gmail.com> on 2017/04/05 00:17:54 UTC, 4 replies.
- With Twitter4j API, why am I not able to pull tweets with certain keywords? - posted by Gaurav1809 <ga...@gmail.com> on 2017/04/05 04:02:52 UTC, 1 replies.
- spark stages UI page has 'gc time' column Emtpy - posted by satishl <sa...@gmail.com> on 2017/04/05 04:24:06 UTC, 0 replies.
- reading binary file in spark-kafka streaming - posted by Yogesh Vyas <in...@gmail.com> on 2017/04/05 06:11:23 UTC, 0 replies.
- Market Basket Analysis by deploying FP Growth algorithm - posted by asethia <se...@gmail.com> on 2017/04/05 09:29:05 UTC, 1 replies.
- convert JavaRDD> to JavaRDD