You are viewing a plain text version of this content. The canonical link for it is here.
- Task not Serializable Exception - posted by khyati <kh...@guavus.com> on 2017/01/01 04:06:17 UTC, 1 replies.
- Skip Corrupted Parquet blocks / footer. - posted by khyati <kh...@guavus.com> on 2017/01/01 04:11:17 UTC, 10 replies.
- Re: context.runJob() was suspended in getPreferredLocations() function - posted by Liang-Chi Hsieh <vi...@gmail.com> on 2017/01/01 12:34:41 UTC, 0 replies.
- Re: Kafka Spark structured streaming latency benchmark. - posted by Prashant Sharma <sc...@gmail.com> on 2017/01/02 11:19:05 UTC, 0 replies.
- Cannot pass broker list parameter from Scala to Kafka: Property bootstrap.servers is not valid - posted by Dino <di...@spam4.me> on 2017/01/02 11:49:27 UTC, 0 replies.
- Re: Spark Improvement Proposals - posted by Cody Koeninger <co...@koeninger.org> on 2017/01/02 15:45:55 UTC, 5 replies.
- Re: What is mainly different from a UDT and a spark internal type that ExpressionEncoder recognized? - posted by Shuai Lin <li...@gmail.com> on 2017/01/02 17:30:27 UTC, 4 replies.
- Re: mllib metrics vs ml evaluators and how to improve apis for users - posted by Joseph Bradley <jo...@databricks.com> on 2017/01/02 20:28:03 UTC, 0 replies.
- StateStoreSaveExec / StateStoreRestoreExec - posted by Jeremy Smith <je...@acorns.com> on 2017/01/02 22:05:02 UTC, 2 replies.
- Why ShuffleMapTask has transient locs and preferredLocs?! - posted by Jacek Laskowski <ja...@japila.pl> on 2017/01/03 11:27:00 UTC, 2 replies.
- Re: Apache Hive with Spark Configuration - posted by Ryan Blue <rb...@netflix.com.INVALID> on 2017/01/03 20:32:37 UTC, 1 replies.
- DataFrame Distinct Sample Bug? - posted by dstuck <da...@gmail.com> on 2017/01/03 23:15:16 UTC, 1 replies.
- Re: Why is spark.shuffle.sort.bypassMergeThreshold 200? - posted by Kay Ousterhout <ke...@eecs.berkeley.edu> on 2017/01/04 00:15:25 UTC, 0 replies.
- Tests failing with GC limit exceeded - posted by Kay Ousterhout <ke...@eecs.berkeley.edu> on 2017/01/04 00:35:38 UTC, 12 replies.
- Re: ml word2vec finSynonyms return type - posted by Asher Krim <ak...@hubspot.com> on 2017/01/04 07:58:22 UTC, 3 replies.
- Re: Dependency Injection and Microservice development with Spark - posted by Chetan Khatri <ch...@gmail.com> on 2017/01/04 11:34:48 UTC, 2 replies.
- Re: Approach: Incremental data load from HBASE - posted by Chetan Khatri <ch...@gmail.com> on 2017/01/04 11:37:45 UTC, 4 replies.
- Quick request: prolific PR openers, review your open PRs - posted by Sean Owen <so...@cloudera.com> on 2017/01/04 12:35:48 UTC, 5 replies.
- Clarification about typesafe aggregations - posted by geoHeil <ge...@gmail.com> on 2017/01/04 15:19:20 UTC, 2 replies.
- Converting an InternalRow to a Row - posted by Andy Dang <na...@gmail.com> on 2017/01/04 19:27:15 UTC, 7 replies.
- unsubscribe - posted by Nikola Z <gr...@gmail.com> on 2017/01/05 10:57:12 UTC, 3 replies.
- Re: Spark SQL - Applying transformation on a struct inside an array - posted by Olivier Girardot <o....@lateral-thoughts.com> on 2017/01/05 20:01:45 UTC, 0 replies.
- Re: Spark GraphFrame ConnectedComponents - posted by Ankur Srivastava <an...@gmail.com> on 2017/01/05 23:45:59 UTC, 2 replies.
- Unsubscribe - posted by "write2sivakumar@gmail" <wr...@gmail.com> on 2017/01/06 04:01:18 UTC, 3 replies.
- handling of empty partitions - posted by geoHeil <ge...@gmail.com> on 2017/01/06 19:30:16 UTC, 7 replies.
- Parquet patch release - posted by Ryan Blue <rb...@netflix.com.INVALID> on 2017/01/06 23:46:03 UTC, 5 replies.
- Spark checkpointing - posted by Felix Cheung <fe...@hotmail.com> on 2017/01/07 08:29:01 UTC, 1 replies.
- [SQL][PYTHON] UDF improvements. - posted by Maciej Szymkiewicz <ms...@gmail.com> on 2017/01/07 20:39:21 UTC, 2 replies.
- protected val mapStatuses is ConcurrentHashMap in both MapOutputTrackerMaster and MapOutputTrackerWorker? - posted by Jacek Laskowski <ja...@japila.pl> on 2017/01/08 19:48:07 UTC, 0 replies.
- A note about MLlib's StandardScaler - posted by Gilad Barkan <gi...@gmail.com> on 2017/01/08 20:06:55 UTC, 3 replies.
- scala.MatchError: scala.collection.immutable.Range.Inclusive from catalyst.ScalaReflection.serializerFor? - posted by Jacek Laskowski <ja...@japila.pl> on 2017/01/09 08:16:59 UTC, 2 replies.
- How to hint Spark to use HashAggregate() for UDAF - posted by Andy Dang <na...@gmail.com> on 2017/01/09 13:52:42 UTC, 4 replies.
- Spark performance tests - posted by Prasun Ratn <pr...@gmail.com> on 2017/01/10 03:50:31 UTC, 3 replies.
- [SQL][CodeGen] Is there a way to set break point and debug the generated code? - posted by dragonly <li...@gmail.com> on 2017/01/10 11:21:59 UTC, 1 replies.
- [PYSPARK] Python tests organization - posted by Maciej Szymkiewicz <ms...@gmail.com> on 2017/01/11 12:18:55 UTC, 11 replies.
- [Streaming] ConcurrentModificationExceptions when Windowing - posted by Kalvin Chau <ka...@gmail.com> on 2017/01/11 22:38:43 UTC, 10 replies.
- FOSDEM 2017 Open Source Conference - Brussels - posted by Sharan F <sh...@apache.org> on 2017/01/12 12:12:10 UTC, 0 replies.
- Limit Query Performance Suggestion - posted by sujith chacko <su...@gmail.com> on 2017/01/13 07:19:24 UTC, 3 replies.
- Both Spark AM and Client are trying to delete Staging Directory - posted by Rostyslav Sotnychenko <r....@gmail.com> on 2017/01/13 10:44:04 UTC, 8 replies.
- Can anyone edit JIRAs SPARK-19191 to SPARK-19202? - posted by Sean Owen <so...@cloudera.com> on 2017/01/13 13:27:41 UTC, 8 replies.
- Why are ml models repartition(1)'d in save methods? - posted by Asher Krim <ak...@hubspot.com> on 2017/01/13 17:23:04 UTC, 7 replies.
- What about removing TaskContext#getPartitionId? - posted by Jacek Laskowski <ja...@japila.pl> on 2017/01/14 10:02:30 UTC, 5 replies.
- Equally split a RDD partition into two partition at the same node - posted by Fei Hu <hu...@gmail.com> on 2017/01/14 23:58:54 UTC, 13 replies.
- Re: Error at starting Phoenix shell with HBase - posted by Chetan Khatri <ch...@gmail.com> on 2017/01/16 05:31:56 UTC, 0 replies.
- spark support on windows - posted by "assaf.mendelson" <as...@rsa.com> on 2017/01/16 10:35:18 UTC, 3 replies.
- About saving DataFrame to Hive 1.2.1 with Spark 2.0.1 - posted by Chetan Khatri <ch...@gmail.com> on 2017/01/16 19:18:23 UTC, 1 replies.
- Weird experience Hive with Spark Transformations - posted by Chetan Khatri <ch...@gmail.com> on 2017/01/17 05:36:05 UTC, 2 replies.
- Re: Spark sql query plan contains all the partitions from hive table even though filtering of partitions is provided - posted by Raju Bairishetti <ra...@apache.org> on 2017/01/17 08:01:59 UTC, 12 replies.
- GraphX-related "open" issues - posted by Takeshi Yamamuro <li...@gmail.com> on 2017/01/17 16:11:13 UTC, 10 replies.
- spark main thread quit, but the Jvm of driver don't crash - posted by John Fang <xi...@alibaba-inc.com> on 2017/01/17 16:32:49 UTC, 0 replies.
- spark main thread quit, but the driver don't crash at standalone cluster - posted by John Fang <xi...@alibaba-inc.com> on 2017/01/17 16:58:14 UTC, 1 replies.
- Feedback on MLlib roadmap process proposal - posted by Joseph Bradley <jo...@databricks.com> on 2017/01/17 23:38:06 UTC, 10 replies.
- RpcEnv(Factory) is no longer pluggable? spark.rpc is gone, isn't it? - posted by Jacek Laskowski <ja...@japila.pl> on 2017/01/18 07:57:29 UTC, 1 replies.
- [SQL][SPARK-14160] Maximum interval for o.a.s.sql.functions.window - posted by Maciej Szymkiewicz <ms...@gmail.com> on 2017/01/18 08:18:59 UTC, 3 replies.
- 答复: Limit Query Performance Suggestion - posted by "wangzhenhua (G)" <wa...@huawei.com> on 2017/01/18 08:53:52 UTC, 1 replies.
- clientMode in RpcEnv.create in Spark on YARN vs general case (driver vs executors)? - posted by Jacek Laskowski <ja...@japila.pl> on 2017/01/18 09:29:41 UTC, 1 replies.
- can someone review my PR? - posted by Steve Loughran <st...@hortonworks.com> on 2017/01/18 11:10:16 UTC, 3 replies.
- [Spark SQL] Making InferSchema and JacksonParser public - posted by Brian Hong <su...@devsisters.com> on 2017/01/18 13:51:43 UTC, 3 replies.
- GC limit exceed - posted by marco rocchi <ro...@studenti.uniroma1.it> on 2017/01/18 14:22:51 UTC, 1 replies.
- ApacheCon CFP closing soon (11 February) - posted by Rich Bowen <rb...@apache.org> on 2017/01/18 16:45:41 UTC, 0 replies.
- Possible bug - Java iterator/iterable inconsistency - posted by Asher Krim <ak...@hubspot.com> on 2017/01/18 20:50:21 UTC, 3 replies.
- Spark Source Code Configuration - posted by Deepu Raj <de...@outlook.com> on 2017/01/19 22:49:53 UTC, 2 replies.
- - posted by Keith Chapman <ke...@gmail.com> on 2017/01/20 02:57:42 UTC, 0 replies.
- Is it possible to get a job end kind of notification on the executor (slave) - posted by Keith Chapman <ke...@gmail.com> on 2017/01/20 05:31:38 UTC, 1 replies.
- Executors exceed maximum memory defined with `--executor-memory` in Spark 2.1.0 - posted by StanZhai <ma...@zhaishidan.cn> on 2017/01/22 08:57:52 UTC, 4 replies.
- A question about creating persistent table when in-memory catalog is used - posted by Shuai Lin <li...@gmail.com> on 2017/01/22 12:51:47 UTC, 7 replies.
- Spark 1.6.3 Driver OOM on createDataFrame - posted by Asher Krim <ak...@hubspot.com> on 2017/01/22 17:35:11 UTC, 0 replies.
- Re: [VOTE] Release Apache Parquet 1.8.2 RC1 - posted by Julien Le Dem <ju...@dremio.com> on 2017/01/23 19:43:40 UTC, 4 replies.
- MLlib mission and goals - posted by Joseph Bradley <jo...@databricks.com> on 2017/01/24 01:03:41 UTC, 8 replies.
- [SPARK-16046] PR Review - posted by Anton Okolnychyi <an...@gmail.com> on 2017/01/24 15:30:29 UTC, 0 replies.
- [YARN] $ and $$ in prepareCommand to resolve environment in ExecutorRunnable? - posted by Jacek Laskowski <ja...@japila.pl> on 2017/01/24 15:52:03 UTC, 0 replies.
- welcoming Burak and Holden as committers - posted by Reynold Xin <rx...@databricks.com> on 2017/01/24 18:13:16 UTC, 34 replies.
- HBaseContext with Spark - posted by Chetan Khatri <ch...@gmail.com> on 2017/01/25 11:32:37 UTC, 7 replies.
- Spark Summit East in Boston ‒ 20% off Code - posted by Scott walent <sc...@gmail.com> on 2017/01/25 19:06:04 UTC, 0 replies.
- Why two makeOffers in CoarseGrainedSchedulerBackend? Duplication? - posted by Jacek Laskowski <ja...@japila.pl> on 2017/01/26 11:48:57 UTC, 4 replies.
- Re: Issue creating row with java.util.Map type - posted by Richard Xin <ri...@yahoo.com.INVALID> on 2017/01/27 20:15:19 UTC, 0 replies.
- CFP for Spark Summit San Francisco closes on Feb. 6 - posted by Scott walent <sc...@gmail.com> on 2017/01/28 00:05:40 UTC, 0 replies.
- Typo on spark.apache.org? "cyclic data flow" - posted by Nicholas Chammas <ni...@gmail.com> on 2017/01/28 19:18:09 UTC, 3 replies.
- Maximum limit for akka.frame.size be greater than 500 MB ? - posted by aravasai <ar...@gmail.com> on 2017/01/29 23:44:29 UTC, 2 replies.
- Re: Error Saving Dataframe to Hive with Spark 2.0.0 - posted by Chetan Khatri <ch...@gmail.com> on 2017/01/30 05:52:30 UTC, 1 replies.
- Spark SQL Dataframe resulting from an except( ) is unusable - posted by Vinayak Joshi5 <vi...@in.ibm.com> on 2017/01/31 10:25:36 UTC, 0 replies.
- [SQL][ML] Pipeline performance regression between 1.6 and 2.x - posted by Maciej Szymkiewicz <ms...@gmail.com> on 2017/01/31 15:06:25 UTC, 0 replies.
- Unique Partition Id per partition - posted by "Chawla,Sumit " <su...@gmail.com> on 2017/01/31 17:08:22 UTC, 1 replies.
- Call for abstracts open for Dataworks & Hadoop Summit San Jose - posted by Alan Gates <al...@gmail.com> on 2017/01/31 19:28:02 UTC, 0 replies.
- Structured Streaming Source error - posted by Sam Elamin <hu...@gmail.com> on 2017/01/31 21:39:58 UTC, 2 replies.