You are viewing a plain text version of this content. The canonical link for it is here.
- Re: Should spark-ec2 get its own repo? - posted by Patrick Wendell <pw...@gmail.com> on 2015/08/01 03:50:31 UTC, 5 replies.
- Re: [ANNOUNCE] Nightly maven and package builds for Spark - posted by Patrick Wendell <pw...@gmail.com> on 2015/08/01 23:47:36 UTC, 4 replies.
- Re: FrequentItems in spark-sql-execution-stat - posted by Burak Yavuz <br...@gmail.com> on 2015/08/02 06:11:15 UTC, 0 replies.
- What are 'Buckets' referred in Spark Core code - posted by Haseeb <11...@seecs.edu.pk> on 2015/08/02 22:55:52 UTC, 2 replies.
- [SparkScore]Performance portal for Apache Spark - WW31 - posted by "Huang, Jie" <ji...@intel.com> on 2015/08/03 03:21:32 UTC, 0 replies.
- Re: Came across Spark SQL hang/Error issue with Spark 1.5 Tungsten feature - posted by james <yi...@gmail.com> on 2015/08/03 03:33:19 UTC, 1 replies.
- Master JIRA ticket for tracking Spark 1.5.0 configuration renames, defaults changes, and configuration deprecation - posted by Josh Rosen <jo...@databricks.com> on 2015/08/03 07:07:15 UTC, 1 replies.
- Moving spark-ec2 to amplab github organization - posted by Shivaram Venkataraman <sh...@eecs.berkeley.edu> on 2015/08/03 19:18:55 UTC, 0 replies.
- Re: Package Release Annoucement: Spark SQL on HBase "Astro" - posted by Ted Yu <yu...@gmail.com> on 2015/08/03 19:32:54 UTC, 2 replies.
- Unsubscribe - posted by Trevor Grant <tr...@gmail.com> on 2015/08/03 19:52:59 UTC, 1 replies.
- PSA: Maven 3.3.3 now required to build - posted by Sean Owen <so...@cloudera.com> on 2015/08/03 20:01:49 UTC, 6 replies.
- [ANNOUNCE] Spark branch-1.5 - posted by Reynold Xin <rx...@databricks.com> on 2015/08/03 20:11:52 UTC, 3 replies.
- Make ML Developer APIs public (post-1.4) - posted by Eron Wright <ew...@live.com> on 2015/08/04 01:51:04 UTC, 1 replies.
- Consistent recommendation for submitting spark apps to YARN, -master yarn --deploy-mode x vs -master yarn-x' - posted by Guru Medasani <gd...@gmail.com> on 2015/08/04 05:20:05 UTC, 1 replies.
- How to help for 1.5 release? - posted by Meihua Wu <ro...@gmail.com> on 2015/08/04 08:32:54 UTC, 2 replies.
- Re: Have Friedman's glmnet algo running in Spark - posted by Patrick <pe...@gmail.com> on 2015/08/04 09:50:13 UTC, 1 replies.
- Fwd: Writing streaming data to cassandra creates duplicates - posted by Priya Ch <le...@gmail.com> on 2015/08/04 13:03:04 UTC, 0 replies.
- shane will be OOO 8-5-15 through 8-18-15 - posted by shane knapp <sk...@berkeley.edu> on 2015/08/04 20:14:04 UTC, 0 replies.
- Re: New Feature Request - posted by Sandeep Giri <sa...@knowbigdata.com> on 2015/08/05 11:34:14 UTC, 1 replies.
- (Unknown) - posted by Sandeep Giri <sa...@knowbigdata.com> on 2015/08/05 17:49:53 UTC, 0 replies.
- Re: - posted by Sean Owen <so...@cloudera.com> on 2015/08/05 17:51:22 UTC, 5 replies.
- Avoiding unnecessary build changes until tests are in better shape - posted by Patrick Wendell <pw...@gmail.com> on 2015/08/05 20:24:56 UTC, 1 replies.
- Why SparkR didn't reuse PythonRDD - posted by Daniel Li <da...@gmail.com> on 2015/08/06 10:27:47 UTC, 1 replies.
- Re: Is there any way to support multiple users executing SQL on thrift server? - posted by Ted Yu <yu...@gmail.com> on 2015/08/06 11:20:47 UTC, 0 replies.
- Bucket mappings of map stage output - posted by cheez <11...@seecs.edu.pk> on 2015/08/06 23:47:09 UTC, 0 replies.
- Re: PySpark on PyPi - posted by Davies Liu <da...@databricks.com> on 2015/08/07 00:14:51 UTC, 10 replies.
- Workflow manager tool for scheduling spark jobs on cassandra - posted by Vikram Kone <vi...@gmail.com> on 2015/08/07 02:53:54 UTC, 0 replies.
- Re: Fixed number of partitions in RangePartitioner - posted by Reynold Xin <rx...@databricks.com> on 2015/08/07 04:47:29 UTC, 0 replies.
- SparkR driver side JNI - posted by Renyi Xiong <re...@gmail.com> on 2015/08/07 05:33:34 UTC, 1 replies.
- [SPARK-9720] spark.ml Identifiable types should have UID in toString methods - posted by Bertrand Dechoux <de...@gmail.com> on 2015/08/07 22:20:55 UTC, 0 replies.
- possible issues with listing objects in the HadoopFSrelation - posted by Gil Vernik <GI...@il.ibm.com> on 2015/08/10 13:55:36 UTC, 0 replies.
- Pushing Spark to 10Gb/s - posted by "Starch, Michael D (398M)" <Mi...@jpl.nasa.gov> on 2015/08/10 20:54:46 UTC, 1 replies.
- 答复: Package Release Annoucement: Spark SQL on HBase "Astro" - posted by "Yan Zhou.sc" <Ya...@huawei.com> on 2015/08/11 08:11:59 UTC, 1 replies.
- 答复: 答复: Package Release Annoucement: Spark SQL on HBase "Astro" - posted by "Yan Zhou.sc" <Ya...@huawei.com> on 2015/08/11 10:07:38 UTC, 8 replies.
- Re: [discuss] Removing individual commit messages from the squash commit message - posted by Reynold Xin <rx...@databricks.com> on 2015/08/11 10:10:05 UTC, 0 replies.
- Re: Inquery about contributing codes - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2015/08/11 11:13:50 UTC, 0 replies.
- Is OutputCommitCoordinator necessary for all the stages ? - posted by Jeff Zhang <zj...@gmail.com> on 2015/08/11 11:25:24 UTC, 2 replies.
- Spark runs into an Infinite loop even if the tasks are completed successfully - posted by Akhil Das <ak...@sigmoidanalytics.com> on 2015/08/11 15:59:07 UTC, 5 replies.
- Potential bug broadcastNestedLoopJoin or default value of spark.sql.autoBroadcastJoinThreshold - posted by gen tang <ge...@gmail.com> on 2015/08/11 16:11:35 UTC, 2 replies.
- Sources/pom for org.spark-project.hive - posted by Pala M Muthaia <mc...@rocketfuelinc.com> on 2015/08/11 21:25:25 UTC, 3 replies.
- 答复: 答复: 答复: Package Release Annoucement: Spark SQL on HBase "Astro" - posted by "Yan Zhou.sc" <Ya...@huawei.com> on 2015/08/12 06:18:31 UTC, 1 replies.
- Re: possible issues with listing objects in the HadoopFSrelation - posted by Cheng Lian <li...@gmail.com> on 2015/08/12 09:51:05 UTC, 1 replies.
- Does Spark optimization might miss to run transformation? - posted by Eugene Morozov <fa...@list.ru> on 2015/08/12 16:06:42 UTC, 0 replies.
- Spark 1.2.2 build problem with Hive 0.12, bringing in wrong version of avro-mapred - posted by java8964 <ja...@hotmail.com> on 2015/08/12 16:36:35 UTC, 0 replies.
- Re: Intermittent timeout failure org/apache/spark/sql/hive/thriftserver/CliSuite.scala - posted by Reynold Xin <rx...@databricks.com> on 2015/08/12 19:14:24 UTC, 0 replies.
- Switch from Sort based to Hash based shuffle - posted by cheez <11...@seecs.edu.pk> on 2015/08/13 11:26:28 UTC, 3 replies.
- please help with ClassNotFoundException - posted by 周千昊 <qh...@apache.org> on 2015/08/13 12:04:08 UTC, 3 replies.
- 回复:please help with ClassNotFoundException - posted by Sea <26...@qq.com> on 2015/08/13 12:43:28 UTC, 0 replies.
- Graphx - how to add vertices to a HashSet of vertices ? - posted by Ranjana Rajendran <ra...@gmail.com> on 2015/08/13 16:37:51 UTC, 0 replies.
- possible bug: user SparkConf properties not copied to worker process - posted by rfarrjr <rf...@gmail.com> on 2015/08/13 17:16:54 UTC, 4 replies.
- What does NativeMethodAccessorImpl.java do? - posted by freedafeng <fr...@yahoo.com> on 2015/08/13 18:13:19 UTC, 2 replies.
- Re: please help with ClassNotFoundException - posted by Sea <26...@qq.com> on 2015/08/13 18:36:14 UTC, 1 replies.
- subscribe - posted by Naga Vij <nv...@gmail.com> on 2015/08/13 18:44:55 UTC, 1 replies.
- Fwd: - Spark 1.4.1 - run-example SparkPi - Failure ... - posted by Naga Vij <nv...@gmail.com> on 2015/08/13 18:47:29 UTC, 3 replies.
- Fwd: [ANNOUNCE] Spark 1.5.0-preview package - posted by Reynold Xin <rx...@databricks.com> on 2015/08/13 21:05:03 UTC, 5 replies.
- Developer API & plugins for Hive & Hadoop ? - posted by Thomas Dudziak <to...@gmail.com> on 2015/08/13 23:01:18 UTC, 3 replies.
- Re: Automatically deleting pull request comments left by AmplabJenkins - posted by Josh Rosen <ro...@gmail.com> on 2015/08/14 04:21:27 UTC, 7 replies.
- Introduce a sbt plugin to deploy and submit jobs to a spark cluster on ec2 - posted by pishen tsai <pi...@gmail.com> on 2015/08/14 09:56:03 UTC, 5 replies.
- avoid creating small objects - posted by 周千昊 <qh...@apache.org> on 2015/08/14 09:59:59 UTC, 2 replies.
- Re: Writing to multiple outputs in Spark - posted by Silas Davis <si...@silasdavis.net> on 2015/08/14 16:56:03 UTC, 5 replies.
- SparkR DataFrame fail to return data of Decimal type - posted by "Shkurenko, Alex" <as...@enova.com> on 2015/08/14 19:30:00 UTC, 2 replies.
- Reliance on java.math.BigInteger implementation - posted by Pete Robbins <ro...@gmail.com> on 2015/08/14 20:27:33 UTC, 2 replies.
- Setting up Spark/flume/? to Ingest 10TB from FTP - posted by "Varadhan, Jawahar" <va...@yahoo.com.INVALID> on 2015/08/14 22:15:43 UTC, 2 replies.
- SPARK-10000 + now - posted by Reynold Xin <rx...@databricks.com> on 2015/08/14 23:58:01 UTC, 0 replies.
- Jenkins having issues? - posted by Cheolsoo Park <pi...@gmail.com> on 2015/08/15 01:11:56 UTC, 2 replies.
- [spark-csv] how to build with Hadoop 2.6.0? - posted by Gil Vernik <GI...@il.ibm.com> on 2015/08/16 18:05:16 UTC, 3 replies.
- Subscribe - posted by Rishitesh Mishra <ri...@gmail.com> on 2015/08/17 08:23:54 UTC, 0 replies.
- Spark Job Hangs on our production cluster - posted by java8964 <ja...@hotmail.com> on 2015/08/17 16:55:34 UTC, 0 replies.
- [survey] [spark-ec2] What do you like/dislike about spark-ec2? - posted by Nicholas Chammas <ni...@gmail.com> on 2015/08/17 17:09:58 UTC, 4 replies.
- What's the best practice for developing new features for spark ? - posted by canan chen <cc...@gmail.com> on 2015/08/19 10:44:09 UTC, 3 replies.
- Unable to run the spark application in standalone cluster mode - posted by Ratika Prasad <rp...@couponsinc.com> on 2015/08/19 17:52:14 UTC, 3 replies.
- Creating RDD with key and Subkey - posted by Ratika Prasad <rp...@couponsinc.com> on 2015/08/19 18:14:35 UTC, 3 replies.
- Dataframe aggregation with Tungsten unsafe - posted by "Ulanov, Alexander" <al...@hp.com> on 2015/08/20 23:57:49 UTC, 16 replies.
- [VOTE] Release Apache Spark 1.5.0 (RC1) - posted by Reynold Xin <rx...@databricks.com> on 2015/08/21 06:37:49 UTC, 16 replies.
- DataFrame. SparkPlan / Project serialization issue: ArrayIndexOutOfBounds. - posted by Eugene Morozov <ev...@gmail.com> on 2015/08/21 12:37:28 UTC, 1 replies.
- Tungsten and sun.misc.Unsafe - posted by Marek Kolodziej <mk...@gmail.com> on 2015/08/21 14:29:13 UTC, 3 replies.
- Fwd: [jira] [Commented] (INFRA-10191) git pushing for Spark fails - posted by Reynold Xin <rx...@databricks.com> on 2015/08/24 20:58:41 UTC, 1 replies.
- ExternalSorter: Thread *** spilling in-memory map of 352.6 MB to disk (38 times so far) - posted by "dan@lumity.com" <da...@lumity.com> on 2015/08/25 03:59:02 UTC, 0 replies.
- Spark builds: allow user override of project version at buildtime - posted by an...@thomsonreuters.com on 2015/08/25 11:17:57 UTC, 3 replies.
- Spark (1.2.0) submit fails with exception saying log directory already exists - posted by "Varadhan, Jawahar" <va...@yahoo.com.INVALID> on 2015/08/25 18:37:35 UTC, 1 replies.
- Paring down / tagging tests (or some other way to avoid timeouts)? - posted by Marcelo Vanzin <va...@cloudera.com> on 2015/08/25 22:33:41 UTC, 3 replies.
- [VOTE] Release Apache Spark 1.5.0 (RC2) - posted by Reynold Xin <rx...@databricks.com> on 2015/08/26 06:28:14 UTC, 22 replies.
- Spark Cannot Connect to HBaseClusterSingleton - posted by Furkan KAMACI <fu...@gmail.com> on 2015/08/26 10:50:30 UTC, 13 replies.
- ClassCastException using DataFrame only when num-executors > 2 ... - posted by Olivier Girardot <ss...@gmail.com> on 2015/08/26 11:47:55 UTC, 1 replies.
- SQLContext.read.json("path") throws java.io.IOException - posted by gsvic <vi...@gmail.com> on 2015/08/26 15:47:12 UTC, 4 replies.
- Maven issues with 1.5-RC - posted by Chris Freeman <cf...@alteryx.com> on 2015/08/26 17:08:08 UTC, 3 replies.
- Building with sbt "impossible to get artifacts when data has not been loaded" - posted by Holden Karau <ho...@pigscanfly.ca> on 2015/08/26 23:23:34 UTC, 3 replies.
- Differing performance in self joins - posted by David Smith <da...@gmail.com> on 2015/08/27 03:10:43 UTC, 0 replies.
- A TPCH benchmark for Spark - posted by Feng Tian <ft...@vitessedata.com> on 2015/08/27 06:40:50 UTC, 0 replies.
- FW: High Availability of Spark Driver - posted by Ashish Rawat <As...@guavus.com> on 2015/08/27 09:42:02 UTC, 4 replies.
- Opening up metrics interfaces - posted by Atsu Kakitani <at...@groupon.com> on 2015/08/27 21:21:36 UTC, 2 replies.
- Re: Feedback: Feature request - posted by Manish Amde <ma...@gmail.com> on 2015/08/28 07:03:35 UTC, 2 replies.
- IOError on createDataFrame - posted by fsacerdoti <fs...@jumptrading.com> on 2015/08/28 21:43:49 UTC, 2 replies.
- Research of Spark scalability / performance issues - posted by Сергей Лихоман <se...@gmail.com> on 2015/08/29 19:52:16 UTC, 3 replies.
- Tungsten off heap memory access for C++ libraries - posted by Paul Weiss <pa...@gmail.com> on 2015/08/29 21:17:00 UTC, 6 replies.
- [ANNOUNCE] New testing capabilities for pull requests - posted by Patrick Wendell <pw...@gmail.com> on 2015/08/31 06:48:29 UTC, 0 replies.
- KryoSerializer for closureSerializer in DAGScheduler - posted by yash datta <sa...@gmail.com> on 2015/08/31 12:44:51 UTC, 2 replies.