You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@kudu.apache.org by jd...@apache.org on 2016/04/18 20:12:03 UTC
incubator-kudu git commit: Add 4/18 weekly update
Repository: incubator-kudu
Updated Branches:
refs/heads/gh-pages 86ae98bc4 -> 65ff8b29a
Add 4/18 weekly update
Change-Id: I31a5a5274ec30014566cd35405c087de5441e14e
Reviewed-on: http://gerrit.cloudera.org:8080/2805
Reviewed-by: Jean-Daniel Cryans
Tested-by: Todd Lipcon <to...@apache.org>
Project: http://git-wip-us.apache.org/repos/asf/incubator-kudu/repo
Commit: http://git-wip-us.apache.org/repos/asf/incubator-kudu/commit/65ff8b29
Tree: http://git-wip-us.apache.org/repos/asf/incubator-kudu/tree/65ff8b29
Diff: http://git-wip-us.apache.org/repos/asf/incubator-kudu/diff/65ff8b29
Branch: refs/heads/gh-pages
Commit: 65ff8b29ad633ab8a17590eda8413f1e0fa98415
Parents: 86ae98b
Author: Todd Lipcon <to...@cloudera.com>
Authored: Mon Apr 18 09:26:47 2016 -0700
Committer: Todd Lipcon <to...@apache.org>
Committed: Mon Apr 18 16:44:06 2016 +0000
----------------------------------------------------------------------
_posts/2016-04-18-weekly-update.md | 80 +++++++++++++++++++++++++++++++++
1 file changed, 80 insertions(+)
----------------------------------------------------------------------
http://git-wip-us.apache.org/repos/asf/incubator-kudu/blob/65ff8b29/_posts/2016-04-18-weekly-update.md
----------------------------------------------------------------------
diff --git a/_posts/2016-04-18-weekly-update.md b/_posts/2016-04-18-weekly-update.md
new file mode 100644
index 0000000..ac524c8
--- /dev/null
+++ b/_posts/2016-04-18-weekly-update.md
@@ -0,0 +1,80 @@
+---
+layout: post
+title: Apache Kudu (incubating) Weekly Update April 18, 2016
+author: Todd Lipcon
+---
+Welcome to the fifth edition of the Kudu Weekly Update. This weekly blog post
+covers ongoing development and news in the Apache Kudu (incubating) project.
+
+<!--more-->
+
+If you find this post useful, please let us know by emailing the
+[kudu-user mailing list](mailto:user@kudu.incubator.apache.org) or
+tweeting at [@ApacheKudu](https://twitter.com/ApacheKudu). Similarly, if you're
+aware of some Kudu news we missed, let us know so we can cover it in
+a future post.
+
+## Project news
+
+* Cloudera announced that it has posted [binary packages](http://markmail.org/thread/tghwcux5k4qvcsep)
+ for the recent 0.8.0 release. These are not official packages from
+ the Apache Kudu (incubating) project, but users who prefer not to
+ build from source may find them convenient.
+
+* Jean-Daniel Cryans has volunteered to continue to act as release manager for
+ the 0.x release line, and has start a [discussion](http://mail-archives.apache.org/mod_mbox/incubator-kudu-dev/201604.mbox/%3CCAGpTDNcfTOcp%2Beb39h5j%3DoxttZNhOBZ7v%2B%2B6hxRtWCh3t_psbQ%40mail.gmail.com%3E)
+ detailing what features and improvements he expects will be ready
+ for an 0.9 release in June.
+
+## Development discussions and code in progress
+
+* Chris George posted a [work in progress patch](http://gerrit.cloudera.org:8080/#/c/2754/)
+ for a native Kudu RDD implementation for Spark. Kudu already ships an RDD
+ based on the generic HadoopRDD and Kudu's MapReduce integration, but Chris's
+ new version paves the way for new features like pushing down predicates.
+
+
+* Todd Lipcon has been working on [KUDU-1410](https://issues.apache.org/jira/browse/KUDU-1410),
+ a small project which makes it easier to diagnose performance issues on a Kudu
+ cluster.
+
+ The first feature proposed by this JIRA is the idea of collecting
+ "exemplar" traces: for each type of RPC (e.g. _Write_, _Scan_, etc.)
+ the RPC system will collect a few _exemplar_ RPCs in different
+ latency buckets and retain their traces. This makes it easier for
+ an operator to see what might have caused a slow response from a
+ server even after the request has been finished for some time.
+
+ The second new feature is the collection of per-RPC-request metrics
+ such as lock acquisition time, time spent waiting on disk, and other
+ metrics specific to each type of RPC. In combination with the
+ exemplar trace feature above, this should make it easy to root-cause
+ whether a request is slow due to underlying hardware issues,
+ Kudu-specific issues, or a particular workload characteristic.
+
+ Todd posted a work-in-progress implementation of these features on gerrit
+ in a five-part patch series:
+ [(1)](http://gerrit.cloudera.org:8080/#/c/2794/)
+ [(2)](http://gerrit.cloudera.org:8080/#/c/2795/)
+ [(3)](http://gerrit.cloudera.org:8080/#/c/2796/)
+ [(4)](http://gerrit.cloudera.org:8080/#/c/2797/)
+ [(5)](http://gerrit.cloudera.org:8080/#/c/2798/)
+
+* Dan Burkert continued working on the [Java implementation of the Scan Token API](http://gerrit.cloudera.org:8080/#/c/2592/)
+ described in previous weekly updates, with reviews this week from Jean-Daniel
+ Cryans and Adar Dembo. He also posted a patch for the [C++ implementation](http://gerrit.cloudera.org:8080/#/c/2757/)
+ which has seen some review action as well.
+
+* Dan also posted a [design document for non-covering range partitioning](http://gerrit.cloudera.org:8080/#/c/2772/).
+ This new feature will allow Kudu operators to add or drop tablets to
+ an existing range-partitioned table. This is very important for time
+ series use cases where new partitions may need to be added daily,
+ and old partitions potentially dropped in order to achieve a
+ "sliding window" table. Read the design document for more details on
+ use cases and the expected semantics.
+
+## On the Kudu blog
+
+* Pat Patterson wrote a post about [Ingesting JSON Data into Apache Kudu with StreamSets
+ Data Collector](http://getkudu.io/2016/04/14/ingesting-json-apache-kudu-streamsets-data-collector.html).
+