You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@kudu.apache.org by jt...@apache.org on 2018/08/08 14:39:03 UTC

kudu git commit: Blog: Getting Started with Kudu

Repository: kudu
Updated Branches:
  refs/heads/gh-pages e35030439 -> 9a2e15032


Blog: Getting Started with Kudu

Change-Id: I9503dca5e6e565be902c4090bf2996bd1960d763
Reviewed-on: http://gerrit.cloudera.org:8080/11136
Reviewed-by: Jordan Birdsell <jt...@apache.org>
Tested-by: Jordan Birdsell <jt...@apache.org>


Project: http://git-wip-us.apache.org/repos/asf/kudu/repo
Commit: http://git-wip-us.apache.org/repos/asf/kudu/commit/9a2e1503
Tree: http://git-wip-us.apache.org/repos/asf/kudu/tree/9a2e1503
Diff: http://git-wip-us.apache.org/repos/asf/kudu/diff/9a2e1503

Branch: refs/heads/gh-pages
Commit: 9a2e150324b8f9ee93649d996b83c9067e4c8e47
Parents: e350304
Author: Brock Noland <br...@phdata.io>
Authored: Mon Aug 6 21:50:20 2018 -0500
Committer: Jordan Birdsell <jt...@apache.org>
Committed: Wed Aug 8 14:27:55 2018 +0000

----------------------------------------------------------------------
 ...etting-started-with-kudu-an-oreilly-title.md |  60 +++++++++++++++++++
 ...tting-started-with-kudu-an-oreilly-title.gif | Bin 0 -> 17397 bytes
 2 files changed, 60 insertions(+)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/kudu/blob/9a2e1503/_posts/2018-08-06-getting-started-with-kudu-an-oreilly-title.md
----------------------------------------------------------------------
diff --git a/_posts/2018-08-06-getting-started-with-kudu-an-oreilly-title.md b/_posts/2018-08-06-getting-started-with-kudu-an-oreilly-title.md
new file mode 100644
index 0000000..a896e05
--- /dev/null
+++ b/_posts/2018-08-06-getting-started-with-kudu-an-oreilly-title.md
@@ -0,0 +1,60 @@
+---
+layout: post
+title: Getting Started with Kudu an O'Reilly Title
+author: Brock Noland
+---
+
+The following article by Brock Noland was reposted from the
+[phData](https://www.phdata.io/getting-started-with-kudu/)
+blog with their permission.
+
+Five years ago, enabling Data Science and Advanced Analytics on the
+Hadoop platform was hard. Organizations required strong Software Engineering
+capabilities to successfully implement complex Lambda architectures or even
+simply implement continuous ingest. Updating or deleting data, were simplify
+nightmare. General Data Protection Regulation (GDPR) would have been an extreme
+challenge at that time.
+
+<!-- more -->
+
+In that context, on October 11th 2012 Todd Lipcon perform Apache Kudu's initial
+commit. The commit message was:
+
+    Code for writing cfiles seems to basically work
+    Need to write code for reading cfiles, still
+
+And Kudu development was off and running. Around this same time Todd, on his
+internal Wiki page, started listing out the papers he was reading to develop
+the theoretical background for creating Kudu. I followed along, reading as many
+as I could, understanding little, because I knew Todd was up to something
+important. About a year after that initial commit, I got my
+[Kudu first commit](https://github.com/apache/kudu/commit/1d7e6864b4a31d3fe6897e4cb484dfcda6608d43),
+documenting the upper bound of a library. This is a small contribution of which I am still
+proud.
+
+In the meantime, I was lucky enough to be a founder of a Hadoop Managed Services
+and Consulting company known as [phData](http://phdata.io/). We found that a majority
+of our customers had use cases which Kudu vastly simplified. Whether it's Change Data
+Capture (CDC) from thousands of source tables to Internet of Things (IoT) ingest, Kudu
+makes life much easier as both an operator of a Hadoop cluster and a developer providing
+business value on the platform.
+
+Through this work, I was lucky enough to be a co-author of
+[Getting Started with Kudu(http://shop.oreilly.com/product/0636920065739.do).
+The book is a summation of mine and our co-authors, Jean-Marc Spaggiari, Mladen
+Kovacevic, and Ryan Bosshart,  learnings while cutting our teeth on early versions
+of Kudu. Specifically you will learn:
+
+* Theoretical understanding of Kudu concepts in simple plain spoken words and simple diagrams
+* Why, for many use cases, using Kudu is so much easier than other ecosystem storage technologies
+* How Kudu enables Hybrid Transactional/Analytical Processing (HTAP) use cases
+* How to design IoT, Predictive Modeling, and Mixed Platform Solutions using Kudu
+* How to design Kudu Schemas
+
+![Getting Started with Kudu Cover]({{ site.github.url }}/img/2018-08-06-getting-started-with-kudu-an-oreilly-title.gif){: .img-responsive}
+
+Looking forward, I am excited to see Kudu gain additional features and adoption
+and eventually the second revision of this title. In the meantime, if you have
+feedback or questions, please reach out on the `#getting-started-kudu` channel of
+the [Kudu Slack](https://getkudu-slack.herokuapp.com/) or if you prefer non-real-time
+communication, please use the user@ mailing list!

http://git-wip-us.apache.org/repos/asf/kudu/blob/9a2e1503/img/2018-08-06-getting-started-with-kudu-an-oreilly-title.gif
----------------------------------------------------------------------
diff --git a/img/2018-08-06-getting-started-with-kudu-an-oreilly-title.gif b/img/2018-08-06-getting-started-with-kudu-an-oreilly-title.gif
new file mode 100644
index 0000000..2f533d5
Binary files /dev/null and b/img/2018-08-06-getting-started-with-kudu-an-oreilly-title.gif differ