You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by bl...@apache.org on 2021/11/17 09:37:13 UTC

[cassandra-website] branch trunk updated: CASSANDRA-17157: Added blog post in November 2021 titled "Inside Cassandra: an interview with Marcel Birkner at Instana"

This is an automated email from the ASF dual-hosted git repository.

blerer pushed a commit to branch trunk
in repository https://gitbox.apache.org/repos/asf/cassandra-website.git


The following commit(s) were added to refs/heads/trunk by this push:
     new 7324a6f  CASSANDRA-17157: Added blog post in November 2021 titled "Inside Cassandra: an interview with Marcel Birkner at Instana"
7324a6f is described below

commit 7324a6f2623dd39d140bf9ad6394bb9208cfb1d6
Author: Diogenese Topper <di...@gmail.com>
AuthorDate: Tue Nov 16 16:18:00 2021 -0800

    CASSANDRA-17157: Added blog post in November 2021 titled "Inside Cassandra: an interview with Marcel Birkner at Instana"
    
    Blog post titled "Inside Cassandra: an interview with Marcel Birkner at Instana"
    
    Modified blog index page
    
    Images added for blog
---
 .../inside-Cassandra-Marcel-Birkner/image1.png     | Bin 0 -> 226775 bytes
 .../inside-Cassandra-Marcel-Birkner/image2.png     | Bin 0 -> 190105 bytes
 .../inside-Cassandra-Marcel-Birkner/image3.png     | Bin 0 -> 232206 bytes
 .../inside-Cassandra-Marcel-Birkner/image4.png     | Bin 0 -> 106451 bytes
 .../inside-Cassandra-Marcel-Birkner/image5.png     | Bin 0 -> 255327 bytes
 site-content/source/modules/ROOT/pages/blog.adoc   |  25 ++++++++++
 ...n-interview-with-Marcel-Birkner-at-Instana.adoc |  52 +++++++++++++++++++++
 7 files changed, 77 insertions(+)

diff --git a/site-content/source/modules/ROOT/images/blog/inside-Cassandra-Marcel-Birkner/image1.png b/site-content/source/modules/ROOT/images/blog/inside-Cassandra-Marcel-Birkner/image1.png
new file mode 100644
index 0000000..eda2e6b
Binary files /dev/null and b/site-content/source/modules/ROOT/images/blog/inside-Cassandra-Marcel-Birkner/image1.png differ
diff --git a/site-content/source/modules/ROOT/images/blog/inside-Cassandra-Marcel-Birkner/image2.png b/site-content/source/modules/ROOT/images/blog/inside-Cassandra-Marcel-Birkner/image2.png
new file mode 100644
index 0000000..3848cb5
Binary files /dev/null and b/site-content/source/modules/ROOT/images/blog/inside-Cassandra-Marcel-Birkner/image2.png differ
diff --git a/site-content/source/modules/ROOT/images/blog/inside-Cassandra-Marcel-Birkner/image3.png b/site-content/source/modules/ROOT/images/blog/inside-Cassandra-Marcel-Birkner/image3.png
new file mode 100644
index 0000000..a17f6d9
Binary files /dev/null and b/site-content/source/modules/ROOT/images/blog/inside-Cassandra-Marcel-Birkner/image3.png differ
diff --git a/site-content/source/modules/ROOT/images/blog/inside-Cassandra-Marcel-Birkner/image4.png b/site-content/source/modules/ROOT/images/blog/inside-Cassandra-Marcel-Birkner/image4.png
new file mode 100644
index 0000000..f9dea75
Binary files /dev/null and b/site-content/source/modules/ROOT/images/blog/inside-Cassandra-Marcel-Birkner/image4.png differ
diff --git a/site-content/source/modules/ROOT/images/blog/inside-Cassandra-Marcel-Birkner/image5.png b/site-content/source/modules/ROOT/images/blog/inside-Cassandra-Marcel-Birkner/image5.png
new file mode 100644
index 0000000..94238a3
Binary files /dev/null and b/site-content/source/modules/ROOT/images/blog/inside-Cassandra-Marcel-Birkner/image5.png differ
diff --git a/site-content/source/modules/ROOT/pages/blog.adoc b/site-content/source/modules/ROOT/pages/blog.adoc
index c41dabc..108fb33 100644
--- a/site-content/source/modules/ROOT/pages/blog.adoc
+++ b/site-content/source/modules/ROOT/pages/blog.adoc
@@ -14,6 +14,31 @@ NOTES FOR CONTENT CREATORS
 [openblock,card-header]
 ------
 [discrete]
+=== Inside Cassandra: an interview with Marcel Birkner at Instana
+[discrete]
+==== November 17, 2021
+------
+[openblock,card-content]
+------
+We interview Marcel Birkner, Site Reliability Engineer at Instana, how they use Apache Cassandra to store and process the metric data at scale and benefit from Cassandra’s fault tolerance, and have learned the importance of dog-fooding.
+
+[openblock,card-btn card-btn--blog]
+--------
+
+[.btn.btn--alt]
+xref:blog/Inside-Cassandra-an-interview-with-Marcel-Birkner-at-Instana.adoc[Read More]
+--------
+
+------
+----
+//end card
+
+//start card
+[openblock,card shadow relative test]
+----
+[openblock,card-header]
+------
+[discrete]
 === What the Future Holds for Apache Cassandra
 [discrete]
 ==== October 26, 2021
diff --git a/site-content/source/modules/ROOT/pages/blog/Inside-Cassandra-an-interview-with-Marcel-Birkner-at-Instana.adoc b/site-content/source/modules/ROOT/pages/blog/Inside-Cassandra-an-interview-with-Marcel-Birkner-at-Instana.adoc
new file mode 100644
index 0000000..f2716dd
--- /dev/null
+++ b/site-content/source/modules/ROOT/pages/blog/Inside-Cassandra-an-interview-with-Marcel-Birkner-at-Instana.adoc
@@ -0,0 +1,52 @@
+= Inside Cassandra: an interview with Marcel Birkner at Instana
+:page-layout: single-post
+:page-role: blog-post
+:page-post-date: November 17, 2021
+:page-post-author: The Apache Cassandra Community
+:description: The Apache Cassandra Community
+:keywords: 
+
+Patrick McFadin, Apache Cassandra community member, sat down with Marcel Birkner, Site Reliability Engineer (SRE) at Instana, an IBM Company. They discussed the unique challenge faced by the company as it strives to scale highly useful telemetry with a responsive query system. This article uses extracts from that conversation, which took place before IBM acquired the business.
+
+One of the constants of life as an SRE is that https://www.instana.com/blog/life-of-an-sre-at-instana-things-break-all-the-time-in-distributed-systems-part-1-clickhouse/[things break all the time,window=_blank]. If developers are expanding what the product can do, and operations are making sure everything keeps running smoothly, SREs often fill that annoying gap between theory and practice. They fix things as they break, set up alerts to enable them to know the next time it breaks, and k [...]
+
+Instana, which IBM acquired in 2020, delivers application performance management software for modern cloud-native environments and is designed to create closed-loop DevOps automation. Instana has developed a high-performance, high-scale architecture which can capture 100% of transactions in one-second intervals across an extensive distributed application in a hybrid cloud. It collects billions of metrics per day in real-time and uses Apache Cassandra to store and process the metric data  [...]
+
+== Finding the Sweet Spot for Scaling
+
+Instana writes for some of its clusters are “250,000 to 1,200,000 metrics per second,” says Marcel, who has been an SRE at Instana for almost four years. By any standard, that’s a huge amount of data ingestion for a distributed workload. But instead of just throwing additional nodes at each problem, Marcel and his team have tried to find the scaling for each node that would perform the best. “The big challenge there is finding the right balance between resources and performance for each  [...]
+
+image::blog/inside-Cassandra-Marcel-Birkner/image2.png[sweet spot]
+Stats for one Metrics Cassandra cluster: 380k writes/sec, 9k reads/sec, overall great latency.
+
+The challenge of knowing when to scale horizontally, when to scale vertically, and when to scale down to save money is something every operations or reliability engineer has faced in the real world. “There’s no single answer for all use cases,” says Marcel, but he sees close monitoring of your system as part of the solution: If you don’t know the effect that resource allocation is having on your service, there’s no way to optimize.
+
+== Monitoring a Monitoring Service
+
+In many ways, monitoring, or its buzzword cousin observability, is the lifeblood of any SRE’s work. At a company whose product is monitoring, the act of monitoring your own services is ‘dog-fooding’ at its finest.
+
+“We are a monitoring company, and we monitor the things that we use ourselves the best,” says Marcel. And really, it makes sense: any time a monitoring company is surprised by resource overruns or over-provisioning that has budgetary impacts, the jokes kind of write themselves.
+
+Marcel monitors a stack where most components are running as part of a cluster, and, as such, there is fault-tolerance built-in thanks, in part, to Cassandra: “a single failing node does not impact our customers. Our system is built to handle these kinds of things. Nevertheless, it is critical to resolve these problems so that they do not escalate.”
+
+When you begin on your journey to a well-monitored system, the first step is always the monitoring and alerts that come built-in. For Marcel and the team, Cassandra’s built-in tools were part of the picture from the outset. “Since we use Apache Cassandra heavily ourselves, there's a lot of built-in alerts,” he told Patrick. A variety of metrics are crucial to understanding service health, such as: “the queuing mutation requests or whether there's high CPU load now on nodes, or if there a [...]
+
+image::blog/inside-Cassandra-Marcel-Birkner/image3.png[single metrics node 1]
+
+image::blog/inside-Cassandra-Marcel-Birkner/image5.png[single metrics node 2]
+
+image::blog/inside-Cassandra-Marcel-Birkner/image4.png[single metrics node 3]
+Detailed statistics for a single metrics Cassandra node.
+
+When scaling, experimentation is vital. Monitoring is a crucial component here since operators need to answer whether modifying provisioning helped at all. “We’re able to monitor very closely what’s happening on the host while we tweak everything, and that lets us compare before and after the tweaks,” says Marcel.
+
+image::blog/inside-Cassandra-Marcel-Birkner/image1.png[1 second metrics resolution]
+1-second metrics resolution helps us identify even small spikes in metrics when making infrastructure changes.
+
+== Using Monitoring needs to Grow the Product
+
+The most important benefit of ‘dog-fooding,’ or using your own product the way your customers do, is the way it highlights how the service needs to expand. For Marcel, that meant finding additional monitoring needs. “So the thing is, we monitor all aspects like network disk I/O on the host JVM [...] if we are missing metrics, we just write the sensor for it, to get additional metrics.”
+
+“If we have problems as an SRE team, and we don't have the insights, we just tell our engineering team,” says Marcel. “Then they just enhance the product, either by new data routes or by new data that they're collecting from the different technologies we monitor. And that makes our lives a lot easier.”
+
+Head here for more information on https://cassandra.apache.org/doc/latest/cassandra/operating/metrics.html[monitoring and metrics within Apache Cassandra], and for a list of third-party projects, tools, and products, head to the project’s https://cassandra.apache.org/_/ecosystem.html[Ecosystem page].
\ No newline at end of file

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@cassandra.apache.org
For additional commands, e-mail: commits-help@cassandra.apache.org