You are viewing a plain text version of this content. The canonical link for it is here.
Posted to pr@cassandra.apache.org by GitBox <gi...@apache.org> on 2022/04/14 02:48:31 UTC

[GitHub] [cassandra-website] dtopdontstop opened a new pull request, #124: CASSANDRA-17555 April 2022 blog "Inside Cassandra: An Interview with Project Contributor, Aleksandr Sorokoumov"

dtopdontstop opened a new pull request, #124:
URL: https://github.com/apache/cassandra-website/pull/124

   patch by Chris Thornett, Diogenese Topper; reviewed by -- for CASSANDRA-17555
   
   Co-authored by: Chris Thornett <ch...@constantia.io>
   Co-authored by: Diogenese Topper <di...@gmail.com>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: pr-unsubscribe@cassandra.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: pr-unsubscribe@cassandra.apache.org
For additional commands, e-mail: pr-help@cassandra.apache.org


[GitHub] [cassandra-website] ErickRamirezAU commented on a diff in pull request #124: CASSANDRA-17555 April 2022 blog "Inside Cassandra: An Interview with Project Contributor, Aleksandr Sorokoumov"

Posted by GitBox <gi...@apache.org>.
ErickRamirezAU commented on code in PR #124:
URL: https://github.com/apache/cassandra-website/pull/124#discussion_r854659665


##########
site-content/source/modules/ROOT/pages/blog/Inside-Cassandra-an-interview-with-Project-Contributor-Aleksandr-Sorokoumov.adoc:
##########
@@ -0,0 +1,59 @@
+= Inside Cassandra: An Interview with Project Contributor, Aleksandr Sorokoumov
+:page-layout: single-post
+:page-role: blog-post
+:page-post-date: April 14, 2022

Review Comment:
   ```suggestion
   :page-post-date: April 21, 2022
   ```



##########
site-content/source/modules/ROOT/pages/blog/Inside-Cassandra-an-interview-with-Project-Contributor-Aleksandr-Sorokoumov.adoc:
##########
@@ -0,0 +1,59 @@
+= Inside Cassandra: An Interview with Project Contributor, Aleksandr Sorokoumov
+:page-layout: single-post
+:page-role: blog-post
+:page-post-date: April 14, 2022
+:page-post-author: The Apache Cassandra Community
+:description: The Apache Cassandra Community

Review Comment:
   Add a proper description:
   ```suggestion
   :description: Interview with Aleksandr Sorokoumov
   ```



##########
site-content/source/modules/ROOT/pages/blog/Inside-Cassandra-an-interview-with-Project-Contributor-Aleksandr-Sorokoumov.adoc:
##########
@@ -0,0 +1,59 @@
+= Inside Cassandra: An Interview with Project Contributor, Aleksandr Sorokoumov
+:page-layout: single-post
+:page-role: blog-post
+:page-post-date: April 14, 2022
+:page-post-author: The Apache Cassandra Community
+:description: The Apache Cassandra Community
+:keywords: 
+
+If communities are the heart of a project, contributors are the lifeblood that keeps open source development pumping away. This time we introduce Aleksandr Sorokoumov. If his volunteer work on the Apache Cassandra project inspires you to get involved, try reading our guide xref:development/index.adoc[‘Contributing to Cassandra’].
+ 
+image::blog/inside-Cassandra-Aleksandr-Sorokoumov.png[Aleksandr Sorokoumov]
+
+=== About Our Contributor
+
+Based in Germany, Aleksandr Sorokoumov is a software engineer at Confluent, where he works on ksqlDB, a database purpose-built for stream processing applications. Previously, he was employed at DataStax working on DSE storage engine and AstraDB, a cloud-native database built on top of Apache Cassandra. As well as contributing code to the project, he’s also begun contributing technical write-ups to share some of his hard-fought-for experience with users and contributors.
+
+=== Contributor Questions
+
+*Question:* What are you currently working on, or have worked on, in the past for the Apache Cassandra project?
+
+*Answer:* I am currently working on an article about the internals of CommitLog. This is a component in Apache Cassandra responsible for durability. I worked on several projects at my previous job that required an in-depth understanding of how CommitLog works. It took me some time to find out the details as I had to dig into old JIRA issues and ask experts clarifying questions. I believe this information will benefit DBAs and new contributors; therefore, I felt it would be good to share it.
+
+Previously, I worked on improving serialization efficiency (https://issues.apache.org/jira/browse/CASSANDRA-15215[CASSANDRA-15215^]). https://developers.google.com/protocol-buffers/docs/encoding#varints[VInt^] (or Varints) is an encoding used to serialize integers, so that smaller numbers occupy less space. With my patch, the serialization throughput increased by up to 30%.
+
+*Question:* What’s been the most rewarding aspect of being part of the Apache Cassandra community?
+
+*Answer:* There are two important aspects I would like to mention. First of all, it is interaction with the community members. People working on Apache Cassandra are among the most supportive and experienced professionals I have had a chance to work with. Collaborating with them creates a ton of learning opportunities. The second aspect is the impact of the work you do. Apache Cassandra is a database running at a massive scale that powers the infrastructure of many well-known companies. It just feels great to know that my work is deployed at that scale.
+
+*Question:* For someone reading this and can help, what types of contributions or support do you think the Cassandra community needs, or is there anything they can do to help you?
+
+*Answer:* In my opinion, one of the areas the Cassandra community will always appreciate help is work on quality. Cassandra 4.0 was a great breakthrough for the project in paying back the technical debt. 
+
+With many upcoming exciting features in-flight, it takes constant effort from everyone to maintain the high bar. In my opinion, fixing a failing test is one of the best ways to get started. Such issues are often well-scoped, and the community always welcomes patches in this area.
+
+_*Note:* If you’d like to try fixing some failing tests, you can find unassigned tickets on https://issues.apache.org/jira/secure/RapidBoard.jspa?rapidView=496&quickFilter=2252[this kanban board^]._
+
+*Question:* What areas of interest and fields are you passionate about in your career?
+
+*Answer:* I am passionate about distributed databases and streaming systems. One of the aspects I love about these fields is the bi-directional feedback loop between industry and academia. On the one hand, concepts behind systems like Apache Cassandra were first described in research publications. For example, https://cassandra.apache.org/doc/latest/cassandra/new/transientreplication.html[Transient Replication] implements an algorithm published in http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.146.3429&rep=rep1&type=pdf[a paper in 1986^]. How cool is that? On the other hand, the communities working on production systems contribute back new theoretical approaches. A recent example is https://cwiki.apache.org/confluence/x/FQRACw[Accord^], which is a leaderless protocol for distributed transactions.
+
+Successful projects in these areas often live for a long time and have a massive impact on the entire industry. Apache Cassandra is 13 years old, an impressive age for a piece of software, yet we have an energetic community and so many exciting upcoming features.
+
+*Question:* How were you introduced to open source software development, and what was your first contribution to an open source project?
+
+*Answer:* I was fortunate to participate in https://summerofcode.withgoogle.com/[Google Summer of Code in 2014^] during my Master's studies. I worked on https://github.com/incanter/incanter[Incanter^], a data-science library for Clojure and its integration with https://github.com/mikera/core.matrix[core.matrix^] - a pluggable backend for array programming.
+
+*Question:* What advice would you give to someone getting started with open source projects?
+
+*Answer:* Don't be shy to reach out to the community. I know from personal experience that it might feel scary at first to ask questions in public mediums, such as mailing lists or Slack. However, there are many fantastic open source communities, such as Apache Cassandra, where people will be happy to help you find a starting issue, answer questions, and review patches.
+
+*Question:* What do you like to do with your spare time when you're not volunteering on the project?
+
+*Answer:* I like to learn new skills in my free time. I am currently learning how to play an electric guitar.
+
+=== How to contribute
+
+As well as the diving into failing tests, as mentioned above, a good starting point for anyone wanting to help with code is our unassigned “Starter Tickets” for versions 4.0.x and 4.1.x, which you’ll find listed on https://issues.apache.org/jira/secure/RapidBoard.jspa?rapidView=484&quickFilter=2162&quickFilter=2160[this kanban board^]. Just remember to assign yourself to the ticket and acknowledge the status, such as ‘Work in Progress’ and ‘Needs Committer/Patch Available’ when you submit your patch.
+
+You can also introduce yourself to active developers on the https://the-asf.slack.com/[ASF Slack^] in the #cassandra-dev Slack channel. If you use @cassandra_mentors, that will put you in touch with our Cassandra mentors.

Review Comment:
   ```suggestion
   You can also introduce yourself to active developers on the https://the-asf.slack.com/[ASF Slack^] in the #cassandra-dev Slack channel. If you use `@cassandra_mentors`, that will put you in touch with our Cassandra mentors.
   ```



##########
site-content/source/modules/ROOT/pages/blog/Inside-Cassandra-an-interview-with-Project-Contributor-Aleksandr-Sorokoumov.adoc:
##########
@@ -0,0 +1,59 @@
+= Inside Cassandra: An Interview with Project Contributor, Aleksandr Sorokoumov
+:page-layout: single-post
+:page-role: blog-post
+:page-post-date: April 14, 2022
+:page-post-author: The Apache Cassandra Community
+:description: The Apache Cassandra Community
+:keywords: 
+
+If communities are the heart of a project, contributors are the lifeblood that keeps open source development pumping away. This time we introduce Aleksandr Sorokoumov. If his volunteer work on the Apache Cassandra project inspires you to get involved, try reading our guide xref:development/index.adoc[‘Contributing to Cassandra’].
+ 
+image::blog/inside-Cassandra-Aleksandr-Sorokoumov.png[Aleksandr Sorokoumov]
+
+=== About Our Contributor
+
+Based in Germany, Aleksandr Sorokoumov is a software engineer at Confluent, where he works on ksqlDB, a database purpose-built for stream processing applications. Previously, he was employed at DataStax working on DSE storage engine and AstraDB, a cloud-native database built on top of Apache Cassandra. As well as contributing code to the project, he’s also begun contributing technical write-ups to share some of his hard-fought-for experience with users and contributors.
+
+=== Contributor Questions
+
+*Question:* What are you currently working on, or have worked on, in the past for the Apache Cassandra project?
+
+*Answer:* I am currently working on an article about the internals of CommitLog. This is a component in Apache Cassandra responsible for durability. I worked on several projects at my previous job that required an in-depth understanding of how CommitLog works. It took me some time to find out the details as I had to dig into old JIRA issues and ask experts clarifying questions. I believe this information will benefit DBAs and new contributors; therefore, I felt it would be good to share it.
+
+Previously, I worked on improving serialization efficiency (https://issues.apache.org/jira/browse/CASSANDRA-15215[CASSANDRA-15215^]). https://developers.google.com/protocol-buffers/docs/encoding#varints[VInt^] (or Varints) is an encoding used to serialize integers, so that smaller numbers occupy less space. With my patch, the serialization throughput increased by up to 30%.
+
+*Question:* What’s been the most rewarding aspect of being part of the Apache Cassandra community?
+
+*Answer:* There are two important aspects I would like to mention. First of all, it is interaction with the community members. People working on Apache Cassandra are among the most supportive and experienced professionals I have had a chance to work with. Collaborating with them creates a ton of learning opportunities. The second aspect is the impact of the work you do. Apache Cassandra is a database running at a massive scale that powers the infrastructure of many well-known companies. It just feels great to know that my work is deployed at that scale.
+
+*Question:* For someone reading this and can help, what types of contributions or support do you think the Cassandra community needs, or is there anything they can do to help you?
+
+*Answer:* In my opinion, one of the areas the Cassandra community will always appreciate help is work on quality. Cassandra 4.0 was a great breakthrough for the project in paying back the technical debt. 
+
+With many upcoming exciting features in-flight, it takes constant effort from everyone to maintain the high bar. In my opinion, fixing a failing test is one of the best ways to get started. Such issues are often well-scoped, and the community always welcomes patches in this area.
+
+_*Note:* If you’d like to try fixing some failing tests, you can find unassigned tickets on https://issues.apache.org/jira/secure/RapidBoard.jspa?rapidView=496&quickFilter=2252[this kanban board^]._
+
+*Question:* What areas of interest and fields are you passionate about in your career?
+
+*Answer:* I am passionate about distributed databases and streaming systems. One of the aspects I love about these fields is the bi-directional feedback loop between industry and academia. On the one hand, concepts behind systems like Apache Cassandra were first described in research publications. For example, https://cassandra.apache.org/doc/latest/cassandra/new/transientreplication.html[Transient Replication] implements an algorithm published in http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.146.3429&rep=rep1&type=pdf[a paper in 1986^]. How cool is that? On the other hand, the communities working on production systems contribute back new theoretical approaches. A recent example is https://cwiki.apache.org/confluence/x/FQRACw[Accord^], which is a leaderless protocol for distributed transactions.

Review Comment:
   ```suggestion
   *Answer:* I am passionate about distributed databases and streaming systems. One of the aspects I love about these fields is the bi-directional feedback loop between industry and academia. On the one hand, concepts behind systems like Apache Cassandra were first described in research publications. For example, link:/doc/latest/cassandra/new/transientreplication.html[Transient Replication] implements an algorithm published in http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.146.3429&rep=rep1&type=pdf[a paper in 1986^]. How cool is that? On the other hand, the communities working on production systems contribute back new theoretical approaches. A recent example is https://cwiki.apache.org/confluence/x/FQRACw[Accord^], which is a leaderless protocol for distributed transactions.
   ```



##########
site-content/source/modules/ROOT/pages/blog.adoc:
##########
@@ -8,6 +8,31 @@ NOTES FOR CONTENT CREATORS
 - Replace post tile, date, description and link to you post.
 ////
 
+//start card
+[openblock,card shadow relative test]
+----
+[openblock,card-header]
+------
+[discrete]
+=== Inside Cassandra: An Interview with Project Contributor, Aleksandr Sorokoumov
+[discrete]
+==== April 14, 2022

Review Comment:
   ```suggestion
   ==== April 21, 2022
   ```



##########
site-content/source/modules/ROOT/pages/blog.adoc:
##########
@@ -8,6 +8,31 @@ NOTES FOR CONTENT CREATORS
 - Replace post tile, date, description and link to you post.
 ////
 
+//start card
+[openblock,card shadow relative test]
+----
+[openblock,card-header]
+------
+[discrete]
+=== Inside Cassandra: An Interview with Project Contributor, Aleksandr Sorokoumov
+[discrete]
+==== April 14, 2022
+------
+[openblock,card-content]
+------
+We continue our Inside Cassandra series with a Q&A with Aleksandr Sorokoumov. Like Lorina Poland and others, he recently accepted the committer position in recognition of his contributions, including his work on improving serialization efficiency.

Review Comment:
   We need to limit the content so it fits in the card:
   ```suggestion
   We continue our Inside Cassandra series with a Q&A with Aleksandr Sorokoumov who recently accepted the committer position in recognition of his contributions.
   ```



##########
site-content/source/modules/ROOT/pages/blog/Inside-Cassandra-an-interview-with-Project-Contributor-Aleksandr-Sorokoumov.adoc:
##########
@@ -0,0 +1,59 @@
+= Inside Cassandra: An Interview with Project Contributor, Aleksandr Sorokoumov
+:page-layout: single-post
+:page-role: blog-post
+:page-post-date: April 14, 2022
+:page-post-author: The Apache Cassandra Community
+:description: The Apache Cassandra Community
+:keywords: 
+
+If communities are the heart of a project, contributors are the lifeblood that keeps open source development pumping away. This time we introduce Aleksandr Sorokoumov. If his volunteer work on the Apache Cassandra project inspires you to get involved, try reading our guide xref:development/index.adoc[‘Contributing to Cassandra’].
+ 
+image::blog/inside-Cassandra-Aleksandr-Sorokoumov.png[Aleksandr Sorokoumov]
+
+=== About Our Contributor
+
+Based in Germany, Aleksandr Sorokoumov is a software engineer at Confluent, where he works on ksqlDB, a database purpose-built for stream processing applications. Previously, he was employed at DataStax working on DSE storage engine and AstraDB, a cloud-native database built on top of Apache Cassandra. As well as contributing code to the project, he’s also begun contributing technical write-ups to share some of his hard-fought-for experience with users and contributors.
+
+=== Contributor Questions
+
+*Question:* What are you currently working on, or have worked on, in the past for the Apache Cassandra project?
+
+*Answer:* I am currently working on an article about the internals of CommitLog. This is a component in Apache Cassandra responsible for durability. I worked on several projects at my previous job that required an in-depth understanding of how CommitLog works. It took me some time to find out the details as I had to dig into old JIRA issues and ask experts clarifying questions. I believe this information will benefit DBAs and new contributors; therefore, I felt it would be good to share it.
+
+Previously, I worked on improving serialization efficiency (https://issues.apache.org/jira/browse/CASSANDRA-15215[CASSANDRA-15215^]). https://developers.google.com/protocol-buffers/docs/encoding#varints[VInt^] (or Varints) is an encoding used to serialize integers, so that smaller numbers occupy less space. With my patch, the serialization throughput increased by up to 30%.
+
+*Question:* What’s been the most rewarding aspect of being part of the Apache Cassandra community?
+
+*Answer:* There are two important aspects I would like to mention. First of all, it is interaction with the community members. People working on Apache Cassandra are among the most supportive and experienced professionals I have had a chance to work with. Collaborating with them creates a ton of learning opportunities. The second aspect is the impact of the work you do. Apache Cassandra is a database running at a massive scale that powers the infrastructure of many well-known companies. It just feels great to know that my work is deployed at that scale.
+
+*Question:* For someone reading this and can help, what types of contributions or support do you think the Cassandra community needs, or is there anything they can do to help you?
+
+*Answer:* In my opinion, one of the areas the Cassandra community will always appreciate help is work on quality. Cassandra 4.0 was a great breakthrough for the project in paying back the technical debt. 
+
+With many upcoming exciting features in-flight, it takes constant effort from everyone to maintain the high bar. In my opinion, fixing a failing test is one of the best ways to get started. Such issues are often well-scoped, and the community always welcomes patches in this area.
+
+_*Note:* If you’d like to try fixing some failing tests, you can find unassigned tickets on https://issues.apache.org/jira/secure/RapidBoard.jspa?rapidView=496&quickFilter=2252[this kanban board^]._

Review Comment:
   ```suggestion
   *Note:* _If you’d like to try fixing some failing tests, you can find unassigned tickets on https://issues.apache.org/jira/secure/RapidBoard.jspa?rapidView=496&quickFilter=2252[this kanban board^]._
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: pr-unsubscribe@cassandra.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: pr-unsubscribe@cassandra.apache.org
For additional commands, e-mail: pr-help@cassandra.apache.org


[GitHub] [cassandra-website] ErickRamirezAU merged pull request #124: CASSANDRA-17555 April 2022 blog "Inside Cassandra: An Interview with Project Contributor, Aleksandr Sorokoumov"

Posted by GitBox <gi...@apache.org>.
ErickRamirezAU merged PR #124:
URL: https://github.com/apache/cassandra-website/pull/124


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: pr-unsubscribe@cassandra.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: pr-unsubscribe@cassandra.apache.org
For additional commands, e-mail: pr-help@cassandra.apache.org