You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@kafka.apache.org by Mich Talebzadeh <mi...@gmail.com> on 2017/03/07 19:35:12 UTC

What is viable hardware for Zookeeper and Kafka brokers?

Hi,

I have read in the note below

https://www.quora.com/What-is-viable-hardware-for-Zookeeper-and-Kafka-brokers

Which states

"Take your expected message size * expected messages/second, and multiply
that by how many seconds you would like to keep your messages available in
your kafka cluster for. Then multiply that size by two or more depending on
what you choose your replication factor (ie: redundancy) to be, and divide
that by how many brokers you'll have in your cluster. This will give you
how much hard drive space per broker that you need.

Kafka is a sequential message queue - there aren't any random reads and
writes. This means that it can take heavy advantage of disk read-ahead,
disk buffering, and linux kernel/filesystem optimizations. A typical hard
disk doing sequential reads/writes is easily able to saturate a gigabit
link, and so is Kafka with any decent CPU made in the last 5 years. This
means that you can get by with big, cheap, slow disks.

Please remember that the disk buffer cache lives in your RAM. That means
that you need sufficient RAM in order to store a certain time period of
messages in cache. If you're not doing anything crazy and are always
expecting your consumers to be up to date, then you don't have to worry too
much. Just keep this in mind should your consumers fall behind far enough
that they need to retrieve data outside of buffer cache. I personally would
use the money saved on disk to buy more RAM."


This pretty old (May 2015) and I am wondering if it refers to Kafka < 0.8).
However, the calculation's seems to be plausible. Any comments on it please?

Thanks


Dr Mich Talebzadeh



LinkedIn * https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
<https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>*



http://talebzadehmich.wordpress.com


*Disclaimer:* Use it at your own risk. Any and all responsibility for any
loss, damage or destruction of data or any other property which may arise
from relying on this email's technical content is explicitly disclaimed.
The author will in no case be liable for any monetary damages arising from
such loss, damage or destruction.