You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@kafka.apache.org by Pariksheet Barapatre <pa...@gmail.com> on 2016/02/22 12:24:27 UTC

Kafka basic doubts

Hi All,

Greetings..!!! This is my first email to Kafka Community.

I have just started exploring Kafka on CDH5.5 cluster which ships with
Kafka 0.8.2.1.

I am able to run sample programs for producer as well as consumer (both
high level and low level).

Now I am trying to load messages from Kafka to HDFS in batch i.e. every
hour.

Managing an offsets at partition level, I guess will do a trick but I am
confused about offset itself. Is it a line number or byte offset.

I tried using Kangaroo project but no luck. It assumes offset as number of
bytes whereas I am getting line number as offset.

Also, Kafka Connect service is introduced in Kafka 0.9, does anybody tried
loading data from Kafka to HDFS using it.

Many Thanks in Advance.

Regards
Pari