You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@kafka.apache.org by "guoliang.cao@ipinyou.com" <gu...@ipinyou.com> on 2016/07/21 07:44:28 UTC

"Powered By" - Kafka in iPinYou

iPinYou (www.ipinyou.com.cn/?defaultLocale=en) is the largest DSP in China which has its HQ in Beijing and offices in Shanghai, Guangzhou, Silicon Valley and Seattle.

Kafka clusters are the central data hub in iPinYou. All kinds of Internet display advertising data, such as bid/no-bid, impression, click, advertiser, conversion and etc., are collected as primary data streams into Kafka brokers in real time, by LogAggregator (a substitute for Apache Flume, which is implemented in C/C++ by iPinYou, has customized functionality, better performance, lower resource-consuming). And quite lots of data in Kafka brokers is loaded into HDFS in near real time by Kafka2HDFS (a distributed pipeline system which is implemented in C/C++ by iPinYou and offers flexible tuning between latency and throughput). And quite lots of data in Kafka brokers is consumered by streaming applications on Apache Storm and Apache Spark Streaming.