You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@flume.apache.org by Sushruth Puttaswamy <su...@nor1.com> on 2012/01/04 19:20:11 UTC

flume + hive options?

Guys,

Whats the best known usage of flume with hive? Just curious to see what everyone is using. My requirements are standard..


 *   Currently writing logs onto HDFS from different production servers.
 *   Need to pre process the logs before writing onto hive.
 *   Need a way to merge the files generated by flume.

I see that there is a flume+hive sink plugin, but did not find much usage data on that. I could write a custom sink or a custom decorator to do the pre processing & then run every hour cron jobs to write data from HDFS to hive.

Any suggestions?

Sushruth