You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by steph <st...@conviva.com> on 2008/09/24 19:48:40 UTC
HDFS ingest rate
Hi,
Are there any performance numbers related to how HDFS can ingest data?
I am assuming a case where multiple processes outside hadoop write into
hadoop in parallel. I understand that it is probably related to
various hardware
constraints but any existing numbers would be interesting. In
particular:
Is the ingest rate tied to the number of nodes on the cluster? Is it
somehow linear?
What is the impact of ingesting data at a high rate on the map/reduce
jobs running?
Thanks,
S.