You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by jagaran das <ja...@yahoo.co.in> on 2011/08/13 10:00:21 UTC
MR job to copy to hadoop
Hi,
What is the best and fast way to achieve parallel copy to hadoop from an NFS mount?
We have a mount with huge number of files and we need to copy it into hdfs.
Some options:
1. Run copyFromLocal in a multithreaded way
2. Use distcp in an isolated way.
3. Can i write a map only job to do copy?
Regards,
JD