You are viewing a plain text version of this content. The canonical link for it is here.

Posted to mapreduce-user@hadoop.apache.org by arun k <ar...@gmail.com> on 2011/11/15 15:22:02 UTC

How is data of each job assigned to nodes in Mumak ?

Hi guys !

Q> How can i assign data of each job in mumak nodes and what else i need to
do ?
In general how can i use the pluggable block-placement for HDFS in Mumak ?
Meaning in my context i am using 19-jobs-trace json file and modified
topology json file consisting of say 4 nodes. Since the number of tasks(map
& reduce) are fixed for these jobs i want to assign the input data( with
their replication) of each job to some particular nodes, so that i can use
this info in my scheduler . Does the code for "ObtainLocalMapTask" /
"ObtainNonlocalMaptask" in Scheduler need to be changed if i have to use
this data placement info ?

Q>If i have to add some fields to jobs in job trace file , how do i add it
to job trace and access in my scheduler code for scheduling in Mumak.
Which classes do i need to modify for this?


Thanks,
Arun