You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by Fabrizio detto Mario <jf...@gmail.com> on 2008/05/15 13:39:23 UTC
Master Node in Hadoop Map/Reduce Implementation.
Hello Hadoop community,
I read about Hadoop framework (
http://hadoop.apache.org/core/docs/r0.16.3/mapred_tutorial.html) this
phrase:
"The Map-Reduce framework consists of a single master JobTracker and one
slave TaskTracker per cluster-node..."
Is The Job Tracker (Master Node) unique and static for my cluster? Is The
Job Tracker the same for each map/reduce run?
If the response is positive, hadoop is different from map/reduce Google
implementation. In Google Implementation the Master node is "one copies of
program" (See MapReduce: Simplified Data Processing on Large Clusters).
Thanks.
--
Fabrizio
Re: Master Node in Hadoop Map/Reduce Implementation.
Posted by Amar Kamat <am...@yahoo-inc.com>.
Amar Kamat wrote:
> Fabrizio detto Mario wrote:
>> Hello Hadoop community,
>> I read about Hadoop framework (
>> http://hadoop.apache.org/core/docs/r0.16.3/mapred_tutorial.html) this
>> phrase:
>> "The Map-Reduce framework consists of a single master JobTracker and one
>> slave TaskTracker per cluster-node..."
>>
>> Is The Job Tracker (Master Node) unique and static for my cluster? Is
>> The
>> Job Tracker the same for each map/reduce run?
>>
>>
> The job-tracker process controls certain task-trackers processes.
> These task-trackers can be on same machine or different machines. One
> task-tracker is controlled by a single job-tracker. You could have
> multiple map-reduce clusters (each with different job-trackers) on a
> same physical cluster. So here are the answers
> 1) The job-tracker is unique for a map-reduce cluster (logical)
> consisting of a set of trackers. There can be multiple such
> job-trackers each controlling different set of trackers, possibly in
> the same physical cluster.
> 2) The job-tracker is same for all jobs that run on the same set of
> trackers.
> It is possible to have multiple task-trackers running simultaneously
> on a same machine (differentiated by ports) and hence each tracker can
> be under different job-trackers and hence one machine
"and hence each tracker can be under different job-trackers .."
should be
"and hence each node can be under different job-trackers .."
Amar
> can be under multiple logical clusters.
> Amar
>> If the response is positive, hadoop is different from map/reduce Google
>> implementation. In Google Implementation the Master node is "one
>> copies of
>> program" (See MapReduce: Simplified Data Processing on Large Clusters).
>>
>> Thanks.
>>
>>
>
Re: Master Node in Hadoop Map/Reduce Implementation.
Posted by Amar Kamat <am...@yahoo-inc.com>.
Fabrizio detto Mario wrote:
> Hello Hadoop community,
> I read about Hadoop framework (
> http://hadoop.apache.org/core/docs/r0.16.3/mapred_tutorial.html) this
> phrase:
> "The Map-Reduce framework consists of a single master JobTracker and one
> slave TaskTracker per cluster-node..."
>
> Is The Job Tracker (Master Node) unique and static for my cluster? Is The
> Job Tracker the same for each map/reduce run?
>
>
The job-tracker process controls certain task-trackers processes. These
task-trackers can be on same machine or different machines. One
task-tracker is controlled by a single job-tracker. You could have
multiple map-reduce clusters (each with different job-trackers) on a
same physical cluster. So here are the answers
1) The job-tracker is unique for a map-reduce cluster (logical)
consisting of a set of trackers. There can be multiple such job-trackers
each controlling different set of trackers, possibly in the same
physical cluster.
2) The job-tracker is same for all jobs that run on the same set of
trackers.
It is possible to have multiple task-trackers running simultaneously on
a same machine (differentiated by ports) and hence each tracker can be
under different job-trackers and hence one machine can be under multiple
logical clusters.
Amar
> If the response is positive, hadoop is different from map/reduce Google
> implementation. In Google Implementation the Master node is "one copies of
> program" (See MapReduce: Simplified Data Processing on Large Clusters).
>
> Thanks.
>
>