You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by Xianqing Yu <xy...@ncsu.edu> on 2011/10/12 05:37:55 UTC

About Tasktracker and DataNode

Hi people,

I have a question about how to setup hadoop cluster. Could I set TaskTracker and DataNode running on the different machines? Which means one machine with Tasktracker only, and one machine has DataNode daemon only.

Thanks,

Xianqing

Re: About Tasktracker and DataNode

Posted by patrick sang <si...@gmail.com>.
just want to add little on top.
I believe that you have to start
set of datanode (start-dfs.sh)
before
set of trasktracker (start-mapred.sh)

hth
p



On Tue, Oct 11, 2011 at 9:10 PM, Harsh J <ha...@cloudera.com> wrote:

> Yes, you can do this - the services are not coupled with one another.
>
> Just start tasktrackers on one set of machines, and datanodes on
> another set of machines (via bin/hadoop-daemon.sh start
> {tasktracker,datanode} or so, individually.)
>
> You will lose out on complete data locality during processing, however.
>
> On Wed, Oct 12, 2011 at 9:07 AM, Xianqing Yu <xy...@ncsu.edu> wrote:
> > Hi people,
> >
> > I have a question about how to setup hadoop cluster. Could I set
> TaskTracker and DataNode running on the different machines? Which means one
> machine with Tasktracker only, and one machine has DataNode daemon only.
> >
> > Thanks,
> >
> > Xianqing
> >
>
>
>
> --
> Harsh J
>

Re: About Tasktracker and DataNode

Posted by Harsh J <ha...@cloudera.com>.
Yes, you can do this - the services are not coupled with one another.

Just start tasktrackers on one set of machines, and datanodes on
another set of machines (via bin/hadoop-daemon.sh start
{tasktracker,datanode} or so, individually.)

You will lose out on complete data locality during processing, however.

On Wed, Oct 12, 2011 at 9:07 AM, Xianqing Yu <xy...@ncsu.edu> wrote:
> Hi people,
>
> I have a question about how to setup hadoop cluster. Could I set TaskTracker and DataNode running on the different machines? Which means one machine with Tasktracker only, and one machine has DataNode daemon only.
>
> Thanks,
>
> Xianqing
>



-- 
Harsh J