You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by Xianqing Yu <xy...@ncsu.edu> on 2011/10/12 05:37:55 UTC
About Tasktracker and DataNode
Hi people,
I have a question about how to setup hadoop cluster. Could I set TaskTracker and DataNode running on the different machines? Which means one machine with Tasktracker only, and one machine has DataNode daemon only.
Thanks,
Xianqing
Re: About Tasktracker and DataNode
Posted by patrick sang <si...@gmail.com>.
just want to add little on top.
I believe that you have to start
set of datanode (start-dfs.sh)
before
set of trasktracker (start-mapred.sh)
hth
p
On Tue, Oct 11, 2011 at 9:10 PM, Harsh J <ha...@cloudera.com> wrote:
> Yes, you can do this - the services are not coupled with one another.
>
> Just start tasktrackers on one set of machines, and datanodes on
> another set of machines (via bin/hadoop-daemon.sh start
> {tasktracker,datanode} or so, individually.)
>
> You will lose out on complete data locality during processing, however.
>
> On Wed, Oct 12, 2011 at 9:07 AM, Xianqing Yu <xy...@ncsu.edu> wrote:
> > Hi people,
> >
> > I have a question about how to setup hadoop cluster. Could I set
> TaskTracker and DataNode running on the different machines? Which means one
> machine with Tasktracker only, and one machine has DataNode daemon only.
> >
> > Thanks,
> >
> > Xianqing
> >
>
>
>
> --
> Harsh J
>
Re: About Tasktracker and DataNode
Posted by Harsh J <ha...@cloudera.com>.
Yes, you can do this - the services are not coupled with one another.
Just start tasktrackers on one set of machines, and datanodes on
another set of machines (via bin/hadoop-daemon.sh start
{tasktracker,datanode} or so, individually.)
You will lose out on complete data locality during processing, however.
On Wed, Oct 12, 2011 at 9:07 AM, Xianqing Yu <xy...@ncsu.edu> wrote:
> Hi people,
>
> I have a question about how to setup hadoop cluster. Could I set TaskTracker and DataNode running on the different machines? Which means one machine with Tasktracker only, and one machine has DataNode daemon only.
>
> Thanks,
>
> Xianqing
>
--
Harsh J