You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by Neil Blue <Ne...@biowisdom.com> on 2009/11/03 12:23:25 UTC

Slave data node failing to connect?

Hello


I am trying to start up my first twin node hadoop cluster. I have followed
this guide:

http://www.michael-noll.com/wiki/Running_Hadoop_On_Ubuntu_Linux_%28Multi-Nod
e_Cluster%29
,and got two machines running as single node instances and then moved on to
connect them into a multi-node cluster.

I have two ubuntu instances running in virtual box with a bridged network
adapter.

I have configured the xml files slaves and master to point to the correct
machines, along with the ssh key.

When I start up the services I get all these starting on the master:

JobTracker
DataNode
SecondaryNameNode
TaskTracker
NameNode

The web interface shows the system is up and running with one node. 

On the slave these are running:
TaskTracker
DataNode

The output logs on the slave show:

hadoop-hadoop-datanode-slave.log
2009-11-03 11:15:52,055 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:4310. Already tried 9 time(s).
2009-11-03 11:15:52,057 INFO org.apache.hadoop.ipc.RPC: Server at
master/172.18.11.95:4310 not available yet, Zzzzz...
2009-11-03 11:15:54,063 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:4310. Already tried 0 time(s).
2009-11-03 11:15:55,064 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:4310. Already tried 1 time(s).
2009-11-03 11:15:56,068 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:4310. Already tried 2 time(s).
2009-11-03 11:15:57,073 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:4310. Already tried 3 time(s).

hadoop-hadoop-tasktracker-slave.log
2009-11-03 11:18:01,002 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:9001. Already tried 9 time(s).
2009-11-03 11:18:01,004 INFO org.apache.hadoop.ipc.RPC: Server at
master/172.18.11.95:9001 not available yet, Zzzzz...
2009-11-03 11:18:03,007 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:9001. Already tried 0 time(s).
2009-11-03 11:18:04,009 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:9001. Already tried 1 time(s).
2009-11-03 11:18:05,011 INFO org.apache.hadoop.ipc.Client: Retrying connect
to server: master/172.18.11.95:9001. Already tried 2 time(s).

Tcpdump shows that the packets are being sent between the machines, and ssh
works, so there does not seem to be any network problems. Also on the slave,
the remote http://master:50070/dfshealth.jsp page is visible.

I have also tried changing the port numbers used by the master, but no luck.

Any suggestions please.

Thanks
Neil

********************************************* 

The information contained in this message is likely to be confidential.  It is intended only for the person named above.  Any dissemination, distribution, copying, disclosure or use of this message or its contents unless authorised by BioWisdom Ltd is strictly prohibited. Any views or opinions expressed within this e-mail are those of the author and do not necessarily represent those of BioWisdom Ltd. If you have received this message in error, please immediately notify us and delete it.  Thank you.  Registered Office: BioWisdom Ltd, Harston Mill, Harston, Cambridge, CB22 7GG.  Registered in England: (GB) 3861669.  VAT registered: (GB) 750899881.  Tel: +44 (0)1223 874800, Fax: +44 (0) 1223 874801, Internet:www.biowisdom.com 

*********************************************


Re: Slave data node failing to connect?

Posted by Jason Venner <ja...@gmail.com>.
Either your master namenode/jobtrackers are not actually starting, or they
are  not listening on those particular ports or there is a networking issue.


On Tue, Nov 3, 2009 at 4:23 AM, Neil Blue <Ne...@biowisdom.com> wrote:

> Hello
>
>
> I am trying to start up my first twin node hadoop cluster. I have followed
> this guide:
>
>
> http://www.michael-noll.com/wiki/Running_Hadoop_On_Ubuntu_Linux_%28Multi-Nod
> e_Cluster%29<http://www.michael-noll.com/wiki/Running_Hadoop_On_Ubuntu_Linux_%28Multi-Nod%0Ae_Cluster%29>
> ,and got two machines running as single node instances and then moved on to
> connect them into a multi-node cluster.
>
> I have two ubuntu instances running in virtual box with a bridged network
> adapter.
>
> I have configured the xml files slaves and master to point to the correct
> machines, along with the ssh key.
>
> When I start up the services I get all these starting on the master:
>
> JobTracker
> DataNode
> SecondaryNameNode
> TaskTracker
> NameNode
>
> The web interface shows the system is up and running with one node.
>
> On the slave these are running:
> TaskTracker
> DataNode
>
> The output logs on the slave show:
>
> hadoop-hadoop-datanode-slave.log
> 2009-11-03 11:15:52,055 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server: master/172.18.11.95:4310. Already tried 9 time(s).
> 2009-11-03 11:15:52,057 INFO org.apache.hadoop.ipc.RPC: Server at
> master/172.18.11.95:4310 not available yet, Zzzzz...
> 2009-11-03 11:15:54,063 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server: master/172.18.11.95:4310. Already tried 0 time(s).
> 2009-11-03 11:15:55,064 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server: master/172.18.11.95:4310. Already tried 1 time(s).
> 2009-11-03 11:15:56,068 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server: master/172.18.11.95:4310. Already tried 2 time(s).
> 2009-11-03 11:15:57,073 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server: master/172.18.11.95:4310. Already tried 3 time(s).
>
> hadoop-hadoop-tasktracker-slave.log
> 2009-11-03 11:18:01,002 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server: master/172.18.11.95:9001. Already tried 9 time(s).
> 2009-11-03 11:18:01,004 INFO org.apache.hadoop.ipc.RPC: Server at
> master/172.18.11.95:9001 not available yet, Zzzzz...
> 2009-11-03 11:18:03,007 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server: master/172.18.11.95:9001. Already tried 0 time(s).
> 2009-11-03 11:18:04,009 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server: master/172.18.11.95:9001. Already tried 1 time(s).
> 2009-11-03 11:18:05,011 INFO org.apache.hadoop.ipc.Client: Retrying connect
> to server: master/172.18.11.95:9001. Already tried 2 time(s).
>
> Tcpdump shows that the packets are being sent between the machines, and ssh
> works, so there does not seem to be any network problems. Also on the
> slave,
> the remote http://master:50070/dfshealth.jsp page is visible.
>
> I have also tried changing the port numbers used by the master, but no
> luck.
>
> Any suggestions please.
>
> Thanks
> Neil
>
> *********************************************
>
> The information contained in this message is likely to be confidential.  It
> is intended only for the person named above.  Any dissemination,
> distribution, copying, disclosure or use of this message or its contents
> unless authorised by BioWisdom Ltd is strictly prohibited. Any views or
> opinions expressed within this e-mail are those of the author and do not
> necessarily represent those of BioWisdom Ltd. If you have received this
> message in error, please immediately notify us and delete it.  Thank you.
>  Registered Office: BioWisdom Ltd, Harston Mill, Harston, Cambridge, CB22
> 7GG.  Registered in England: (GB) 3861669.  VAT registered: (GB) 750899881.
>  Tel: +44 (0)1223 874800, Fax: +44 (0) 1223 874801, Internet:
> www.biowisdom.com
>
> *********************************************
>
>


-- 
Pro Hadoop, a book to guide you from beginner to hadoop mastery,
http://www.amazon.com/dp/1430219424?tag=jewlerymall
www.prohadoopbook.com a community for Hadoop Professionals

Hadoop tread in my blog

Posted by Mikio Uzawa <m_...@amber.plala.or.jp>.
Hi all.

I made a Hadoop tread at below in my blog.
http://jclouds.wordpress.com/category/hadoop/

There are only two contents now, however there will be 
much more Hadoop topics, because of Hadoop Conference 
Japan will be taken place next week.

Please expect!

/mikio uzawa