You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hbase.apache.org by "Ananth T. Sarathy" <an...@gmail.com> on 2010/01/13 18:43:55 UTC

Problem With Zookeeper

I have Hbase.env set to manage Zookeeper. When I try to start hbase, the
zookeeper out says

java.io.IOException: Could not find my address: localhost in list of
ZooKeeper quorum servers
        at
org.apache.hadoop.hbase.zookeeper.HQuorumPeer.writeMyID(HQuorumPeer.java:128)
        at
org.apache.hadoop.hbase.zookeeper.HQuorumPeer.main(HQuorumPeer.java:67)
~

in my hbase-site.xml

 <property>
   <name>hbase.zookeeper.quorum</name>
   <value>gs2,gs3,gs4</value>
   <description>Comma separated list of servers in the ZooKeeper Quorum.
   For example, "host1.mydomain.com,host2.mydomain.com,host3.mydomain.com".
   By default this is set to localhost for local and pseudo-distributed
modes
   of operation. For a fully-distributed setup, this should be set to a full
   list of ZooKeeper quorum servers. If HBASE_MANAGES_ZK is set in
hbase-env.sh
   this is the list of servers which we will start/stop ZooKeeper on.
   </description>
 </property>

in my /etc/hosts

# hostname gs2 added to /etc/hosts by anaconda
127.0.0.1   localhost localhost.localdomain localhost4
localhost4.localdomain4 gs2
::1         localhost localhost.localdomain localhost6
localhost6.localdomain6 gs2

192.168.20.101 gs1
192.168.20.102 gs2
192.168.20.103 gs3
192.168.20.104 gs4
192.168.20.105 gs5
192.168.20.106 gs6
192.168.20.107 gs7
192.168.20.108 gs8
192.168.20.110 gs10
192.168.20.111 gs11
192.168.20.112 gs12
192.168.20.113 gs13
192.168.20.114 gs14
192.168.20.115 gs15
192.168.20.116 gs16
192.168.20.117 gs17

am I missing something here? Why does it insist on localhost in the quorum
list? What do i need to do to unconfuse it?


Ananth T Sarathy

Re: Problem With Zookeeper

Posted by "Ananth T. Sarathy" <an...@gmail.com>.
Yeah it looks like after i fixed the servers to work with Zookeeper, HDFS
got hosed! Restarts of that fixed everything!

Thanks.
Ananth T Sarathy


On Wed, Jan 13, 2010 at 3:34 PM, Jean-Daniel Cryans <jd...@apache.org>wrote:

> Oh I see something, it seems that the master is waiting on the file
> system in the main thread. Is HDFS running? Is can you create a file?
>
> J-D
>
> On Wed, Jan 13, 2010 at 12:27 PM, Ananth T. Sarathy
> <an...@gmail.com> wrote:
> > here 's what I get
> >
> > http://pastebin.com/m60c1864b
> >
> >
> > Ananth T Sarathy
> >
> >
> > On Wed, Jan 13, 2010 at 2:57 PM, Jean-Daniel Cryans <jdcryans@apache.org
> >wrote:
> >
> >> Do "jps" then "jstack pid" with the master's pid given by jps.
> >>
> >> J-D
> >>
> >> On Wed, Jan 13, 2010 at 11:41 AM, Ananth T. Sarathy
> >> <an...@gmail.com> wrote:
> >> > well when i do a ps -ef|grep hbase i have 3 processes running. I have
> >> killed
> >> > them all, reinstalled hbase, formated my name node, and still the
> >> master.log
> >> > is the same when I restart.  What could be causing it hang?
> >> >
> >> >
> >> > Ananth T Sarathy
> >> >
> >> >
> >> > On Wed, Jan 13, 2010 at 2:26 PM, Jean-Daniel Cryans <
> jdcryans@apache.org
> >> >wrote:
> >> >
> >> >> Well it's just weird that your master would just "refuse" to start.
> Is
> >> >> the process still there? If you jstack it, is there any thread
> >> >> running?
> >> >>
> >> >> You could also clean up everything and retry, but that's just the
> easy
> >> >> way out :P
> >> >>
> >> >> J-D
> >> >>
> >> >> On Wed, Jan 13, 2010 at 11:23 AM, Ananth T. Sarathy
> >> >> <an...@gmail.com> wrote:
> >> >> > master. out is empty.... could something have cludged up from the
> >> >> previous
> >> >> > issues? Are there files I should delete/ reformat my namenode?
> >> >> >
> >> >> > I don't have any data yet in these, so I can afford to blow things
> >> away,
> >> >> but
> >> >> > I cleaned out the tmp dir already so I am not sure what else i need
> to
> >> >> do.
> >> >> > Ananth T Sarathy
> >> >> >
> >> >> >
> >> >> > On Wed, Jan 13, 2010 at 2:14 PM, Jean-Daniel Cryans <
> >> jdcryans@apache.org
> >> >> >wrote:
> >> >> >
> >> >> >> If that's everything from your master log, then I would suggest
> you
> >> >> >> take a look at the .out file (instead of .log) since it might be a
> >> >> >> problem on startup.
> >> >> >>
> >> >> >> J-D
> >> >> >>
> >> >> >> On Wed, Jan 13, 2010 at 11:09 AM, Ananth T. Sarathy
> >> >> >> <an...@gmail.com> wrote:
> >> >> >> > Master log
> >> >> >> >
> >> >> >> > http://pastebin.com/m469d1b39
> >> >> >> >
> >> >> >> > Zookeeper log
> >> >> >> > http://pastebin.com/m47f0503
> >> >> >> >
> >> >> >> > region server
> >> >> >> >
> >> >> >> > http://pastebin.com/m305fab14
> >> >> >> >
> >> >> >> > Ananth T Sarathy
> >> >> >> >
> >> >> >> >
> >> >> >> > On Wed, Jan 13, 2010 at 2:02 PM, Jean-Daniel Cryans <
> >> >> jdcryans@apache.org
> >> >> >> >wrote:
> >> >> >> >
> >> >> >> >> Looks like your master didn't register itself in zookeeper, you
> >> >> should
> >> >> >> >> look in its log.
> >> >> >> >>
> >> >> >> >> J-D
> >> >> >> >>
> >> >> >> >> On Wed, Jan 13, 2010 at 10:59 AM, Ananth T. Sarathy
> >> >> >> >> <an...@gmail.com> wrote:
> >> >> >> >> > ok, we got that to work and zookeeper is coming up, but now I
> am
> >> >> >> getting
> >> >> >> >> > something else... the regionserver are connecting cause  of
> >> >> >> >> >
> >> >> >> >> > 2010-01-13 13:57:56,029 WARN
> >> >> >> >> > org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to
> >> read
> >> >> >> master
> >> >> >> >> > address from ZooKeeper. Retrying. Error was:
> >> >> >> >> > java.io.IOException:
> >> >> >> >> org.apache.zookeeper.KeeperException$NoNodeException:
> >> >> >> >> > KeeperErrorCode = NoNode for /hbase/master
> >> >> >> >> >        at
> >> >> >> >> >
> >> >> >> >>
> >> >> >>
> >> >>
> >>
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:332)
> >> >> >> >> >        at
> >> >> >> >> >
> >> >> >> >>
> >> >> >>
> >> >>
> >>
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readMasterAddressOrThrow(ZooKeeperWrapper.java:240)
> >> >> >> >> >        at
> >> >> >> >> >
> >> >> >> >>
> >> >> >>
> >> >>
> >>
> org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:1339)
> >> >> >> >> >        at
> >> >> >> >> >
> >> >> >> >>
> >> >> >>
> >> >>
> >>
> org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:1371)
> >> >> >> >> >        at
> >> >> >> >> >
> >> >> >> >>
> >> >> >>
> >> >>
> >>
> org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:427)
> >> >> >> >> >        at java.lang.Thread.run(Thread.java:636)
> >> >> >> >> > Caused by:
> org.apache.zookeeper.KeeperException$NoNodeException:
> >> >> >> >> > KeeperErrorCode = NoNode for /hbase/master
> >> >> >> >> >        at
> >> >> >> >> >
> >> >> org.apache.zookeeper.KeeperException.create(KeeperException.java:102)
> >> >> >> >> >        at
> >> >> >> >> >
> >> >> org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
> >> >> >> >> >        at
> >> >> org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:892)
> >> >> >> >> >        at
> >> >> >> >> >
> >> >> >> >>
> >> >> >>
> >> >>
> >>
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:328)
> >> >> >> >> >        ... 5 more
> >> >> >> >> >
> >> >> >> >> >
> >> >> >> >> > any ideas?
> >> >> >> >> > Ananth T Sarathy
> >> >> >> >> >
> >> >> >> >> >
> >> >> >> >> > On Wed, Jan 13, 2010 at 12:52 PM, Jean-Daniel Cryans <
> >> >> >> >> jdcryans@apache.org>wrote:
> >> >> >> >> >
> >> >> >> >> >> HBase 0.20.2 and previous only checked one address against
> the
> >> >> list
> >> >> >> >> >> that is provided, the one returned was the default Java knew
> >> of.
> >> >> It
> >> >> >> >> >> seems that in your case your /etc/hosts makes it that this
> >> >> machines
> >> >> >> >> >> resolves itself only as localhost. You can:
> >> >> >> >> >>
> >> >> >> >> >> 1) Try to fix your network configuration to have your
> machine
> >> >> always
> >> >> >> >> >> resolve by its hostname first, or
> >> >> >> >> >>
> >> >> >> >> >> 2) Use HBase 0.20.3RC1 which contains a fix that tries
> harder
> >> to
> >> >> >> match
> >> >> >> >> >> the address. You can get it here:
> >> >> >> >> >>
> http://people.apache.org/~jdcryans/hbase-0.20.3-candidate-1/<http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> >> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> >> >> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> >> >> >> >> <
> http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> >> >> >> >> >>
> >> >> >> >> >> Sorry for that,
> >> >> >> >> >>
> >> >> >> >> >> J-D
> >> >> >> >> >>
> >> >> >> >> >> On Wed, Jan 13, 2010 at 9:43 AM, Ananth T. Sarathy
> >> >> >> >> >> <an...@gmail.com> wrote:
> >> >> >> >> >> > I have Hbase.env set to manage Zookeeper. When I try to
> start
> >> >> >> hbase,
> >> >> >> >> the
> >> >> >> >> >> > zookeeper out says
> >> >> >> >> >> >
> >> >> >> >> >> > java.io.IOException: Could not find my address: localhost
> in
> >> >> list
> >> >> >> of
> >> >> >> >> >> > ZooKeeper quorum servers
> >> >> >> >> >> >        at
> >> >> >> >> >> >
> >> >> >> >> >>
> >> >> >> >>
> >> >> >>
> >> >>
> >>
> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.writeMyID(HQuorumPeer.java:128)
> >> >> >> >> >> >        at
> >> >> >> >> >> >
> >> >> >> >>
> >> >>
> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.main(HQuorumPeer.java:67)
> >> >> >> >> >> > ~
> >> >> >> >> >> >
> >> >> >> >> >> > in my hbase-site.xml
> >> >> >> >> >> >
> >> >> >> >> >> >  <property>
> >> >> >> >> >> >   <name>hbase.zookeeper.quorum</name>
> >> >> >> >> >> >   <value>gs2,gs3,gs4</value>
> >> >> >> >> >> >   <description>Comma separated list of servers in the
> >> ZooKeeper
> >> >> >> >> Quorum.
> >> >> >> >> >> >   For example, "host1.mydomain.com,host2.mydomain.com,
> >> >> >> >> host3.mydomain.com
> >> >> >> >> >> ".
> >> >> >> >> >> >   By default this is set to localhost for local and
> >> >> >> pseudo-distributed
> >> >> >> >> >> > modes
> >> >> >> >> >> >   of operation. For a fully-distributed setup, this should
> be
> >> >> set
> >> >> >> to a
> >> >> >> >> >> full
> >> >> >> >> >> >   list of ZooKeeper quorum servers. If HBASE_MANAGES_ZK is
> >> set
> >> >> in
> >> >> >> >> >> > hbase-env.sh
> >> >> >> >> >> >   this is the list of servers which we will start/stop
> >> ZooKeeper
> >> >> >> on.
> >> >> >> >> >> >   </description>
> >> >> >> >> >> >  </property>
> >> >> >> >> >> >
> >> >> >> >> >> > in my /etc/hosts
> >> >> >> >> >> >
> >> >> >> >> >> > # hostname gs2 added to /etc/hosts by anaconda
> >> >> >> >> >> > 127.0.0.1   localhost localhost.localdomain localhost4
> >> >> >> >> >> > localhost4.localdomain4 gs2
> >> >> >> >> >> > ::1         localhost localhost.localdomain localhost6
> >> >> >> >> >> > localhost6.localdomain6 gs2
> >> >> >> >> >> >
> >> >> >> >> >> > 192.168.20.101 gs1
> >> >> >> >> >> > 192.168.20.102 gs2
> >> >> >> >> >> > 192.168.20.103 gs3
> >> >> >> >> >> > 192.168.20.104 gs4
> >> >> >> >> >> > 192.168.20.105 gs5
> >> >> >> >> >> > 192.168.20.106 gs6
> >> >> >> >> >> > 192.168.20.107 gs7
> >> >> >> >> >> > 192.168.20.108 gs8
> >> >> >> >> >> > 192.168.20.110 gs10
> >> >> >> >> >> > 192.168.20.111 gs11
> >> >> >> >> >> > 192.168.20.112 gs12
> >> >> >> >> >> > 192.168.20.113 gs13
> >> >> >> >> >> > 192.168.20.114 gs14
> >> >> >> >> >> > 192.168.20.115 gs15
> >> >> >> >> >> > 192.168.20.116 gs16
> >> >> >> >> >> > 192.168.20.117 gs17
> >> >> >> >> >> >
> >> >> >> >> >> > am I missing something here? Why does it insist on
> localhost
> >> in
> >> >> the
> >> >> >> >> >> quorum
> >> >> >> >> >> > list? What do i need to do to unconfuse it?
> >> >> >> >> >> >
> >> >> >> >> >> >
> >> >> >> >> >> > Ananth T Sarathy
> >> >> >> >> >> >
> >> >> >> >> >>
> >> >> >> >> >
> >> >> >> >>
> >> >> >> >
> >> >> >>
> >> >> >
> >> >>
> >> >
> >>
> >
>

Re: Problem With Zookeeper

Posted by Jean-Daniel Cryans <jd...@apache.org>.
Oh I see something, it seems that the master is waiting on the file
system in the main thread. Is HDFS running? Is can you create a file?

J-D

On Wed, Jan 13, 2010 at 12:27 PM, Ananth T. Sarathy
<an...@gmail.com> wrote:
> here 's what I get
>
> http://pastebin.com/m60c1864b
>
>
> Ananth T Sarathy
>
>
> On Wed, Jan 13, 2010 at 2:57 PM, Jean-Daniel Cryans <jd...@apache.org>wrote:
>
>> Do "jps" then "jstack pid" with the master's pid given by jps.
>>
>> J-D
>>
>> On Wed, Jan 13, 2010 at 11:41 AM, Ananth T. Sarathy
>> <an...@gmail.com> wrote:
>> > well when i do a ps -ef|grep hbase i have 3 processes running. I have
>> killed
>> > them all, reinstalled hbase, formated my name node, and still the
>> master.log
>> > is the same when I restart.  What could be causing it hang?
>> >
>> >
>> > Ananth T Sarathy
>> >
>> >
>> > On Wed, Jan 13, 2010 at 2:26 PM, Jean-Daniel Cryans <jdcryans@apache.org
>> >wrote:
>> >
>> >> Well it's just weird that your master would just "refuse" to start. Is
>> >> the process still there? If you jstack it, is there any thread
>> >> running?
>> >>
>> >> You could also clean up everything and retry, but that's just the easy
>> >> way out :P
>> >>
>> >> J-D
>> >>
>> >> On Wed, Jan 13, 2010 at 11:23 AM, Ananth T. Sarathy
>> >> <an...@gmail.com> wrote:
>> >> > master. out is empty.... could something have cludged up from the
>> >> previous
>> >> > issues? Are there files I should delete/ reformat my namenode?
>> >> >
>> >> > I don't have any data yet in these, so I can afford to blow things
>> away,
>> >> but
>> >> > I cleaned out the tmp dir already so I am not sure what else i need to
>> >> do.
>> >> > Ananth T Sarathy
>> >> >
>> >> >
>> >> > On Wed, Jan 13, 2010 at 2:14 PM, Jean-Daniel Cryans <
>> jdcryans@apache.org
>> >> >wrote:
>> >> >
>> >> >> If that's everything from your master log, then I would suggest you
>> >> >> take a look at the .out file (instead of .log) since it might be a
>> >> >> problem on startup.
>> >> >>
>> >> >> J-D
>> >> >>
>> >> >> On Wed, Jan 13, 2010 at 11:09 AM, Ananth T. Sarathy
>> >> >> <an...@gmail.com> wrote:
>> >> >> > Master log
>> >> >> >
>> >> >> > http://pastebin.com/m469d1b39
>> >> >> >
>> >> >> > Zookeeper log
>> >> >> > http://pastebin.com/m47f0503
>> >> >> >
>> >> >> > region server
>> >> >> >
>> >> >> > http://pastebin.com/m305fab14
>> >> >> >
>> >> >> > Ananth T Sarathy
>> >> >> >
>> >> >> >
>> >> >> > On Wed, Jan 13, 2010 at 2:02 PM, Jean-Daniel Cryans <
>> >> jdcryans@apache.org
>> >> >> >wrote:
>> >> >> >
>> >> >> >> Looks like your master didn't register itself in zookeeper, you
>> >> should
>> >> >> >> look in its log.
>> >> >> >>
>> >> >> >> J-D
>> >> >> >>
>> >> >> >> On Wed, Jan 13, 2010 at 10:59 AM, Ananth T. Sarathy
>> >> >> >> <an...@gmail.com> wrote:
>> >> >> >> > ok, we got that to work and zookeeper is coming up, but now I am
>> >> >> getting
>> >> >> >> > something else... the regionserver are connecting cause  of
>> >> >> >> >
>> >> >> >> > 2010-01-13 13:57:56,029 WARN
>> >> >> >> > org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to
>> read
>> >> >> master
>> >> >> >> > address from ZooKeeper. Retrying. Error was:
>> >> >> >> > java.io.IOException:
>> >> >> >> org.apache.zookeeper.KeeperException$NoNodeException:
>> >> >> >> > KeeperErrorCode = NoNode for /hbase/master
>> >> >> >> >        at
>> >> >> >> >
>> >> >> >>
>> >> >>
>> >>
>> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:332)
>> >> >> >> >        at
>> >> >> >> >
>> >> >> >>
>> >> >>
>> >>
>> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readMasterAddressOrThrow(ZooKeeperWrapper.java:240)
>> >> >> >> >        at
>> >> >> >> >
>> >> >> >>
>> >> >>
>> >>
>> org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:1339)
>> >> >> >> >        at
>> >> >> >> >
>> >> >> >>
>> >> >>
>> >>
>> org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:1371)
>> >> >> >> >        at
>> >> >> >> >
>> >> >> >>
>> >> >>
>> >>
>> org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:427)
>> >> >> >> >        at java.lang.Thread.run(Thread.java:636)
>> >> >> >> > Caused by: org.apache.zookeeper.KeeperException$NoNodeException:
>> >> >> >> > KeeperErrorCode = NoNode for /hbase/master
>> >> >> >> >        at
>> >> >> >> >
>> >> org.apache.zookeeper.KeeperException.create(KeeperException.java:102)
>> >> >> >> >        at
>> >> >> >> >
>> >> org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
>> >> >> >> >        at
>> >> org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:892)
>> >> >> >> >        at
>> >> >> >> >
>> >> >> >>
>> >> >>
>> >>
>> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:328)
>> >> >> >> >        ... 5 more
>> >> >> >> >
>> >> >> >> >
>> >> >> >> > any ideas?
>> >> >> >> > Ananth T Sarathy
>> >> >> >> >
>> >> >> >> >
>> >> >> >> > On Wed, Jan 13, 2010 at 12:52 PM, Jean-Daniel Cryans <
>> >> >> >> jdcryans@apache.org>wrote:
>> >> >> >> >
>> >> >> >> >> HBase 0.20.2 and previous only checked one address against the
>> >> list
>> >> >> >> >> that is provided, the one returned was the default Java knew
>> of.
>> >> It
>> >> >> >> >> seems that in your case your /etc/hosts makes it that this
>> >> machines
>> >> >> >> >> resolves itself only as localhost. You can:
>> >> >> >> >>
>> >> >> >> >> 1) Try to fix your network configuration to have your machine
>> >> always
>> >> >> >> >> resolve by its hostname first, or
>> >> >> >> >>
>> >> >> >> >> 2) Use HBase 0.20.3RC1 which contains a fix that tries harder
>> to
>> >> >> match
>> >> >> >> >> the address. You can get it here:
>> >> >> >> >> http://people.apache.org/~jdcryans/hbase-0.20.3-candidate-1/<http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>> >> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>> >> >> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>> >> >> >> >>
>> >> >> >> >> Sorry for that,
>> >> >> >> >>
>> >> >> >> >> J-D
>> >> >> >> >>
>> >> >> >> >> On Wed, Jan 13, 2010 at 9:43 AM, Ananth T. Sarathy
>> >> >> >> >> <an...@gmail.com> wrote:
>> >> >> >> >> > I have Hbase.env set to manage Zookeeper. When I try to start
>> >> >> hbase,
>> >> >> >> the
>> >> >> >> >> > zookeeper out says
>> >> >> >> >> >
>> >> >> >> >> > java.io.IOException: Could not find my address: localhost in
>> >> list
>> >> >> of
>> >> >> >> >> > ZooKeeper quorum servers
>> >> >> >> >> >        at
>> >> >> >> >> >
>> >> >> >> >>
>> >> >> >>
>> >> >>
>> >>
>> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.writeMyID(HQuorumPeer.java:128)
>> >> >> >> >> >        at
>> >> >> >> >> >
>> >> >> >>
>> >> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.main(HQuorumPeer.java:67)
>> >> >> >> >> > ~
>> >> >> >> >> >
>> >> >> >> >> > in my hbase-site.xml
>> >> >> >> >> >
>> >> >> >> >> >  <property>
>> >> >> >> >> >   <name>hbase.zookeeper.quorum</name>
>> >> >> >> >> >   <value>gs2,gs3,gs4</value>
>> >> >> >> >> >   <description>Comma separated list of servers in the
>> ZooKeeper
>> >> >> >> Quorum.
>> >> >> >> >> >   For example, "host1.mydomain.com,host2.mydomain.com,
>> >> >> >> host3.mydomain.com
>> >> >> >> >> ".
>> >> >> >> >> >   By default this is set to localhost for local and
>> >> >> pseudo-distributed
>> >> >> >> >> > modes
>> >> >> >> >> >   of operation. For a fully-distributed setup, this should be
>> >> set
>> >> >> to a
>> >> >> >> >> full
>> >> >> >> >> >   list of ZooKeeper quorum servers. If HBASE_MANAGES_ZK is
>> set
>> >> in
>> >> >> >> >> > hbase-env.sh
>> >> >> >> >> >   this is the list of servers which we will start/stop
>> ZooKeeper
>> >> >> on.
>> >> >> >> >> >   </description>
>> >> >> >> >> >  </property>
>> >> >> >> >> >
>> >> >> >> >> > in my /etc/hosts
>> >> >> >> >> >
>> >> >> >> >> > # hostname gs2 added to /etc/hosts by anaconda
>> >> >> >> >> > 127.0.0.1   localhost localhost.localdomain localhost4
>> >> >> >> >> > localhost4.localdomain4 gs2
>> >> >> >> >> > ::1         localhost localhost.localdomain localhost6
>> >> >> >> >> > localhost6.localdomain6 gs2
>> >> >> >> >> >
>> >> >> >> >> > 192.168.20.101 gs1
>> >> >> >> >> > 192.168.20.102 gs2
>> >> >> >> >> > 192.168.20.103 gs3
>> >> >> >> >> > 192.168.20.104 gs4
>> >> >> >> >> > 192.168.20.105 gs5
>> >> >> >> >> > 192.168.20.106 gs6
>> >> >> >> >> > 192.168.20.107 gs7
>> >> >> >> >> > 192.168.20.108 gs8
>> >> >> >> >> > 192.168.20.110 gs10
>> >> >> >> >> > 192.168.20.111 gs11
>> >> >> >> >> > 192.168.20.112 gs12
>> >> >> >> >> > 192.168.20.113 gs13
>> >> >> >> >> > 192.168.20.114 gs14
>> >> >> >> >> > 192.168.20.115 gs15
>> >> >> >> >> > 192.168.20.116 gs16
>> >> >> >> >> > 192.168.20.117 gs17
>> >> >> >> >> >
>> >> >> >> >> > am I missing something here? Why does it insist on localhost
>> in
>> >> the
>> >> >> >> >> quorum
>> >> >> >> >> > list? What do i need to do to unconfuse it?
>> >> >> >> >> >
>> >> >> >> >> >
>> >> >> >> >> > Ananth T Sarathy
>> >> >> >> >> >
>> >> >> >> >>
>> >> >> >> >
>> >> >> >>
>> >> >> >
>> >> >>
>> >> >
>> >>
>> >
>>
>

Re: Problem With Zookeeper

Posted by "Ananth T. Sarathy" <an...@gmail.com>.
here 's what I get

http://pastebin.com/m60c1864b


Ananth T Sarathy


On Wed, Jan 13, 2010 at 2:57 PM, Jean-Daniel Cryans <jd...@apache.org>wrote:

> Do "jps" then "jstack pid" with the master's pid given by jps.
>
> J-D
>
> On Wed, Jan 13, 2010 at 11:41 AM, Ananth T. Sarathy
> <an...@gmail.com> wrote:
> > well when i do a ps -ef|grep hbase i have 3 processes running. I have
> killed
> > them all, reinstalled hbase, formated my name node, and still the
> master.log
> > is the same when I restart.  What could be causing it hang?
> >
> >
> > Ananth T Sarathy
> >
> >
> > On Wed, Jan 13, 2010 at 2:26 PM, Jean-Daniel Cryans <jdcryans@apache.org
> >wrote:
> >
> >> Well it's just weird that your master would just "refuse" to start. Is
> >> the process still there? If you jstack it, is there any thread
> >> running?
> >>
> >> You could also clean up everything and retry, but that's just the easy
> >> way out :P
> >>
> >> J-D
> >>
> >> On Wed, Jan 13, 2010 at 11:23 AM, Ananth T. Sarathy
> >> <an...@gmail.com> wrote:
> >> > master. out is empty.... could something have cludged up from the
> >> previous
> >> > issues? Are there files I should delete/ reformat my namenode?
> >> >
> >> > I don't have any data yet in these, so I can afford to blow things
> away,
> >> but
> >> > I cleaned out the tmp dir already so I am not sure what else i need to
> >> do.
> >> > Ananth T Sarathy
> >> >
> >> >
> >> > On Wed, Jan 13, 2010 at 2:14 PM, Jean-Daniel Cryans <
> jdcryans@apache.org
> >> >wrote:
> >> >
> >> >> If that's everything from your master log, then I would suggest you
> >> >> take a look at the .out file (instead of .log) since it might be a
> >> >> problem on startup.
> >> >>
> >> >> J-D
> >> >>
> >> >> On Wed, Jan 13, 2010 at 11:09 AM, Ananth T. Sarathy
> >> >> <an...@gmail.com> wrote:
> >> >> > Master log
> >> >> >
> >> >> > http://pastebin.com/m469d1b39
> >> >> >
> >> >> > Zookeeper log
> >> >> > http://pastebin.com/m47f0503
> >> >> >
> >> >> > region server
> >> >> >
> >> >> > http://pastebin.com/m305fab14
> >> >> >
> >> >> > Ananth T Sarathy
> >> >> >
> >> >> >
> >> >> > On Wed, Jan 13, 2010 at 2:02 PM, Jean-Daniel Cryans <
> >> jdcryans@apache.org
> >> >> >wrote:
> >> >> >
> >> >> >> Looks like your master didn't register itself in zookeeper, you
> >> should
> >> >> >> look in its log.
> >> >> >>
> >> >> >> J-D
> >> >> >>
> >> >> >> On Wed, Jan 13, 2010 at 10:59 AM, Ananth T. Sarathy
> >> >> >> <an...@gmail.com> wrote:
> >> >> >> > ok, we got that to work and zookeeper is coming up, but now I am
> >> >> getting
> >> >> >> > something else... the regionserver are connecting cause  of
> >> >> >> >
> >> >> >> > 2010-01-13 13:57:56,029 WARN
> >> >> >> > org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to
> read
> >> >> master
> >> >> >> > address from ZooKeeper. Retrying. Error was:
> >> >> >> > java.io.IOException:
> >> >> >> org.apache.zookeeper.KeeperException$NoNodeException:
> >> >> >> > KeeperErrorCode = NoNode for /hbase/master
> >> >> >> >        at
> >> >> >> >
> >> >> >>
> >> >>
> >>
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:332)
> >> >> >> >        at
> >> >> >> >
> >> >> >>
> >> >>
> >>
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readMasterAddressOrThrow(ZooKeeperWrapper.java:240)
> >> >> >> >        at
> >> >> >> >
> >> >> >>
> >> >>
> >>
> org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:1339)
> >> >> >> >        at
> >> >> >> >
> >> >> >>
> >> >>
> >>
> org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:1371)
> >> >> >> >        at
> >> >> >> >
> >> >> >>
> >> >>
> >>
> org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:427)
> >> >> >> >        at java.lang.Thread.run(Thread.java:636)
> >> >> >> > Caused by: org.apache.zookeeper.KeeperException$NoNodeException:
> >> >> >> > KeeperErrorCode = NoNode for /hbase/master
> >> >> >> >        at
> >> >> >> >
> >> org.apache.zookeeper.KeeperException.create(KeeperException.java:102)
> >> >> >> >        at
> >> >> >> >
> >> org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
> >> >> >> >        at
> >> org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:892)
> >> >> >> >        at
> >> >> >> >
> >> >> >>
> >> >>
> >>
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:328)
> >> >> >> >        ... 5 more
> >> >> >> >
> >> >> >> >
> >> >> >> > any ideas?
> >> >> >> > Ananth T Sarathy
> >> >> >> >
> >> >> >> >
> >> >> >> > On Wed, Jan 13, 2010 at 12:52 PM, Jean-Daniel Cryans <
> >> >> >> jdcryans@apache.org>wrote:
> >> >> >> >
> >> >> >> >> HBase 0.20.2 and previous only checked one address against the
> >> list
> >> >> >> >> that is provided, the one returned was the default Java knew
> of.
> >> It
> >> >> >> >> seems that in your case your /etc/hosts makes it that this
> >> machines
> >> >> >> >> resolves itself only as localhost. You can:
> >> >> >> >>
> >> >> >> >> 1) Try to fix your network configuration to have your machine
> >> always
> >> >> >> >> resolve by its hostname first, or
> >> >> >> >>
> >> >> >> >> 2) Use HBase 0.20.3RC1 which contains a fix that tries harder
> to
> >> >> match
> >> >> >> >> the address. You can get it here:
> >> >> >> >> http://people.apache.org/~jdcryans/hbase-0.20.3-candidate-1/<http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> >> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> >> >> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> >> >> >> >>
> >> >> >> >> Sorry for that,
> >> >> >> >>
> >> >> >> >> J-D
> >> >> >> >>
> >> >> >> >> On Wed, Jan 13, 2010 at 9:43 AM, Ananth T. Sarathy
> >> >> >> >> <an...@gmail.com> wrote:
> >> >> >> >> > I have Hbase.env set to manage Zookeeper. When I try to start
> >> >> hbase,
> >> >> >> the
> >> >> >> >> > zookeeper out says
> >> >> >> >> >
> >> >> >> >> > java.io.IOException: Could not find my address: localhost in
> >> list
> >> >> of
> >> >> >> >> > ZooKeeper quorum servers
> >> >> >> >> >        at
> >> >> >> >> >
> >> >> >> >>
> >> >> >>
> >> >>
> >>
> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.writeMyID(HQuorumPeer.java:128)
> >> >> >> >> >        at
> >> >> >> >> >
> >> >> >>
> >> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.main(HQuorumPeer.java:67)
> >> >> >> >> > ~
> >> >> >> >> >
> >> >> >> >> > in my hbase-site.xml
> >> >> >> >> >
> >> >> >> >> >  <property>
> >> >> >> >> >   <name>hbase.zookeeper.quorum</name>
> >> >> >> >> >   <value>gs2,gs3,gs4</value>
> >> >> >> >> >   <description>Comma separated list of servers in the
> ZooKeeper
> >> >> >> Quorum.
> >> >> >> >> >   For example, "host1.mydomain.com,host2.mydomain.com,
> >> >> >> host3.mydomain.com
> >> >> >> >> ".
> >> >> >> >> >   By default this is set to localhost for local and
> >> >> pseudo-distributed
> >> >> >> >> > modes
> >> >> >> >> >   of operation. For a fully-distributed setup, this should be
> >> set
> >> >> to a
> >> >> >> >> full
> >> >> >> >> >   list of ZooKeeper quorum servers. If HBASE_MANAGES_ZK is
> set
> >> in
> >> >> >> >> > hbase-env.sh
> >> >> >> >> >   this is the list of servers which we will start/stop
> ZooKeeper
> >> >> on.
> >> >> >> >> >   </description>
> >> >> >> >> >  </property>
> >> >> >> >> >
> >> >> >> >> > in my /etc/hosts
> >> >> >> >> >
> >> >> >> >> > # hostname gs2 added to /etc/hosts by anaconda
> >> >> >> >> > 127.0.0.1   localhost localhost.localdomain localhost4
> >> >> >> >> > localhost4.localdomain4 gs2
> >> >> >> >> > ::1         localhost localhost.localdomain localhost6
> >> >> >> >> > localhost6.localdomain6 gs2
> >> >> >> >> >
> >> >> >> >> > 192.168.20.101 gs1
> >> >> >> >> > 192.168.20.102 gs2
> >> >> >> >> > 192.168.20.103 gs3
> >> >> >> >> > 192.168.20.104 gs4
> >> >> >> >> > 192.168.20.105 gs5
> >> >> >> >> > 192.168.20.106 gs6
> >> >> >> >> > 192.168.20.107 gs7
> >> >> >> >> > 192.168.20.108 gs8
> >> >> >> >> > 192.168.20.110 gs10
> >> >> >> >> > 192.168.20.111 gs11
> >> >> >> >> > 192.168.20.112 gs12
> >> >> >> >> > 192.168.20.113 gs13
> >> >> >> >> > 192.168.20.114 gs14
> >> >> >> >> > 192.168.20.115 gs15
> >> >> >> >> > 192.168.20.116 gs16
> >> >> >> >> > 192.168.20.117 gs17
> >> >> >> >> >
> >> >> >> >> > am I missing something here? Why does it insist on localhost
> in
> >> the
> >> >> >> >> quorum
> >> >> >> >> > list? What do i need to do to unconfuse it?
> >> >> >> >> >
> >> >> >> >> >
> >> >> >> >> > Ananth T Sarathy
> >> >> >> >> >
> >> >> >> >>
> >> >> >> >
> >> >> >>
> >> >> >
> >> >>
> >> >
> >>
> >
>

Re: Problem With Zookeeper

Posted by Jean-Daniel Cryans <jd...@apache.org>.
Do "jps" then "jstack pid" with the master's pid given by jps.

J-D

On Wed, Jan 13, 2010 at 11:41 AM, Ananth T. Sarathy
<an...@gmail.com> wrote:
> well when i do a ps -ef|grep hbase i have 3 processes running. I have killed
> them all, reinstalled hbase, formated my name node, and still the master.log
> is the same when I restart.  What could be causing it hang?
>
>
> Ananth T Sarathy
>
>
> On Wed, Jan 13, 2010 at 2:26 PM, Jean-Daniel Cryans <jd...@apache.org>wrote:
>
>> Well it's just weird that your master would just "refuse" to start. Is
>> the process still there? If you jstack it, is there any thread
>> running?
>>
>> You could also clean up everything and retry, but that's just the easy
>> way out :P
>>
>> J-D
>>
>> On Wed, Jan 13, 2010 at 11:23 AM, Ananth T. Sarathy
>> <an...@gmail.com> wrote:
>> > master. out is empty.... could something have cludged up from the
>> previous
>> > issues? Are there files I should delete/ reformat my namenode?
>> >
>> > I don't have any data yet in these, so I can afford to blow things away,
>> but
>> > I cleaned out the tmp dir already so I am not sure what else i need to
>> do.
>> > Ananth T Sarathy
>> >
>> >
>> > On Wed, Jan 13, 2010 at 2:14 PM, Jean-Daniel Cryans <jdcryans@apache.org
>> >wrote:
>> >
>> >> If that's everything from your master log, then I would suggest you
>> >> take a look at the .out file (instead of .log) since it might be a
>> >> problem on startup.
>> >>
>> >> J-D
>> >>
>> >> On Wed, Jan 13, 2010 at 11:09 AM, Ananth T. Sarathy
>> >> <an...@gmail.com> wrote:
>> >> > Master log
>> >> >
>> >> > http://pastebin.com/m469d1b39
>> >> >
>> >> > Zookeeper log
>> >> > http://pastebin.com/m47f0503
>> >> >
>> >> > region server
>> >> >
>> >> > http://pastebin.com/m305fab14
>> >> >
>> >> > Ananth T Sarathy
>> >> >
>> >> >
>> >> > On Wed, Jan 13, 2010 at 2:02 PM, Jean-Daniel Cryans <
>> jdcryans@apache.org
>> >> >wrote:
>> >> >
>> >> >> Looks like your master didn't register itself in zookeeper, you
>> should
>> >> >> look in its log.
>> >> >>
>> >> >> J-D
>> >> >>
>> >> >> On Wed, Jan 13, 2010 at 10:59 AM, Ananth T. Sarathy
>> >> >> <an...@gmail.com> wrote:
>> >> >> > ok, we got that to work and zookeeper is coming up, but now I am
>> >> getting
>> >> >> > something else... the regionserver are connecting cause  of
>> >> >> >
>> >> >> > 2010-01-13 13:57:56,029 WARN
>> >> >> > org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to read
>> >> master
>> >> >> > address from ZooKeeper. Retrying. Error was:
>> >> >> > java.io.IOException:
>> >> >> org.apache.zookeeper.KeeperException$NoNodeException:
>> >> >> > KeeperErrorCode = NoNode for /hbase/master
>> >> >> >        at
>> >> >> >
>> >> >>
>> >>
>> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:332)
>> >> >> >        at
>> >> >> >
>> >> >>
>> >>
>> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readMasterAddressOrThrow(ZooKeeperWrapper.java:240)
>> >> >> >        at
>> >> >> >
>> >> >>
>> >>
>> org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:1339)
>> >> >> >        at
>> >> >> >
>> >> >>
>> >>
>> org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:1371)
>> >> >> >        at
>> >> >> >
>> >> >>
>> >>
>> org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:427)
>> >> >> >        at java.lang.Thread.run(Thread.java:636)
>> >> >> > Caused by: org.apache.zookeeper.KeeperException$NoNodeException:
>> >> >> > KeeperErrorCode = NoNode for /hbase/master
>> >> >> >        at
>> >> >> >
>> org.apache.zookeeper.KeeperException.create(KeeperException.java:102)
>> >> >> >        at
>> >> >> >
>> org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
>> >> >> >        at
>> org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:892)
>> >> >> >        at
>> >> >> >
>> >> >>
>> >>
>> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:328)
>> >> >> >        ... 5 more
>> >> >> >
>> >> >> >
>> >> >> > any ideas?
>> >> >> > Ananth T Sarathy
>> >> >> >
>> >> >> >
>> >> >> > On Wed, Jan 13, 2010 at 12:52 PM, Jean-Daniel Cryans <
>> >> >> jdcryans@apache.org>wrote:
>> >> >> >
>> >> >> >> HBase 0.20.2 and previous only checked one address against the
>> list
>> >> >> >> that is provided, the one returned was the default Java knew of.
>> It
>> >> >> >> seems that in your case your /etc/hosts makes it that this
>> machines
>> >> >> >> resolves itself only as localhost. You can:
>> >> >> >>
>> >> >> >> 1) Try to fix your network configuration to have your machine
>> always
>> >> >> >> resolve by its hostname first, or
>> >> >> >>
>> >> >> >> 2) Use HBase 0.20.3RC1 which contains a fix that tries harder to
>> >> match
>> >> >> >> the address. You can get it here:
>> >> >> >> http://people.apache.org/~jdcryans/hbase-0.20.3-candidate-1/<http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>> >> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>> >> >> >>
>> >> >> >> Sorry for that,
>> >> >> >>
>> >> >> >> J-D
>> >> >> >>
>> >> >> >> On Wed, Jan 13, 2010 at 9:43 AM, Ananth T. Sarathy
>> >> >> >> <an...@gmail.com> wrote:
>> >> >> >> > I have Hbase.env set to manage Zookeeper. When I try to start
>> >> hbase,
>> >> >> the
>> >> >> >> > zookeeper out says
>> >> >> >> >
>> >> >> >> > java.io.IOException: Could not find my address: localhost in
>> list
>> >> of
>> >> >> >> > ZooKeeper quorum servers
>> >> >> >> >        at
>> >> >> >> >
>> >> >> >>
>> >> >>
>> >>
>> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.writeMyID(HQuorumPeer.java:128)
>> >> >> >> >        at
>> >> >> >> >
>> >> >>
>> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.main(HQuorumPeer.java:67)
>> >> >> >> > ~
>> >> >> >> >
>> >> >> >> > in my hbase-site.xml
>> >> >> >> >
>> >> >> >> >  <property>
>> >> >> >> >   <name>hbase.zookeeper.quorum</name>
>> >> >> >> >   <value>gs2,gs3,gs4</value>
>> >> >> >> >   <description>Comma separated list of servers in the ZooKeeper
>> >> >> Quorum.
>> >> >> >> >   For example, "host1.mydomain.com,host2.mydomain.com,
>> >> >> host3.mydomain.com
>> >> >> >> ".
>> >> >> >> >   By default this is set to localhost for local and
>> >> pseudo-distributed
>> >> >> >> > modes
>> >> >> >> >   of operation. For a fully-distributed setup, this should be
>> set
>> >> to a
>> >> >> >> full
>> >> >> >> >   list of ZooKeeper quorum servers. If HBASE_MANAGES_ZK is set
>> in
>> >> >> >> > hbase-env.sh
>> >> >> >> >   this is the list of servers which we will start/stop ZooKeeper
>> >> on.
>> >> >> >> >   </description>
>> >> >> >> >  </property>
>> >> >> >> >
>> >> >> >> > in my /etc/hosts
>> >> >> >> >
>> >> >> >> > # hostname gs2 added to /etc/hosts by anaconda
>> >> >> >> > 127.0.0.1   localhost localhost.localdomain localhost4
>> >> >> >> > localhost4.localdomain4 gs2
>> >> >> >> > ::1         localhost localhost.localdomain localhost6
>> >> >> >> > localhost6.localdomain6 gs2
>> >> >> >> >
>> >> >> >> > 192.168.20.101 gs1
>> >> >> >> > 192.168.20.102 gs2
>> >> >> >> > 192.168.20.103 gs3
>> >> >> >> > 192.168.20.104 gs4
>> >> >> >> > 192.168.20.105 gs5
>> >> >> >> > 192.168.20.106 gs6
>> >> >> >> > 192.168.20.107 gs7
>> >> >> >> > 192.168.20.108 gs8
>> >> >> >> > 192.168.20.110 gs10
>> >> >> >> > 192.168.20.111 gs11
>> >> >> >> > 192.168.20.112 gs12
>> >> >> >> > 192.168.20.113 gs13
>> >> >> >> > 192.168.20.114 gs14
>> >> >> >> > 192.168.20.115 gs15
>> >> >> >> > 192.168.20.116 gs16
>> >> >> >> > 192.168.20.117 gs17
>> >> >> >> >
>> >> >> >> > am I missing something here? Why does it insist on localhost in
>> the
>> >> >> >> quorum
>> >> >> >> > list? What do i need to do to unconfuse it?
>> >> >> >> >
>> >> >> >> >
>> >> >> >> > Ananth T Sarathy
>> >> >> >> >
>> >> >> >>
>> >> >> >
>> >> >>
>> >> >
>> >>
>> >
>>
>

Re: Problem With Zookeeper

Posted by "Ananth T. Sarathy" <an...@gmail.com>.
well when i do a ps -ef|grep hbase i have 3 processes running. I have killed
them all, reinstalled hbase, formated my name node, and still the master.log
is the same when I restart.  What could be causing it hang?


Ananth T Sarathy


On Wed, Jan 13, 2010 at 2:26 PM, Jean-Daniel Cryans <jd...@apache.org>wrote:

> Well it's just weird that your master would just "refuse" to start. Is
> the process still there? If you jstack it, is there any thread
> running?
>
> You could also clean up everything and retry, but that's just the easy
> way out :P
>
> J-D
>
> On Wed, Jan 13, 2010 at 11:23 AM, Ananth T. Sarathy
> <an...@gmail.com> wrote:
> > master. out is empty.... could something have cludged up from the
> previous
> > issues? Are there files I should delete/ reformat my namenode?
> >
> > I don't have any data yet in these, so I can afford to blow things away,
> but
> > I cleaned out the tmp dir already so I am not sure what else i need to
> do.
> > Ananth T Sarathy
> >
> >
> > On Wed, Jan 13, 2010 at 2:14 PM, Jean-Daniel Cryans <jdcryans@apache.org
> >wrote:
> >
> >> If that's everything from your master log, then I would suggest you
> >> take a look at the .out file (instead of .log) since it might be a
> >> problem on startup.
> >>
> >> J-D
> >>
> >> On Wed, Jan 13, 2010 at 11:09 AM, Ananth T. Sarathy
> >> <an...@gmail.com> wrote:
> >> > Master log
> >> >
> >> > http://pastebin.com/m469d1b39
> >> >
> >> > Zookeeper log
> >> > http://pastebin.com/m47f0503
> >> >
> >> > region server
> >> >
> >> > http://pastebin.com/m305fab14
> >> >
> >> > Ananth T Sarathy
> >> >
> >> >
> >> > On Wed, Jan 13, 2010 at 2:02 PM, Jean-Daniel Cryans <
> jdcryans@apache.org
> >> >wrote:
> >> >
> >> >> Looks like your master didn't register itself in zookeeper, you
> should
> >> >> look in its log.
> >> >>
> >> >> J-D
> >> >>
> >> >> On Wed, Jan 13, 2010 at 10:59 AM, Ananth T. Sarathy
> >> >> <an...@gmail.com> wrote:
> >> >> > ok, we got that to work and zookeeper is coming up, but now I am
> >> getting
> >> >> > something else... the regionserver are connecting cause  of
> >> >> >
> >> >> > 2010-01-13 13:57:56,029 WARN
> >> >> > org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to read
> >> master
> >> >> > address from ZooKeeper. Retrying. Error was:
> >> >> > java.io.IOException:
> >> >> org.apache.zookeeper.KeeperException$NoNodeException:
> >> >> > KeeperErrorCode = NoNode for /hbase/master
> >> >> >        at
> >> >> >
> >> >>
> >>
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:332)
> >> >> >        at
> >> >> >
> >> >>
> >>
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readMasterAddressOrThrow(ZooKeeperWrapper.java:240)
> >> >> >        at
> >> >> >
> >> >>
> >>
> org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:1339)
> >> >> >        at
> >> >> >
> >> >>
> >>
> org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:1371)
> >> >> >        at
> >> >> >
> >> >>
> >>
> org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:427)
> >> >> >        at java.lang.Thread.run(Thread.java:636)
> >> >> > Caused by: org.apache.zookeeper.KeeperException$NoNodeException:
> >> >> > KeeperErrorCode = NoNode for /hbase/master
> >> >> >        at
> >> >> >
> org.apache.zookeeper.KeeperException.create(KeeperException.java:102)
> >> >> >        at
> >> >> >
> org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
> >> >> >        at
> org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:892)
> >> >> >        at
> >> >> >
> >> >>
> >>
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:328)
> >> >> >        ... 5 more
> >> >> >
> >> >> >
> >> >> > any ideas?
> >> >> > Ananth T Sarathy
> >> >> >
> >> >> >
> >> >> > On Wed, Jan 13, 2010 at 12:52 PM, Jean-Daniel Cryans <
> >> >> jdcryans@apache.org>wrote:
> >> >> >
> >> >> >> HBase 0.20.2 and previous only checked one address against the
> list
> >> >> >> that is provided, the one returned was the default Java knew of.
> It
> >> >> >> seems that in your case your /etc/hosts makes it that this
> machines
> >> >> >> resolves itself only as localhost. You can:
> >> >> >>
> >> >> >> 1) Try to fix your network configuration to have your machine
> always
> >> >> >> resolve by its hostname first, or
> >> >> >>
> >> >> >> 2) Use HBase 0.20.3RC1 which contains a fix that tries harder to
> >> match
> >> >> >> the address. You can get it here:
> >> >> >> http://people.apache.org/~jdcryans/hbase-0.20.3-candidate-1/<http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> >> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> >> >> >>
> >> >> >> Sorry for that,
> >> >> >>
> >> >> >> J-D
> >> >> >>
> >> >> >> On Wed, Jan 13, 2010 at 9:43 AM, Ananth T. Sarathy
> >> >> >> <an...@gmail.com> wrote:
> >> >> >> > I have Hbase.env set to manage Zookeeper. When I try to start
> >> hbase,
> >> >> the
> >> >> >> > zookeeper out says
> >> >> >> >
> >> >> >> > java.io.IOException: Could not find my address: localhost in
> list
> >> of
> >> >> >> > ZooKeeper quorum servers
> >> >> >> >        at
> >> >> >> >
> >> >> >>
> >> >>
> >>
> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.writeMyID(HQuorumPeer.java:128)
> >> >> >> >        at
> >> >> >> >
> >> >>
> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.main(HQuorumPeer.java:67)
> >> >> >> > ~
> >> >> >> >
> >> >> >> > in my hbase-site.xml
> >> >> >> >
> >> >> >> >  <property>
> >> >> >> >   <name>hbase.zookeeper.quorum</name>
> >> >> >> >   <value>gs2,gs3,gs4</value>
> >> >> >> >   <description>Comma separated list of servers in the ZooKeeper
> >> >> Quorum.
> >> >> >> >   For example, "host1.mydomain.com,host2.mydomain.com,
> >> >> host3.mydomain.com
> >> >> >> ".
> >> >> >> >   By default this is set to localhost for local and
> >> pseudo-distributed
> >> >> >> > modes
> >> >> >> >   of operation. For a fully-distributed setup, this should be
> set
> >> to a
> >> >> >> full
> >> >> >> >   list of ZooKeeper quorum servers. If HBASE_MANAGES_ZK is set
> in
> >> >> >> > hbase-env.sh
> >> >> >> >   this is the list of servers which we will start/stop ZooKeeper
> >> on.
> >> >> >> >   </description>
> >> >> >> >  </property>
> >> >> >> >
> >> >> >> > in my /etc/hosts
> >> >> >> >
> >> >> >> > # hostname gs2 added to /etc/hosts by anaconda
> >> >> >> > 127.0.0.1   localhost localhost.localdomain localhost4
> >> >> >> > localhost4.localdomain4 gs2
> >> >> >> > ::1         localhost localhost.localdomain localhost6
> >> >> >> > localhost6.localdomain6 gs2
> >> >> >> >
> >> >> >> > 192.168.20.101 gs1
> >> >> >> > 192.168.20.102 gs2
> >> >> >> > 192.168.20.103 gs3
> >> >> >> > 192.168.20.104 gs4
> >> >> >> > 192.168.20.105 gs5
> >> >> >> > 192.168.20.106 gs6
> >> >> >> > 192.168.20.107 gs7
> >> >> >> > 192.168.20.108 gs8
> >> >> >> > 192.168.20.110 gs10
> >> >> >> > 192.168.20.111 gs11
> >> >> >> > 192.168.20.112 gs12
> >> >> >> > 192.168.20.113 gs13
> >> >> >> > 192.168.20.114 gs14
> >> >> >> > 192.168.20.115 gs15
> >> >> >> > 192.168.20.116 gs16
> >> >> >> > 192.168.20.117 gs17
> >> >> >> >
> >> >> >> > am I missing something here? Why does it insist on localhost in
> the
> >> >> >> quorum
> >> >> >> > list? What do i need to do to unconfuse it?
> >> >> >> >
> >> >> >> >
> >> >> >> > Ananth T Sarathy
> >> >> >> >
> >> >> >>
> >> >> >
> >> >>
> >> >
> >>
> >
>

Re: Problem With Zookeeper

Posted by Jean-Daniel Cryans <jd...@apache.org>.
Well it's just weird that your master would just "refuse" to start. Is
the process still there? If you jstack it, is there any thread
running?

You could also clean up everything and retry, but that's just the easy
way out :P

J-D

On Wed, Jan 13, 2010 at 11:23 AM, Ananth T. Sarathy
<an...@gmail.com> wrote:
> master. out is empty.... could something have cludged up from the previous
> issues? Are there files I should delete/ reformat my namenode?
>
> I don't have any data yet in these, so I can afford to blow things away, but
> I cleaned out the tmp dir already so I am not sure what else i need to do.
> Ananth T Sarathy
>
>
> On Wed, Jan 13, 2010 at 2:14 PM, Jean-Daniel Cryans <jd...@apache.org>wrote:
>
>> If that's everything from your master log, then I would suggest you
>> take a look at the .out file (instead of .log) since it might be a
>> problem on startup.
>>
>> J-D
>>
>> On Wed, Jan 13, 2010 at 11:09 AM, Ananth T. Sarathy
>> <an...@gmail.com> wrote:
>> > Master log
>> >
>> > http://pastebin.com/m469d1b39
>> >
>> > Zookeeper log
>> > http://pastebin.com/m47f0503
>> >
>> > region server
>> >
>> > http://pastebin.com/m305fab14
>> >
>> > Ananth T Sarathy
>> >
>> >
>> > On Wed, Jan 13, 2010 at 2:02 PM, Jean-Daniel Cryans <jdcryans@apache.org
>> >wrote:
>> >
>> >> Looks like your master didn't register itself in zookeeper, you should
>> >> look in its log.
>> >>
>> >> J-D
>> >>
>> >> On Wed, Jan 13, 2010 at 10:59 AM, Ananth T. Sarathy
>> >> <an...@gmail.com> wrote:
>> >> > ok, we got that to work and zookeeper is coming up, but now I am
>> getting
>> >> > something else... the regionserver are connecting cause  of
>> >> >
>> >> > 2010-01-13 13:57:56,029 WARN
>> >> > org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to read
>> master
>> >> > address from ZooKeeper. Retrying. Error was:
>> >> > java.io.IOException:
>> >> org.apache.zookeeper.KeeperException$NoNodeException:
>> >> > KeeperErrorCode = NoNode for /hbase/master
>> >> >        at
>> >> >
>> >>
>> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:332)
>> >> >        at
>> >> >
>> >>
>> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readMasterAddressOrThrow(ZooKeeperWrapper.java:240)
>> >> >        at
>> >> >
>> >>
>> org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:1339)
>> >> >        at
>> >> >
>> >>
>> org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:1371)
>> >> >        at
>> >> >
>> >>
>> org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:427)
>> >> >        at java.lang.Thread.run(Thread.java:636)
>> >> > Caused by: org.apache.zookeeper.KeeperException$NoNodeException:
>> >> > KeeperErrorCode = NoNode for /hbase/master
>> >> >        at
>> >> > org.apache.zookeeper.KeeperException.create(KeeperException.java:102)
>> >> >        at
>> >> > org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
>> >> >        at org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:892)
>> >> >        at
>> >> >
>> >>
>> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:328)
>> >> >        ... 5 more
>> >> >
>> >> >
>> >> > any ideas?
>> >> > Ananth T Sarathy
>> >> >
>> >> >
>> >> > On Wed, Jan 13, 2010 at 12:52 PM, Jean-Daniel Cryans <
>> >> jdcryans@apache.org>wrote:
>> >> >
>> >> >> HBase 0.20.2 and previous only checked one address against the list
>> >> >> that is provided, the one returned was the default Java knew of. It
>> >> >> seems that in your case your /etc/hosts makes it that this machines
>> >> >> resolves itself only as localhost. You can:
>> >> >>
>> >> >> 1) Try to fix your network configuration to have your machine always
>> >> >> resolve by its hostname first, or
>> >> >>
>> >> >> 2) Use HBase 0.20.3RC1 which contains a fix that tries harder to
>> match
>> >> >> the address. You can get it here:
>> >> >> http://people.apache.org/~jdcryans/hbase-0.20.3-candidate-1/<http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>> >> >>
>> >> >> Sorry for that,
>> >> >>
>> >> >> J-D
>> >> >>
>> >> >> On Wed, Jan 13, 2010 at 9:43 AM, Ananth T. Sarathy
>> >> >> <an...@gmail.com> wrote:
>> >> >> > I have Hbase.env set to manage Zookeeper. When I try to start
>> hbase,
>> >> the
>> >> >> > zookeeper out says
>> >> >> >
>> >> >> > java.io.IOException: Could not find my address: localhost in list
>> of
>> >> >> > ZooKeeper quorum servers
>> >> >> >        at
>> >> >> >
>> >> >>
>> >>
>> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.writeMyID(HQuorumPeer.java:128)
>> >> >> >        at
>> >> >> >
>> >> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.main(HQuorumPeer.java:67)
>> >> >> > ~
>> >> >> >
>> >> >> > in my hbase-site.xml
>> >> >> >
>> >> >> >  <property>
>> >> >> >   <name>hbase.zookeeper.quorum</name>
>> >> >> >   <value>gs2,gs3,gs4</value>
>> >> >> >   <description>Comma separated list of servers in the ZooKeeper
>> >> Quorum.
>> >> >> >   For example, "host1.mydomain.com,host2.mydomain.com,
>> >> host3.mydomain.com
>> >> >> ".
>> >> >> >   By default this is set to localhost for local and
>> pseudo-distributed
>> >> >> > modes
>> >> >> >   of operation. For a fully-distributed setup, this should be set
>> to a
>> >> >> full
>> >> >> >   list of ZooKeeper quorum servers. If HBASE_MANAGES_ZK is set in
>> >> >> > hbase-env.sh
>> >> >> >   this is the list of servers which we will start/stop ZooKeeper
>> on.
>> >> >> >   </description>
>> >> >> >  </property>
>> >> >> >
>> >> >> > in my /etc/hosts
>> >> >> >
>> >> >> > # hostname gs2 added to /etc/hosts by anaconda
>> >> >> > 127.0.0.1   localhost localhost.localdomain localhost4
>> >> >> > localhost4.localdomain4 gs2
>> >> >> > ::1         localhost localhost.localdomain localhost6
>> >> >> > localhost6.localdomain6 gs2
>> >> >> >
>> >> >> > 192.168.20.101 gs1
>> >> >> > 192.168.20.102 gs2
>> >> >> > 192.168.20.103 gs3
>> >> >> > 192.168.20.104 gs4
>> >> >> > 192.168.20.105 gs5
>> >> >> > 192.168.20.106 gs6
>> >> >> > 192.168.20.107 gs7
>> >> >> > 192.168.20.108 gs8
>> >> >> > 192.168.20.110 gs10
>> >> >> > 192.168.20.111 gs11
>> >> >> > 192.168.20.112 gs12
>> >> >> > 192.168.20.113 gs13
>> >> >> > 192.168.20.114 gs14
>> >> >> > 192.168.20.115 gs15
>> >> >> > 192.168.20.116 gs16
>> >> >> > 192.168.20.117 gs17
>> >> >> >
>> >> >> > am I missing something here? Why does it insist on localhost in the
>> >> >> quorum
>> >> >> > list? What do i need to do to unconfuse it?
>> >> >> >
>> >> >> >
>> >> >> > Ananth T Sarathy
>> >> >> >
>> >> >>
>> >> >
>> >>
>> >
>>
>

Re: Problem With Zookeeper

Posted by "Ananth T. Sarathy" <an...@gmail.com>.
master. out is empty.... could something have cludged up from the previous
issues? Are there files I should delete/ reformat my namenode?

I don't have any data yet in these, so I can afford to blow things away, but
I cleaned out the tmp dir already so I am not sure what else i need to do.
Ananth T Sarathy


On Wed, Jan 13, 2010 at 2:14 PM, Jean-Daniel Cryans <jd...@apache.org>wrote:

> If that's everything from your master log, then I would suggest you
> take a look at the .out file (instead of .log) since it might be a
> problem on startup.
>
> J-D
>
> On Wed, Jan 13, 2010 at 11:09 AM, Ananth T. Sarathy
> <an...@gmail.com> wrote:
> > Master log
> >
> > http://pastebin.com/m469d1b39
> >
> > Zookeeper log
> > http://pastebin.com/m47f0503
> >
> > region server
> >
> > http://pastebin.com/m305fab14
> >
> > Ananth T Sarathy
> >
> >
> > On Wed, Jan 13, 2010 at 2:02 PM, Jean-Daniel Cryans <jdcryans@apache.org
> >wrote:
> >
> >> Looks like your master didn't register itself in zookeeper, you should
> >> look in its log.
> >>
> >> J-D
> >>
> >> On Wed, Jan 13, 2010 at 10:59 AM, Ananth T. Sarathy
> >> <an...@gmail.com> wrote:
> >> > ok, we got that to work and zookeeper is coming up, but now I am
> getting
> >> > something else... the regionserver are connecting cause  of
> >> >
> >> > 2010-01-13 13:57:56,029 WARN
> >> > org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to read
> master
> >> > address from ZooKeeper. Retrying. Error was:
> >> > java.io.IOException:
> >> org.apache.zookeeper.KeeperException$NoNodeException:
> >> > KeeperErrorCode = NoNode for /hbase/master
> >> >        at
> >> >
> >>
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:332)
> >> >        at
> >> >
> >>
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readMasterAddressOrThrow(ZooKeeperWrapper.java:240)
> >> >        at
> >> >
> >>
> org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:1339)
> >> >        at
> >> >
> >>
> org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:1371)
> >> >        at
> >> >
> >>
> org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:427)
> >> >        at java.lang.Thread.run(Thread.java:636)
> >> > Caused by: org.apache.zookeeper.KeeperException$NoNodeException:
> >> > KeeperErrorCode = NoNode for /hbase/master
> >> >        at
> >> > org.apache.zookeeper.KeeperException.create(KeeperException.java:102)
> >> >        at
> >> > org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
> >> >        at org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:892)
> >> >        at
> >> >
> >>
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:328)
> >> >        ... 5 more
> >> >
> >> >
> >> > any ideas?
> >> > Ananth T Sarathy
> >> >
> >> >
> >> > On Wed, Jan 13, 2010 at 12:52 PM, Jean-Daniel Cryans <
> >> jdcryans@apache.org>wrote:
> >> >
> >> >> HBase 0.20.2 and previous only checked one address against the list
> >> >> that is provided, the one returned was the default Java knew of. It
> >> >> seems that in your case your /etc/hosts makes it that this machines
> >> >> resolves itself only as localhost. You can:
> >> >>
> >> >> 1) Try to fix your network configuration to have your machine always
> >> >> resolve by its hostname first, or
> >> >>
> >> >> 2) Use HBase 0.20.3RC1 which contains a fix that tries harder to
> match
> >> >> the address. You can get it here:
> >> >> http://people.apache.org/~jdcryans/hbase-0.20.3-candidate-1/<http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> >> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> >> >>
> >> >> Sorry for that,
> >> >>
> >> >> J-D
> >> >>
> >> >> On Wed, Jan 13, 2010 at 9:43 AM, Ananth T. Sarathy
> >> >> <an...@gmail.com> wrote:
> >> >> > I have Hbase.env set to manage Zookeeper. When I try to start
> hbase,
> >> the
> >> >> > zookeeper out says
> >> >> >
> >> >> > java.io.IOException: Could not find my address: localhost in list
> of
> >> >> > ZooKeeper quorum servers
> >> >> >        at
> >> >> >
> >> >>
> >>
> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.writeMyID(HQuorumPeer.java:128)
> >> >> >        at
> >> >> >
> >> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.main(HQuorumPeer.java:67)
> >> >> > ~
> >> >> >
> >> >> > in my hbase-site.xml
> >> >> >
> >> >> >  <property>
> >> >> >   <name>hbase.zookeeper.quorum</name>
> >> >> >   <value>gs2,gs3,gs4</value>
> >> >> >   <description>Comma separated list of servers in the ZooKeeper
> >> Quorum.
> >> >> >   For example, "host1.mydomain.com,host2.mydomain.com,
> >> host3.mydomain.com
> >> >> ".
> >> >> >   By default this is set to localhost for local and
> pseudo-distributed
> >> >> > modes
> >> >> >   of operation. For a fully-distributed setup, this should be set
> to a
> >> >> full
> >> >> >   list of ZooKeeper quorum servers. If HBASE_MANAGES_ZK is set in
> >> >> > hbase-env.sh
> >> >> >   this is the list of servers which we will start/stop ZooKeeper
> on.
> >> >> >   </description>
> >> >> >  </property>
> >> >> >
> >> >> > in my /etc/hosts
> >> >> >
> >> >> > # hostname gs2 added to /etc/hosts by anaconda
> >> >> > 127.0.0.1   localhost localhost.localdomain localhost4
> >> >> > localhost4.localdomain4 gs2
> >> >> > ::1         localhost localhost.localdomain localhost6
> >> >> > localhost6.localdomain6 gs2
> >> >> >
> >> >> > 192.168.20.101 gs1
> >> >> > 192.168.20.102 gs2
> >> >> > 192.168.20.103 gs3
> >> >> > 192.168.20.104 gs4
> >> >> > 192.168.20.105 gs5
> >> >> > 192.168.20.106 gs6
> >> >> > 192.168.20.107 gs7
> >> >> > 192.168.20.108 gs8
> >> >> > 192.168.20.110 gs10
> >> >> > 192.168.20.111 gs11
> >> >> > 192.168.20.112 gs12
> >> >> > 192.168.20.113 gs13
> >> >> > 192.168.20.114 gs14
> >> >> > 192.168.20.115 gs15
> >> >> > 192.168.20.116 gs16
> >> >> > 192.168.20.117 gs17
> >> >> >
> >> >> > am I missing something here? Why does it insist on localhost in the
> >> >> quorum
> >> >> > list? What do i need to do to unconfuse it?
> >> >> >
> >> >> >
> >> >> > Ananth T Sarathy
> >> >> >
> >> >>
> >> >
> >>
> >
>

Re: Problem With Zookeeper

Posted by Jean-Daniel Cryans <jd...@apache.org>.
If that's everything from your master log, then I would suggest you
take a look at the .out file (instead of .log) since it might be a
problem on startup.

J-D

On Wed, Jan 13, 2010 at 11:09 AM, Ananth T. Sarathy
<an...@gmail.com> wrote:
> Master log
>
> http://pastebin.com/m469d1b39
>
> Zookeeper log
> http://pastebin.com/m47f0503
>
> region server
>
> http://pastebin.com/m305fab14
>
> Ananth T Sarathy
>
>
> On Wed, Jan 13, 2010 at 2:02 PM, Jean-Daniel Cryans <jd...@apache.org>wrote:
>
>> Looks like your master didn't register itself in zookeeper, you should
>> look in its log.
>>
>> J-D
>>
>> On Wed, Jan 13, 2010 at 10:59 AM, Ananth T. Sarathy
>> <an...@gmail.com> wrote:
>> > ok, we got that to work and zookeeper is coming up, but now I am getting
>> > something else... the regionserver are connecting cause  of
>> >
>> > 2010-01-13 13:57:56,029 WARN
>> > org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to read master
>> > address from ZooKeeper. Retrying. Error was:
>> > java.io.IOException:
>> org.apache.zookeeper.KeeperException$NoNodeException:
>> > KeeperErrorCode = NoNode for /hbase/master
>> >        at
>> >
>> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:332)
>> >        at
>> >
>> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readMasterAddressOrThrow(ZooKeeperWrapper.java:240)
>> >        at
>> >
>> org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:1339)
>> >        at
>> >
>> org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:1371)
>> >        at
>> >
>> org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:427)
>> >        at java.lang.Thread.run(Thread.java:636)
>> > Caused by: org.apache.zookeeper.KeeperException$NoNodeException:
>> > KeeperErrorCode = NoNode for /hbase/master
>> >        at
>> > org.apache.zookeeper.KeeperException.create(KeeperException.java:102)
>> >        at
>> > org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
>> >        at org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:892)
>> >        at
>> >
>> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:328)
>> >        ... 5 more
>> >
>> >
>> > any ideas?
>> > Ananth T Sarathy
>> >
>> >
>> > On Wed, Jan 13, 2010 at 12:52 PM, Jean-Daniel Cryans <
>> jdcryans@apache.org>wrote:
>> >
>> >> HBase 0.20.2 and previous only checked one address against the list
>> >> that is provided, the one returned was the default Java knew of. It
>> >> seems that in your case your /etc/hosts makes it that this machines
>> >> resolves itself only as localhost. You can:
>> >>
>> >> 1) Try to fix your network configuration to have your machine always
>> >> resolve by its hostname first, or
>> >>
>> >> 2) Use HBase 0.20.3RC1 which contains a fix that tries harder to match
>> >> the address. You can get it here:
>> >> http://people.apache.org/~jdcryans/hbase-0.20.3-candidate-1/<http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>> >>
>> >> Sorry for that,
>> >>
>> >> J-D
>> >>
>> >> On Wed, Jan 13, 2010 at 9:43 AM, Ananth T. Sarathy
>> >> <an...@gmail.com> wrote:
>> >> > I have Hbase.env set to manage Zookeeper. When I try to start hbase,
>> the
>> >> > zookeeper out says
>> >> >
>> >> > java.io.IOException: Could not find my address: localhost in list of
>> >> > ZooKeeper quorum servers
>> >> >        at
>> >> >
>> >>
>> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.writeMyID(HQuorumPeer.java:128)
>> >> >        at
>> >> >
>> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.main(HQuorumPeer.java:67)
>> >> > ~
>> >> >
>> >> > in my hbase-site.xml
>> >> >
>> >> >  <property>
>> >> >   <name>hbase.zookeeper.quorum</name>
>> >> >   <value>gs2,gs3,gs4</value>
>> >> >   <description>Comma separated list of servers in the ZooKeeper
>> Quorum.
>> >> >   For example, "host1.mydomain.com,host2.mydomain.com,
>> host3.mydomain.com
>> >> ".
>> >> >   By default this is set to localhost for local and pseudo-distributed
>> >> > modes
>> >> >   of operation. For a fully-distributed setup, this should be set to a
>> >> full
>> >> >   list of ZooKeeper quorum servers. If HBASE_MANAGES_ZK is set in
>> >> > hbase-env.sh
>> >> >   this is the list of servers which we will start/stop ZooKeeper on.
>> >> >   </description>
>> >> >  </property>
>> >> >
>> >> > in my /etc/hosts
>> >> >
>> >> > # hostname gs2 added to /etc/hosts by anaconda
>> >> > 127.0.0.1   localhost localhost.localdomain localhost4
>> >> > localhost4.localdomain4 gs2
>> >> > ::1         localhost localhost.localdomain localhost6
>> >> > localhost6.localdomain6 gs2
>> >> >
>> >> > 192.168.20.101 gs1
>> >> > 192.168.20.102 gs2
>> >> > 192.168.20.103 gs3
>> >> > 192.168.20.104 gs4
>> >> > 192.168.20.105 gs5
>> >> > 192.168.20.106 gs6
>> >> > 192.168.20.107 gs7
>> >> > 192.168.20.108 gs8
>> >> > 192.168.20.110 gs10
>> >> > 192.168.20.111 gs11
>> >> > 192.168.20.112 gs12
>> >> > 192.168.20.113 gs13
>> >> > 192.168.20.114 gs14
>> >> > 192.168.20.115 gs15
>> >> > 192.168.20.116 gs16
>> >> > 192.168.20.117 gs17
>> >> >
>> >> > am I missing something here? Why does it insist on localhost in the
>> >> quorum
>> >> > list? What do i need to do to unconfuse it?
>> >> >
>> >> >
>> >> > Ananth T Sarathy
>> >> >
>> >>
>> >
>>
>

Re: Problem With Zookeeper

Posted by "Ananth T. Sarathy" <an...@gmail.com>.
Master log

http://pastebin.com/m469d1b39

Zookeeper log
http://pastebin.com/m47f0503

region server

http://pastebin.com/m305fab14

Ananth T Sarathy


On Wed, Jan 13, 2010 at 2:02 PM, Jean-Daniel Cryans <jd...@apache.org>wrote:

> Looks like your master didn't register itself in zookeeper, you should
> look in its log.
>
> J-D
>
> On Wed, Jan 13, 2010 at 10:59 AM, Ananth T. Sarathy
> <an...@gmail.com> wrote:
> > ok, we got that to work and zookeeper is coming up, but now I am getting
> > something else... the regionserver are connecting cause  of
> >
> > 2010-01-13 13:57:56,029 WARN
> > org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to read master
> > address from ZooKeeper. Retrying. Error was:
> > java.io.IOException:
> org.apache.zookeeper.KeeperException$NoNodeException:
> > KeeperErrorCode = NoNode for /hbase/master
> >        at
> >
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:332)
> >        at
> >
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readMasterAddressOrThrow(ZooKeeperWrapper.java:240)
> >        at
> >
> org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:1339)
> >        at
> >
> org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:1371)
> >        at
> >
> org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:427)
> >        at java.lang.Thread.run(Thread.java:636)
> > Caused by: org.apache.zookeeper.KeeperException$NoNodeException:
> > KeeperErrorCode = NoNode for /hbase/master
> >        at
> > org.apache.zookeeper.KeeperException.create(KeeperException.java:102)
> >        at
> > org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
> >        at org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:892)
> >        at
> >
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:328)
> >        ... 5 more
> >
> >
> > any ideas?
> > Ananth T Sarathy
> >
> >
> > On Wed, Jan 13, 2010 at 12:52 PM, Jean-Daniel Cryans <
> jdcryans@apache.org>wrote:
> >
> >> HBase 0.20.2 and previous only checked one address against the list
> >> that is provided, the one returned was the default Java knew of. It
> >> seems that in your case your /etc/hosts makes it that this machines
> >> resolves itself only as localhost. You can:
> >>
> >> 1) Try to fix your network configuration to have your machine always
> >> resolve by its hostname first, or
> >>
> >> 2) Use HBase 0.20.3RC1 which contains a fix that tries harder to match
> >> the address. You can get it here:
> >> http://people.apache.org/~jdcryans/hbase-0.20.3-candidate-1/<http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> <http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
> >>
> >> Sorry for that,
> >>
> >> J-D
> >>
> >> On Wed, Jan 13, 2010 at 9:43 AM, Ananth T. Sarathy
> >> <an...@gmail.com> wrote:
> >> > I have Hbase.env set to manage Zookeeper. When I try to start hbase,
> the
> >> > zookeeper out says
> >> >
> >> > java.io.IOException: Could not find my address: localhost in list of
> >> > ZooKeeper quorum servers
> >> >        at
> >> >
> >>
> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.writeMyID(HQuorumPeer.java:128)
> >> >        at
> >> >
> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.main(HQuorumPeer.java:67)
> >> > ~
> >> >
> >> > in my hbase-site.xml
> >> >
> >> >  <property>
> >> >   <name>hbase.zookeeper.quorum</name>
> >> >   <value>gs2,gs3,gs4</value>
> >> >   <description>Comma separated list of servers in the ZooKeeper
> Quorum.
> >> >   For example, "host1.mydomain.com,host2.mydomain.com,
> host3.mydomain.com
> >> ".
> >> >   By default this is set to localhost for local and pseudo-distributed
> >> > modes
> >> >   of operation. For a fully-distributed setup, this should be set to a
> >> full
> >> >   list of ZooKeeper quorum servers. If HBASE_MANAGES_ZK is set in
> >> > hbase-env.sh
> >> >   this is the list of servers which we will start/stop ZooKeeper on.
> >> >   </description>
> >> >  </property>
> >> >
> >> > in my /etc/hosts
> >> >
> >> > # hostname gs2 added to /etc/hosts by anaconda
> >> > 127.0.0.1   localhost localhost.localdomain localhost4
> >> > localhost4.localdomain4 gs2
> >> > ::1         localhost localhost.localdomain localhost6
> >> > localhost6.localdomain6 gs2
> >> >
> >> > 192.168.20.101 gs1
> >> > 192.168.20.102 gs2
> >> > 192.168.20.103 gs3
> >> > 192.168.20.104 gs4
> >> > 192.168.20.105 gs5
> >> > 192.168.20.106 gs6
> >> > 192.168.20.107 gs7
> >> > 192.168.20.108 gs8
> >> > 192.168.20.110 gs10
> >> > 192.168.20.111 gs11
> >> > 192.168.20.112 gs12
> >> > 192.168.20.113 gs13
> >> > 192.168.20.114 gs14
> >> > 192.168.20.115 gs15
> >> > 192.168.20.116 gs16
> >> > 192.168.20.117 gs17
> >> >
> >> > am I missing something here? Why does it insist on localhost in the
> >> quorum
> >> > list? What do i need to do to unconfuse it?
> >> >
> >> >
> >> > Ananth T Sarathy
> >> >
> >>
> >
>

Re: Problem With Zookeeper

Posted by Jean-Daniel Cryans <jd...@apache.org>.
Looks like your master didn't register itself in zookeeper, you should
look in its log.

J-D

On Wed, Jan 13, 2010 at 10:59 AM, Ananth T. Sarathy
<an...@gmail.com> wrote:
> ok, we got that to work and zookeeper is coming up, but now I am getting
> something else... the regionserver are connecting cause  of
>
> 2010-01-13 13:57:56,029 WARN
> org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to read master
> address from ZooKeeper. Retrying. Error was:
> java.io.IOException: org.apache.zookeeper.KeeperException$NoNodeException:
> KeeperErrorCode = NoNode for /hbase/master
>        at
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:332)
>        at
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readMasterAddressOrThrow(ZooKeeperWrapper.java:240)
>        at
> org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:1339)
>        at
> org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:1371)
>        at
> org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:427)
>        at java.lang.Thread.run(Thread.java:636)
> Caused by: org.apache.zookeeper.KeeperException$NoNodeException:
> KeeperErrorCode = NoNode for /hbase/master
>        at
> org.apache.zookeeper.KeeperException.create(KeeperException.java:102)
>        at
> org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
>        at org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:892)
>        at
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:328)
>        ... 5 more
>
>
> any ideas?
> Ananth T Sarathy
>
>
> On Wed, Jan 13, 2010 at 12:52 PM, Jean-Daniel Cryans <jd...@apache.org>wrote:
>
>> HBase 0.20.2 and previous only checked one address against the list
>> that is provided, the one returned was the default Java knew of. It
>> seems that in your case your /etc/hosts makes it that this machines
>> resolves itself only as localhost. You can:
>>
>> 1) Try to fix your network configuration to have your machine always
>> resolve by its hostname first, or
>>
>> 2) Use HBase 0.20.3RC1 which contains a fix that tries harder to match
>> the address. You can get it here:
>> http://people.apache.org/~jdcryans/hbase-0.20.3-candidate-1/<http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>>
>> Sorry for that,
>>
>> J-D
>>
>> On Wed, Jan 13, 2010 at 9:43 AM, Ananth T. Sarathy
>> <an...@gmail.com> wrote:
>> > I have Hbase.env set to manage Zookeeper. When I try to start hbase, the
>> > zookeeper out says
>> >
>> > java.io.IOException: Could not find my address: localhost in list of
>> > ZooKeeper quorum servers
>> >        at
>> >
>> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.writeMyID(HQuorumPeer.java:128)
>> >        at
>> > org.apache.hadoop.hbase.zookeeper.HQuorumPeer.main(HQuorumPeer.java:67)
>> > ~
>> >
>> > in my hbase-site.xml
>> >
>> >  <property>
>> >   <name>hbase.zookeeper.quorum</name>
>> >   <value>gs2,gs3,gs4</value>
>> >   <description>Comma separated list of servers in the ZooKeeper Quorum.
>> >   For example, "host1.mydomain.com,host2.mydomain.com,host3.mydomain.com
>> ".
>> >   By default this is set to localhost for local and pseudo-distributed
>> > modes
>> >   of operation. For a fully-distributed setup, this should be set to a
>> full
>> >   list of ZooKeeper quorum servers. If HBASE_MANAGES_ZK is set in
>> > hbase-env.sh
>> >   this is the list of servers which we will start/stop ZooKeeper on.
>> >   </description>
>> >  </property>
>> >
>> > in my /etc/hosts
>> >
>> > # hostname gs2 added to /etc/hosts by anaconda
>> > 127.0.0.1   localhost localhost.localdomain localhost4
>> > localhost4.localdomain4 gs2
>> > ::1         localhost localhost.localdomain localhost6
>> > localhost6.localdomain6 gs2
>> >
>> > 192.168.20.101 gs1
>> > 192.168.20.102 gs2
>> > 192.168.20.103 gs3
>> > 192.168.20.104 gs4
>> > 192.168.20.105 gs5
>> > 192.168.20.106 gs6
>> > 192.168.20.107 gs7
>> > 192.168.20.108 gs8
>> > 192.168.20.110 gs10
>> > 192.168.20.111 gs11
>> > 192.168.20.112 gs12
>> > 192.168.20.113 gs13
>> > 192.168.20.114 gs14
>> > 192.168.20.115 gs15
>> > 192.168.20.116 gs16
>> > 192.168.20.117 gs17
>> >
>> > am I missing something here? Why does it insist on localhost in the
>> quorum
>> > list? What do i need to do to unconfuse it?
>> >
>> >
>> > Ananth T Sarathy
>> >
>>
>

Re: Problem With Zookeeper

Posted by "Ananth T. Sarathy" <an...@gmail.com>.
ok, we got that to work and zookeeper is coming up, but now I am getting
something else... the regionserver are connecting cause  of

2010-01-13 13:57:56,029 WARN
org.apache.hadoop.hbase.regionserver.HRegionServer: Unable to read master
address from ZooKeeper. Retrying. Error was:
java.io.IOException: org.apache.zookeeper.KeeperException$NoNodeException:
KeeperErrorCode = NoNode for /hbase/master
        at
org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:332)
        at
org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readMasterAddressOrThrow(ZooKeeperWrapper.java:240)
        at
org.apache.hadoop.hbase.regionserver.HRegionServer.getMaster(HRegionServer.java:1339)
        at
org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:1371)
        at
org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:427)
        at java.lang.Thread.run(Thread.java:636)
Caused by: org.apache.zookeeper.KeeperException$NoNodeException:
KeeperErrorCode = NoNode for /hbase/master
        at
org.apache.zookeeper.KeeperException.create(KeeperException.java:102)
        at
org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
        at org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:892)
        at
org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:328)
        ... 5 more


any ideas?
Ananth T Sarathy


On Wed, Jan 13, 2010 at 12:52 PM, Jean-Daniel Cryans <jd...@apache.org>wrote:

> HBase 0.20.2 and previous only checked one address against the list
> that is provided, the one returned was the default Java knew of. It
> seems that in your case your /etc/hosts makes it that this machines
> resolves itself only as localhost. You can:
>
> 1) Try to fix your network configuration to have your machine always
> resolve by its hostname first, or
>
> 2) Use HBase 0.20.3RC1 which contains a fix that tries harder to match
> the address. You can get it here:
> http://people.apache.org/~jdcryans/hbase-0.20.3-candidate-1/<http://people.apache.org/%7Ejdcryans/hbase-0.20.3-candidate-1/>
>
> Sorry for that,
>
> J-D
>
> On Wed, Jan 13, 2010 at 9:43 AM, Ananth T. Sarathy
> <an...@gmail.com> wrote:
> > I have Hbase.env set to manage Zookeeper. When I try to start hbase, the
> > zookeeper out says
> >
> > java.io.IOException: Could not find my address: localhost in list of
> > ZooKeeper quorum servers
> >        at
> >
> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.writeMyID(HQuorumPeer.java:128)
> >        at
> > org.apache.hadoop.hbase.zookeeper.HQuorumPeer.main(HQuorumPeer.java:67)
> > ~
> >
> > in my hbase-site.xml
> >
> >  <property>
> >   <name>hbase.zookeeper.quorum</name>
> >   <value>gs2,gs3,gs4</value>
> >   <description>Comma separated list of servers in the ZooKeeper Quorum.
> >   For example, "host1.mydomain.com,host2.mydomain.com,host3.mydomain.com
> ".
> >   By default this is set to localhost for local and pseudo-distributed
> > modes
> >   of operation. For a fully-distributed setup, this should be set to a
> full
> >   list of ZooKeeper quorum servers. If HBASE_MANAGES_ZK is set in
> > hbase-env.sh
> >   this is the list of servers which we will start/stop ZooKeeper on.
> >   </description>
> >  </property>
> >
> > in my /etc/hosts
> >
> > # hostname gs2 added to /etc/hosts by anaconda
> > 127.0.0.1   localhost localhost.localdomain localhost4
> > localhost4.localdomain4 gs2
> > ::1         localhost localhost.localdomain localhost6
> > localhost6.localdomain6 gs2
> >
> > 192.168.20.101 gs1
> > 192.168.20.102 gs2
> > 192.168.20.103 gs3
> > 192.168.20.104 gs4
> > 192.168.20.105 gs5
> > 192.168.20.106 gs6
> > 192.168.20.107 gs7
> > 192.168.20.108 gs8
> > 192.168.20.110 gs10
> > 192.168.20.111 gs11
> > 192.168.20.112 gs12
> > 192.168.20.113 gs13
> > 192.168.20.114 gs14
> > 192.168.20.115 gs15
> > 192.168.20.116 gs16
> > 192.168.20.117 gs17
> >
> > am I missing something here? Why does it insist on localhost in the
> quorum
> > list? What do i need to do to unconfuse it?
> >
> >
> > Ananth T Sarathy
> >
>

Re: Problem With Zookeeper

Posted by Jean-Daniel Cryans <jd...@apache.org>.
HBase 0.20.2 and previous only checked one address against the list
that is provided, the one returned was the default Java knew of. It
seems that in your case your /etc/hosts makes it that this machines
resolves itself only as localhost. You can:

1) Try to fix your network configuration to have your machine always
resolve by its hostname first, or

2) Use HBase 0.20.3RC1 which contains a fix that tries harder to match
the address. You can get it here:
http://people.apache.org/~jdcryans/hbase-0.20.3-candidate-1/

Sorry for that,

J-D

On Wed, Jan 13, 2010 at 9:43 AM, Ananth T. Sarathy
<an...@gmail.com> wrote:
> I have Hbase.env set to manage Zookeeper. When I try to start hbase, the
> zookeeper out says
>
> java.io.IOException: Could not find my address: localhost in list of
> ZooKeeper quorum servers
>        at
> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.writeMyID(HQuorumPeer.java:128)
>        at
> org.apache.hadoop.hbase.zookeeper.HQuorumPeer.main(HQuorumPeer.java:67)
> ~
>
> in my hbase-site.xml
>
>  <property>
>   <name>hbase.zookeeper.quorum</name>
>   <value>gs2,gs3,gs4</value>
>   <description>Comma separated list of servers in the ZooKeeper Quorum.
>   For example, "host1.mydomain.com,host2.mydomain.com,host3.mydomain.com".
>   By default this is set to localhost for local and pseudo-distributed
> modes
>   of operation. For a fully-distributed setup, this should be set to a full
>   list of ZooKeeper quorum servers. If HBASE_MANAGES_ZK is set in
> hbase-env.sh
>   this is the list of servers which we will start/stop ZooKeeper on.
>   </description>
>  </property>
>
> in my /etc/hosts
>
> # hostname gs2 added to /etc/hosts by anaconda
> 127.0.0.1   localhost localhost.localdomain localhost4
> localhost4.localdomain4 gs2
> ::1         localhost localhost.localdomain localhost6
> localhost6.localdomain6 gs2
>
> 192.168.20.101 gs1
> 192.168.20.102 gs2
> 192.168.20.103 gs3
> 192.168.20.104 gs4
> 192.168.20.105 gs5
> 192.168.20.106 gs6
> 192.168.20.107 gs7
> 192.168.20.108 gs8
> 192.168.20.110 gs10
> 192.168.20.111 gs11
> 192.168.20.112 gs12
> 192.168.20.113 gs13
> 192.168.20.114 gs14
> 192.168.20.115 gs15
> 192.168.20.116 gs16
> 192.168.20.117 gs17
>
> am I missing something here? Why does it insist on localhost in the quorum
> list? What do i need to do to unconfuse it?
>
>
> Ananth T Sarathy
>