You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hbase.apache.org by Rural Hunter <ru...@gmail.com> on 2014/04/09 15:57:52 UTC

hbase region server reboot steps

Hi,

I have one region server which needs to be rebooted for server 
maintenance. The server hosts both the hadoop and hbase 
slave(hadoop2-hbase 0.96). What is the recommended steps to reboot it 
without impacting hbase service?

Re: hbase region server reboot steps

Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.
Hi Rural,

Decomission the node, stop the processes, and reboot. You can look at the
scripts in bin/ to help you with that. Like bin/graceful_stop.sh.

JM


2014-04-09 9:57 GMT-04:00 Rural Hunter <ru...@gmail.com>:

> Hi,
>
> I have one region server which needs to be rebooted for server
> maintenance. The server hosts both the hadoop and hbase slave(hadoop2-hbase
> 0.96). What is the recommended steps to reboot it without impacting hbase
> service?
>

Re: hbase region server reboot steps

Posted by Rural Hunter <ru...@gmail.com>.
Yes, I've already stopped the balancer and manually moved the regions to 
other servers. Now I'm decommissioning the dfs data node on the server. 
After that I will reboot the server.

于 2014/4/9 22:28, Jean-Marc Spaggiari 写道:
> Hum.
>
> Disable load balancer, and move all the regions manually to other hosts
> using the shell? Then hard restart it?
>
> JM
>
>

Re: hbase region server reboot steps

Posted by Ted Yu <yu...@gmail.com>.
Rural:
Take a look at:
http://hbase.apache.org/book.html#decommission

especially 15.3.1.1


On Wed, Apr 9, 2014 at 8:28 AM, Jean-Marc Spaggiari <jean-marc@spaggiari.org
> wrote:

> Hum.
>
> Disable load balancer, and move all the regions manually to other hosts
> using the shell? Then hard restart it?
>
> JM
>
>
> 2014-04-09 10:26 GMT-04:00 Rural Hunter <ru...@gmail.com>:
>
> > Actually I have to do a hard reboot. Let me provide more info about the
> > problem: Except the ssh service(ssh error is:
> ssh_exchange_identification:
> > Connection closed by remote host) and local login problem, other services
> > are running fine on the server(including http/ftp/hbase/hadoop etc).
> >
> > 于 2014/4/9 22:14, Rural Hunter 写道:
> >
> >  Thanks. What if I'm not able to login the region server(both ssh and
> >> local)? I have to reboot and check the server because of this serious
> >> problem.
> >>
> >> 于 2014/4/9 22:01, Ted Yu 写道:
> >>
> >>> You can use bin/graceful_stop.sh to stop the region server process.
> >>>
> >>> # Move regions off a server then stop it.  Optionally restart and
> reload.
> >>> # Turn off the balancer before running this script.
> >>>
> >>> After that, you can stop hadoop (datanode, etc)
> >>>
> >>>
> >>>
> >>
> >
>

Re: hbase region server reboot steps

Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.
Hum.

Disable load balancer, and move all the regions manually to other hosts
using the shell? Then hard restart it?

JM


2014-04-09 10:26 GMT-04:00 Rural Hunter <ru...@gmail.com>:

> Actually I have to do a hard reboot. Let me provide more info about the
> problem: Except the ssh service(ssh error is: ssh_exchange_identification:
> Connection closed by remote host) and local login problem, other services
> are running fine on the server(including http/ftp/hbase/hadoop etc).
>
> 于 2014/4/9 22:14, Rural Hunter 写道:
>
>  Thanks. What if I'm not able to login the region server(both ssh and
>> local)? I have to reboot and check the server because of this serious
>> problem.
>>
>> 于 2014/4/9 22:01, Ted Yu 写道:
>>
>>> You can use bin/graceful_stop.sh to stop the region server process.
>>>
>>> # Move regions off a server then stop it.  Optionally restart and reload.
>>> # Turn off the balancer before running this script.
>>>
>>> After that, you can stop hadoop (datanode, etc)
>>>
>>>
>>>
>>
>

Re: hbase region server reboot steps

Posted by Rural Hunter <ru...@gmail.com>.
Actually I have to do a hard reboot. Let me provide more info about the 
problem: Except the ssh service(ssh error is: 
ssh_exchange_identification: Connection closed by remote host) and local 
login problem, other services are running fine on the server(including 
http/ftp/hbase/hadoop etc).

于 2014/4/9 22:14, Rural Hunter 写道:
> Thanks. What if I'm not able to login the region server(both ssh and 
> local)? I have to reboot and check the server because of this serious 
> problem.
>
> 于 2014/4/9 22:01, Ted Yu 写道:
>> You can use bin/graceful_stop.sh to stop the region server process.
>>
>> # Move regions off a server then stop it.  Optionally restart and 
>> reload.
>> # Turn off the balancer before running this script.
>>
>> After that, you can stop hadoop (datanode, etc)
>>
>>
>


Re: hbase region server reboot steps

Posted by Rural Hunter <ru...@gmail.com>.
Thanks. What if I'm not able to login the region server(both ssh and 
local)? I have to reboot and check the server because of this serious 
problem.

于 2014/4/9 22:01, Ted Yu 写道:
> You can use bin/graceful_stop.sh to stop the region server process.
>
> # Move regions off a server then stop it.  Optionally restart and reload.
> # Turn off the balancer before running this script.
>
> After that, you can stop hadoop (datanode, etc)
>
>


Re: hbase region server reboot steps

Posted by Ted Yu <yu...@gmail.com>.
You can use bin/graceful_stop.sh to stop the region server process.

# Move regions off a server then stop it.  Optionally restart and reload.
# Turn off the balancer before running this script.

After that, you can stop hadoop (datanode, etc)


On Wed, Apr 9, 2014 at 7:57 AM, Rural Hunter <ru...@gmail.com> wrote:

> Hi,
>
> I have one region server which needs to be rebooted for server
> maintenance. The server hosts both the hadoop and hbase slave(hadoop2-hbase
> 0.96). What is the recommended steps to reboot it without impacting hbase
> service?
>