You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hbase.apache.org by Rural Hunter <ru...@gmail.com> on 2014/04/09 15:57:52 UTC
hbase region server reboot steps
Hi,
I have one region server which needs to be rebooted for server
maintenance. The server hosts both the hadoop and hbase
slave(hadoop2-hbase 0.96). What is the recommended steps to reboot it
without impacting hbase service?
Re: hbase region server reboot steps
Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.
Hi Rural,
Decomission the node, stop the processes, and reboot. You can look at the
scripts in bin/ to help you with that. Like bin/graceful_stop.sh.
JM
2014-04-09 9:57 GMT-04:00 Rural Hunter <ru...@gmail.com>:
> Hi,
>
> I have one region server which needs to be rebooted for server
> maintenance. The server hosts both the hadoop and hbase slave(hadoop2-hbase
> 0.96). What is the recommended steps to reboot it without impacting hbase
> service?
>
Re: hbase region server reboot steps
Posted by Rural Hunter <ru...@gmail.com>.
Yes, I've already stopped the balancer and manually moved the regions to
other servers. Now I'm decommissioning the dfs data node on the server.
After that I will reboot the server.
于 2014/4/9 22:28, Jean-Marc Spaggiari 写道:
> Hum.
>
> Disable load balancer, and move all the regions manually to other hosts
> using the shell? Then hard restart it?
>
> JM
>
>
Re: hbase region server reboot steps
Posted by Ted Yu <yu...@gmail.com>.
Rural:
Take a look at:
http://hbase.apache.org/book.html#decommission
especially 15.3.1.1
On Wed, Apr 9, 2014 at 8:28 AM, Jean-Marc Spaggiari <jean-marc@spaggiari.org
> wrote:
> Hum.
>
> Disable load balancer, and move all the regions manually to other hosts
> using the shell? Then hard restart it?
>
> JM
>
>
> 2014-04-09 10:26 GMT-04:00 Rural Hunter <ru...@gmail.com>:
>
> > Actually I have to do a hard reboot. Let me provide more info about the
> > problem: Except the ssh service(ssh error is:
> ssh_exchange_identification:
> > Connection closed by remote host) and local login problem, other services
> > are running fine on the server(including http/ftp/hbase/hadoop etc).
> >
> > 于 2014/4/9 22:14, Rural Hunter 写道:
> >
> > Thanks. What if I'm not able to login the region server(both ssh and
> >> local)? I have to reboot and check the server because of this serious
> >> problem.
> >>
> >> 于 2014/4/9 22:01, Ted Yu 写道:
> >>
> >>> You can use bin/graceful_stop.sh to stop the region server process.
> >>>
> >>> # Move regions off a server then stop it. Optionally restart and
> reload.
> >>> # Turn off the balancer before running this script.
> >>>
> >>> After that, you can stop hadoop (datanode, etc)
> >>>
> >>>
> >>>
> >>
> >
>
Re: hbase region server reboot steps
Posted by Jean-Marc Spaggiari <je...@spaggiari.org>.
Hum.
Disable load balancer, and move all the regions manually to other hosts
using the shell? Then hard restart it?
JM
2014-04-09 10:26 GMT-04:00 Rural Hunter <ru...@gmail.com>:
> Actually I have to do a hard reboot. Let me provide more info about the
> problem: Except the ssh service(ssh error is: ssh_exchange_identification:
> Connection closed by remote host) and local login problem, other services
> are running fine on the server(including http/ftp/hbase/hadoop etc).
>
> 于 2014/4/9 22:14, Rural Hunter 写道:
>
> Thanks. What if I'm not able to login the region server(both ssh and
>> local)? I have to reboot and check the server because of this serious
>> problem.
>>
>> 于 2014/4/9 22:01, Ted Yu 写道:
>>
>>> You can use bin/graceful_stop.sh to stop the region server process.
>>>
>>> # Move regions off a server then stop it. Optionally restart and reload.
>>> # Turn off the balancer before running this script.
>>>
>>> After that, you can stop hadoop (datanode, etc)
>>>
>>>
>>>
>>
>
Re: hbase region server reboot steps
Posted by Rural Hunter <ru...@gmail.com>.
Actually I have to do a hard reboot. Let me provide more info about the
problem: Except the ssh service(ssh error is:
ssh_exchange_identification: Connection closed by remote host) and local
login problem, other services are running fine on the server(including
http/ftp/hbase/hadoop etc).
于 2014/4/9 22:14, Rural Hunter 写道:
> Thanks. What if I'm not able to login the region server(both ssh and
> local)? I have to reboot and check the server because of this serious
> problem.
>
> 于 2014/4/9 22:01, Ted Yu 写道:
>> You can use bin/graceful_stop.sh to stop the region server process.
>>
>> # Move regions off a server then stop it. Optionally restart and
>> reload.
>> # Turn off the balancer before running this script.
>>
>> After that, you can stop hadoop (datanode, etc)
>>
>>
>
Re: hbase region server reboot steps
Posted by Rural Hunter <ru...@gmail.com>.
Thanks. What if I'm not able to login the region server(both ssh and
local)? I have to reboot and check the server because of this serious
problem.
于 2014/4/9 22:01, Ted Yu 写道:
> You can use bin/graceful_stop.sh to stop the region server process.
>
> # Move regions off a server then stop it. Optionally restart and reload.
> # Turn off the balancer before running this script.
>
> After that, you can stop hadoop (datanode, etc)
>
>
Re: hbase region server reboot steps
Posted by Ted Yu <yu...@gmail.com>.
You can use bin/graceful_stop.sh to stop the region server process.
# Move regions off a server then stop it. Optionally restart and reload.
# Turn off the balancer before running this script.
After that, you can stop hadoop (datanode, etc)
On Wed, Apr 9, 2014 at 7:57 AM, Rural Hunter <ru...@gmail.com> wrote:
> Hi,
>
> I have one region server which needs to be rebooted for server
> maintenance. The server hosts both the hadoop and hbase slave(hadoop2-hbase
> 0.96). What is the recommended steps to reboot it without impacting hbase
> service?
>