You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Enis Soztutar (JIRA)" <ji...@apache.org> on 2015/10/23 01:12:27 UTC

[jira] [Updated] (HBASE-14682) CM restore functionality for regionservers is broken

     [ https://issues.apache.org/jira/browse/HBASE-14682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Enis Soztutar updated HBASE-14682:
----------------------------------
    Attachment: hbase-14682_v1.patch

Let me try out this patch. 

> CM restore functionality for regionservers is broken
> ----------------------------------------------------
>
>                 Key: HBASE-14682
>                 URL: https://issues.apache.org/jira/browse/HBASE-14682
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Enis Soztutar
>            Assignee: Enis Soztutar
>         Attachments: hbase-14682_v1.patch
>
>
> In the {{DistributedHBaseCluster.restoreClusterStatus()}}, we are starting a regionserver for backup masters. Seems to be a copy-paste error from HBASE-12429. 
> This is causing further issues on our test rig since we are starting a regionserver where only a backup master were before:
> {code}
> INFO  [main] hbase.HBaseClusterManager: Executing remote command: ps aux | grep proc_master | grep -v grep | tr -s ' ' | cut -d ' ' -f2 , hostname:10.0.0.99
> INFO  [main] util.Shell: Executing full command [/usr/bin/ssh  -o StrictHostKeyChecking=no 10.0.0.99 "sudo su - hbase -c \"ps aux | grep proc_master | grep -v grep | tr -s ' ' | cut -d ' ' -f2\""]
> INFO  [main] hbase.HBaseClusterManager: Executed remote command, exit code:0 , output:13244
> INFO  [main] hbase.HBaseClusterManager: Executing remote command: ps aux | grep proc_regionserver | grep -v grep | tr -s ' ' | cut -d ' ' -f2 , hostname:10.0.0.99
> INFO  [main] util.Shell: Executing full command [/usr/bin/ssh  -o StrictHostKeyChecking=no 10.0.0.99 "sudo su - hbase -c \"ps aux | grep proc_regionserver | grep -v grep | tr -s ' ' | cut -d ' ' -f2\""]
> INFO  [main] hbase.HBaseClusterManager: Executed remote command, exit code:0 , output:
> INFO  [main] hbase.HBaseCluster: Restoring cluster - starting initial region server: 10.0.0.99:16000
> INFO  [main] hbase.HBaseCluster: Starting RS on: 10.0.0.99
> INFO  [main] hbase.HBaseClusterManager: Executing remote command: /usr/hdp/2.3.3.0-2971/hbase/bin/hbase-daemon.sh --config /tmp/hbaseConf start regionserver , hostname:10.0.0.99
> INFO  [main] util.Shell: Executing full command [/usr/bin/ssh  -o StrictHostKeyChecking=no 10.0.0.99 "sudo su - hbase -c \"/usr/hdp/2.3.3.0-2971/hbase/bin/hbase-daemon.sh --config /tmp/hbaseConf start reg
> INFO  [main] hbase.HBaseClusterManager: Executed remote command, exit code:0 , output:starting regionserver, logging to /var/log/hbase/hbase-hbase-regionserver-zk0-hb23-h.out
> INFO  [main] hbase.HBaseClusterManager: Executing remote command: ps aux | grep proc_regionserver | grep -v grep | tr -s ' ' | cut -d ' ' -f2 , hostname:10.0.0.126
> INFO  [main] util.Shell: Executing full command [/usr/bin/ssh  -o StrictHostKeyChecking=no 10.0.0.126 "sudo su - hbase -c \"ps aux | grep proc_regionserver | grep -v grep | tr -s ' ' | cut -d ' ' -f2\""]
> INFO  [main] hbase.HBaseClusterManager: Executed remote command, exit code:0 , output:
> INFO  [main] hbase.HBaseCluster: Added new HBaseAdmin
> INFO  [main] hbase.HBaseCluster: Restoring cluster - done
> DEBUG [main] hbase.IntegrationTestBase: Done restoring the cluster
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)