You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Jim Kellerman (JIRA)" <ji...@apache.org> on 2007/10/02 02:20:50 UTC

[jira] Updated: (HADOOP-1960) [hbase] If a region server cannot talk to the master after several attempts, it should shut itself down

     [ https://issues.apache.org/jira/browse/HADOOP-1960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jim Kellerman updated HADOOP-1960:
----------------------------------

    Attachment: patch.txt

TestMasterAbort
- New test

MiniHBaseCluster
- Add getter that returns the HMaster object

TestRegionServerAbort
- Add check for scanner == null before trying to close it

TestSplit
- Enclose test body in try catch block so that exceptions can be
  dumped to the console at the point in the test where they occur.

HRegionServer
- If unable to communicate with the master for more than the lease
  timeout interval abort server.

HMaster
- Add abort method
- If aborting,  ignores region server reports for 1 1/2 times lease period


> [hbase] If a region server cannot talk to the master after several attempts, it should shut itself down
> -------------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-1960
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1960
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: contrib/hbase
>    Affects Versions: 0.15.0
>            Reporter: Jim Kellerman
>            Assignee: Jim Kellerman
>             Fix For: 0.15.0
>
>         Attachments: patch.txt
>
>
> If a region server cannot contact the master after a configurable number of tries, it should shut itself down.
> If the region server cannot contact the master,
> - if the master is alive but the network is partitioned, the master will probably time out the region server's lease and try to recover the server's log and reassign the regions the server is serving.
> - if the master has died, and subsequently restarts, it will be reassigning regions anyway, so the region server should stop serving the regions.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.