You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@accumulo.apache.org by "John Vines (JIRA)" <ji...@apache.org> on 2017/06/21 17:30:00 UTC
[jira] [Created] (ACCUMULO-4663) ShutdownTServer attempts shutdown
over and over again, can end up blocking migrations
John Vines created ACCUMULO-4663:
------------------------------------
Summary: ShutdownTServer attempts shutdown over and over again, can end up blocking migrations
Key: ACCUMULO-4663
URL: https://issues.apache.org/jira/browse/ACCUMULO-4663
Project: Accumulo
Issue Type: Bug
Affects Versions: 1.7.2
Reporter: John Vines
Also affects 1.7.1
ACCUMULO-1259 identified a problem with it repeatedly invoking master.shutdownTServer. One side effect of this is a race where a server goes down, gets removed from the online tablet sets, etc. and then gets re-added to the serversToShutdown set. This will cause the balancer to not balance due to shutdown in progress and never gets rectified. Only workaround is to restart the master (or bring that server back up, I'm guessing).
ACCUMULO-3897 attempted to fix that problem by attempting shutdown once and only once. It does this by setting a local boolean. But because we do not reserialize our fate repos between isReady calls, this boolean effectively is reset between each check, making it pointless.
I believe there are 2 problems here- 1 is that ShutdownTServer.requestedShutdown is not implemented correctly
2 is we should have a mechanism to remove from serversToShutdown any server that is not present.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)