You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@aurora.apache.org by "Kevin Sweeney (JIRA)" <ji...@apache.org> on 2015/08/31 21:27:45 UTC

[jira] [Created] (AURORA-1463) Consider forcing a reconciliation immediately after a backup restore

Kevin Sweeney created AURORA-1463:
-------------------------------------

             Summary: Consider forcing a reconciliation immediately after a backup restore
                 Key: AURORA-1463
                 URL: https://issues.apache.org/jira/browse/AURORA-1463
             Project: Aurora
          Issue Type: Story
          Components: Reliability, Scheduler
            Reporter: Kevin Sweeney


Based on an IRC conversation it's not immediately obvious that a scheduler backup restore will need to wait for a reconciliation run to become fully effective. For example, restoring a full cluster given an empty set of slaves will leave a user with a bunch of RUNNING tasks that actually aren't.

 My suggestion is to add a flag in the admin commit_recovery command that will force an immediate reconciliation once the new scheduler has come back up. It may also make sense to add a reconcile_now command as well.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)