You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@ambari.apache.org by "Yusaku Sako (JIRA)" <ji...@apache.org> on 2015/09/23 00:26:04 UTC

[jira] [Reopened] (AMBARI-12951) Prompt user to save checkpoint before shutdown if last checkpoint is too old

     [ https://issues.apache.org/jira/browse/AMBARI-12951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Yusaku Sako reopened AMBARI-12951:
----------------------------------

Reopening.
The time interval to determine that the NN edit log is too old is hard coded, and makes it hard to do automated testing.

> Prompt user to save checkpoint before shutdown if last checkpoint is too old
> ----------------------------------------------------------------------------
>
>                 Key: AMBARI-12951
>                 URL: https://issues.apache.org/jira/browse/AMBARI-12951
>             Project: Ambari
>          Issue Type: Task
>          Components: ambari-web
>    Affects Versions: 2.1.2
>            Reporter: Xi Wang
>            Assignee: Xi Wang
>            Priority: Critical
>             Fix For: 2.1.2
>
>         Attachments: AMBARI-12951.patch, AMBARI-12951.patch, Popup saying No JMX data .png, Popup to create new checkpoint.png
>
>
> If the admin attempts to shutdown the HDFS NameNode(s) through Ambari in following situations:
> * HDFS service > Stop / Restart All components, 
> * HDFS summary/configs page> Restart Affected, 
> * Host Details > NameNode Stop / Restart, 
> * Host Details > Stop/Restart all components, 
> * Stop All Services
> * Hosts Actions > All/Selected/Filtered Hosts > Hosts > Stop/Restart All Components
> A check should be run to see when the last HDFS checkpoint was created.
> If the checkpoint was created more than 12 hours ago, Ambari should prompt the user to save/create a checkpoint first.
> "The last HDFS checkpoint is older than 12 hours.  Make sure that you have taken a checkpoint before proceeding.  Otherwise, the NameNode(s) can take a very long time to start up."
> The check should be skipped if there's no NameNode to shut down (i.e., all NameNodes are already down) as a result of the operation.
> The user should be warned with instructions to create a new checkpoint, it is up to User if they would like to create a new checkpoint. User can always proceed with button "Next".
> The Active NameNode JMX is checked to figure out when the last checkpoint was taken.  If the Active NameNode JMX is not available for whatever reason, check for any available NameNode JMX (Standby).
> If no JMX info is available, then something is wrong.  In this case, display warning along the line of "Could not determine the age of the last HDFS checkpoint.  Please ensure that you have a recent checkpoint.  Otherwise, the NameNode(s) can take a very long time to start up."
> *Limitations*
> * NameNode shutdown that are part of any Wizard will not be taken care of in this RMP (e.g., NameNode HA wizard)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)