You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-dev@hadoop.apache.org by "Aaron T. Myers (Created) (JIRA)" <ji...@apache.org> on 2012/01/16 08:47:53 UTC
[jira] [Created] (HDFS-2795) HA: Standby NN takes a long time to
recover from a dead DN starting up
HA: Standby NN takes a long time to recover from a dead DN starting up
----------------------------------------------------------------------
Key: HDFS-2795
URL: https://issues.apache.org/jira/browse/HDFS-2795
Project: Hadoop HDFS
Issue Type: Sub-task
Components: data-node, ha, name-node
Affects Versions: HA branch (HDFS-1623)
Reporter: Aaron T. Myers
Assignee: Aaron T. Myers
Priority: Critical
To reproduce:
# Start an HA cluster with a DN.
# Write several blocks to the FS with replication 1.
# Shutdown the DN
# Wait for the NNs to declare the DN dead. All blocks will be under-replicated.
# Restart the DN.
Note that upon restarting the DN, the active NN will immediately get all block locations from the initial BR. The standby NN will not, and instead will slowly add block locations for a subset of the previously-missing blocks on every DN heartbeat.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (HDFS-2795) HA: Standby NN takes a long time to
recover from a dead DN starting up
Posted by "Todd Lipcon (Resolved) (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HDFS-2795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Todd Lipcon resolved HDFS-2795.
-------------------------------
Resolution: Fixed
Fix Version/s: HA branch (HDFS-1623)
Hadoop Flags: Reviewed
Committed to branch. I fixed the "20" to "5", good catch.
> HA: Standby NN takes a long time to recover from a dead DN starting up
> ----------------------------------------------------------------------
>
> Key: HDFS-2795
> URL: https://issues.apache.org/jira/browse/HDFS-2795
> Project: Hadoop HDFS
> Issue Type: Sub-task
> Components: data-node, ha, name-node
> Affects Versions: HA branch (HDFS-1623)
> Reporter: Aaron T. Myers
> Assignee: Todd Lipcon
> Priority: Critical
> Fix For: HA branch (HDFS-1623)
>
> Attachments: hdfs-2795.txt
>
>
> To reproduce:
> # Start an HA cluster with a DN.
> # Write several blocks to the FS with replication 1.
> # Shutdown the DN
> # Wait for the NNs to declare the DN dead. All blocks will be under-replicated.
> # Restart the DN.
> Note that upon restarting the DN, the active NN will immediately get all block locations from the initial BR. The standby NN will not, and instead will slowly add block locations for a subset of the previously-missing blocks on every DN heartbeat.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira