You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-dev@hadoop.apache.org by "Chackaravarthy (JIRA)" <ji...@apache.org> on 2016/05/04 18:41:12 UTC

[jira] [Created] (HDFS-10365) FullBlockReports retransmission delays NN startup time in large cluster.

Chackaravarthy created HDFS-10365:
-------------------------------------

             Summary: FullBlockReports retransmission delays NN startup time in large cluster.
                 Key: HDFS-10365
                 URL: https://issues.apache.org/jira/browse/HDFS-10365
             Project: Hadoop HDFS
          Issue Type: Bug
          Components: hdfs
    Affects Versions: 2.6.0
         Environment: version - hadoop-2.6.0
DN - 1200 nodes
            Reporter: Chackaravarthy
            Priority: Critical


Whenever NN is restarted, it takes huge time for NN to come back to stable state. i.e. Last contact time remains more than 1 or 2 mins continuously for around 3 to 4 hours. This is mainly because most of the DN's getting timeout (60s) in blockReport (FBR) rpc call and then it keep sending FBR again.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-help@hadoop.apache.org