You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2015/10/09 03:26:29 UTC

[jira] [Created] (SPARK-11020) HistoryServer fails to come up if HDFS takes too long to come out of safe mode

Marcelo Vanzin created SPARK-11020:
--------------------------------------

             Summary: HistoryServer fails to come up if HDFS takes too long to come out of safe mode
                 Key: SPARK-11020
                 URL: https://issues.apache.org/jira/browse/SPARK-11020
             Project: Spark
          Issue Type: Bug
          Components: Spark Core
    Affects Versions: 1.5.1, 1.6.0
            Reporter: Marcelo Vanzin


When HDFS is starting up, it starts in safe mode until the NN is able to read the whole fs image and initialize everything. For a really large NN that can take a while.

If the HS is started at the same time, it may give up trying to check whether the event log directory exists, and exit. That's a little sub-optimal; the HS could wait until HDFS came out of safe mode instead.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org