You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hadoop.apache.org by Alberto Chiusole <al...@gmail.com> on 2016/12/05 19:32:46 UTC
Small mistake (?) in doc about HA with Journal Nodes
Hi all,
I'm Alberto Chiusole, an Italian computer science student and
open-source fan.
I'm currently performing a small research to expose to my fellow
students the Hadoop project, and this is my first post in this ML.
I think I spotted I small mistake in the HDFS documentation regarding
achieving HA with the Quorum Journal Manager [1], section "Hardware
resources", paragraph "JournalNode machines": it's stated:
"""
The JournalNode daemon is relatively lightweight, so these daemons may
reasonably be collocated on machines with other Hadoop daemons, for
example NameNodes, the JobTracker, (...)
"""
Is "NameNodes" a typo and you meant "DataNode" instead? Aren't the
JournalNodes meant to survive in case of a failure of the NameNodes? Why
should I place a JournalNode on the same machine that contains the log I
need to synchronize?
Moreover I have a quick question on the same topic: why do you suggest
to place an odd numbers of machines as JournalNodes in order to increase
the Fault Tolerance?
Regards,
Alberto Chiusole
[1]:
https://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/HDFSHighAvailabilityWithQJM.html#Hardware_resources
---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@hadoop.apache.org
For additional commands, e-mail: user-help@hadoop.apache.org
Re: Small mistake (?) in doc about HA with Journal Nodes
Posted by Ravi Prakash <ra...@gmail.com>.
Hi Alberto!
The assumption is that *multiple* machines could be running the Namenode
process. Only one of them would be active, while the other Namenode
processes would be in Standby mode.
The number of machines is suggested to be odd so that its easier to form
consensus. To handle the failure of k machines, 2k+1 is usually the number
of QJMs you'd need.
HTH
Ravi
On Mon, Dec 5, 2016 at 11:32 AM, Alberto Chiusole <
alberto.chiusole95@gmail.com> wrote:
> Hi all,
> I'm Alberto Chiusole, an Italian computer science student and open-source
> fan.
> I'm currently performing a small research to expose to my fellow students
> the Hadoop project, and this is my first post in this ML.
>
> I think I spotted I small mistake in the HDFS documentation regarding
> achieving HA with the Quorum Journal Manager [1], section "Hardware
> resources", paragraph "JournalNode machines": it's stated:
> """
> The JournalNode daemon is relatively lightweight, so these daemons may
> reasonably be collocated on machines with other Hadoop daemons, for example
> NameNodes, the JobTracker, (...)
> """
>
> Is "NameNodes" a typo and you meant "DataNode" instead? Aren't the
> JournalNodes meant to survive in case of a failure of the NameNodes? Why
> should I place a JournalNode on the same machine that contains the log I
> need to synchronize?
>
>
> Moreover I have a quick question on the same topic: why do you suggest to
> place an odd numbers of machines as JournalNodes in order to increase the
> Fault Tolerance?
>
>
> Regards,
> Alberto Chiusole
>
>
> [1]: https://hadoop.apache.org/docs/stable/hadoop-project-dist/
> hadoop-hdfs/HDFSHighAvailabilityWithQJM.html#Hardware_resources
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscribe@hadoop.apache.org
> For additional commands, e-mail: user-help@hadoop.apache.org
>
>