You are viewing a plain text version of this content. The canonical link for it is here.

Posted to common-issues@hadoop.apache.org by "shifeng (JIRA)" <ji...@apache.org> on 2015/07/02 05:37:04 UTC

[jira] [Commented] (HADOOP-692) Rack-aware Replica Placement

    [ https://issues.apache.org/jira/browse/HADOOP-692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14611413#comment-14611413 ] 

shifeng commented on HADOOP-692:
--------------------------------

is this question resolved?
I searched but found no answer.

> Rack-aware Replica Placement
> ----------------------------
>
>                 Key: HADOOP-692
>                 URL: https://issues.apache.org/jira/browse/HADOOP-692
>             Project: Hadoop Common
>          Issue Type: Improvement
>    Affects Versions: 0.10.1
>            Reporter: Hairong Kuang
>            Assignee: Hairong Kuang
>             Fix For: 0.11.0
>
>         Attachments: Rack_aware_HDFS_proposal.pdf, rack.patch
>
>
> This issue assumes that HDFS runs on a cluster of computers that spread across many racks. Communication between two nodes on different racks needs to go through switches. Bandwidth in/out of a rack may be less than the total bandwidth of machines in the rack. The purpose of rack-aware replica placement is to improve data reliability, availability, and network bandwidth utilization. The basic idea is that each data node determines to which rack it belongs at the startup time and notifies the name node of the rack id upon registration. The name node maintains a rackid-to-datanode map and tries to place replicas across racks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)