You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "Ravi Prakash (JIRA)" <ji...@apache.org> on 2011/02/15 21:33:57 UTC

[jira] Commented: (HADOOP-5094) Show dead nodes information in dfsadmin -report

    [ https://issues.apache.org/jira/browse/HADOOP-5094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12994988#comment-12994988 ] 

Ravi Prakash commented on HADOOP-5094:
--------------------------------------

Hi Gary,

I've only just joined the hadoop team and am going to be writing automated tests for the test cases you've listed. Please excuse my naivete if I'm way off. Can you please clarify what you mean by "stop a node" . Is that using the hadoop-daemon.sh script? Won't the the namenode mark the datanode as dead only after 10 mins? Should my test be that long-lived?

Cheers
Ravi.


> Show dead nodes information in dfsadmin -report
> -----------------------------------------------
>
>                 Key: HADOOP-5094
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5094
>             Project: Hadoop Common
>          Issue Type: Improvement
>    Affects Versions: 0.18.2
>            Reporter: Jim Huang
>            Assignee: Jakob Homan
>            Priority: Minor
>             Fix For: 0.21.0
>
>         Attachments: DfsAdminDeadNode_testCases.html, DfsAdminDeadNode_testCases.html, HADOOP-5094.patch, HADOOP-5094.patch, HADOOP-5094.patch
>
>
> As part of operations responsibility to bring back dead nodes, it will be good to have a quick way to obtain a list of dead data nodes.  
> The current way is to scrape the namenode web UI page and parse that information, but this creates load on the namenode.   
> In search of a less costly way, I noticed dfsadmin -report only reports data nodes with State: "In Service" and "Decommission in progress" get listed.
> Asking for a cheap way to obtain a list of dead nodes.  
> In addition, can the following requests be reviewed for additional enhancement and changes to dfsadmin -report.
> - Consistent formatting output in "Remaining raw bytes:" for the data nodes should have a space between the exact value and the parenthesized value.
> Sample:
> Total raw bytes: 3842232975360 (3.49 TB)
> Remaining raw bytes: 146090593065(136.06 GB)
> Used raw bytes: 3240864964620 (2.95 TB)
> - Include the running version of Hadoop.  
> - What is the meaning of "Total effective bytes"?
> - Display the hostname instead of the IP address for the data node (toggle option?)

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira