You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Amar Kamat (JIRA)" <ji...@apache.org> on 2009/03/18 12:29:51 UTC

[jira] Created: (HADOOP-5526) Provide an admin page displaying events in the cluster along with cluster status/health

Provide an admin page displaying events in the cluster along with cluster status/health
---------------------------------------------------------------------------------------

                 Key: HADOOP-5526
                 URL: https://issues.apache.org/jira/browse/HADOOP-5526
             Project: Hadoop Core
          Issue Type: New Feature
          Components: mapred
            Reporter: Amar Kamat


Here are few things that will help admins understand whats happening in the cluster
# Events updates
  ## recently added tracker
  ## lost trackers
  ## recently submitted jobs
  ## user updates
  ## killed/failed attempts/tasks 
  ## killed jobs and the reason
  ## recent exceptions like oom etc
  ## expired tasks
  ## recovery manager updates
  ## memory/cpu usage
  ## black listing of tracker 
  ## killing of maps based on fetch failures
  ## info about why some jobs was rejected(acls, max tasks)/failed(failures)/killed (user)
  ## etc
# Status :
  ## tracker health and status
  ## User status
    ### num jobs submitted
    ### total time the cluster was used
    ### success/failed/killed history
  ## job status
     ### task completion events
     ### recently scheduled tasks
     ### progress
     ### killed/failed/success history
  ## space on the box where the jt is running
  ## etc
# Config :
  ## slot info
  ## acl info
  ## etc

----
Graphical views and auto updation would be cool. Raising alarms upon certain events would be super cool.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.