You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Sreekanth Ramakrishnan (JIRA)" <ji...@apache.org> on 2008/09/10 13:12:44 UTC

[jira] Updated: (HADOOP-3930) Decide how to integrate scheduler info into CLI and job tracker web page

     [ https://issues.apache.org/jira/browse/HADOOP-3930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sreekanth Ramakrishnan updated HADOOP-3930:
-------------------------------------------

    Attachment: HADOOP-3930-2.patch

Attaching a patch with following changes according to Owens and Hemanth's comments:
Added following method to _JobSubmissionProtocol_

{code}
public JobQueueInfo getJobQueueInfo(String queue);
public JobQueueInfo[] getJobQueueInfos();
public JobStatus[] getAllJobs(String queue);
{code}

Added a new method to _TaskScheduler_

{code}
public abstract Collection<JobInProgress> getJobs(String queueName);
{code}

Added a new class to encapsulate the Scheduling information related to Job Queues :: _JobQueueInfo_

Added new jsp page to display queue details and list of jobs held by the queue along with the Queue Scheduling Information:  _jobqueue_details.jsp_

Refactored Job Table generation into a new class in _org.apache.hadoop.mapred.JSPUtil_

Added new command line options in the _JobClient.java_

Currently the patch has no test case attached alongwith it. Would be attaching them soon.


> Decide how to integrate scheduler info into CLI and job tracker web page
> ------------------------------------------------------------------------
>
>                 Key: HADOOP-3930
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3930
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>    Affects Versions: 0.19.0
>            Reporter: Matei Zaharia
>            Assignee: Sreekanth Ramakrishnan
>         Attachments: 3930-1.patch, HADOOP-3930-2.patch, mockup.JPG
>
>
> We need a way for job schedulers such as HADOOP-3445 and HADOOP-3476 to provide info to display on the JobTracker web interface and in the CLI. The main things needed seem to be:
> * A way for schedulers to provide info to show in a column on the web UI and in the CLI - something as simple as a single string, or a map<string, int> for multiple parameters.
> * Some sorting order for jobs - maybe a method to sort a list of jobs.
> Let's figure out what the best way to do this is and implement it in the existing schedulers.
> My first-order proposal at an API: Augment the TaskScheduler with
> * public Map<String, String> getSchedulingInfo(JobInProgress job) -- returns key-value pairs which are displayed in columns on the web UI or the CLI for the list of jobs.
> * public Map<String, String> getSchedulingInfo(String queue) -- returns key-value pairs which are displayed in columns on the web UI or the CLI for the list of queues.
> * public Collection<JobInProgress> getJobs(String queueName) -- returns the list of jobs in a given queue, sorted by a scheduler-specific order (the order it wants to run them in / schedule the next task in / etc).
> * public List<String> getQueues();

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.