You are viewing a plain text version of this content. The canonical link for it is here.
Posted to general@hadoop.apache.org by Tim Hawkins <ti...@bejant.com> on 2009/01/13 01:19:16 UTC
New User Questions
I have just recently started working with hadoop and I have a few
questions I would like to submit to the community.
1. Is there any remote-able interface to the jobtracker, so that I can
create an application that would remotely be able to track jobs. Ie
something like a rest or XMLRPC interface?.
2. Is it possible to attach additional metadata to a job for tracking
purposes, we have a set of tasks that result in a large number of jobs
per task, and I would like to tag jobs with an overall task id for
tracking purposes.
A usage case would be a nutch crawl, our application uses nutch to
scan domains on demand, each scan is a distinct task, and nutch spawns
a set of mapred tasks for each stage of the crawl.
I would like to be able to determine completion status for a set of
nutch crawls using an interface to job-tracker.