You are viewing a plain text version of this content. The canonical link for it is here.

Posted to common-dev@hadoop.apache.org by "Todd Lipcon (JIRA)" <ji...@apache.org> on 2010/02/17 07:28:27 UTC

[jira] Created: (HADOOP-6571) /stacks servlet no longer has correct content-type

/stacks servlet no longer has correct content-type
--------------------------------------------------

                 Key: HADOOP-6571
                 URL: https://issues.apache.org/jira/browse/HADOOP-6571
             Project: Hadoop Common
          Issue Type: Bug
    Affects Versions: 0.21.0, 0.22.0
            Reporter: Todd Lipcon


This makes /stacks format completely incorrectly. It should have text/plain content type, but instead has text/html.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Re: Question on job scheduling

Posted by arun kumar <ar...@yahoo.com>.

Thanks for the reply. I am currently trying to move all the blocks and its replicas, of all the input files only, to a specified location. That is, just before job start-up check for the input files' location and move its corresponding blocks and replicas to the desired/highly efficient data nodes, there by making sure only these nodes execute the job (I am assuming this because, I believe each block will be operated upon by the nearest available mapping process only).

And in your reply you had mentioned that some of the work should be initiated from the client, is it the JobClient class you are talking about?

Thanks,
Arun

--- On Fri, 2/19/10, Wang Xu <gn...@gmail.com> wrote:

From: Wang Xu <gn...@gmail.com>
Subject: Re: Question on job scheduling
To: common-dev@hadoop.apache.org
Date: Friday, February 19, 2010, 7:25 AM

On Thu, Feb 18, 2010 at 12:00 AM, arun kumar <ar...@yahoo.com> wrote:
> My questions are:
> 1. Will such a change improve the performance? Considering the overhead caused by moving the data blocks.

In some special case, it might improve the performance, but it depends
on your application.

> 2. I believe I will have to start from the NameNode to move the blocks. If anyone can give me a brief explanation on the process to implement this or even sources to find information on this it would be very helpful.

I think some of the work might initiate from client. Could you
describe what you want to do in detail?
 1 do you want to specify datanode to store special blocks, or only
want some blocks are located together?
 2 do you want to specify the location of all the replicas of a block,
or only want to specify one of the replicas.

-- 
Wang Xu
Stephen Leacock  - "I detest life-insurance agents: they always argue
that I shall some day die, which is not so." -
http://www.brainyquote.com/quotes/authors/s/stephen_leacock.html

Re: Question on job scheduling

Posted by Wang Xu <gn...@gmail.com>.

On Thu, Feb 18, 2010 at 12:00 AM, arun kumar <ar...@yahoo.com> wrote:
> My questions are:
> 1. Will such a change improve the performance? Considering the overhead caused by moving the data blocks.

In some special case, it might improve the performance, but it depends
on your application.

> 2. I believe I will have to start from the NameNode to move the blocks. If anyone can give me a brief explanation on the process to implement this or even sources to find information on this it would be very helpful.

I think some of the work might initiate from client. Could you
describe what you want to do in detail?
 1 do you want to specify datanode to store special blocks, or only
want some blocks are located together?
 2 do you want to specify the location of all the replicas of a block,
or only want to specify one of the replicas.

-- 
Wang Xu
Stephen Leacock  - "I detest life-insurance agents: they always argue
that I shall some day die, which is not so." -
http://www.brainyquote.com/quotes/authors/s/stephen_leacock.html

Question on job scheduling

Posted by arun kumar <ar...@yahoo.com>.

All,

I am a newbie and trying to learn the Hadoop internals. I have got a few questions on something that I am trying to implement. I am still in the learning process, so these questions may seem silly or may even be entirely wrong, any guidance is highly appreciated. 

I am working on release 0.18.3 and trying to do some job scheduling, that is: instead of placing the data blocks on specific/desired nodes when input files are copied into HDFS, I am trying to move the blocks from their original locations to these desired/highly efficient nodes just before job submission.

My questions are:
1. Will such a change improve the performance? Considering the overhead caused by moving the data blocks.
2. I believe I will have to start from the NameNode to move the blocks. If anyone can give me a brief explanation on the process to implement this or even sources to find information on this it would be very helpful.

Thanks,
Arun