You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by Camilo Arango <ca...@gmail.com> on 2007/12/11 05:08:21 UTC

Hadoop performance patch

Hi,
I saw this interesting patch in Jira that improves performance:

http://issues.apache.org/jira/browse/HADOOP-249

does the latest version of hadoop already include this fix?

Thanks,

Camilo A.

Re: Hadoop performance patch

Posted by Johan Oskarsson <jo...@oskarsson.nu>.
I believe the last comment on that issue by Owen O'Malley is still valid:

"Most of these things except for the jvm reuse have been done in other 
patches. I've only left this bug open because of the jvm reuse issues, 
but the patch is currently useless because the code has changed so much 
in the last year. In fact, it would be relatively tricky to get right 
with the capturing of stdout/stderr from the tasks."

Personally I'd like to see the option of the tasktracker starting one 
jvm per job and run multiple tasks in that instead of one per task. But 
as Owen says, it would be a pretty big change.

/Johan

Camilo Arango wrote:
> Hi,
> I saw this interesting patch in Jira that improves performance:
> 
> http://issues.apache.org/jira/browse/HADOOP-249
> 
> does the latest version of hadoop already include this fix?
> 
> Thanks,
> 
> Camilo A.
>