You are viewing a plain text version of this content. The canonical link for it is here.

Posted to mapreduce-issues@hadoop.apache.org by "Andrew Johnson (JIRA)" <ji...@apache.org> on 2015/03/06 14:31:39 UTC

[jira] [Commented] (MAPREDUCE-6222) HistoryServer Hangs Processing Large Jobs

    [ https://issues.apache.org/jira/browse/MAPREDUCE-6222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14350337#comment-14350337 ] 

Andrew Johnson commented on MAPREDUCE-6222:
-------------------------------------------

One thought I had regarding #1 was using Avro binary serialization instead of Avro-JSON.

I like the pagination idea and I'd definitely be interested in trying that out.

> HistoryServer Hangs Processing Large Jobs
> -----------------------------------------
>
>                 Key: MAPREDUCE-6222
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6222
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>            Reporter: Andrew Johnson
>         Attachments: JHS New Display Top.png, JHS Original Display Top.png, MAPREDUCE-6222.001.patch, head.jhist, historyserver_jstack.txt
>
>
> I'm encountering an issue with the Mapreduce HistoryServer processing the history files for large jobs.  This has come up several times with for jobs with around 60000 total tasks.  When the HistoryServer loads the .jhist file from HDFS for a job of that size (which is usually around 500 Mb), the HistoryServer's CPU usage spiked and the UI became unresponsive.  After about 10 minutes I restarted the HistoryServer and it was behaving normally again.
> The cluster is running CDH 5.3 (2.5.0-cdh5.3.0).  I've attached the output of jstack from a time this was occurring.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)