You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Cheolsoo Park (JIRA)" <ji...@apache.org> on 2014/06/28 04:24:24 UTC
[jira] [Created] (PIG-4043) JobClient.getMap/ReduceTaskReports()
causes OOM for jobs with a large number of tasks
Cheolsoo Park created PIG-4043:
----------------------------------
Summary: JobClient.getMap/ReduceTaskReports() causes OOM for jobs with a large number of tasks
Key: PIG-4043
URL: https://issues.apache.org/jira/browse/PIG-4043
Project: Pig
Issue Type: Bug
Reporter: Cheolsoo Park
Assignee: Cheolsoo Park
Fix For: 0.14.0
With Hadoop 2.4, I often see Pig client fails due to OOM when there are many tasks (~100K) with 1GB heap size.
The heap dump (attached) shows that TaskReport[] occupies more than 90% of heap space at the time of OOM.
The problem is that JobClient.getMap/ReduceTaskReports() returns an array of TaskReport objects, which can be huge if the number of task is large.
--
This message was sent by Atlassian JIRA
(v6.2#6252)