You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Rohini Palaniswamy (JIRA)" <ji...@apache.org> on 2016/04/12 01:14:25 UTC
[jira] [Updated] (PIG-4844) Tez AM runs out of memory when vertex
has high number of outputs
[ https://issues.apache.org/jira/browse/PIG-4844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rohini Palaniswamy updated PIG-4844:
------------------------------------
Attachment: PIG-4844-2.patch
Attached patch has the pigContext change removed. Will post that as part of PIG-4866.
> Tez AM runs out of memory when vertex has high number of outputs
> ----------------------------------------------------------------
>
> Key: PIG-4844
> URL: https://issues.apache.org/jira/browse/PIG-4844
> Project: Pig
> Issue Type: Improvement
> Reporter: Rohini Palaniswamy
> Assignee: Rohini Palaniswamy
> Fix For: 0.16.0
>
> Attachments: PIG-4844-1-nowhitespacechanges.patch, PIG-4844-1.patch, PIG-4844-2.patch
>
>
> AM runs out of memory when trying to respond to getTask() calls from container for a vertex with large number of outputs (usually the case with multi-query when you group by on multiple dimensions). Problem is with the size of payload config associated with PigProcessor, Input and Output. When there is >10 outputs size of the payload considerably increases causing memory pressure.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)