You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Rohini Palaniswamy (JIRA)" <ji...@apache.org> on 2016/04/12 01:14:25 UTC

[jira] [Updated] (PIG-4844) Tez AM runs out of memory when vertex has high number of outputs

     [ https://issues.apache.org/jira/browse/PIG-4844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rohini Palaniswamy updated PIG-4844:
------------------------------------
    Attachment: PIG-4844-2.patch

Attached patch has the pigContext change removed. Will post that as part of PIG-4866. 

> Tez AM runs out of memory when vertex has high number of outputs
> ----------------------------------------------------------------
>
>                 Key: PIG-4844
>                 URL: https://issues.apache.org/jira/browse/PIG-4844
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Rohini Palaniswamy
>            Assignee: Rohini Palaniswamy
>             Fix For: 0.16.0
>
>         Attachments: PIG-4844-1-nowhitespacechanges.patch, PIG-4844-1.patch, PIG-4844-2.patch
>
>
>   AM runs out of memory when trying to respond to getTask() calls from container for a vertex with large number of outputs (usually the case with multi-query when you group by on multiple dimensions).  Problem is with the size of payload config associated with PigProcessor, Input and Output. When there is >10 outputs size of the payload considerably increases causing memory pressure.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)