You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Mithun Radhakrishnan (JIRA)" <ji...@apache.org> on 2017/10/10 06:13:00 UTC

[jira] [Created] (HIVE-17754) InputJobInfo in Pig UDFContext is heavyweight, and causes OOMs in Tez AMs

Mithun Radhakrishnan created HIVE-17754:
-------------------------------------------

             Summary: InputJobInfo in Pig UDFContext is heavyweight, and causes OOMs in Tez AMs
                 Key: HIVE-17754
                 URL: https://issues.apache.org/jira/browse/HIVE-17754
             Project: Hive
          Issue Type: Bug
          Components: HCatalog
    Affects Versions: 2.2.0, 3.0.0
            Reporter: Mithun Radhakrishnan
            Assignee: Mithun Radhakrishnan


HIVE-9845 dealt with reducing the size of HCat split-info, to improve job-launch times for Pig/HCat jobs.
For large Pig queries that scan a large number of Hive partitions, it was found that the Pig {{UDFContext}} stored full-fat HCat {{InputJobInfo}} objects, thus blowing out the Pig Tez AM. Since this information is already stored in the {{HCatSplit}}, the serialization of {{InputJobInfo}} can be spared.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)