You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Sahil Takiar (JIRA)" <ji...@apache.org> on 2017/09/28 19:10:00 UTC

[jira] [Created] (HIVE-17638) SparkDynamicPartitionPruner loads all partition metadata into memory

Sahil Takiar created HIVE-17638:
-----------------------------------

             Summary: SparkDynamicPartitionPruner loads all partition metadata into memory
                 Key: HIVE-17638
                 URL: https://issues.apache.org/jira/browse/HIVE-17638
             Project: Hive
          Issue Type: Sub-task
          Components: Spark
            Reporter: Sahil Takiar


The {{SparkDynamicPartitionPruner}} first loads the contents of each partition pruning file into memory, and then prunes all the partitions from the {{MapWork}}. This can cause increased memory pressure on the HoS Remote Driver because it requires loading all the partition metadata into memory. It would be more efficient if pruning of partitions was done while scanning the files, so that all the partition metadata doesn't need to be buffered in memory.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)