You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Lefty Leverenz (JIRA)" <ji...@apache.org> on 2014/08/09 21:33:12 UTC

[jira] [Commented] (HIVE-4248) Implement a memory manager for ORC

    [ https://issues.apache.org/jira/browse/HIVE-4248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14091872#comment-14091872 ] 

Lefty Leverenz commented on HIVE-4248:
--------------------------------------

This added configuration parameter *hive.exec.orc.memory.pool* to HiveConf.java in 0.11.0.  It's documented in the wiki here:

* [Configuration Properties -- hive.exec.orc.memory.pool | https://cwiki.apache.org/confluence/display/Hive/Configuration+Properties#ConfigurationProperties-hive.exec.orc.memory.pool] 

> Implement a memory manager for ORC
> ----------------------------------
>
>                 Key: HIVE-4248
>                 URL: https://issues.apache.org/jira/browse/HIVE-4248
>             Project: Hive
>          Issue Type: New Feature
>          Components: Serializers/Deserializers
>            Reporter: Owen O'Malley
>            Assignee: Owen O'Malley
>             Fix For: 0.11.0
>
>         Attachments: HIVE-4248.D9993.1.patch, HIVE-4248.D9993.2.patch, HIVE-4248.D9993.4.patch
>
>
> With the large default stripe size (256MB) and dynamic partitions, it is quite easy for users to run out of memory when writing ORC files. We probably need a solution that keeps track of the total number of concurrent ORC writers and divides the available heap space between them. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)