You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Jimmy Xiang (JIRA)" <ji...@apache.org> on 2014/12/15 23:03:13 UTC

[jira] [Updated] (HIVE-8843) Release RDD cache when Hive query is done [Spark Branch]

     [ https://issues.apache.org/jira/browse/HIVE-8843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jimmy Xiang updated HIVE-8843:
------------------------------
    Status: Patch Available  (was: Open)

> Release RDD cache when Hive query is done [Spark Branch]
> --------------------------------------------------------
>
>                 Key: HIVE-8843
>                 URL: https://issues.apache.org/jira/browse/HIVE-8843
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Spark
>            Reporter: Xuefu Zhang
>            Assignee: Jimmy Xiang
>         Attachments: HIVE-8843.1-spark.patch
>
>
> In some multi-inser cases, RDD.cache() is called to improve performance. RDD is SparkContext specific, but the caching is useful only for the query. Thus, once the query is executed, we need to release the cache used by calling RDD.uncache().



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)