You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Ning Zhang (JIRA)" <ji...@apache.org> on 2011/03/03 09:54:37 UTC

[jira] Commented: (HIVE-2026) Parallelize UpdateInputAccessTimeHook

    [ https://issues.apache.org/jira/browse/HIVE-2026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13001923#comment-13001923 ] 

Ning Zhang commented on HIVE-2026:
----------------------------------

review board: https://reviews.apache.org/r/460/

> Parallelize UpdateInputAccessTimeHook
> -------------------------------------
>
>                 Key: HIVE-2026
>                 URL: https://issues.apache.org/jira/browse/HIVE-2026
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Ning Zhang
>            Assignee: Ning Zhang
>         Attachments: HIVE-2026.patch
>
>
> UpdateInputAccessTimeHook is usually used as a pre-execution hook to update the metastore's lastAccessTime field of input partition/table. If a query touches a large number of partitions, this hooks takes a long time to execute. One approach is to make the hook itself to run in a separate thread. But it is hard to guarantee backward compatibility in semantics in case of exceptions encountered in the hook execution. This task takes another approach to parallelize the hook itself (update multiple partitions concurrently), but execute each pre-hook in sequential order. 

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira