You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Sercan Tekin (Jira)" <ji...@apache.org> on 2023/05/04 13:12:00 UTC

[jira] [Commented] (HIVE-27317) Temporary (local) session files cleanup improvements

    [ https://issues.apache.org/jira/browse/HIVE-27317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17719319#comment-17719319 ] 

Sercan Tekin commented on HIVE-27317:
-------------------------------------

Created a patch for master branch.

With the patch, when *ClearDanglingScratchDir* identifies the dangling sessions, we will be cleaning files/dirs in *HiveConf.ConfVars.LOCALSCRATCHDIR* as well.

Added unit test as well.

> Temporary (local) session files cleanup improvements
> ----------------------------------------------------
>
>                 Key: HIVE-27317
>                 URL: https://issues.apache.org/jira/browse/HIVE-27317
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Sercan Tekin
>            Assignee: Sercan Tekin
>            Priority: Major
>         Attachments: HIVE-27317.patch
>
>
> When Hive session is killed, no chance for shutdown hook to clean-up tmp files.
> There is a Hive service to clean residual files https://issues.apache.org/jira/browse/HIVE-13429, and later on its execution is scheduled inside HS2 https://issues.apache.org/jira/browse/HIVE-15068 to make sure not to leave any temp file behind. But this service cleans up only HDFS temp files, there are still residual files/dirs in *HiveConf.ConfVars.LOCALSCRATCHDIR* location as follows;
> {code:java}
> > ll /tmp/user/97c4ef50-5e80-480e-a6f0-4f779050852b*
> drwx------ 2 user user 4096 Oct 29 10:09 97c4ef50-5e80-480e-a6f0-4f779050852b
> -rw------- 1 user user    0 Oct 29 10:09 97c4ef50-5e80-480e-a6f0-4f779050852b10571819313894728966.pipeout
> -rw------- 1 user user    0 Oct 29 10:09 97c4ef50-5e80-480e-a6f0-4f779050852b16013956055489853961.pipeout
> -rw------- 1 user user    0 Oct 29 10:09 97c4ef50-5e80-480e-a6f0-4f779050852b4383913570068173450.pipeout
> -rw------- 1 user user    0 Oct 29 10:09 97c4ef50-5e80-480e-a6f0-4f779050852b889740171428672108.pipeout {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)