You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Sercan Tekin (Jira)" <ji...@apache.org> on 2023/05/04 13:12:00 UTC
[jira] [Commented] (HIVE-27317) Temporary (local) session files cleanup improvements
[ https://issues.apache.org/jira/browse/HIVE-27317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17719319#comment-17719319 ]
Sercan Tekin commented on HIVE-27317:
-------------------------------------
Created a patch for master branch.
With the patch, when *ClearDanglingScratchDir* identifies the dangling sessions, we will be cleaning files/dirs in *HiveConf.ConfVars.LOCALSCRATCHDIR* as well.
Added unit test as well.
> Temporary (local) session files cleanup improvements
> ----------------------------------------------------
>
> Key: HIVE-27317
> URL: https://issues.apache.org/jira/browse/HIVE-27317
> Project: Hive
> Issue Type: Improvement
> Reporter: Sercan Tekin
> Assignee: Sercan Tekin
> Priority: Major
> Attachments: HIVE-27317.patch
>
>
> When Hive session is killed, no chance for shutdown hook to clean-up tmp files.
> There is a Hive service to clean residual files https://issues.apache.org/jira/browse/HIVE-13429, and later on its execution is scheduled inside HS2 https://issues.apache.org/jira/browse/HIVE-15068 to make sure not to leave any temp file behind. But this service cleans up only HDFS temp files, there are still residual files/dirs in *HiveConf.ConfVars.LOCALSCRATCHDIR* location as follows;
> {code:java}
> > ll /tmp/user/97c4ef50-5e80-480e-a6f0-4f779050852b*
> drwx------ 2 user user 4096 Oct 29 10:09 97c4ef50-5e80-480e-a6f0-4f779050852b
> -rw------- 1 user user 0 Oct 29 10:09 97c4ef50-5e80-480e-a6f0-4f779050852b10571819313894728966.pipeout
> -rw------- 1 user user 0 Oct 29 10:09 97c4ef50-5e80-480e-a6f0-4f779050852b16013956055489853961.pipeout
> -rw------- 1 user user 0 Oct 29 10:09 97c4ef50-5e80-480e-a6f0-4f779050852b4383913570068173450.pipeout
> -rw------- 1 user user 0 Oct 29 10:09 97c4ef50-5e80-480e-a6f0-4f779050852b889740171428672108.pipeout {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)