You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Peter Vary (Jira)" <ji...@apache.org> on 2020/01/08 10:44:00 UTC

[jira] [Comment Edited] (HIVE-20948) Eliminate file rename in compactor

    [ https://issues.apache.org/jira/browse/HIVE-20948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17010561#comment-17010561 ] 

Peter Vary edited comment on HIVE-20948 at 1/8/20 10:43 AM:
------------------------------------------------------------

[~asomani]: Even after fixing HIVE-21164 the CRUD compactor has an extra move step which should be avoided if possible. CC: [~lpinter]


was (Author: pvary):
[~asomani]: Even after fixing HIVE-21164 the CRUD compactor has an extra move step which could be avoided if possible. 

> Eliminate file rename in compactor
> ----------------------------------
>
>                 Key: HIVE-20948
>                 URL: https://issues.apache.org/jira/browse/HIVE-20948
>             Project: Hive
>          Issue Type: Bug
>          Components: Transactions
>    Affects Versions: 4.0.0
>            Reporter: Eugene Koifman
>            Priority: Major
>
> Once HIVE-20823 is committed, we should investigate if it's possible to have compactor write directly to base_x_cZ or delta_x_y_cZ.  
> For query based compaction: can we control location of temp table dir?  We support external temp tables so this may work but we'd need to have non-acid insert create files with {{bucket_xxxxx}} names.
>  
> For MR/Tez/LLAP based (should this be done at all?), need to figure out how retries of tasks will work.  Just like we currently generate an MR job to compact, we should be able to generate a Tez job.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)