You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Vaibhav Gumashta (JIRA)" <ji...@apache.org> on 2019/07/10 09:26:00 UTC

[jira] [Commented] (HIVE-21164) ACID: explore how we can avoid a move step during inserts/compaction

    [ https://issues.apache.org/jira/browse/HIVE-21164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16881886#comment-16881886 ] 

Vaibhav Gumashta commented on HIVE-21164:
-----------------------------------------

[~gopalv] I think this is good for a review now (pending test run: but I've fixed a bunch of issues from the previous run and verified locally). Union + multi file spray (more buckets than reducers case) might need another look, but I'll wait for test run and go from there. Review board linked to the jira. Thanks!

> ACID: explore how we can avoid a move step during inserts/compaction
> --------------------------------------------------------------------
>
>                 Key: HIVE-21164
>                 URL: https://issues.apache.org/jira/browse/HIVE-21164
>             Project: Hive
>          Issue Type: Bug
>          Components: Transactions
>    Affects Versions: 3.1.1
>            Reporter: Vaibhav Gumashta
>            Assignee: Vaibhav Gumashta
>            Priority: Major
>         Attachments: HIVE-21164.1.patch, HIVE-21164.2.patch, HIVE-21164.3.patch, HIVE-21164.4.patch, HIVE-21164.5.patch, HIVE-21164.6.patch
>
>
> Currently, we write compacted data to a temporary location and then move the files to a final location, which is an expensive operation on some cloud file systems. Since HIVE-20823 is already in, it can control the visibility of compacted data for the readers. Therefore, we can perhaps avoid writing data to a temporary location and directly write compacted data to the intended final path.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)