You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Aron Hamvas (Jira)" <ji...@apache.org> on 2020/02/14 14:55:00 UTC

[jira] [Resolved] (HIVE-22639) Bucket file name does not match bucket id after query based major compaction

     [ https://issues.apache.org/jira/browse/HIVE-22639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Aron Hamvas resolved HIVE-22639.
--------------------------------
    Target Version/s:   (was: 4.0.0)
          Resolution: Won't Do

This was resolved when minor crud compaction was delivered by [~lpinter].

> Bucket file name does not match bucket id after query based major compaction
> ----------------------------------------------------------------------------
>
>                 Key: HIVE-22639
>                 URL: https://issues.apache.org/jira/browse/HIVE-22639
>             Project: Hive
>          Issue Type: Bug
>          Components: Hive
>    Affects Versions: 3.0.0, 3.1.0
>            Reporter: Aron Hamvas
>            Assignee: Aron Hamvas
>            Priority: Major
>         Attachments: HIVE-22639.1.patch, HIVE-22639.2.patch, HIVE-22639.patch
>
>
> While debugging 
> {{TestCrudCompactorOnTez#testCompactionWithSchemaEvolutionAndBuckets()}}, it has come to my attention, that even though before compaction, the file name of the single bucket in the delta directories is {{bucket_00001}}, in the new base, the name of the new single bucket file is {{bucket_00000}}. At the same time, the bucket value in the ROW__ID of the records remain the same and suggest that the bucket id is 1. 
> So the bucket id and the file name do not match. This could lead to problems.
> The test itself does not reveal this issue, although I think that the tests should check this, too. At the same time, the tests assume the exact bucket id value in cases where it cannot be predicted and fail, even though the bucket it does not change after the compaction, so the check should really pass.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)