You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Eugene Koifman (JIRA)" <ji...@apache.org> on 2017/05/19 20:59:04 UTC

[jira] [Created] (HIVE-16722) Converting non-acid table to acid should perform validation

Eugene Koifman created HIVE-16722:
-------------------------------------

             Summary: Converting non-acid table to acid should perform validation
                 Key: HIVE-16722
                 URL: https://issues.apache.org/jira/browse/HIVE-16722
             Project: Hive
          Issue Type: Sub-task
    Affects Versions: 1.0.0
            Reporter: Eugene Koifman


Converting a non acid table to acid only performs metadata validation (in _TransactionalValidationListener_).
The data read code path only understands certain directory layouts and file names and ignores (generally) files that don't match the expected format.

In Hive, directory layout and bucket file naming (especially older releases) is poorly enforced.

Need to add a validation step on 
{noformat}
alter table T SET TBLPROPERTIES ('transactional'='true')
{noformat}
to 
scan the file system and report any possible data loss scenarios.

Currently Acid understands bucket files name like "00000_0" and (with HIVE-16177) 00000_0_copy1" etc at the root of the partition.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)