You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Eugene Koifman (JIRA)" <ji...@apache.org> on 2017/07/31 17:39:00 UTC

[jira] [Created] (HIVE-17214) check/fix conversion of non-acid to acid

Eugene Koifman created HIVE-17214:
-------------------------------------

             Summary: check/fix conversion of non-acid to acid
                 Key: HIVE-17214
                 URL: https://issues.apache.org/jira/browse/HIVE-17214
             Project: Hive
          Issue Type: Sub-task
          Components: Transactions
            Reporter: Eugene Koifman
            Assignee: Eugene Koifman


bucketed tables have stricter rules for file layout on disk - bucket files are direct children of a partition directory.

for un-bucketed tables I'm not sure there are any rules
for example, CTAS with Tez + Union operator creates 1 directory for each leg of the union

Supposedly Hive can read table by picking all files recursively.  
Can it also write (other than CTAS example above) arbitrarily?
Does it mean Acid write can also write anywhere?

Figure out what can be supported and how can existing layout can be checked?  Examining a full "ls -l -R" for a large table could be expensive. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)