You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Eugene Koifman (JIRA)" <ji...@apache.org> on 2018/02/27 20:16:00 UTC

[jira] [Created] (HIVE-18814) Support Add Partition For Acid tables

Eugene Koifman created HIVE-18814:
-------------------------------------

             Summary: Support Add Partition For Acid tables
                 Key: HIVE-18814
                 URL: https://issues.apache.org/jira/browse/HIVE-18814
             Project: Hive
          Issue Type: New Feature
          Components: Transactions
            Reporter: Eugene Koifman
            Assignee: Eugene Koifman


[https://cwiki.apache.org/confluence/display/Hive/LanguageManual%2BDDL#LanguageManualDDL-AddPartitions]

Add Partition command creates a \{{Partition}} metadata object and set the location to the directory containing data files.

In current master (Hive 3.0), Add partition on an acid table doesn't fail and at read time the data is decorated with row__id but the original transaction is 0.  I suspect in earlier Hive versions this will throw or return no data.

 

One option is follow Load Data approach and create a new delta_x_x/ and move/copy the data there.

 

Another is to allocate a new writeid and save it in Partition metadata.  This could then be used to decorate data with ROW__IDs.  This avoids move/copy but retains data "outside" of the table tree which make it more likely that this data will be modified in some way which can really break things if done after and SQL update/delete on this data have happened. 

 

 

 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)