You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Eugene Koifman (JIRA)" <ji...@apache.org> on 2018/02/09 01:15:00 UTC

[jira] [Created] (HIVE-18662) hive.acid.key.index is missing entries

Eugene Koifman created HIVE-18662:
-------------------------------------

             Summary: hive.acid.key.index is missing entries
                 Key: HIVE-18662
                 URL: https://issues.apache.org/jira/browse/HIVE-18662
             Project: Hive
          Issue Type: Bug
          Components: Transactions
            Reporter: Eugene Koifman


OrcRecordUpdater.KeyIndexBuilder stores an index in ORC footer where each entry is the last ROW__ID of each stripe.  In acid1 this is used to filter the events from delta file when merging with part of the base.

 

as can be seen in \{{TestTxnCommands.testVersioning()}} (added in HIVE-18659) the \{{hive.acid.key.index}} is empty.  

 

This is because very little data is written and WriterImpl.flushStripe() is not called except when \{{WriterImpl.close()} is called.  In the later, \{{WriterCallback.preFooterWrite()}} is called before \{{preStripeWrite}} and so KeyIndexBuilder.preFooterWriter() records nothing in \{{hive.acid.key.index}}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)