You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Alan Gates (JIRA)" <ji...@apache.org> on 2014/07/31 17:56:38 UTC

[jira] [Created] (HIVE-7571) RecordUpdater should read virtual columns from row

Alan Gates created HIVE-7571:
--------------------------------

             Summary: RecordUpdater should read virtual columns from row
                 Key: HIVE-7571
                 URL: https://issues.apache.org/jira/browse/HIVE-7571
             Project: Hive
          Issue Type: Sub-task
          Components: Transactions
    Affects Versions: 0.13.0
            Reporter: Alan Gates
            Assignee: Alan Gates


Currently RecordUpdater.update and delete take rowid and original transaction as parameters.  These values are already present in the row as part of the new ROW__ID virtual column in HIVE-7513, and thus can be read by the writer from there.  And the writer will already have to handle skipping ROW__ID when writing, so it needs to be aware of that column anyone.

We could instead read the values from ROW__ID and then remove it from the object inspector in FileSinkOperator, but this will be hard in the vectorization case where rows are being dealt with 10k at a time.

For these reasons it makes more sense to do this work in the writer.



--
This message was sent by Atlassian JIRA
(v6.2#6252)