You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Sergey Shelukhin (JIRA)" <ji...@apache.org> on 2017/09/06 22:34:00 UTC

[jira] [Updated] (HIVE-12631) LLAP: support ORC ACID tables

     [ https://issues.apache.org/jira/browse/HIVE-12631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sergey Shelukhin updated HIVE-12631:
------------------------------------
    Attachment: HIVE-12631.27.patch

Updated the patch. Not sure why the config was set in UpdateDeleteSemanticAnalyzer so I commented that out for now. I looked a bit at the CVB-VRB-CVB-VRB conversion, given that handling a selected vector after ACID reader requires copying stuff, it doesn't seem ideal. Can be handled in a followup. Either a selected vector can be added to CVB and the ACID merger thing made operate on both (the code is common between the two), or LLAPRecordReader can be enabled to accept VRBs directly.

> LLAP: support ORC ACID tables
> -----------------------------
>
>                 Key: HIVE-12631
>                 URL: https://issues.apache.org/jira/browse/HIVE-12631
>             Project: Hive
>          Issue Type: Bug
>          Components: llap, Transactions
>            Reporter: Sergey Shelukhin
>            Assignee: Teddy Choi
>         Attachments: HIVE-12631.10.patch, HIVE-12631.10.patch, HIVE-12631.11.patch, HIVE-12631.11.patch, HIVE-12631.12.patch, HIVE-12631.13.patch, HIVE-12631.15.patch, HIVE-12631.16.patch, HIVE-12631.17.patch, HIVE-12631.18.patch, HIVE-12631.19.patch, HIVE-12631.1.patch, HIVE-12631.20.patch, HIVE-12631.21.patch, HIVE-12631.22.patch, HIVE-12631.23.patch, HIVE-12631.24.patch, HIVE-12631.25.patch, HIVE-12631.26.patch, HIVE-12631.27.patch, HIVE-12631.2.patch, HIVE-12631.3.patch, HIVE-12631.4.patch, HIVE-12631.5.patch, HIVE-12631.6.patch, HIVE-12631.7.patch, HIVE-12631.8.patch, HIVE-12631.8.patch, HIVE-12631.9.patch
>
>
> LLAP uses a completely separate read path in ORC to allow for caching and parallelization of reads and processing. This path does not support ACID. As far as I remember ACID logic is embedded inside ORC format; we need to refactor it to be on top of some interface, if practical; or just port it to LLAP read path.
> Another consideration is how the logic will work with cache. The cache is currently low-level (CB-level in ORC), so we could just use it to read bases and deltas (deltas should be cached with higher priority) and merge as usual. We could also cache merged representation in future.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)