You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Ádám Szita (Jira)" <ji...@apache.org> on 2020/07/10 13:05:00 UTC

[jira] [Updated] (HIVE-23824) LLAP - add API to look up ORC metadata for certain Path

     [ https://issues.apache.org/jira/browse/HIVE-23824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ádám Szita updated HIVE-23824:
------------------------------
    Description: 
LLAP IO supports caching but currently this is only done via LlapRecordReader / using splits, aka good old mapreduce way.

At certain times it would worth to leverage the caching of files on certain paths, that are not necessarily associated with a record reader directly. An example of this could be the caching of ACID delete delta files, as they are currently being read without caching.

With this patch we'd extend the LLAP API and offer another entry point for retrieving metadata of ORC files.

> LLAP - add API to look up ORC metadata for certain Path
> -------------------------------------------------------
>
>                 Key: HIVE-23824
>                 URL: https://issues.apache.org/jira/browse/HIVE-23824
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Ádám Szita
>            Assignee: Ádám Szita
>            Priority: Major
>
> LLAP IO supports caching but currently this is only done via LlapRecordReader / using splits, aka good old mapreduce way.
> At certain times it would worth to leverage the caching of files on certain paths, that are not necessarily associated with a record reader directly. An example of this could be the caching of ACID delete delta files, as they are currently being read without caching.
> With this patch we'd extend the LLAP API and offer another entry point for retrieving metadata of ORC files.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)