You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/07/10 13:07:00 UTC

[jira] [Work logged] (HIVE-23824) LLAP - add API to look up ORC metadata for certain Path

     [ https://issues.apache.org/jira/browse/HIVE-23824?focusedWorklogId=457155&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-457155 ]

ASF GitHub Bot logged work on HIVE-23824:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 10/Jul/20 13:06
            Start Date: 10/Jul/20 13:06
    Worklog Time Spent: 10m 
      Work Description: szlta opened a new pull request #1238:
URL: https://github.com/apache/hive/pull/1238


   ## NOTICE
   
   Please create an issue in ASF JIRA before opening a pull request,
   and you need to set the title of the pull request which starts with
   the corresponding JIRA issue number. (e.g. HIVE-XXXXX: Fix a typo in YYY)
   For more details, please see https://cwiki.apache.org/confluence/display/Hive/HowToContribute
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

            Worklog Id:     (was: 457155)
    Remaining Estimate: 0h
            Time Spent: 10m

> LLAP - add API to look up ORC metadata for certain Path
> -------------------------------------------------------
>
>                 Key: HIVE-23824
>                 URL: https://issues.apache.org/jira/browse/HIVE-23824
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Ádám Szita
>            Assignee: Ádám Szita
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> LLAP IO supports caching but currently this is only done via LlapRecordReader / using splits, aka good old mapreduce way.
> At certain times it would worth to leverage the caching of files on certain paths, that are not necessarily associated with a record reader directly. An example of this could be the caching of ACID delete delta files, as they are currently being read without caching.
> With this patch we'd extend the LLAP API and offer another entry point for retrieving metadata of ORC files.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)