You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/08/05 20:38:00 UTC

[jira] [Commented] (HUDI-2281) add metadata client to read snapshot and incremental information

    [ https://issues.apache.org/jira/browse/HUDI-2281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17394282#comment-17394282 ] 

ASF GitHub Bot commented on HUDI-2281:
--------------------------------------

satishkotha opened a new pull request #3417:
URL: https://github.com/apache/hudi/pull/3417


   
   ## What is the purpose of the pull request
   Provide generic APIs to 
   * get all modified partitions since a specified commit time
   * get all data files written as part of latest commit
    
   We have different consumers using these APIs to achieve cross-dc consistency and improving efficiency for single region compute
   
   ## Brief change log
   * Add HoodieSnapshotMetadataClient to fetch data file paths written as part of a commit
   * Add HoodieIncrementalMetadataClient to fetch partitions modified since last commit
   
   ## Verify this pull request
   This change added tests
   
   ## Committer checklist
   
    - [ ] Has a corresponding JIRA in PR title & commit
    
    - [ ] Commit message is descriptive of the change
    
    - [ ] CI is green
   
    - [ ] Necessary doc changes done or have another open PR
          
    - [ ] For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> add metadata client to read snapshot and incremental information
> ----------------------------------------------------------------
>
>                 Key: HUDI-2281
>                 URL: https://issues.apache.org/jira/browse/HUDI-2281
>             Project: Apache Hudi
>          Issue Type: New Feature
>            Reporter: satish
>            Assignee: satish
>            Priority: Major
>
> We have usecases to
>  
>  * get all modified partitions since a specified commit time
>  * get all data files written as part of latest commit
>  
> Provide more high level generic API so different consumers can build on top of these interfaces



--
This message was sent by Atlassian Jira
(v8.3.4#803005)