You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2020/05/28 22:08:25 UTC

[GitHub] [hudi] vinothchandar commented on issue #1675: [SUPPORT] Get all changed records from an incremental query rather than the latest one

vinothchandar commented on issue #1675:
URL: https://github.com/apache/hudi/issues/1675#issuecomment-635634106


   @abhibhat98 Thanks for the thought provoking questions.. table history, is something we already support via the CLI tool..  
   
   >> Hudi has the history of everything, it can look up by times, why can't it look up by the key? Or, is it something by design that Hudi doesn't intend to do.
   Typically key value stores (HBase, bigtable) are able to do this, because they have an effective index to fetch keys out.. Hudi is slowly getting there (see RFC-08/RFC-15 if interested) and when we do have such means, we can start providing such lookups.. Today, if you don't care about performance you can just to what you did above with a `where clause _hoodie_record_key in (<list_of_keys_interested_in>)` 
   
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org