You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2019/12/09 15:40:11 UTC

[GitHub] [incubator-hudi] nsivabalan opened a new pull request #1091: [HUDI-389] Fixing Index look up to return partitions for a given key along with fileId with Global Bloom

nsivabalan opened a new pull request #1091: [HUDI-389] Fixing Index look up to return partitions for a given key along with fileId with Global Bloom
URL: https://github.com/apache/incubator-hudi/pull/1091
 
 
   ## What is the purpose of the pull request
   
   Fixing Index look up to return partitions for a given key along with fileId with Global Bloom
   
   Use-case: 
   If a record is updated with a different partition than where it exists, with Global bloom, an exception is thrown as given [here](https://issues.apache.org/jira/browse/HUDI-389). This patch fixes the same. 
   
   Essentially in HoodieGlobaIndex#tagLocationBacktoRecords, each record is tagged with the right fileId and Partition and not just fileId (which was the case before this patch). In order to achieve this, I had to change the interface for IndexFileFilter from getMatchingFiles(String partitionPath, String recordKey) returning Set<FileId>s to returning Set<Pair<ParitionPath, fileId>>s. 
   
   ## Brief change log
   
     - Change interface of IndexfileFilter to return Set<Pair<PartitionPath, fileId>>
     - Fix Global Bloom to tag records with correct partitonPath from index look up and not from the passed in records(HoodieKey). 
   
   Tests:
   
   Added test in TestHoodieClientOnCopyOnWriteStorage#testUpsertToDiffPartitionGlobaIndex. 

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services