You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2021/05/13 14:44:28 UTC

[GitHub] [hudi] lw309637554 edited a comment on pull request #2918: [HUDI-1877] Add support in clustering to not change record location

lw309637554 edited a comment on pull request #2918:
URL: https://github.com/apache/hudi/pull/2918#issuecomment-840609544


   > > @satishkotha hello , have some doubt
   > > 
   > > 1. Just see add a test strategy . Will a formal strategy be added later?
   > > 2. This PR is to support which Index?
   > > 3. If every file group just transfrom to a same name file group. If the small files  can not merge ?
   > 
   > @lw309637554
   > 
   > 1. Yes, the actual strategy can be added easily if we agree on high level change
   > 2. This is to support HBaseIndex, which does not support update for record location
   > 3. yes, you are right. merging strategy cannot be applied to tables that use HBaseIndex. We can still local 'file-level' sorting i.e., sorting records in each data file by specified column so only one block (row group) needs to be read for queries.
   > 
   > Let me know if you any other questions/comments.
   @satishkotha 
   high level change is OK . Just have a other two comments
   1.  ".withClusteringMaxBytesInGroup(10) // set small number so each file is considered as separate clustering group" , Can we add aother config
   2. If the sort will support in HoodieCreateFixedHandle?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org