You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@accumulo.apache.org by "keith-turner (via GitHub)" <gi...@apache.org> on 2023/04/14 02:29:25 UTC

[GitHub] [accumulo] keith-turner commented on pull request #3286: Add support for storing ranges in DataFileValue

keith-turner commented on PR #3286:
URL: https://github.com/apache/accumulo/pull/3286#issuecomment-1507842193

   > Will those values stay as they currently are (size and numEntries for the entire file) or do we need to set those values based on the Ranges provided?
   
   We can probably just keep a number per file, possibly summing it on merge.  If we have the following
   
    * Tablet T1 covering range R1 has file F1 with est size S1
    * Tablet T2 covering range R2 has file F1 with est size S2
   
   If we merge T1 and T2 into T3 then T3 could have an estimated size of (S1+S2) for file F1 and ranges R1,R2 for F1.  One case I can think of where may want to store S1 and S2 instead of summing them is future splits of a merged tablet. Like if we split T3, how will we estimate its size?
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscribe@accumulo.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org