You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@accumulo.apache.org by GitBox <gi...@apache.org> on 2019/03/13 20:28:57 UTC

[GitHub] [accumulo] keith-turner opened a new issue #1033: Default Compaction strategy is sub-optimal

keith-turner opened a new issue #1033: Default Compaction strategy is sub-optimal
URL: https://github.com/apache/accumulo/issues/1033
 
 
   Consider a tablet with the following files.  If the compaction ratio is 3 then all files would meet the criteria for compaction.  However if the max files to compact is 10, then the files C4 and F[5-d] will be selected for compaction.  This is very suboptimal over time.  It would be much better if a subset of files that met the compaction ratio criteria were returned.  For example C[2-4] and F[5-b] could be selected, which is 10 files that meet the ratio criteria.  Another possibility is only selecting only the F files, which meet the criteria and is less than max files. 
   
   | File | Size |
   |------|-------|
   | C1 | 100M |
   | C2 | 100M |
   | C3 | 100M |
   | C4 | 100M |
   | F5 | 1M |
   | F6 | 1M |
   | F7 | 1M |
   | F8 | 1M |
   | F9 | 1M |
   | Fa | 1M |
   | Fb | 1M |
   | Fc | 1M |
   | Fd | 1M |
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services