You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kudu.apache.org by "Will Berkeley (JIRA)" <ji...@apache.org> on 2019/03/04 22:14:00 UTC

[jira] [Created] (KUDU-2725) RollingDiskRowSetWriter create rowsets that are bigger than the target rowset size

Will Berkeley created KUDU-2725:
-----------------------------------

             Summary: RollingDiskRowSetWriter create rowsets that are bigger than the target rowset size
                 Key: KUDU-2725
                 URL: https://issues.apache.org/jira/browse/KUDU-2725
             Project: Kudu
          Issue Type: Improvement
    Affects Versions: 1.9.0
            Reporter: Will Berkeley


The diskrowset writer create rowsets that are bigger than the target rowset size, with the excess proportional to the number of columns that compress poorly. For example, modifying loadgen to create a table with 280 columns and then using the {{--use_random}} flag, I saw rowsets that were in excess of 80MB. This is a problem because the budget for compactions is 128MB, so rowsets that are that big can never participate in a compaction.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)