You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kudu.apache.org by "Will Berkeley (JIRA)" <ji...@apache.org> on 2019/03/04 22:15:00 UTC

[jira] [Commented] (KUDU-2725) RollingDiskRowSetWriter create rowsets that are bigger than the target rowset size

    [ https://issues.apache.org/jira/browse/KUDU-2725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16783828#comment-16783828 ] 

Will Berkeley commented on KUDU-2725:
-------------------------------------

One can workaround this problem by increasing the target rowset size {{--budgeted_compaction_target_rowset_size}}.

> RollingDiskRowSetWriter create rowsets that are bigger than the target rowset size
> ----------------------------------------------------------------------------------
>
>                 Key: KUDU-2725
>                 URL: https://issues.apache.org/jira/browse/KUDU-2725
>             Project: Kudu
>          Issue Type: Improvement
>    Affects Versions: 1.9.0
>            Reporter: Will Berkeley
>            Priority: Major
>
> The diskrowset writer create rowsets that are bigger than the target rowset size, with the excess proportional to the number of columns that compress poorly. For example, modifying loadgen to create a table with 280 columns and then using the {{--use_random}} flag, I saw rowsets that were in excess of 80MB. This is a problem because the budget for compactions is 128MB, so rowsets that are that big can never participate in a compaction.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)