You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kudu.apache.org by "Will Berkeley (JIRA)" <ji...@apache.org> on 2019/03/04 22:15:00 UTC

[jira] [Comment Edited] (KUDU-2725) RollingDiskRowSetWriter create rowsets that are bigger than the target rowset size

    [ https://issues.apache.org/jira/browse/KUDU-2725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16783828#comment-16783828 ] 

Will Berkeley edited comment on KUDU-2725 at 3/4/19 10:14 PM:
--------------------------------------------------------------

One can workaround this problem by increasing the compaction budget {{--tablet_compaction_budget_mb}}.


was (Author: wdberkeley):
One can workaround this problem by increasing the target rowset size {{--budgeted_compaction_target_rowset_size}}.

> RollingDiskRowSetWriter create rowsets that are bigger than the target rowset size
> ----------------------------------------------------------------------------------
>
>                 Key: KUDU-2725
>                 URL: https://issues.apache.org/jira/browse/KUDU-2725
>             Project: Kudu
>          Issue Type: Improvement
>    Affects Versions: 1.9.0
>            Reporter: Will Berkeley
>            Priority: Major
>
> The diskrowset writer create rowsets that are bigger than the target rowset size, with the excess proportional to the number of columns that compress poorly. For example, modifying loadgen to create a table with 280 columns and then using the {{--use_random}} flag, I saw rowsets that were in excess of 80MB. This is a problem because the budget for compactions is 128MB, so rowsets that are that big can never participate in a compaction.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)