You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kudu.apache.org by "Todd Lipcon (JIRA)" <ji...@apache.org> on 2017/03/06 21:02:33 UTC

[jira] [Updated] (KUDU-686) Delta apply optimizations

     [ https://issues.apache.org/jira/browse/KUDU-686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Todd Lipcon updated KUDU-686:
-----------------------------
    Component/s: perf

> Delta apply optimizations
> -------------------------
>
>                 Key: KUDU-686
>                 URL: https://issues.apache.org/jira/browse/KUDU-686
>             Project: Kudu
>          Issue Type: Improvement
>          Components: perf, tablet
>    Affects Versions: M4.5
>            Reporter: David Alves
>            Priority: Trivial
>
> We currently iterate on each delta file several times, one for deletes and then one for each one of the columns.
> It seems that, when selecting all the columns it would be more efficient to apply the deltas to all columns at the same time. This might or might not be advantageous depending on the number of columns projected. Todd also suggest that whether this is an advantage also depends on whether there are predicates being pushed down.
> We could likely also merge the updates and deletes into a single iteration or at least avoid applying the mutations if the row will end up delete (right now we still apply the updates even when we find that the row will be deleted).



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)