You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Vicky Papavasileiou (Jira)" <ji...@apache.org> on 2022/09/21 15:19:00 UTC

[jira] [Created] (KAFKA-14251) Improve CPU usage of self-joins by sacrificing order

Vicky Papavasileiou created KAFKA-14251:
-------------------------------------------

             Summary: Improve CPU usage of self-joins by sacrificing order
                 Key: KAFKA-14251
                 URL: https://issues.apache.org/jira/browse/KAFKA-14251
             Project: Kafka
          Issue Type: Improvement
            Reporter: Vicky Papavasileiou


The current self-join operator implementation ensures that records in the output follow the same order as if the join was implemented using an inner-join. To achieve this, the self-join operator needs to use two loops, each doing a window store fetch, to simulate the left-hand side of the join probing the join and the right-hand side probing the join. 

As an optimization, if we don't care about the ordering of the join results, we can avoid doing two loops and instead do one where the window store fetch will use the union of the left and righ-side windows.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)