You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@kafka.apache.org by "Guozhang Wang (JIRA)" <ji...@apache.org> on 2017/11/30 01:05:43 UTC

[jira] [Created] (KAFKA-6286) Consider reusing materialized store for multi-same-stream join

Guozhang Wang created KAFKA-6286:
------------------------------------

             Summary: Consider reusing materialized store for multi-same-stream join
                 Key: KAFKA-6286
                 URL: https://issues.apache.org/jira/browse/KAFKA-6286
             Project: Kafka
          Issue Type: Bug
          Components: streams
            Reporter: Guozhang Wang


Imagine the following streams application:

{code}
stream1.join(stream2...)..join(stream2)
{code}

Each join will result in {{stream2}} being materialized into a separate store. Arguably such multi-joins that involves the same stream multiple times is rare, but it worth considering if we can optimize such cases.

One thing to note, though, is that in our DSL parser today we do "put into store first, and then query the other store second", which means if we share the same store it would result in duplicates as the matching would already see the newly put records in the second join.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)