You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@kafka.apache.org by "Guozhang Wang (JIRA)" <ji...@apache.org> on 2017/11/30 01:05:43 UTC
[jira] [Created] (KAFKA-6286) Consider reusing materialized store
for multi-same-stream join
Guozhang Wang created KAFKA-6286:
------------------------------------
Summary: Consider reusing materialized store for multi-same-stream join
Key: KAFKA-6286
URL: https://issues.apache.org/jira/browse/KAFKA-6286
Project: Kafka
Issue Type: Bug
Components: streams
Reporter: Guozhang Wang
Imagine the following streams application:
{code}
stream1.join(stream2...)..join(stream2)
{code}
Each join will result in {{stream2}} being materialized into a separate store. Arguably such multi-joins that involves the same stream multiple times is rare, but it worth considering if we can optimize such cases.
One thing to note, though, is that in our DSL parser today we do "put into store first, and then query the other store second", which means if we share the same store it would result in duplicates as the matching would already see the newly put records in the second join.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)