You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Boyang Chen (JIRA)" <ji...@apache.org> on 2018/04/30 21:22:00 UTC

[jira] [Created] (KAFKA-6840) support windowing in ktable API

Boyang Chen created KAFKA-6840:
----------------------------------

             Summary: support windowing in ktable API
                 Key: KAFKA-6840
                 URL: https://issues.apache.org/jira/browse/KAFKA-6840
             Project: Kafka
          Issue Type: Improvement
          Components: streams
    Affects Versions: 1.1.0
            Reporter: Boyang Chen
            Assignee: Boyang Chen


The StreamsBuilder provides table() API to materialize a changelog topic into a local key-value store (KTable), which is very convenient. However, current underlying implementation does not support materializing one topic to a windowed key-value store, which in certain cases would be very useful. 

To make up the gap, we proposed a new API in StreamsBuilder that could get a windowed Ktable.

The table() API in StreamsBuilder looks like this:

public synchronized <K, V> KTable<K, V> table(final String topic,

                                                  final Consumed<K, V> consumed,

                                                  final Materialized<K, V, KeyValueStore<Bytes, byte[]>> materialized) {

        Objects.requireNonNull(topic, "topic can't be null");

        Objects.requireNonNull(consumed, "consumed can't be null");

        Objects.requireNonNull(materialized, "materialized can't be null");

        materialized.withKeySerde(consumed.keySerde).withValueSerde(consumed.valueSerde);

        return internalStreamsBuilder.table(topic,

                                            new ConsumedInternal<>(consumed),

                                            new MaterializedInternal<>(materialized, internalStreamsBuilder, topic + "-"));

    }

 

Where we could see that the store type is given as KeyValueStore. There is no flexibility to change it to WindowStore.

 

To maintain compatibility of the existing API, we have two options to define a new API:

1.Overload existing KTable struct

public synchronized <K, V> KTable<Windowed<K>, V> windowedTable(final String topic,

                                                  final Consumed<K, V> consumed,

                                                  final Materialized<K, V, WindowStore<Bytes, byte[]>> materialized);

 

This could give developer an alternative to use windowed table instead. However, this implies that we need to make sure all the KTable logic still works as expected, such as join, aggregation, etc, so the challenge would be making sure all current KTable logics work.

 

2.Define a new type called WindowedKTable

public synchronized <K, V> WindowedKTable<K, V> windowedTable(final String topic,

                                                  final Consumed<K, V> consumed,

                                                  final Materialized<K, V, WindowStore<Bytes, byte[]>> materialized);

The benefit of doing this is that we don’t need to worry about the existing functionality of KTable. However, the cost is to introduce redundancy of common operation logic. When upgrading common functionality, we need to take care of both types.

We could fill in more details in the KIP. Right now I would like to hear some feedbacks on the two approaches, thank you!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)