You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Kumud Kumar Srivatsava Tirupati (Jira)" <ji...@apache.org> on 2022/06/04 03:24:00 UTC

[jira] [Resolved] (KAFKA-13926) Proposal to have "HasField" predicate for kafka connect

     [ https://issues.apache.org/jira/browse/KAFKA-13926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Kumud Kumar Srivatsava Tirupati resolved KAFKA-13926.
-----------------------------------------------------
    Resolution: Won't Fix

Dropping in favor of improving the existing SMTs as per the discussion.

https://lists.apache.org/thread/odbj7793plyz7xxyy6d71c3xn7zng49f

> Proposal to have "HasField" predicate for kafka connect
> -------------------------------------------------------
>
>                 Key: KAFKA-13926
>                 URL: https://issues.apache.org/jira/browse/KAFKA-13926
>             Project: Kafka
>          Issue Type: Improvement
>          Components: KafkaConnect
>            Reporter: Kumud Kumar Srivatsava Tirupati
>            Assignee: Kumud Kumar Srivatsava Tirupati
>            Priority: Major
>
> Hello,
> Today's connect predicates enables checks on the record metadata. However, this can be limiting considering {*}many inbuilt and custom transformations that we (community) use are more key/value centric{*}.
> Some use-cases this can solve:
>  * Data type conversions of certain pre-identified fields for records coming across datasets only if those fields exist. [Ex: TimestampConverter can be run only if the specified date field exists irrespective of the record metadata]
>  * Skip running certain transform if a given field does/does not exist. A lot of inbuilt transforms raise exceptions (Ex: InsertField transform if the field already exists) thereby breaking the task. Giving this control enable users to consciously configure for such cases.
>  * Even though some inbuilt transforms explicitly handle these cases, it would still be an unnecessary pass-through loop.
>  * Considering each connector usually deals with multiple datasets (Even 100s for a database CDC connector), metadata-centric predicate checking will be somewhat limiting when we talk about such pre-identified custom metadata fields in the records.
> I know some of these cases can be handled within the transforms itself but that defeats the purpose of having predicates.
> We have built this predicate for us and it is found to be extremely helpful. Please let me know your thoughts on the same so that I can raise a PR.
>  
> KIP: https://cwiki.apache.org/confluence/display/KAFKA/KIP-845%3A+%27HasField%27+predicate+for+kafka+connect



--
This message was sent by Atlassian Jira
(v8.20.7#820007)