You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Marcelo Vanzin (Jira)" <ji...@apache.org> on 2019/09/04 17:19:00 UTC

[jira] [Assigned] (SPARK-25151) Apply Apache Commons Pool to KafkaDataConsumer

     [ https://issues.apache.org/jira/browse/SPARK-25151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marcelo Vanzin reassigned SPARK-25151:
--------------------------------------

    Assignee: Jungtaek Lim

> Apply Apache Commons Pool to KafkaDataConsumer
> ----------------------------------------------
>
>                 Key: SPARK-25151
>                 URL: https://issues.apache.org/jira/browse/SPARK-25151
>             Project: Spark
>          Issue Type: Improvement
>          Components: Structured Streaming
>    Affects Versions: 3.0.0
>            Reporter: Jungtaek Lim
>            Assignee: Jungtaek Lim
>            Priority: Major
>
> KafkaDataConsumer contains its own logic for caching InternalKafkaConsumer which looks like can be simplified via applying Apache Commons Pool. Benefits of applying Apache Commons Pool are following:
>  * We can get rid of synchronization of KafkaDataConsumer object while acquiring and returning InternalKafkaConsumer.
>  * We can extract the feature of object pool to outside of the class, so that the behaviors of the pool can be tested easily. Now it doesn't have detailed tests and only covers reported issues.
>  * We can get various statistics for the object pool, and also be able to enable JMX for the pool.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org