You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2019/05/20 14:59:44 UTC

[GitHub] [spark] HeartSaVioR commented on issue #24613: [SPARK-27549][SS] Add support for committing kafka offsets per batch for supporting external tooling

HeartSaVioR commented on issue #24613: [SPARK-27549][SS] Add support for committing kafka offsets per batch for supporting external tooling
URL: https://github.com/apache/spark/pull/24613#issuecomment-494024240
 
 
   > Is there a way for the user to optionally provide a full group.id per Spark Query?
   
   You can provide a full `group.id` starting at Spark 3.0.0, but the option should be used with full aware of the impacts. Spark fault-tolerance guarantees come from reliable source and sharing consumer group with other app will simply break it - that's why Spark creates unique consumer group id.
   
   https://github.com/apache/spark/blob/master/docs/structured-streaming-kafka-integration.md
   (search `kafka.group.id` in above link)
   
   `query ID` might be considered as unique group id since it can provide both unique and continuous, but it should consider the case where multiple Kafka sources are being used in same query.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org