You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sasaki Toru (JIRA)" <ji...@apache.org> on 2017/03/22 05:47:41 UTC

[jira] [Commented] (SPARK-20052) Some InputDStream needs closing processing after processing all batches when graceful shutdown

    [ https://issues.apache.org/jira/browse/SPARK-20052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15935789#comment-15935789 ] 

Sasaki Toru commented on SPARK-20052:
-------------------------------------

My explain is not good, sorry.

This ticket is related to SPARK-20050.
In JobGenerate#stop, it will wait for finishing all batches after InputDStream#stop called when graceful shutdown is enable,
but Kafka 0.10 DirectStream should commit offset after processing all batches.

So I thought more process(I explained this "closing process") is needed after processing all batches.


> Some InputDStream needs closing processing after processing all batches when graceful shutdown
> ----------------------------------------------------------------------------------------------
>
>                 Key: SPARK-20052
>                 URL: https://issues.apache.org/jira/browse/SPARK-20052
>             Project: Spark
>          Issue Type: Improvement
>          Components: DStreams
>    Affects Versions: 2.2.0
>            Reporter: Sasaki Toru
>
> Some class extend InputDStream needs closing processing after processing all batches when graceful shutdown enabled.
> (e.g. When using Kafka as data source, need to commit processed offsets to Kafka Broker)
> InputDStream has method 'stop' to stop receiving data, but this method will be called before processing last batches generated for graceful shutdown.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org