You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Shahbaz Hussain (JIRA)" <ji...@apache.org> on 2018/02/12 14:15:00 UTC

[jira] [Created] (SPARK-23397) Scheduling delay causes Spark Streaming to miss batches.

Shahbaz Hussain created SPARK-23397:
---------------------------------------

             Summary: Scheduling delay causes Spark Streaming to miss batches.
                 Key: SPARK-23397
                 URL: https://issues.apache.org/jira/browse/SPARK-23397
             Project: Spark
          Issue Type: Bug
          Components: DStreams
    Affects Versions: 2.2.1
            Reporter: Shahbaz Hussain


* For Complex Spark (Scala) based D-Stream based applications ,which requires creating Ex: 40 Jobs for every batch ,its been observed that ,batches does not get created on the specific time ,ex: if i started a Spark Streaming based application with batch interval as 20 seconds and application is creating 40 odd Jobs ,observe the next batch does not create 20 seconds later than previous job creation time.
 * This is due to the fact that Job Creation is Single Threaded, if Job Creation delay is greater than Batch Interval time ,batch execution misses its schedule.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org