You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Jeff Nadler (JIRA)" <ji...@apache.org> on 2016/09/14 15:40:20 UTC

[jira] [Created] (SPARK-17539) Streaming Backpressure Starves DirectStream When Used In Combination With Receivers

Jeff Nadler created SPARK-17539:
-----------------------------------

             Summary: Streaming Backpressure Starves DirectStream When Used In Combination With Receivers
                 Key: SPARK-17539
                 URL: https://issues.apache.org/jira/browse/SPARK-17539
             Project: Spark
          Issue Type: Bug
          Components: Streaming
    Affects Versions: 1.5.0
         Environment: Spark 1.5.0 under CDH 5.5.1 w/ Kafka 0.8.2.1
            Reporter: Jeff Nadler


Backpressure seems to work fine for Direct Streams when used alone.

We use Receivers (x4) to consume Topic A, and a Direct Stream to consume Topic B in a single Streaming app.    Both go through updateStateByKey and are later joined.

In this configuration, backpressure works as expected for the Receivers but not for the Direct Stream.   After running for some time there is no scheduling delay - there is some at startup, but it burns off in a minute or two.    

The Direct Stream will get 'stuck' throttled at 1 msg / partition / sec.   As soon as I turn off backpressure this problem goes away.  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org