You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Cody Koeninger (JIRA)" <ji...@apache.org> on 2017/01/26 13:44:24 UTC

[jira] [Commented] (SPARK-19361) kafka.maxRatePerPartition for compacted topic cause exception

    [ https://issues.apache.org/jira/browse/SPARK-19361?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15839683#comment-15839683 ] 

Cody Koeninger commented on SPARK-19361:
----------------------------------------

Compacted topics in general don't work with direct stream, this isn't unique to maxRate.

There's an existing ticket on this,  https://issues.apache.org/jira/browse/SPARK-17147 , with work in progress code.  If this feature is important to you, go help test it,

> kafka.maxRatePerPartition for compacted topic cause exception
> -------------------------------------------------------------
>
>                 Key: SPARK-19361
>                 URL: https://issues.apache.org/jira/browse/SPARK-19361
>             Project: Spark
>          Issue Type: Bug
>          Components: DStreams
>    Affects Versions: 1.6.1
>            Reporter: Natalia Gorchakova
>
> creating DirectKafkaInputDStream with param spark.streaming.kafka.maxRatePerPartition for compacted topic cause exception:
> ERROR [Executor task launch worker-2] executor.Executor: Exception in task 1.0 in stage 2.0 (TID 22)
> java.lang.AssertionError: assertion failed: Got 3740923 > ending offset 2428156 for topic COMPACTED.KAFKA.TOPIC partition 6 start 2228156. This should not happen, and indicates a message may have been skipped
>   at scala.Predef$.assert(Predef.scala:179)
>   at org.apache.spark.streaming.kafka.KafkaRDD$KafkaRDDIterator.getNext(KafkaRDD.scala:217)
> as KafkaRDD expect maxOffset in batch <= startOffset + maxRatePerPartition*secondsInBatch. While for compacted topic some offsets can be missing. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org