You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Natalia Gorchakova (JIRA)" <ji...@apache.org> on 2017/01/25 11:43:26 UTC

[jira] [Created] (SPARK-19361) kafka.maxRatePerPartition for compacted topic cause exception

Natalia Gorchakova created SPARK-19361:
------------------------------------------

             Summary: kafka.maxRatePerPartition for compacted topic cause exception
                 Key: SPARK-19361
                 URL: https://issues.apache.org/jira/browse/SPARK-19361
             Project: Spark
          Issue Type: Bug
          Components: DStreams
    Affects Versions: 1.6.1
            Reporter: Natalia Gorchakova


creating DirectKafkaInputDStream with param spark.streaming.kafka.maxRatePerPartition for compacted topic cause exception:

ERROR [Executor task launch worker-2] executor.Executor: Exception in task 1.0 in stage 2.0 (TID 22)
java.lang.AssertionError: assertion failed: Got 3740923 > ending offset 2428156 for topic COMPACTED.KAFKA.TOPIC partition 6 start 2228156. This should not happen, and indicates a message may have been skipped
  at scala.Predef$.assert(Predef.scala:179)
  at org.apache.spark.streaming.kafka.KafkaRDD$KafkaRDDIterator.getNext(KafkaRDD.scala:217)

as KafkaRDD expect maxOffset in batch <= startOffset + maxRatePerPartition*secondsInBatch. While for compacted topic some offsets can be missing. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org