You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Jason Rosenberg (JIRA)" <ji...@apache.org> on 2013/08/27 03:21:56 UTC

[jira] [Created] (KAFKA-1025) Producer.send should provide recoverability info on failiure

Jason Rosenberg created KAFKA-1025:
--------------------------------------

             Summary: Producer.send should provide recoverability info on failiure
                 Key: KAFKA-1025
                 URL: https://issues.apache.org/jira/browse/KAFKA-1025
             Project: Kafka
          Issue Type: Bug
    Affects Versions: 0.8
            Reporter: Jason Rosenberg


Currently, in 0.8, the Producer.send() method either succeeds, or fails by throwing an Exception.

There are several exceptions that can be thrown, including:

FailedToSendException
QueueFullException
ClassCastExeption

These are all sub-classes of RuntimeException.

Under the covers, the producer will retry sending messages up to a maximum number of times (according to the message.send.max.retries property).  Internally, the producer may decide which sorts of failures are recoverable, and will retry those.  Alternatively (via an upcoming change, see KAFKA-998), it may decide to not retry at all, if the error is not recoverable.

The problem is, if FailedToSendException is returned, the caller to Producer.send doesn't have a way to decide if a send failed due to an unrecoverable error, or failed after exhausting a maximum number of retries.

A caller may want to decide to retry more times, perhaps after waiting a while.  But it should know first whether it's even likely that the failure is retryable.

An example of this might be a if the message size is too large (represented internally as a MessageSizeTooLargeException).  In this case, it is not recoverable, but it is still wrapped as a FailedToSendException, and should not be retried.

So the suggestion is to make clear in the api javadoc (or scaladoc) for Producer.send, the set of exception types that can be thrown (so that we don't have to search through source code to find them).  And add exception types, or perhaps fields within FailedToSendException, so that it's possible to reason about whether retrying might make sense.

Currently, in addition, I've found that Producer.send can throw a QueueFullException in async mode (this should be a retryable exception, after time has elapsed, etc.), and also a ClassCastException, if there's a misconfiguration between the configured Encoder and the message data type.  I suspect there are other RuntimeExceptions that can also be thrown (e.g. NullPointerException if the message/topic are null).



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira