You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@kafka.apache.org by Elmar Weber <in...@elmarweber.org> on 2017/11/01 16:18:07 UTC

Client Offset Deleted by Broker without cause

Hello,

I had this morning the issue that a client offset got deleted from a 
broker as it seems.

(Kafka 0.11.0.1 with patch for KAFKA-6030 on top)

It happened in a test environment and the pipeline stage got re-deployed 
multiple times, during one restart, when the consumer reconnected, it 
didn't get any offset and started from the beginning as per it's 
configuration. The queue didn't receive any new events in the last 48h, 
so any restart should not do anything.

Here the relevant consumer logs:


2017-10-31 17:15:08.453 Successfully joined group 
events-inst-agg-stream.aggregation.v1 with generation 5
2017-11-01 14:29:46.554 Successfully joined group 
events-inst-agg-stream.aggregation.v1 with generation 7
2017-11-01 14:51:14.639 Successfully joined group 
events-inst-agg-stream.aggregation.v1 with generation 9
2017-11-01 14:51:19.068 Committing 
Map(GroupTopicPartition(events-inst-agg-stream.aggregation.v1,events.lg,0) 
-> 3830)
2017-11-01 14:51:24.083 Committing 
Map(GroupTopicPartition(events-inst-agg-stream.aggregation.v1,events.lg,0) 
-> 11339)

You can see the restarts at 17:15 yesterday and 14:29 today were normal. 
Then the restart at 14:51 started reading from the beginning (we log 
committed offsets).

The relevant leading broker did some "stuff" that was different between 
14:29 and 14:51. The full logs are below.
 From what I can see it deleted a segment from the consumer offsets log, 
and the next time the consumer connected it got no offset.

I can provide the logs of the other kafka nodes if it is useful. I also 
attached screenshots from the log viewer in case it's easier to read.

I found https://issues.apache.org/jira/browse/KAFKA-5600 which looked 
related, but it's fixed in 0.11.0.1.

Any other ideas what the issue could be?


[2017-11-01 14:09:46,805] INFO [Group Metadata Manager on Broker 1]: 
Removed 0 expired offsets in 1 milliseconds. 
(kafka.coordinator.group.GroupMetadataManager)
[2017-11-01 14:19:46,805] INFO [Group Metadata Manager on Broker 1]: 
Removed 0 expired offsets in 1 milliseconds. 
(kafka.coordinator.group.GroupMetadataManager)
[2017-11-01 14:24:23,627] INFO [GroupCoordinator 1]: Preparing to 
rebalance group events-inst-agg-stream.aggregation.v1 with old 
generation 5 (__consumer_offsets-8) 
(kafka.coordinator.group.GroupCoordinator)
[2017-11-01 14:24:23,627] INFO [GroupCoordinator 1]: Group 
events-inst-agg-stream.aggregation.v1 with generation 6 is now empty 
(__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
[2017-11-01 14:29:46,540] INFO [GroupCoordinator 1]: Preparing to 
rebalance group events-inst-agg-stream.aggregation.v1 with old 
generation 6 (__consumer_offsets-8) 
(kafka.coordinator.group.GroupCoordinator)
[2017-11-01 14:29:46,546] INFO [GroupCoordinator 1]: Stabilized group 
events-inst-agg-stream.aggregation.v1 generation 7 
(__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
[2017-11-01 14:29:46,551] INFO [GroupCoordinator 1]: Assignment received 
from leader for group events-inst-agg-stream.aggregation.v1 for 
generation 7 (kafka.coordinator.group.GroupCoordinator)
[2017-11-01 14:29:46,832] INFO Rolled new log segment for 
'__consumer_offsets-5' in 26 ms. (kafka.log.Log)
[2017-11-01 14:29:46,833] INFO [Group Metadata Manager on Broker 1]: 
Removed 3 expired offsets in 29 milliseconds. 
(kafka.coordinator.group.GroupMetadataManager)
[2017-11-01 14:30:23,642] INFO [GroupCoordinator 1]: Preparing to 
rebalance group key-stats-kafka-stream.stats.v1 with old generation 11 
(__consumer_offsets-5) (kafka.coordinator.group.GroupCoordinator)
[2017-11-01 14:30:23,644] INFO [GroupCoordinator 1]: Group 
key-stats-kafka-stream.stats.v1 with generation 12 is now empty 
(__consumer_offsets-5) (kafka.coordinator.group.GroupCoordinator)
[2017-11-01 14:30:54,731] INFO Deleting segment 0 from log 
__consumer_offsets-5. (kafka.log.Log)
[2017-11-01 14:30:54,767] INFO Deleting index 
/var/lib/kafka/data/topics/__consumer_offsets-5/00000000000000000000.index.deleted 
(kafka.log.OffsetIndex)
[2017-11-01 14:30:54,767] INFO Deleting index 
/var/lib/kafka/data/topics/__consumer_offsets-5/00000000000000000000.timeindex.deleted 
(kafka.log.TimeIndex)
[2017-11-01 14:36:15,590] INFO [GroupCoordinator 1]: Preparing to 
rebalance group key-stats-kafka-stream.stats.v1 with old generation 12 
(__consumer_offsets-5) (kafka.coordinator.group.GroupCoordinator)
[2017-11-01 14:36:15,594] INFO [GroupCoordinator 1]: Stabilized group 
key-stats-kafka-stream.stats.v1 generation 13 (__consumer_offsets-5) 
(kafka.coordinator.group.GroupCoordinator)
[2017-11-01 14:36:15,612] INFO [GroupCoordinator 1]: Assignment received 
from leader for group key-stats-kafka-stream.stats.v1 for generation 13 
(kafka.coordinator.group.GroupCoordinator)
[2017-11-01 14:39:46,805] INFO [Group Metadata Manager on Broker 1]: 
Removed 0 expired offsets in 1 milliseconds. 
(kafka.coordinator.group.GroupMetadataManager)
[2017-11-01 14:49:46,804] INFO [Group Metadata Manager on Broker 1]: 
Removed 0 expired offsets in 0 milliseconds. 
(kafka.coordinator.group.GroupMetadataManager)
[2017-11-01 14:50:45,546] INFO [GroupCoordinator 1]: Preparing to 
rebalance group events-inst-agg-stream.aggregation.v1 with old 
generation 7 (__consumer_offsets-8) 
(kafka.coordinator.group.GroupCoordinator)
[2017-11-01 14:50:45,546] INFO [GroupCoordinator 1]: Group 
events-inst-agg-stream.aggregation.v1 with generation 8 is now empty 
(__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
[2017-11-01 14:51:14,625] INFO [GroupCoordinator 1]: Preparing to 
rebalance group events-inst-agg-stream.aggregation.v1 with old 
generation 8 (__consumer_offsets-8) 
(kafka.coordinator.group.GroupCoordinator)
[2017-11-01 14:51:14,627] INFO [GroupCoordinator 1]: Stabilized group 
events-inst-agg-stream.aggregation.v1 generation 9 
(__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
[2017-11-01 14:51:14,635] INFO [GroupCoordinator 1]: Assignment received 
from leader for group events-inst-agg-stream.aggregation.v1 for 
generation 9 (kafka.coordinator.group.GroupCoordinator)
[2017-11-01 14:52:50,193] INFO Rolled new log segment for 
'events.lg.aggregated-0' in 5 ms. (kafka.log.Log)




Re: Client Offset Deleted by Broker without cause

Posted by Elmar Weber <in...@elmarweber.org>.
Right, they are:
http://cupenya.com/files/tmp/hgd23121/consumer-logs.png
http://cupenya.com/files/tmp/hgd23121/kafka-broker-logs.png

(it's a bit more than in the pasted mail)

On 11/01/2017 5:35 PM, Ted Yu wrote:
> bq. attached screenshots from the log viewer
> 
> The screenshots didn't go through. Consider using 3rd party site.
> 
> On Wed, Nov 1, 2017 at 9:18 AM, Elmar Weber <in...@elmarweber.org> wrote:
> 
>> Hello,
>>
>> I had this morning the issue that a client offset got deleted from a
>> broker as it seems.
>>
>> (Kafka 0.11.0.1 with patch for KAFKA-6030 on top)
>>
>> It happened in a test environment and the pipeline stage got re-deployed
>> multiple times, during one restart, when the consumer reconnected, it
>> didn't get any offset and started from the beginning as per it's
>> configuration. The queue didn't receive any new events in the last 48h, so
>> any restart should not do anything.
>>
>> Here the relevant consumer logs:
>>
>>
>> 2017-10-31 17:15:08.453 Successfully joined group
>> events-inst-agg-stream.aggregation.v1 with generation 5
>> 2017-11-01 14:29:46.554 Successfully joined group
>> events-inst-agg-stream.aggregation.v1 with generation 7
>> 2017-11-01 14:51:14.639 Successfully joined group
>> events-inst-agg-stream.aggregation.v1 with generation 9
>> 2017-11-01 14:51:19.068 Committing Map(GroupTopicPartition(events
>> -inst-agg-stream.aggregation.v1,events.lg,0) -> 3830)
>> 2017-11-01 14:51:24.083 Committing Map(GroupTopicPartition(events
>> -inst-agg-stream.aggregation.v1,events.lg,0) -> 11339)
>>
>> You can see the restarts at 17:15 yesterday and 14:29 today were normal.
>> Then the restart at 14:51 started reading from the beginning (we log
>> committed offsets).
>>
>> The relevant leading broker did some "stuff" that was different between
>> 14:29 and 14:51. The full logs are below.
>>  From what I can see it deleted a segment from the consumer offsets log,
>> and the next time the consumer connected it got no offset.
>>
>> I can provide the logs of the other kafka nodes if it is useful. I also
>> attached screenshots from the log viewer in case it's easier to read.
>>
>> I found https://issues.apache.org/jira/browse/KAFKA-5600 which looked
>> related, but it's fixed in 0.11.0.1.
>>
>> Any other ideas what the issue could be?
>>
>>
>> [2017-11-01 14:09:46,805] INFO [Group Metadata Manager on Broker 1]:
>> Removed 0 expired offsets in 1 milliseconds. (kafka.coordinator.group.Group
>> MetadataManager)
>> [2017-11-01 14:19:46,805] INFO [Group Metadata Manager on Broker 1]:
>> Removed 0 expired offsets in 1 milliseconds. (kafka.coordinator.group.Group
>> MetadataManager)
>> [2017-11-01 14:24:23,627] INFO [GroupCoordinator 1]: Preparing to
>> rebalance group events-inst-agg-stream.aggregation.v1 with old generation
>> 5 (__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
>> [2017-11-01 14:24:23,627] INFO [GroupCoordinator 1]: Group
>> events-inst-agg-stream.aggregation.v1 with generation 6 is now empty
>> (__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
>> [2017-11-01 14:29:46,540] INFO [GroupCoordinator 1]: Preparing to
>> rebalance group events-inst-agg-stream.aggregation.v1 with old generation
>> 6 (__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
>> [2017-11-01 14:29:46,546] INFO [GroupCoordinator 1]: Stabilized group
>> events-inst-agg-stream.aggregation.v1 generation 7 (__consumer_offsets-8)
>> (kafka.coordinator.group.GroupCoordinator)
>> [2017-11-01 14:29:46,551] INFO [GroupCoordinator 1]: Assignment received
>> from leader for group events-inst-agg-stream.aggregation.v1 for
>> generation 7 (kafka.coordinator.group.GroupCoordinator)
>> [2017-11-01 14:29:46,832] INFO Rolled new log segment for
>> '__consumer_offsets-5' in 26 ms. (kafka.log.Log)
>> [2017-11-01 14:29:46,833] INFO [Group Metadata Manager on Broker 1]:
>> Removed 3 expired offsets in 29 milliseconds. (kafka.coordinator.group.Group
>> MetadataManager)
>> [2017-11-01 14:30:23,642] INFO [GroupCoordinator 1]: Preparing to
>> rebalance group key-stats-kafka-stream.stats.v1 with old generation 11
>> (__consumer_offsets-5) (kafka.coordinator.group.GroupCoordinator)
>> [2017-11-01 14:30:23,644] INFO [GroupCoordinator 1]: Group
>> key-stats-kafka-stream.stats.v1 with generation 12 is now empty
>> (__consumer_offsets-5) (kafka.coordinator.group.GroupCoordinator)
>> [2017-11-01 14:30:54,731] INFO Deleting segment 0 from log
>> __consumer_offsets-5. (kafka.log.Log)
>> [2017-11-01 14:30:54,767] INFO Deleting index
>> /var/lib/kafka/data/topics/__consumer_offsets-5/00000000000000000000.index.deleted
>> (kafka.log.OffsetIndex)
>> [2017-11-01 14:30:54,767] INFO Deleting index
>> /var/lib/kafka/data/topics/__consumer_offsets-5/00000000000000000000.timeindex.deleted
>> (kafka.log.TimeIndex)
>> [2017-11-01 14:36:15,590] INFO [GroupCoordinator 1]: Preparing to
>> rebalance group key-stats-kafka-stream.stats.v1 with old generation 12
>> (__consumer_offsets-5) (kafka.coordinator.group.GroupCoordinator)
>> [2017-11-01 14:36:15,594] INFO [GroupCoordinator 1]: Stabilized group
>> key-stats-kafka-stream.stats.v1 generation 13 (__consumer_offsets-5)
>> (kafka.coordinator.group.GroupCoordinator)
>> [2017-11-01 14:36:15,612] INFO [GroupCoordinator 1]: Assignment received
>> from leader for group key-stats-kafka-stream.stats.v1 for generation 13
>> (kafka.coordinator.group.GroupCoordinator)
>> [2017-11-01 14:39:46,805] INFO [Group Metadata Manager on Broker 1]:
>> Removed 0 expired offsets in 1 milliseconds. (kafka.coordinator.group.Group
>> MetadataManager)
>> [2017-11-01 14:49:46,804] INFO [Group Metadata Manager on Broker 1]:
>> Removed 0 expired offsets in 0 milliseconds. (kafka.coordinator.group.Group
>> MetadataManager)
>> [2017-11-01 14:50:45,546] INFO [GroupCoordinator 1]: Preparing to
>> rebalance group events-inst-agg-stream.aggregation.v1 with old generation
>> 7 (__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
>> [2017-11-01 14:50:45,546] INFO [GroupCoordinator 1]: Group
>> events-inst-agg-stream.aggregation.v1 with generation 8 is now empty
>> (__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
>> [2017-11-01 14:51:14,625] INFO [GroupCoordinator 1]: Preparing to
>> rebalance group events-inst-agg-stream.aggregation.v1 with old generation
>> 8 (__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
>> [2017-11-01 14:51:14,627] INFO [GroupCoordinator 1]: Stabilized group
>> events-inst-agg-stream.aggregation.v1 generation 9 (__consumer_offsets-8)
>> (kafka.coordinator.group.GroupCoordinator)
>> [2017-11-01 14:51:14,635] INFO [GroupCoordinator 1]: Assignment received
>> from leader for group events-inst-agg-stream.aggregation.v1 for
>> generation 9 (kafka.coordinator.group.GroupCoordinator)
>> [2017-11-01 14:52:50,193] INFO Rolled new log segment for
>> 'events.lg.aggregated-0' in 5 ms. (kafka.log.Log)
>>
>>
>>
>>
> 


Re: Client Offset Deleted by Broker without cause

Posted by Ted Yu <yu...@gmail.com>.
bq. attached screenshots from the log viewer

The screenshots didn't go through. Consider using 3rd party site.

On Wed, Nov 1, 2017 at 9:18 AM, Elmar Weber <in...@elmarweber.org> wrote:

> Hello,
>
> I had this morning the issue that a client offset got deleted from a
> broker as it seems.
>
> (Kafka 0.11.0.1 with patch for KAFKA-6030 on top)
>
> It happened in a test environment and the pipeline stage got re-deployed
> multiple times, during one restart, when the consumer reconnected, it
> didn't get any offset and started from the beginning as per it's
> configuration. The queue didn't receive any new events in the last 48h, so
> any restart should not do anything.
>
> Here the relevant consumer logs:
>
>
> 2017-10-31 17:15:08.453 Successfully joined group
> events-inst-agg-stream.aggregation.v1 with generation 5
> 2017-11-01 14:29:46.554 Successfully joined group
> events-inst-agg-stream.aggregation.v1 with generation 7
> 2017-11-01 14:51:14.639 Successfully joined group
> events-inst-agg-stream.aggregation.v1 with generation 9
> 2017-11-01 14:51:19.068 Committing Map(GroupTopicPartition(events
> -inst-agg-stream.aggregation.v1,events.lg,0) -> 3830)
> 2017-11-01 14:51:24.083 Committing Map(GroupTopicPartition(events
> -inst-agg-stream.aggregation.v1,events.lg,0) -> 11339)
>
> You can see the restarts at 17:15 yesterday and 14:29 today were normal.
> Then the restart at 14:51 started reading from the beginning (we log
> committed offsets).
>
> The relevant leading broker did some "stuff" that was different between
> 14:29 and 14:51. The full logs are below.
> From what I can see it deleted a segment from the consumer offsets log,
> and the next time the consumer connected it got no offset.
>
> I can provide the logs of the other kafka nodes if it is useful. I also
> attached screenshots from the log viewer in case it's easier to read.
>
> I found https://issues.apache.org/jira/browse/KAFKA-5600 which looked
> related, but it's fixed in 0.11.0.1.
>
> Any other ideas what the issue could be?
>
>
> [2017-11-01 14:09:46,805] INFO [Group Metadata Manager on Broker 1]:
> Removed 0 expired offsets in 1 milliseconds. (kafka.coordinator.group.Group
> MetadataManager)
> [2017-11-01 14:19:46,805] INFO [Group Metadata Manager on Broker 1]:
> Removed 0 expired offsets in 1 milliseconds. (kafka.coordinator.group.Group
> MetadataManager)
> [2017-11-01 14:24:23,627] INFO [GroupCoordinator 1]: Preparing to
> rebalance group events-inst-agg-stream.aggregation.v1 with old generation
> 5 (__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
> [2017-11-01 14:24:23,627] INFO [GroupCoordinator 1]: Group
> events-inst-agg-stream.aggregation.v1 with generation 6 is now empty
> (__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
> [2017-11-01 14:29:46,540] INFO [GroupCoordinator 1]: Preparing to
> rebalance group events-inst-agg-stream.aggregation.v1 with old generation
> 6 (__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
> [2017-11-01 14:29:46,546] INFO [GroupCoordinator 1]: Stabilized group
> events-inst-agg-stream.aggregation.v1 generation 7 (__consumer_offsets-8)
> (kafka.coordinator.group.GroupCoordinator)
> [2017-11-01 14:29:46,551] INFO [GroupCoordinator 1]: Assignment received
> from leader for group events-inst-agg-stream.aggregation.v1 for
> generation 7 (kafka.coordinator.group.GroupCoordinator)
> [2017-11-01 14:29:46,832] INFO Rolled new log segment for
> '__consumer_offsets-5' in 26 ms. (kafka.log.Log)
> [2017-11-01 14:29:46,833] INFO [Group Metadata Manager on Broker 1]:
> Removed 3 expired offsets in 29 milliseconds. (kafka.coordinator.group.Group
> MetadataManager)
> [2017-11-01 14:30:23,642] INFO [GroupCoordinator 1]: Preparing to
> rebalance group key-stats-kafka-stream.stats.v1 with old generation 11
> (__consumer_offsets-5) (kafka.coordinator.group.GroupCoordinator)
> [2017-11-01 14:30:23,644] INFO [GroupCoordinator 1]: Group
> key-stats-kafka-stream.stats.v1 with generation 12 is now empty
> (__consumer_offsets-5) (kafka.coordinator.group.GroupCoordinator)
> [2017-11-01 14:30:54,731] INFO Deleting segment 0 from log
> __consumer_offsets-5. (kafka.log.Log)
> [2017-11-01 14:30:54,767] INFO Deleting index
> /var/lib/kafka/data/topics/__consumer_offsets-5/00000000000000000000.index.deleted
> (kafka.log.OffsetIndex)
> [2017-11-01 14:30:54,767] INFO Deleting index
> /var/lib/kafka/data/topics/__consumer_offsets-5/00000000000000000000.timeindex.deleted
> (kafka.log.TimeIndex)
> [2017-11-01 14:36:15,590] INFO [GroupCoordinator 1]: Preparing to
> rebalance group key-stats-kafka-stream.stats.v1 with old generation 12
> (__consumer_offsets-5) (kafka.coordinator.group.GroupCoordinator)
> [2017-11-01 14:36:15,594] INFO [GroupCoordinator 1]: Stabilized group
> key-stats-kafka-stream.stats.v1 generation 13 (__consumer_offsets-5)
> (kafka.coordinator.group.GroupCoordinator)
> [2017-11-01 14:36:15,612] INFO [GroupCoordinator 1]: Assignment received
> from leader for group key-stats-kafka-stream.stats.v1 for generation 13
> (kafka.coordinator.group.GroupCoordinator)
> [2017-11-01 14:39:46,805] INFO [Group Metadata Manager on Broker 1]:
> Removed 0 expired offsets in 1 milliseconds. (kafka.coordinator.group.Group
> MetadataManager)
> [2017-11-01 14:49:46,804] INFO [Group Metadata Manager on Broker 1]:
> Removed 0 expired offsets in 0 milliseconds. (kafka.coordinator.group.Group
> MetadataManager)
> [2017-11-01 14:50:45,546] INFO [GroupCoordinator 1]: Preparing to
> rebalance group events-inst-agg-stream.aggregation.v1 with old generation
> 7 (__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
> [2017-11-01 14:50:45,546] INFO [GroupCoordinator 1]: Group
> events-inst-agg-stream.aggregation.v1 with generation 8 is now empty
> (__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
> [2017-11-01 14:51:14,625] INFO [GroupCoordinator 1]: Preparing to
> rebalance group events-inst-agg-stream.aggregation.v1 with old generation
> 8 (__consumer_offsets-8) (kafka.coordinator.group.GroupCoordinator)
> [2017-11-01 14:51:14,627] INFO [GroupCoordinator 1]: Stabilized group
> events-inst-agg-stream.aggregation.v1 generation 9 (__consumer_offsets-8)
> (kafka.coordinator.group.GroupCoordinator)
> [2017-11-01 14:51:14,635] INFO [GroupCoordinator 1]: Assignment received
> from leader for group events-inst-agg-stream.aggregation.v1 for
> generation 9 (kafka.coordinator.group.GroupCoordinator)
> [2017-11-01 14:52:50,193] INFO Rolled new log segment for
> 'events.lg.aggregated-0' in 5 ms. (kafka.log.Log)
>
>
>
>