You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@kafka.apache.org by "Brian Hawkins (Jira)" <ji...@apache.org> on 2021/03/31 00:18:00 UTC

[jira] [Created] (KAFKA-12585) FencedInstanceIdException can cause heartbeat thread to never be closed

Brian Hawkins created KAFKA-12585:
-------------------------------------

             Summary: FencedInstanceIdException can cause heartbeat thread to never be closed
                 Key: KAFKA-12585
                 URL: https://issues.apache.org/jira/browse/KAFKA-12585
             Project: Kafka
          Issue Type: Bug
          Components: clients
    Affects Versions: 2.7.0, 2.5.1
            Reporter: Brian Hawkins


The bug has been there since static consumers was introduced.

The problem is all within AbstractCoordinator.java

If a FencedInstanceIdException is throw and onFailure (line 1406) is called by a thread other than the heartbeat thread this will occur.  

In the onFailure callback the heartbeatThread.failed is set and the heartbeatThread is disabled, but the actual thread is waiting on line 1350 (AbstractCoordinator.this.wait())

Sometime later pollHeartbeat is called (line 316).  The check for hasFailed is true so it sets heartbeatThread = null without freeing the thread and now it will never be closed.

 

I have verified this within a debuger using two clients that create read and close over and over again using the same group and instance id.  I tested this with 2.5.1 but found the same code bug to be in the latest master branch, the above line numbers are for the latest in github.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)