You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@cassandra.apache.org by Philippe Dupont <pd...@teads.tv> on 2014/01/02 16:54:28 UTC

Nodetool removenode => Stream stuck

Hi guys,

Recently a node in our ring became unreachable due to some AWS EC2 issue,
and we decided to remove it using "nodetool removenode" command.
As we are using Vnodes on a 28 node cluster, the removenode command
generates a lot of streams between all nodes. The problem is that some
stream becomes stuck.
It's the third time we use the removenode command and each time, some
stream was stuck

The solution we found is to force the removenode (nodetool removenode
force) and then restart the nodes with stream pending to get rid of them

Is there an explanation of this issue?

Thanks,

Philippe

Re: Nodetool removenode => Stream stuck

Posted by Robert Coli <rc...@eventbrite.com>.
On Thu, Jan 2, 2014 at 7:54 AM, Philippe Dupont <pd...@teads.tv> wrote:

> As we are using Vnodes on a 28 node cluster, the removenode command
> generates a lot of streams between all nodes. The problem is that some
> stream becomes stuck.
> It's the third time we use the removenode command and each time, some
> stream was stuck
>

What version of Cassandra? If 1.2.x, streaming has been rewritten
significantly in 2.0, and so in theory should be less likely to do this.
Note that I do not recommend upgrading to 2.0 in production at this time.


> The solution we found is to force the removenode (nodetool removenode
> force) and then restart the nodes with stream pending to get rid of them
>

The solution you found means that the streaming part of removenode was
interrupted, which means you should repair ASAP.

https://issues.apache.org/jira/browse/CASSANDRA-3486

Is the ticket where your input as to the desirability of a facility to
stop/restart streaming operations without restarting nodes is welcome.

=Rob