You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ignite.apache.org by "Andrew Medvedev (JIRA)" <ji...@apache.org> on 2018/07/05 17:49:00 UTC

[jira] [Created] (IGNITE-8943) Deactivation in large cluster hangs during rebalance

Andrew Medvedev created IGNITE-8943:
---------------------------------------

             Summary: Deactivation in large cluster hangs during rebalance
                 Key: IGNITE-8943
                 URL: https://issues.apache.org/jira/browse/IGNITE-8943
             Project: Ignite
          Issue Type: Bug
    Affects Versions: 2.5
            Reporter: Andrew Medvedev


In large cluster (> 100 nodes) deactivation during rebalance does not finish (we have waited for at least an hour).

All nodes log "Start deactivation process" message, but on all nodes exchange-worker threads in cluster are WAITING with this stack:

"exchange-worker-#152%DPL_GRID%DplGridNodeName%" #520 daemon prio=5 os_prio=0 tid=0x00007f5cfb2b6000 nid=0x2dc7bc waiting on condition [0x00007f59a05b2000]
   java.lang.Thread.State: WAITING (parking)
  at sun.misc.Unsafe.park(Native Method)
  at java.util.concurrent.locks.LockSupport.park(LockSupport.java:304)
  at org.apache.ignite.internal.util.future.GridFutureAdapter.get0(GridFutureAdapter.java:177)
  at org.apache.ignite.internal.util.future.GridFutureAdapter.get(GridFutureAdapter.java:140)
  at org.apache.ignite.internal.processors.cache.query.continuous.CacheContinuousQueryManager.cancelInternalQuery(CacheContinuousQueryManager.java:575)
  at org.apache.ignite.internal.processors.datastructures.DataStructuresProcessor.onKernalStop(DataStructuresProcessor.java:253)
  at org.apache.ignite.internal.processors.datastructures.DataStructuresProcessor.onDeActivate(DataStructuresProcessor.java:299)
  at org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture.onClusterStateChangeRequest(GridDhtPartitionsExchangeFuture.java:954)
  at org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture.init(GridDhtPartitionsExchangeFuture.java:650)
  at org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager$ExchangeWorker.body0(GridCachePartitionExchangeManager.java:2452)
  at org.apache.ignite.internal.processors.cache.GridCachePartitionExchangeManager$ExchangeWorker.body(GridCachePartitionExchangeManager.java:2332)
  at org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:110)
  at java.lang.Thread.run(Thread.java:745)

This is reproducible



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)