You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Roland Otta (JIRA)" <ji...@apache.org> on 2017/04/14 09:14:41 UTC

[jira] [Commented] (CASSANDRA-13450) repair thread stuck

    [ https://issues.apache.org/jira/browse/CASSANDRA-13450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15968817#comment-15968817 ] 

Roland Otta commented on CASSANDRA-13450:
-----------------------------------------

the last message in the debug log regarding repairs is from yesterday and there was an exception

{noformat}
ERROR [ValidationExecutor:733] 2017-04-13 07:42:52,122 Validator.java:261 - Failed creating a merkle tree for [repair #f790a3b0-200b-11e7-832d-63f3cc1716ba on bds/adcounter_total, [(6083618763864780186,609059675
8865971039], (5488178912202442861,5490938324381352290]]], /192.168.0.26 (see log for details)
ERROR [ValidationExecutor:733] 2017-04-13 07:42:52,123 CassandraDaemon.java:217 - Exception in thread Thread[ValidationExecutor:733,1,main]
java.lang.NullPointerException: null
        at org.apache.cassandra.service.ActiveRepairService$ParentRepairSession.getActiveSSTables(ActiveRepairService.java:495) ~[apache-cassandra-3.7.jar:3.7]
        at org.apache.cassandra.service.ActiveRepairService$ParentRepairSession.access$300(ActiveRepairService.java:451) ~[apache-cassandra-3.7.jar:3.7]
        at org.apache.cassandra.service.ActiveRepairService.currentlyRepairing(ActiveRepairService.java:338) ~[apache-cassandra-3.7.jar:3.7]
        at org.apache.cassandra.db.compaction.CompactionManager.getSSTablesToValidate(CompactionManager.java:1320) ~[apache-cassandra-3.7.jar:3.7]
        at org.apache.cassandra.db.compaction.CompactionManager.doValidationCompaction(CompactionManager.java:1215) ~[apache-cassandra-3.7.jar:3.7]
        at org.apache.cassandra.db.compaction.CompactionManager.access$700(CompactionManager.java:81) ~[apache-cassandra-3.7.jar:3.7]
        at org.apache.cassandra.db.compaction.CompactionManager$11.call(CompactionManager.java:844) ~[apache-cassandra-3.7.jar:3.7]
        at java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[na:1.8.0_77]
        at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) ~[na:1.8.0_77]
        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) [na:1.8.0_77]
        at java.lang.Thread.run(Thread.java:745) [na:1.8.0_77]
{noformat}

> repair thread stuck
> -------------------
>
>                 Key: CASSANDRA-13450
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-13450
>             Project: Cassandra
>          Issue Type: Bug
>         Environment: cassandra 3.7
>            Reporter: Roland Otta
>
> we sometimes have stuck repair threads in our production system
> this is the corresponding stack trace
> {noformat}
> Name: Repair#202:3
> State: WAITING on com.google.common.util.concurrent.AbstractFuture$Sync@56d7b0ab
> Total blocked: 0  Total waited: 1
> Stack trace: 
> sun.misc.Unsafe.park(Native Method)
> java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)
> java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:836)
> java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireSharedInterruptibly(AbstractQueuedSynchronizer.java:997)
> java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireSharedInterruptibly(AbstractQueuedSynchronizer.java:1304)
> com.google.common.util.concurrent.AbstractFuture$Sync.get(AbstractFuture.java:285)
> com.google.common.util.concurrent.AbstractFuture.get(AbstractFuture.java:116)
> com.google.common.util.concurrent.Uninterruptibles.getUninterruptibly(Uninterruptibles.java:137)
> com.google.common.util.concurrent.Futures.getUnchecked(Futures.java:1509)
> org.apache.cassandra.repair.RepairJob.run(RepairJob.java:160)
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> java.lang.Thread.run(Thread.java:745)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)