You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Ignace Desimpel (JIRA)" <ji...@apache.org> on 2018/01/12 09:01:00 UTC
[jira] [Created] (CASSANDRA-14164) Calling
StorageService.loadNewSSTables function results in deadlock with compaction
background task
Ignace Desimpel created CASSANDRA-14164:
-------------------------------------------
Summary: Calling StorageService.loadNewSSTables function results in deadlock with compaction background task
Key: CASSANDRA-14164
URL: https://issues.apache.org/jira/browse/CASSANDRA-14164
Project: Cassandra
Issue Type: Bug
Components: Compaction, Tools
Environment: code
Reporter: Ignace Desimpel
Priority: Blocker
Fix For: 2.2.x, 3.0.x
Attachments: Stack1.txt
Tested on version 2.2.11 (but seems like trunck 3.x is still the same for the related code path), using nodetool refresh for restoring a snapshot
Calling StorageService.loadNewSSTables function results in deadlock with compaction background task.
because :
From StorageService class , function public void loadNewSSTables(String ksName, String cfName) a call is made to ColumnFamilyStore class , function public static synchronized void loadNewSSTables(String ksName, String cfName) and then a call to Keyspace class, function public static Keyspace open(String keyspaceName)
getting to the function private static Keyspace open(String keyspaceName, Schema schema, boolean loadSSTables)
finally trying to get a lock by synchronized (Keyspace.class)
So inside the ColumnFamilyStore class lock, there is an attempt to get the lock on the Keyspace.class
Now at the same time I have the thread OptionalTasks executing the ColumnFamilyStore.getBackgroundCompactionTaskSubmitter() task.
The thread task is also calling Keyspace.open function, already progressed as far as getting the lock on Keyspace class.
But then the call also initializes the column families and thus is calling on class ColumnFamilyStore the public static synchronized ColumnFamilyStore createColumnFamilyStore ...
Result : the external call on loadNewSSTables blocks the internal compaction background task.
So function 1 locks A and then B
And function 2 locks B and then A
leading to deadlock (due to incorrect order of locking objects)
Regards,
Ignace
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@cassandra.apache.org
For additional commands, e-mail: commits-help@cassandra.apache.org