You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@iotdb.apache.org by "刘珍 (Jira)" <ji...@apache.org> on 2022/11/04 06:47:00 UTC
[jira] [Created] (IOTDB-4851) 1rep 3C 5D , remove 1 datanode , region migration failed(Unknown)
刘珍 created IOTDB-4851:
-------------------------
Summary: 1rep 3C 5D , remove 1 datanode , region migration failed(Unknown)
Key: IOTDB-4851
URL: https://issues.apache.org/jira/browse/IOTDB-4851
Project: Apache IoTDB
Issue Type: Bug
Components: mpp-cluster
Affects Versions: 0.14.0-SNAPSHOT
Reporter: 刘珍
Assignee: Gaofei Cao
Attachments: image-2022-11-04-14-40-54-764.png
版本:m_1103_f857667
{color:#DE350B}1副本{color},3C5D
schema_region_consensus_protocol_class=org.apache.iotdb.consensus.ratis.RatisConsensus
data_region_consensus_protocol_class=org.apache.iotdb.consensus.multileader.{color:#DE350B}MultiLeaderConsensus{color}
缩容1节点(ip76),从集群中移除此节点成功,但是:
问题1 : 缩容节点的data没有迁移成功
!image-2022-11-04-14-40-54-764.png!
问题2:缩容节点的datanode进程不退出,datanode日志刷:
2022-11-04 13:52:56,659 [7@group-000200000006-SegmentedRaftLogWorker] INFO o.a.r.s.r.s.SegmentedRaftLogWorker:345 - 7@group-000200000006-SegmentedRaftLogWorker was interrupted, exiting. There are 0 tasks remaining in the queue.
2022-11-04 13:52:56,659 [7@group-000200000001-SegmentedRaftLogWorker] INFO o.a.r.s.r.s.SegmentedRaftLogWorker:345 - 7@group-000200000001-SegmentedRaftLogWorker was interrupted, exiting. There are 0 tasks remaining in the queue.
2022-11-04 13:52:56,684 [7-impl-thread2] INFO o.a.r.s.r.s.SegmentedRaftLogWorker:255 - 7@group-000200000006-SegmentedRaftLogWorker close()
2022-11-04 13:52:56,701 [7-impl-thread3] INFO o.a.r.s.r.s.SegmentedRaftLogWorker:255 - 7@group-000200000001-SegmentedRaftLogWorker close()
2022-11-04 13:52:56,703 [JvmPauseMonitor0] INFO o.a.r.u.JvmPauseMonitor:111 - JvmPauseMonitor-7: Stopped
2022-11-04 13:52:56,705 [pool-24-IoTDB-DataNodeInternalRPC-Processor-146] INFO o.a.i.c.s.ThriftService:158 - IoTDB: closing Multi Leader consensus Service...
2022-11-04 13:52:56,707 [pool-24-IoTDB-DataNodeInternalRPC-Processor-146] INFO o.a.i.c.s.ThriftService:165 - IoTDB: close Multi Leader consensus Service successfully
2022-11-04 13:52:56,707 [pool-24-IoTDB-DataNodeInternalRPC-Processor-146] INFO o.a.i.c.s.RegisterManager:67 - deregister all service.
2022-11-04 13:53:23,771 [DataNodeInternalRPC-Service] ERROR o.a.t.s.TThreadPoolServer:144 - Shutdown is not done after 60SECONDS
2022-11-04 13:53:23,778 [MPPDataExchangeRPC-Service] ERROR o.a.t.s.TThreadPoolServer:144 - Shutdown is not done after 60SECONDS
2022-11-04 13:53:54,354 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 60s to wait for closing all TsFiles.
2022-11-04 13:54:54,354 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 120s to wait for closing all TsFiles.
2022-11-04 13:55:54,354 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 180s to wait for closing all TsFiles.
2022-11-04 13:56:54,355 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 240s to wait for closing all TsFiles.
2022-11-04 13:57:54,355 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 300s to wait for closing all TsFiles.
2022-11-04 13:58:54,355 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 360s to wait for closing all TsFiles.
2022-11-04 13:59:54,356 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 420s to wait for closing all TsFiles.
2022-11-04 14:00:54,356 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 480s to wait for closing all TsFiles.
2022-11-04 14:01:54,357 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 540s to wait for closing all TsFiles.
2022-11-04 14:02:54,357 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 600s to wait for closing all TsFiles.
2022-11-04 14:03:54,357 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 660s to wait for closing all TsFiles.
2022-11-04 14:04:54,358 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 720s to wait for closing all TsFiles.
2022-11-04 14:05:54,358 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 780s to wait for closing all TsFiles.
2022-11-04 14:06:54,358 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 840s to wait for closing all TsFiles.
2022-11-04 14:07:54,358 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 900s to wait for closing all TsFiles.
2022-11-04 14:08:54,359 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 960s to wait for closing all TsFiles.
2022-11-04 14:09:54,359 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 1020s to wait for closing all TsFiles.
2022-11-04 14:10:54,359 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 1080s to wait for closing all TsFiles.
2022-11-04 14:11:54,360 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 1140s to wait for closing all TsFiles.
2022-11-04 14:12:54,360 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 1200s to wait for closing all TsFiles.
2022-11-04 14:13:54,360 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 1260s to wait for closing all TsFiles.
2022-11-04 14:14:54,360 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 1320s to wait for closing all TsFiles.
2022-11-04 14:15:54,361 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 1380s to wait for closing all TsFiles.
2022-11-04 14:16:54,361 [Thread-0] WARN o.a.i.d.e.s.DataRegion:1848 - root.test.g_2-21 has spent 1440s to wait for closing all TsFiles.
……
测试流程
1. 启动集群 192.168.10.72~75
ConfigNode 72,73,74
MAX_HEAP_SIZE="8G"
Common
max_connection_for_internal_service=300
query_timeout_threshold=3600000
schema_region_consensus_protocol_class=org.apache.iotdb.consensus.ratis.RatisConsensus
data_region_consensus_protocol_class=org.apache.iotdb.consensus.multileader.MultiLeaderConsensus
schema_replication_factor=1
data_replication_factor=1
DataNode
MAX_HEAP_SIZE="256G"
MAX_DIRECT_MEMORY_SIZE="32G"
2. bm写入数据
配置见附件
3. 写入完成,无其他客户端操作,缩容ip76
详细日志见机器
/data/mpp_test/m_1103_f857667
--
This message was sent by Atlassian Jira
(v8.20.10#820010)