You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@iotdb.apache.org by "Jinrui Zhang (Jira)" <ji...@apache.org> on 2022/11/18 02:47:00 UTC

[jira] [Commented] (IOTDB-4825) [ multiLeader ] ERROR o.a.i.d.m.p.s.FixedRateFragInsStateTracker:114 - error happened while fetching query state

    [ https://issues.apache.org/jira/browse/IOTDB-4825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17635614#comment-17635614 ] 

Jinrui Zhang commented on IOTDB-4825:
-------------------------------------

According to the log and write operation result, this issue seems to be an occasional issue caused by network connection. Won't fix

> [ multiLeader ] ERROR o.a.i.d.m.p.s.FixedRateFragInsStateTracker:114 - error happened while fetching query state
> ----------------------------------------------------------------------------------------------------------------
>
>                 Key: IOTDB-4825
>                 URL: https://issues.apache.org/jira/browse/IOTDB-4825
>             Project: Apache IoTDB
>          Issue Type: Bug
>          Components: mpp-cluster
>    Affects Versions: 0.14.0-SNAPSHOT
>            Reporter: 刘珍
>            Assignee: Jinrui Zhang
>            Priority: Major
>         Attachments: iotdb_4825.conf, screenshot-1.png
>
>
> master_1101_bc0e88b
> 3rep , 3C3D
> schema region : ratis
> data region : multiLeader
> ip62 datanode ERROR during writing (All nodes are RUNNING) :
> 2022-11-01 17:09:25,158 [pool-23-IoTDB-MPPCoordinatorScheduled-1] ERROR o.a.i.d.m.p.s.FixedRateFragInsStateTracker:114 -{color:#DE350B}* error happened while fetching query state*{color}
> java.io.IOException: Borrow client from pool for node TEndPoint(ip:192.168.10.66, port:9003) failed.
>         at org.apache.iotdb.commons.client.ClientManager.borrowClient(ClientManager.java:61)
>         at org.apache.iotdb.db.mpp.plan.scheduler.AbstractFragInsStateTracker.fetchState(AbstractFragInsStateTracker.java:82)
>         at org.apache.iotdb.db.mpp.plan.scheduler.FixedRateFragInsStateTracker.fetchStateAndUpdate(FixedRateFragInsStateTracker.java:98)
>         at org.apache.iotdb.commons.concurrent.threadpool.ScheduledExecutorUtil.lambda$scheduleAtFixedRate$0(ScheduledExecutorUtil.java:153)
>         at org.apache.iotdb.commons.concurrent.WrappedRunnable$1.runMayThrow(WrappedRunnable.java:44)
>         at org.apache.iotdb.commons.concurrent.WrappedRunnable.run(WrappedRunnable.java:29)
>         at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
>         at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308)
>         at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180)
>         at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294)
>         at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>         at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>         at java.lang.Thread.run(Thread.java:748)
> Caused by: java.lang.InterruptedException: null
>         at java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.reportInterruptAfterWait(AbstractQueuedSynchronizer.java:2014)
>         at java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2088)
>         at org.apache.commons.pool2.impl.LinkedBlockingDeque.pollFirst(LinkedBlockingDeque.java:937)
>         at org.apache.commons.pool2.impl.LinkedBlockingDeque.pollFirst(LinkedBlockingDeque.java:956)
>         at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:449)
>         at org.apache.commons.pool2.impl.GenericKeyedObjectPool.borrowObject(GenericKeyedObjectPool.java:350)
>         at org.apache.iotdb.commons.client.ClientManager.borrowClient(ClientManager.java:50)
>         ... 12 common frames omitted
> 测试流程:
> 1. 192.168.10.62/66/68 72C256GB
> ConfigNode
> MAX_HEAP_SIZE="8G"
> Common
> query_timeout_threshold=36000000
> schema_region_consensus_protocol_class=org.apache.iotdb.consensus.ratis.RatisConsensus
> data_region_consensus_protocol_class=org.apache.iotdb.consensus.multileader.MultiLeaderConsensus
> schema_replication_factor=3
> data_replication_factor=3
> partition_region_ratis_request_timeout_ms=1200000
> schema_region_ratis_request_timeout_ms=1200000
> data_region_ratis_request_timeout_ms=1200000
> partition_region_ratis_max_retry_attempts=1
> schema_region_ratis_max_retry_attempts=1
> data_region_ratis_max_retry_attempts=1
> DataNode
> MAX_HEAP_SIZE="192G"
> MAX_DIRECT_MEMORY_SIZE="32G"
> 2. bm 在192.168.10.64
> /data/liuzhen_test/weektest/benchmark_tool
> 配置见附件
> {color:#00875A}*All writes succeeded*{color}
>  !screenshot-1.png! 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)