You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-dev@hadoop.apache.org by "muyannian (JIRA)" <ji...@apache.org> on 2018/01/01 06:53:00 UTC
[jira] [Created] (HDFS-12971) DfsClint hang on hedged
getFromOneDataNode
muyannian created HDFS-12971:
--------------------------------
Summary: DfsClint hang on hedged getFromOneDataNode
Key: HDFS-12971
URL: https://issues.apache.org/jira/browse/HDFS-12971
Project: Hadoop HDFS
Issue Type: Bug
Components: hdfs, hdfs-client
Affects Versions: 2.7.0, 2.6.0
Environment: hadoop 2.6.0
<property>
<name>dfs.client.hedged.read.threadpool.size</name>
<value>512</value>
</property>
<property>
<name>dfs.client.hedged.read.threshold.millis</name>
<value>300</value>
</property>
Reporter: muyannian
Attachments: 1.jpg, 2.jpg, 3.jpg, 4.jpg, DFSInputStream-afterpatch.java, DFSInputStream-beforepatch.java, DFSInputStream.java.patch
when i used hdfs hedged read , i found DFSInputStream hang on getFirstToComplete method.
the reason is when something has exception on datanode,or namenode , for example FileNotFoundException, that may cause hang up for ever. future has finished but still call get "future = hedgedService.take()" method ,so cause hang up.
in the attatch file is my jstack and java patch.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-help@hadoop.apache.org