You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-dev@hadoop.apache.org by "Wei-Chiu Chuang (JIRA)" <ji...@apache.org> on 2019/02/13 23:50:00 UTC
[jira] [Created] (HDFS-14276) [SBN read] Reduce tailing overhead
Wei-Chiu Chuang created HDFS-14276:
--------------------------------------
Summary: [SBN read] Reduce tailing overhead
Key: HDFS-14276
URL: https://issues.apache.org/jira/browse/HDFS-14276
Project: Hadoop HDFS
Issue Type: Improvement
Components: ha, namenode
Affects Versions: 3.3.0
Environment: Hardware: 4-node cluster, each node has 4 core, Xeon 2.5Ghz, 25GB memory.
Software: CentOS 7.4, CDH 6.0 + Consistent Reads from Standby, Kerberos, SSL, RPC encryption + Data Transfer Encryption.
Reporter: Wei-Chiu Chuang
Assignee: Wei-Chiu Chuang
Attachments: Screen Shot 2019-02-12 at 10.51.41 PM.png
When Observer setsĀ {{dfs.ha.tail-edits.period}} = {{0ms}}, it tails edit log continuously in order to fetch the latest edits, but there is a lot of overhead in doing so.
Critically, edit log tailer should _not_ update NameDirSize metric every time. It has nothing to do with fetching edits, and it involves lots of directory space calculation.
Profiler suggests a non-trivial chunk of time is spent for nothing.
Other than this, the biggest overhead is in the communication to serialize/deserialize messages to/from JNs. I am looking for ways to reduce the cost because it's burning 30% of my CPU time even when the cluster is idle.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-help@hadoop.apache.org