You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user-zh@hbase.apache.org by ww...@sina.com on 2020/07/21 03:51:41 UTC

开启replication后,hbase读写性能下降问题

在开启hbase的replication后,集群的slowGetCount和slowPutCount都会大幅增长。请问这种情况怎么解决。
开启复制之后,日志里边还有大量的slow sync日志。日志如下:
2020-07-21 11:08:11,314 INFO org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL: Slow sync cost: 635 ms, current pipeline: [DatanodeInfoWithStorage[xxxxx:9866,DS-6522f59f-3ca9-46a9-b25a-5d86cdbb656e,SSD], DatanodeInfoWithStorage[xxxxx:9866,DS-8768f8b8-732f-4238-b51d-90ba83343353,SSD], DatanodeInfoWithStorage[xxxxx:9866,DS-a5792087-6c58-4600-a239-9a1f61f1c6c0,SSD]]2020-07-21 11:08:11,315 INFO org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL: Slow sync cost: 564 ms, current pipeline: [DatanodeInfoWithStorage[xxxxx:9866,DS-6522f59f-3ca9-46a9-b25a-5d86cdbb656e,SSD], DatanodeInfoWithStorage[xxxxx:9866,DS-8768f8b8-732f-4238-b51d-90ba83343353,SSD], DatanodeInfoWithStorage[xxxxx:9866,DS-a5792087-6c58-4600-a239-9a1f61f1c6c0,SSD]]2020-07-21 11:08:11,315 INFO org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL: Slow sync cost: 535 ms, current pipeline: [DatanodeInfoWithStorage[xxxxx:9866,DS-6522f59f-3ca9-46a9-b25a-5d86cdbb656e,SSD], DatanodeInfoWithStorage[xxxxx:9866,DS-8768f8b8-732f-4238-b51d-90ba83343353,SSD], DatanodeInfoWithStorage[xxxxx:9866,DS-a5792087-6c58-4600-a239-9a1f61f1c6c0,SSD]]

Re: 开启replication后,hbase读写性能下降问题

Posted by "张铎 (Duo Zhang)" <pa...@gmail.com>.
写入的很猛吗?需要看看瓶颈在哪,我们之前遇到过peer很多的情况下,replication对HDFS的压力很大,因为每个peer都要读一遍WAL。
你这个是SSD的话,看起来是replication本身对region server的压力很大?

zheng wang <18...@qq.com> 于2020年7月21日周二 下午12:05写道:

> 可以考虑对replication限速。
>
>
>
>
> ------------------&nbsp;原始邮件&nbsp;------------------
> 发件人:
>
> "user-zh@hbase.apache.orgww112925@sina.com"
>                                                       <ww112925@sina.com
> &gt;;
> 发送时间:&nbsp;2020年7月21日(星期二) 中午11:51
> 收件人:&nbsp;"user-zh"<user-zh@hbase.apache.org&gt;;
>
> 主题:&nbsp;开启replication后,hbase读写性能下降问题
>
>
>
>
> 在开启hbase的replication后,集群的slowGetCount和slowPutCount都会大幅增长。请问这种情况怎么解决。
> 开启复制之后,日志里边还有大量的slow sync日志。日志如下:
> 2020-07-21 11:08:11,314 INFO
> org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL: Slow sync cost: 635
> ms, current pipeline:
> [DatanodeInfoWithStorage[xxxxx:9866,DS-6522f59f-3ca9-46a9-b25a-5d86cdbb656e,SSD],
> DatanodeInfoWithStorage[xxxxx:9866,DS-8768f8b8-732f-4238-b51d-90ba83343353,SSD],
> DatanodeInfoWithStorage[xxxxx:9866,DS-a5792087-6c58-4600-a239-9a1f61f1c6c0,SSD]]2020-07-21
> 11:08:11,315 INFO org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL:
> Slow sync cost: 564 ms, current pipeline:
> [DatanodeInfoWithStorage[xxxxx:9866,DS-6522f59f-3ca9-46a9-b25a-5d86cdbb656e,SSD],
> DatanodeInfoWithStorage[xxxxx:9866,DS-8768f8b8-732f-4238-b51d-90ba83343353,SSD],
> DatanodeInfoWithStorage[xxxxx:9866,DS-a5792087-6c58-4600-a239-9a1f61f1c6c0,SSD]]2020-07-21
> 11:08:11,315 INFO org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL:
> Slow sync cost: 535 ms, current pipeline:
> [DatanodeInfoWithStorage[xxxxx:9866,DS-6522f59f-3ca9-46a9-b25a-5d86cdbb656e,SSD],
> DatanodeInfoWithStorage[xxxxx:9866,DS-8768f8b8-732f-4238-b51d-90ba83343353,SSD],
> DatanodeInfoWithStorage[xxxxx:9866,DS-a5792087-6c58-4600-a239-9a1f61f1c6c0,SSD]]

回复:开启replication后,hbase读写性能下降问题

Posted by zheng wang <18...@qq.com>.
可以考虑对replication限速。




------------------&nbsp;原始邮件&nbsp;------------------
发件人:                                                                                                                        "user-zh@hbase.apache.orgww112925@sina.com"                                                                                    <ww112925@sina.com&gt;;
发送时间:&nbsp;2020年7月21日(星期二) 中午11:51
收件人:&nbsp;"user-zh"<user-zh@hbase.apache.org&gt;;

主题:&nbsp;开启replication后,hbase读写性能下降问题




在开启hbase的replication后,集群的slowGetCount和slowPutCount都会大幅增长。请问这种情况怎么解决。
开启复制之后,日志里边还有大量的slow sync日志。日志如下:
2020-07-21 11:08:11,314 INFO org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL: Slow sync cost: 635 ms, current pipeline: [DatanodeInfoWithStorage[xxxxx:9866,DS-6522f59f-3ca9-46a9-b25a-5d86cdbb656e,SSD], DatanodeInfoWithStorage[xxxxx:9866,DS-8768f8b8-732f-4238-b51d-90ba83343353,SSD], DatanodeInfoWithStorage[xxxxx:9866,DS-a5792087-6c58-4600-a239-9a1f61f1c6c0,SSD]]2020-07-21 11:08:11,315 INFO org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL: Slow sync cost: 564 ms, current pipeline: [DatanodeInfoWithStorage[xxxxx:9866,DS-6522f59f-3ca9-46a9-b25a-5d86cdbb656e,SSD], DatanodeInfoWithStorage[xxxxx:9866,DS-8768f8b8-732f-4238-b51d-90ba83343353,SSD], DatanodeInfoWithStorage[xxxxx:9866,DS-a5792087-6c58-4600-a239-9a1f61f1c6c0,SSD]]2020-07-21 11:08:11,315 INFO org.apache.hadoop.hbase.regionserver.wal.AbstractFSWAL: Slow sync cost: 535 ms, current pipeline: [DatanodeInfoWithStorage[xxxxx:9866,DS-6522f59f-3ca9-46a9-b25a-5d86cdbb656e,SSD], DatanodeInfoWithStorage[xxxxx:9866,DS-8768f8b8-732f-4238-b51d-90ba83343353,SSD], DatanodeInfoWithStorage[xxxxx:9866,DS-a5792087-6c58-4600-a239-9a1f61f1c6c0,SSD]]