You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-dev@hadoop.apache.org by Wei-Chiu Chuang <we...@apache.org> on 2019/10/16 17:46:29 UTC

Meeting notes from today's Hadoop storage community sync

Here's today's notes for future reference:
https://docs.google.com/document/d/1jXM5Ujvf-zhcyw_5kiQVx6g-HeKe-YGnFS_1-qFXomI/edit?usp=sharing
10/16/2019

Attendee:

Weichiu, Cynthia, Craig, Stephen, Akira, David

Stephen introduced upgrade domain, which was developed at Twitter. Cloudera
is going to support this feature in the next release. The feature was
developed a few years back and quite complete, so Cloudera is just adding
UI and verification/guardrails to support this feature.

Akira is interested in decommission and maintenance mode. Decomm is slow at
Y! Japan. Akira’s interested in maintenance mode too, but they are on 2.6.x
so can’t try yet.

Stephen introduced the decommissioning improvement project. Decommissioning
in practice has a few weird behavior and tend to be slow.

HDFS-14814  a new decommissioning monitor. It reduces NameNode lock holding
time, and spread replication load across DataNodes. It also gives priority
to dead nodes than decommissioning nodes. But it’s hard to simulate its
performance. It will have to run on a real large cluster to prove it works.
Looking for community members to pick it up and introduce it in some large
clusters to try out.

HDFS-14861 instead of letting the block to go to the end of replication
queue, iterator is reset periodically.

EC is not considered yet.


Next week we will have the Hadoop storage community sync for the APAC time
(PDT 10pm Wednesday, CST 1pm Thursday). Looking for topics.

Best,
Weichiu