You are viewing a plain text version of this content. The canonical link for it is here.

Posted to hdfs-dev@hadoop.apache.org by "Gabor Bota (JIRA)" <ji...@apache.org> on 2018/08/24 15:02:00 UTC

[jira] [Created] (HDFS-13864) Service for FSNamesystem#clearCorruptLazyPersistFiles to iterate with writeLock

Gabor Bota created HDFS-13864:
---------------------------------

             Summary: Service for FSNamesystem#clearCorruptLazyPersistFiles to iterate with writeLock
                 Key: HDFS-13864
                 URL: https://issues.apache.org/jira/browse/HDFS-13864
             Project: Hadoop HDFS
          Issue Type: Improvement
            Reporter: Gabor Bota


In HDFS-13672 we agreed that the current implementation could be changed, but not in a way that it was addressed in that issue.
This jira is a follow-up for HDFS-13672.

As a workaround, we can disable the scrubber interval when debugging. In the real world/customer environments, there are no cases when there are so many corrupted lazy persist files.

We agreed that 
* holding the lock for a long time is an anti-pattern
* the common case here is that there are zero lazy persist files, a better (though different) change would be to skip running this scrubber entirely if there aren't any lazy persist files

We had the following ideas:
* create a service where you can iterate through a list of elements with a gained writeLock - and each element can be run through a lambda function.
* What we need here is a tail iterator that starts at the last processed element.
* Open question: should we disable the {{clearCorruptLazyPersistFiles}} by default? 




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-help@hadoop.apache.org