You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-dev@hadoop.apache.org by "Bikas Saha (Created) (JIRA)" <ji...@apache.org> on 2012/02/08 09:42:57 UTC
[jira] [Created] (HDFS-2915) HA:
TestFailureOfSharedDir.testFailureOfSharedDir() has race condition
HA: TestFailureOfSharedDir.testFailureOfSharedDir() has race condition
----------------------------------------------------------------------
Key: HDFS-2915
URL: https://issues.apache.org/jira/browse/HDFS-2915
Project: Hadoop HDFS
Issue Type: Sub-task
Components: name-node
Affects Versions: HA branch (HDFS-1623)
Reporter: Bikas Saha
Assignee: Bikas Saha
Priority: Minor
The test deletes the shared edits dir to simulate a failure. Then it calls rollEditLogs() to trigger the deleted dir to be used and fail with an IOException. Unfortunately, deleting the shared dir can put the NN in safe mode due to lack of space. This causes a SafeModeException to be thrown when rollEditDirs() is called. This exception is caught as an IOException in the test but the associated assert in the catch block fails.
This always happens in the debugger because the delay in stepping through causes the safe mode change to happen before rollEditLogs() gets called.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (HDFS-2915) HA:
TestFailureOfSharedDir.testFailureOfSharedDir() has race condition
Posted by "Jitendra Nath Pandey (Resolved) (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HDFS-2915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jitendra Nath Pandey resolved HDFS-2915.
----------------------------------------
Resolution: Fixed
Fix Version/s: HA branch (HDFS-1623)
Hadoop Flags: Reviewed
Committed. Thanks to Bikas!
> HA: TestFailureOfSharedDir.testFailureOfSharedDir() has race condition
> ----------------------------------------------------------------------
>
> Key: HDFS-2915
> URL: https://issues.apache.org/jira/browse/HDFS-2915
> Project: Hadoop HDFS
> Issue Type: Sub-task
> Components: name-node
> Affects Versions: HA branch (HDFS-1623)
> Reporter: Bikas Saha
> Assignee: Bikas Saha
> Priority: Minor
> Fix For: HA branch (HDFS-1623)
>
> Attachments: HDFS-2915.HDFS-1623.patch, HDFS-2915.HDFS-1623.patch
>
>
> The test deletes the shared edits dir to simulate a failure. Then it calls rollEditLogs() to trigger the deleted dir to be used and fail with an IOException. Unfortunately, deleting the shared dir can put the NN in safe mode due to lack of space. This causes a SafeModeException to be thrown when rollEditDirs() is called. This exception is caught as an IOException in the test but the associated assert in the catch block fails.
> This always happens in the debugger because the delay in stepping through causes the safe mode change to happen before rollEditLogs() gets called.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira