You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Tsz Wo (Nicholas), SZE (JIRA)" <ji...@apache.org> on 2009/03/25 22:41:54 UTC

[jira] Commented: (HADOOP-5573) TestBackupNode sometimes fails

    [ https://issues.apache.org/jira/browse/HADOOP-5573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12689248#action_12689248 ] 

Tsz Wo (Nicholas), SZE commented on HADOOP-5573:
------------------------------------------------

Here are more details:

- Unable to open edit log file .\build\test\data\dfs\name-backup1\current\edits (FSEditLog.java:open(371))
{noformat}
2009-03-24 17:36:39,421 WARN  namenode.FSNamesystem (FSEditLog.java:open(371)) - Unable to open edit log
 file d:\@sze\hadoop\latest\build\test\data\dfs\name-backup1\current\edits
2009-03-24 17:36:39,421 ERROR namenode.Checkpointer (Checkpointer.java:run(138)) - Exception in doCheckpoint: 
java.io.IOException: Could not locate checkpoint directories
	at org.apache.hadoop.hdfs.server.namenode.BackupStorage.loadCheckpoint(BackupStorage.java:157)
	at org.apache.hadoop.hdfs.server.namenode.Checkpointer.doCheckpoint(Checkpointer.java:232)
	at org.apache.hadoop.hdfs.server.namenode.Checkpointer.run(Checkpointer.java:134)
	at java.lang.Thread.run(Thread.java:619)
{noformat}

- NullPointerException at org.apache.hadoop.hdfs.server.namenode.EditLogBackupOutputStream.flushAndSync(EditLogBackupOutputStream.java:163)
{noformat}
2009-03-24 17:56:09,750 INFO  ipc.Server (Server.java:run(968)) - IPC Server handler 6 on 1441, call startCheckpoint(
NamenodeRegistration(xx.xx.xx.xx:50100, role=Backup Node)) from 127.0.0.1:1485: error: java.io.IOException: java.lang.NullPointerException
java.io.IOException: java.lang.NullPointerException
	at org.apache.hadoop.hdfs.server.namenode.EditLogBackupOutputStream.flushAndSync(EditLogBackupOutputStream.java:163)
	at org.apache.hadoop.hdfs.server.namenode.EditLogOutputStream.flush(EditLogOutputStream.java:83)
	at org.apache.hadoop.hdfs.server.namenode.FSEditLog.logSync(FSEditLog.java:989)
	at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.startCheckpoint(FSNamesystem.java:4395)
	at org.apache.hadoop.hdfs.server.namenode.NameNode.startCheckpoint(NameNode.java:440)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:508)
	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:959)
	at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:955)
	at java.security.AccessController.doPrivileged(Native Method)
	at javax.security.auth.Subject.doAs(Subject.java:396)
	at org.apache.hadoop.ipc.Server$Handler.run(Server.java:953)
{noformat}

- Fatal Error : All storage directories are inaccessible.
{noformat}
2009-03-25 14:27:06,828 INFO  namenode.FSNamesystem (FSEditLog.java:printStatistics(1044))
 - Number of transactions: 0 Total time for transactions(ms): 0Number of transactions batched in Syncs: 0 Number of syncs: 0 SyncTimes(ms): 0 
2009-03-25 14:27:06,937 WARN  namenode.FSNamesystem (FSEditLog.java:close(420))
 - FSEditLog:close - failed to close stream d:\@sze\hadoop\testing\build\test\data\dfs\name-checkpoint1\current\edits
2009-03-25 14:27:06,937 ERROR namenode.FSNamesystem (FSEditLog.java:processIOError(506))
 - Unable to log edits to d:\@sze\hadoop\testing\build\test\data\dfs\name-checkpoint1\current\edits
2009-03-25 14:27:06,937 FATAL namenode.FSNamesystem (FSEditLog.java:processIOError(450))
 - Fatal Error : All storage directories are inaccessible.
2009-03-25 14:27:06,937 INFO  namenode.NameNode (NameNode.java:errorReport(421))
 - Error report from NamenodeRegistration(servicehot-dx.ds.corp.yahoo.com:50100, role=Checkpoint Node): Shutting down.
2009-03-25 14:27:06,953 WARN  namenode.DecommissionManager (DecommissionManager.java:run(67))
 - Monitor interrupted: java.lang.InterruptedException: sleep interrupted
2009-03-25 14:27:06,953 WARN  namenode.FSNamesystem (FSNamesystem.java:run(2346))
 - ReplicationMonitor thread received InterruptedException.java.lang.InterruptedException: sleep interrupted
Test org.apache.hadoop.hdfs.server.namenode.TestBackupNode FAILED (crashed)
{noformat}

> TestBackupNode sometimes fails
> ------------------------------
>
>                 Key: HADOOP-5573
>                 URL: https://issues.apache.org/jira/browse/HADOOP-5573
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: dfs
>            Reporter: Tsz Wo (Nicholas), SZE
>
> TestBackupNode may fail with different reasons:
> - Unable to open edit log file .\build\test\data\dfs\name-backup1\current\edits (FSEditLog.java:open(371))
> - NullPointerException at org.apache.hadoop.hdfs.server.namenode.EditLogBackupOutputStream.flushAndSync(EditLogBackupOutputStream.java:163)
> - Fatal Error : All storage directories are inaccessible.
> Will provide more information in the comments.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.