You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-dev@hadoop.apache.org by "Yongjun Zhang (JIRA)" <ji...@apache.org> on 2017/06/02 07:25:04 UTC

[jira] [Created] (HDFS-11914) Add more diagnosis info for fsimage transfer failure.

Yongjun Zhang created HDFS-11914:
------------------------------------

             Summary: Add more diagnosis info for fsimage transfer failure.
                 Key: HDFS-11914
                 URL: https://issues.apache.org/jira/browse/HDFS-11914
             Project: Hadoop HDFS
          Issue Type: Bug
            Reporter: Yongjun Zhang
            Assignee: Yongjun Zhang


Hit a fsimage download problem:

Client tries to download fsimage, and got:

 WARN org.apache.hadoop.security.UserGroupInformation: PriviledgedActionException as:hdfs (auth:SIMPLE) cause:java.io.IOException: File http://x.y.z:50070/imagetransfer?getimage=1&txid=latest received length xyz is not of the advertised size abc.

Basically client does not get enough fsimage data and finished prematurely without any exception thrown, until it finds the size of data received is smaller than expected. The client then closed the conenction to NN, that caused NN to report

INFO org.apache.hadoop.hdfs.server.namenode.TransferFsImage: Connection closed by client

This jira is to add some more information in logs to help debugging the sitaution. Specifically, report the stack trace when the connection is closed. And how much data has been sent at that point. etc.
 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-help@hadoop.apache.org