You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "LinJi (JIRA)" <ji...@apache.org> on 2018/10/10 06:43:00 UTC

[jira] [Created] (HADOOP-15838) Copy files from SFTP to HDFS using DistCp failed with error

LinJi created HADOOP-15838:
------------------------------

             Summary: Copy files from SFTP to HDFS using DistCp failed with error
                 Key: HADOOP-15838
                 URL: https://issues.apache.org/jira/browse/HADOOP-15838
             Project: Hadoop Common
          Issue Type: Bug
          Components: tools/distcp
    Affects Versions: 2.7.2, 2.5.0
         Environment: Hadoop 2.5.0 + kerberos
            Reporter: LinJi
             Fix For: 2.7.5
         Attachments: 微信截图_20181010224316.png, 微信截图_20181010224330.png

1. When I run command:
{code:java}
hadoop distcp sftp://mysftp:1qaz_@WSX@192.168.1.44:/upload/hosts /tmp/JOY{code}
 

I got error like:

 
{noformat}
2018-10-10 22:31:37,799 INFO util.KerberosUtil: Using principal pattern: HTTP/_HOST
2018-10-10 22:31:39,055 INFO tools.DistCp: Input Options: DistCpOptions{atomicCommit=false, syncFolder=false, deleteMissing=false, ignoreFailures=false, maxMaps=20, sslConfigurationFile='null', copyStrategy='uniformsize', sourceFileListing=null, sourcePaths=[sftp://mysftp:1qaz_@WSX@192.168.1.44:/upload/hosts], targetPath=/tmp/JOY, targetPathExists=false}
2018-10-10 22:31:39,365 ERROR tools.DistCp: Exception encountered
java.io.IOException: Invalid host specified
        at org.apache.hadoop.fs.sftp.SFTPFileSystem.initialize(SFTPFileSystem.java:67)
        at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2591)
        at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:89)
        at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2643)
        at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2625)
        at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:368)
        at org.apache.hadoop.fs.Path.getFileSystem(Path.java:296)
        at org.apache.hadoop.tools.GlobbedCopyListing.doBuildListing(GlobbedCopyListing.java:76)
        at org.apache.hadoop.tools.CopyListing.buildListing(CopyListing.java:84)
        at org.apache.hadoop.tools.DistCp.createInputFileListing(DistCp.java:353)
        at org.apache.hadoop.tools.DistCp.execute(DistCp.java:160)
        at org.apache.hadoop.tools.DistCp.run(DistCp.java:121)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
        at org.apache.hadoop.tools.DistCp.main(DistCp.java:401)
{noformat}
 

2. When I run command:
{code:java}
hadoop distcp sftp://mysftp:1qaz_%40WSX@192.168.1.44:/upload/hosts /tmp/JOY{code}
I got error like:
{noformat}
2018-10-10 22:31:59,909 INFO util.KerberosUtil: Using principal pattern: HTTP/_HOST

2018-10-10 22:32:01,286 INFO tools.DistCp: Input Options: DistCpOptions{atomicCommit=false, syncFolder=false, deleteMissing=false, ignoreFailures=false, maxMaps=20, sslConfigurationFile='null', copyStrategy='uniformsize', sourceFileListing=null, sourcePaths=[sftp://mysftp:1qaz_%40WSX@192.168.1.44:/upload/hosts], targetPath=/tmp/JOY, targetPathExists=false}

2018-10-10 22:32:02,190 ERROR tools.DistCp: Exception encountered

java.io.IOException: SSH_MSG_DISCONNECT: 2 Too many authentication failures for mysftp

        at org.apache.hadoop.fs.sftp.SFTPFileSystem.connect(SFTPFileSystem.java:143)

        at org.apache.hadoop.fs.sftp.SFTPFileSystem.getFileStatus(SFTPFileSystem.java:371)

        at org.apache.hadoop.fs.Globber.getFileStatus(Globber.java:57)

        at org.apache.hadoop.fs.Globber.glob(Globber.java:252)

        at org.apache.hadoop.fs.FileSystem.globStatus(FileSystem.java:1623)

        at org.apache.hadoop.tools.GlobbedCopyListing.doBuildListing(GlobbedCopyListing.java:77)

        at org.apache.hadoop.tools.CopyListing.buildListing(CopyListing.java:84)

        at org.apache.hadoop.tools.DistCp.createInputFileListing(DistCp.java:353)

        at org.apache.hadoop.tools.DistCp.execute(DistCp.java:160)

        at org.apache.hadoop.tools.DistCp.run(DistCp.java:121)

        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)

        at org.apache.hadoop.tools.DistCp.main(DistCp.java:401){noformat}
The SFTP username is mysftp

password is 1qaz_@WSX

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-dev-help@hadoop.apache.org