You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "LinJi (JIRA)" <ji...@apache.org> on 2018/10/10 06:46:00 UTC

[jira] [Updated] (HADOOP-15838) Copy files from SFTP to HDFS using DistCp failed with error

     [ https://issues.apache.org/jira/browse/HADOOP-15838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

LinJi updated HADOOP-15838:
---------------------------
    Attachment:     (was: 微信截图_20181010224330.png)

> Copy files from SFTP to HDFS using DistCp failed with error
> -----------------------------------------------------------
>
>                 Key: HADOOP-15838
>                 URL: https://issues.apache.org/jira/browse/HADOOP-15838
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: tools/distcp
>    Affects Versions: 2.5.0, 2.7.2
>         Environment: Hadoop 2.5.0 + kerberos
>            Reporter: LinJi
>            Priority: Major
>              Labels: distcp, hadoop-tools
>             Fix For: 2.7.5
>
>   Original Estimate: 96h
>  Remaining Estimate: 96h
>
> 1. When I run command:
> {code:java}
> hadoop distcp sftp://mysftp:1qaz_@WSX@192.168.1.44:/upload/hosts /tmp/JOY{code}
>  
> I got error like:
>  
> {noformat}
> 2018-10-10 22:31:37,799 INFO util.KerberosUtil: Using principal pattern: HTTP/_HOST
> 2018-10-10 22:31:39,055 INFO tools.DistCp: Input Options: DistCpOptions{atomicCommit=false, syncFolder=false, deleteMissing=false, ignoreFailures=false, maxMaps=20, sslConfigurationFile='null', copyStrategy='uniformsize', sourceFileListing=null, sourcePaths=[sftp://mysftp:1qaz_@WSX@192.168.1.44:/upload/hosts], targetPath=/tmp/JOY, targetPathExists=false}
> 2018-10-10 22:31:39,365 ERROR tools.DistCp: Exception encountered
> java.io.IOException: Invalid host specified
>         at org.apache.hadoop.fs.sftp.SFTPFileSystem.initialize(SFTPFileSystem.java:67)
>         at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2591)
>         at org.apache.hadoop.fs.FileSystem.access$200(FileSystem.java:89)
>         at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:2643)
>         at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:2625)
>         at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:368)
>         at org.apache.hadoop.fs.Path.getFileSystem(Path.java:296)
>         at org.apache.hadoop.tools.GlobbedCopyListing.doBuildListing(GlobbedCopyListing.java:76)
>         at org.apache.hadoop.tools.CopyListing.buildListing(CopyListing.java:84)
>         at org.apache.hadoop.tools.DistCp.createInputFileListing(DistCp.java:353)
>         at org.apache.hadoop.tools.DistCp.execute(DistCp.java:160)
>         at org.apache.hadoop.tools.DistCp.run(DistCp.java:121)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
>         at org.apache.hadoop.tools.DistCp.main(DistCp.java:401)
> {noformat}
>  
> 2. When I run command:
> {code:java}
> hadoop distcp sftp://mysftp:1qaz_%40WSX@192.168.1.44:/upload/hosts /tmp/JOY{code}
> I got error like:
> {noformat}
> 2018-10-10 22:31:59,909 INFO util.KerberosUtil: Using principal pattern: HTTP/_HOST
> 2018-10-10 22:32:01,286 INFO tools.DistCp: Input Options: DistCpOptions{atomicCommit=false, syncFolder=false, deleteMissing=false, ignoreFailures=false, maxMaps=20, sslConfigurationFile='null', copyStrategy='uniformsize', sourceFileListing=null, sourcePaths=[sftp://mysftp:1qaz_%40WSX@192.168.1.44:/upload/hosts], targetPath=/tmp/JOY, targetPathExists=false}
> 2018-10-10 22:32:02,190 ERROR tools.DistCp: Exception encountered
> java.io.IOException: SSH_MSG_DISCONNECT: 2 Too many authentication failures for mysftp
>         at org.apache.hadoop.fs.sftp.SFTPFileSystem.connect(SFTPFileSystem.java:143)
>         at org.apache.hadoop.fs.sftp.SFTPFileSystem.getFileStatus(SFTPFileSystem.java:371)
>         at org.apache.hadoop.fs.Globber.getFileStatus(Globber.java:57)
>         at org.apache.hadoop.fs.Globber.glob(Globber.java:252)
>         at org.apache.hadoop.fs.FileSystem.globStatus(FileSystem.java:1623)
>         at org.apache.hadoop.tools.GlobbedCopyListing.doBuildListing(GlobbedCopyListing.java:77)
>         at org.apache.hadoop.tools.CopyListing.buildListing(CopyListing.java:84)
>         at org.apache.hadoop.tools.DistCp.createInputFileListing(DistCp.java:353)
>         at org.apache.hadoop.tools.DistCp.execute(DistCp.java:160)
>         at org.apache.hadoop.tools.DistCp.run(DistCp.java:121)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
>         at org.apache.hadoop.tools.DistCp.main(DistCp.java:401){noformat}
> The SFTP username is mysftp
> password is 1qaz_@WSX
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org