You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@reef.apache.org by "Rogan Carr (JIRA)" <ji...@apache.org> on 2017/09/24 05:56:00 UTC
[jira] [Created] (REEF-1892) HDFS File Copy only uses local HDFS
Rogan Carr created REEF-1892:
--------------------------------
Summary: HDFS File Copy only uses local HDFS
Key: REEF-1892
URL: https://issues.apache.org/jira/browse/REEF-1892
Project: REEF
Issue Type: Bug
Components: REEF.NET IO
Affects Versions: 0.17
Reporter: Rogan Carr
In REEF-1827 [1], the URI used to specify remote and local files were changed to use the "AbsolutePath". [2]
This means that a file specified as "hdfs://my/file" becomes "/my/file" and the hdfs:// is assumed by the `dfs` command.
This is fine if you are using vanilla HDFS, but for cases like Blob Storage in Azure, there is a special prefix, `wasb://` that is used instead of `hdfs://`. This means that the AbsolutePath method trims off the wasb, and this Copy() function instead attempts to download the file from the local HDFS instead of WASB.
We need to revisit this issue and keep the full path for copies while also keeping proper casing in the path.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)