You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "Colin Patrick McCabe (JIRA)" <ji...@apache.org> on 2013/08/13 21:06:47 UTC
[jira] [Commented] (HADOOP-9865) FileContext.globStatus() has a
regression with respect to relative path
[ https://issues.apache.org/jira/browse/HADOOP-9865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13738688#comment-13738688 ]
Colin Patrick McCabe commented on HADOOP-9865:
----------------------------------------------
{code}
+ String scheme = schemeFromPath(fixRelativePart(pathPattern));
+ String authority = authorityFromPath(fixRelativePart(pathPattern));
{code}
This is a good start, but the problem is that pathPattern is not actually a path-- it's a pattern. So it may be something like {/,a}/foo, which you can't really make into an absolute path in a sensible way.
I think the right fix is something like this:
{code}
diff --git hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/Globber.java hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/Globber.java
index ad28478..378311a 100644
--- hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/Globber.java
+++ hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/Globber.java
@@ -99,24 +99,24 @@ private Path fixRelativePart(Path path) {
}
private String schemeFromPath(Path path) throws IOException {
- String scheme = pathPattern.toUri().getScheme();
+ String scheme = path.toUri().getScheme();
if (scheme == null) {
if (fs != null) {
scheme = fs.getUri().getScheme();
} else {
- scheme = fc.getFSofPath(path).getUri().getScheme();
+ scheme = fc.getDefaultFileSystem().getUri().getScheme();
}
}
return scheme;
}
private String authorityFromPath(Path path) throws IOException {
- String authority = pathPattern.toUri().getAuthority();
+ String authority = path.toUri().getAuthority();
if (authority == null) {
if (fs != null) {
authority = fs.getUri().getAuthority();
} else {
- authority = fc.getFSofPath(path).getUri().getAuthority();
+ authority = fc.getDefaultFileSystem().getUri().getAuthority();
}
}
return authority ;
{code}
This probably needs more testing, including unit tests...
> FileContext.globStatus() has a regression with respect to relative path
> -----------------------------------------------------------------------
>
> Key: HADOOP-9865
> URL: https://issues.apache.org/jira/browse/HADOOP-9865
> Project: Hadoop Common
> Issue Type: Bug
> Affects Versions: 3.0.0, 2.3.0
> Reporter: Chuan Liu
> Assignee: Chuan Liu
> Attachments: HADOOP-9865-demo.patch
>
>
> I discovered the problem when running unit test TestMRJobClient on Windows. The cause is indirect in this case. In the unit test, we try to launch a job and list its status. The job failed, and caused the list command get a result of 0, which triggered the unit test assert. From the log and debug, the job failed because we failed to create the Jar with classpath (see code around {{FileUtil.createJarWithClassPath}}) in {{ContainerLaunch}}. This is a Windows specific step right now; so the test still passes on Linux. This step failed because we passed in a relative path to {{FileContext.globStatus()}} in {{FileUtil.createJarWithClassPath}}. The relevant log looks like the following.
> {noformat}
> 2013-08-12 16:12:05,937 WARN [ContainersLauncher #0] launcher.ContainerLaunch (ContainerLaunch.java:call(270)) - Failed to launch container.
> org.apache.hadoop.HadoopIllegalArgumentException: Path is relative
> at org.apache.hadoop.fs.Path.checkNotRelative(Path.java:74)
> at org.apache.hadoop.fs.FileContext.getFSofPath(FileContext.java:304)
> at org.apache.hadoop.fs.Globber.schemeFromPath(Globber.java:107)
> at org.apache.hadoop.fs.Globber.glob(Globber.java:128)
> at org.apache.hadoop.fs.FileContext$Util.globStatus(FileContext.java:1908)
> at org.apache.hadoop.fs.FileUtil.createJarWithClassPath(FileUtil.java:1247)
> at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.sanitizeEnv(ContainerLaunch.java:679)
> at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:232)
> at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:1)
> at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
> at java.util.concurrent.FutureTask.run(FutureTask.java:138)
> at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
> at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
> at java.lang.Thread.run(Thread.java:662)
> {noformat}
> I think this is a regression from HADOOP-9817. I modified some code and the unit test passed. (See the attached patch.) However, I think the impact is larger. I will add some unit tests to verify the behavior, and work on a more complete fix.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira