You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Marcelo Vanzin (JIRA)" <ji...@apache.org> on 2014/10/04 01:01:33 UTC

[jira] [Created] (SPARK-3788) Yarn dist cache code is not friendly to HDFS HA, Federation

Marcelo Vanzin created SPARK-3788:
-------------------------------------

             Summary: Yarn dist cache code is not friendly to HDFS HA, Federation
                 Key: SPARK-3788
                 URL: https://issues.apache.org/jira/browse/SPARK-3788
             Project: Spark
          Issue Type: Bug
          Components: YARN
            Reporter: Marcelo Vanzin


There are two bugs here.

1. The {{compareFs()}} method in ClientBase considers the 'host' part of the URI to be an actual host. In the case of HA and Federation, that's a namespace name, which doesn't resolve to anything. So in those cases, {{compareFs()}} always says the file systems are different.

2. In {{prepareLocalResources()}}, when adding a file to the distributed cache, that is done with the common FileSystem object instantiated at the start of the method. In the case of Federation that doesn't work: the qualified URL's scheme may differ from the non-qualified one, so the FileSystem instance will not work.

Fixes are pretty trivial.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org