You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Thomas Graves (JIRA)" <ji...@apache.org> on 2015/09/29 15:07:04 UTC

[jira] [Comment Edited] (SPARK-10858) YARN: archives/jar/files rename with # doesn't work unless scheme given

    [ https://issues.apache.org/jira/browse/SPARK-10858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14935129#comment-14935129 ] 

Thomas Graves edited comment on SPARK-10858 at 9/29/15 1:06 PM:
----------------------------------------------------------------

yes its a bad thing as users don't know when # works.  It should work in all cases, file://, hdfs://. The default is file:// so I would expect it to act the same whether you specify the scheme or not since that is the default.

[~jerryshao]  what was the error you got in the failed case?  You escaped the # in the first case and not the second. what platform are you on?

I was assuming it was failing when the scheme was explicit because we are using getFragment() for perhaps it wasn't fully parsing the URI without the scheme.


was (Author: tgraves):
yes its a bad thing as users don't know when # works.  It should work in all cases, file://, hdfs://. The default is file:// so I would expect it to act the same whether you specify the scheme or not since that is the default.

[~jerryshao]  what was the error you got in the failed case?  You escaped the # in the first case and now the second. what platform are you on?

I was assuming it was failing when the scheme was explicit because we are using getFragment() for perhaps it wasn't fully parsing the URI without the scheme.

> YARN: archives/jar/files rename with # doesn't work unless scheme given
> -----------------------------------------------------------------------
>
>                 Key: SPARK-10858
>                 URL: https://issues.apache.org/jira/browse/SPARK-10858
>             Project: Spark
>          Issue Type: Bug
>          Components: YARN
>    Affects Versions: 1.5.1
>            Reporter: Thomas Graves
>            Priority: Minor
>
> The YARN distributed cache feature with --jars, --archives, --files where you can rename the file/archive using a # symbol only works if you explicitly include the scheme in the path:
> works:
> --jars file:///home/foo/my.jar#renamed.jar
> doesn't work:
> --jars /home/foo/my.jar#renamed.jar
> Exception in thread "main" java.io.FileNotFoundException: File file:/home/foo/my.jar#renamed.jar does not exist
>         at org.apache.hadoop.fs.RawLocalFileSystem.deprecatedGetFileStatus(RawLocalFileSystem.java:534)
>         at org.apache.hadoop.fs.RawLocalFileSystem.getFileLinkStatusInternal(RawLocalFileSystem.java:747)
>         at org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:524)
>         at org.apache.hadoop.fs.FilterFileSystem.getFileStatus(FilterFileSystem.java:416)
>         at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:337)
>         at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:289)
>         at org.apache.spark.deploy.yarn.Client.copyFileToRemote(Client.scala:240)
>         at org.apache.spark.deploy.yarn.Client.org$apache$spark$deploy$yarn$Client$$distribute$1(Client.scala:329)
>         at org.apache.spark.deploy.yarn.Client$$anonfun$prepareLocalResources$6$$anonfun$apply$2.apply(Client.scala:393)
>         at org.apache.spark.deploy.yarn.Client$$anonfun$prepareLocalResources$6$$anonfun$apply$2.apply(Client.scala:392)
>         at scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
>         at scala.collection.mutable.ArrayOps$ofRef.foreach(ArrayOps.scala:108)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org