You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@oozie.apache.org by "jdnurmi (JIRA)" <ji...@apache.org> on 2013/03/01 01:59:13 UTC
[jira] [Commented] (OOZIE-1241) DistributedCache unqualification
prevents oozie from reading applications from non 'default' filesystems
[ https://issues.apache.org/jira/browse/OOZIE-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13590122#comment-13590122 ]
jdnurmi commented on OOZIE-1241:
--------------------------------
>From fairly limited testing, this does seem to allow oozie on apache hadoop to interact with application.xml's and similar on S3;
To be clear, it was this email:
http://mail-archives.apache.org/mod_mbox/oozie-user/201212.mbox/browser
and a class path update hinted at by:
https://groups.google.com/a/cloudera.org/forum/?fromgroups=#!topic/cdh-user/rY99hCM2V_A
that allowed S3 to be used as a source for oozie workflows.
I have not experienced any obvious issues with this patch, but will update this ticket if I notice any.
> DistributedCache unqualification prevents oozie from reading applications from non 'default' filesystems
> --------------------------------------------------------------------------------------------------------
>
> Key: OOZIE-1241
> URL: https://issues.apache.org/jira/browse/OOZIE-1241
> Project: Oozie
> Issue Type: Bug
> Components: action
> Affects Versions: 3.3.1
> Environment: Sun Java6, oozie 3.3.0 (and 3.3.1), ubuntu lucid (+/-)
> Reporter: jdnurmi
> Priority: Minor
> Attachments: urifix.patch
>
>
> For example, a workflow config similar to:
> oozie.wf.application.path=s3n://foo/bar/baz
> Pretty much regardless of the workflow itself will yield something akin to
> "JA008: File does not exist: /bar/baz/oozie/mapred-job-launcher.jar"
> Chasing this down a bit, I ran across:
> http://mail-archives.apache.org/mod_mbox/oozie-user/201212.mbox/browser
> which seemed like a cogent analysis of the problem I was facing.
> The 1-line patch to be attached "fixes the glitch", and passes all tests on 3.3.0 (confirming 3.3.1), but I can't swear it doesn't introduce further problems.
> (There is an additional subtle quirk, in that a Path with an anchor that has been ToURI'd puts the anchor in the URI's /path/ as opposed to its fragment - causing it to be URL encoded later on in the process. By using new URI(path.toString()), the fragment is properly parsed).
> I am still waiting for a few workflows to run to see if this fixes "all" s3 access, but wanted to collect feedback early in case this is a bad path to travel down for other reasons. It at least appears to solve the case of your workflow.xml being in S3.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira