You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@oozie.apache.org by Panshul Whisper <ou...@gmail.com> on 2013/03/30 01:12:02 UTC

Error running pig from oozie workflow

Hello,

I am trying to run a pig script from within an oozie workflow. I am running
CDH4 and using Hue to create the workflow and execute it.  But I am getting
an error :

 Failing Oozie Launcher, Main class
[org.apache.oozie.action.hadoop.PigMain], main() threw exception,
org/apache/pig/Main
                              java.lang.NoClassDefFoundError:
org/apache/pig/Main
                              at
org.apache.oozie.action.hadoop.PigMain.runPigJob(PigMain.java:323)
                              at
org.apache.oozie.action.hadoop.PigMain.run(PigMain.java:218)


The workflow xml:

<workflow-app name="testing workflow" xmlns="uri:oozie:workflow:0.4">
    <start to="app10sample"/>
    <action name="app10sample">
        <pig>
            <job-tracker>${jobTracker}</job-tracker>
            <name-node>${nameNode}</name-node>
            <prepare>
                  <delete
path="${nameNode}/user/panshul/pig/output/app10samples/"/>
            </prepare>
            <script>/user/panshul/pig/steamSamples/app10sample.pig</script>
        </pig>
        <ok to="end"/>
        <error to="kill"/>
    </action>
    <kill name="kill">
        <message>Action failed, error
message[${wf:errorMessage(wf:lastErrorNode())}]</message>
    </kill>
    <end name="end"/>
</workflow-app>


Found this error message when I clicked on the job id of the pig task
submitted:

*Cannot access:
/user/panshul/oozie-oozi/0000003-130315174901117-oozie-oozi-W/app10sample--pig/output.
Note: you are a Hue admin but not a HDFS superuser (which is "hdfs").*

[Errno 2] File
/user/panshul/oozie-oozi/0000003-130315174901117-oozie-oozi-W/app10sample--pig/output
not found



The full error log is attached to the mail.
The full task tracker log is also attached to the mail.

Please help me find out the problem. I have no idea which way to proceed.

Thanking You,
-- 
Regards,
Ouch Whisper
010101010101

Re: Error running pig from oozie workflow

Posted by Ryota Egashira <eg...@yahoo-inc.com>.
Hi, Panshul
I'm not familiar with CDH/Hue,  but I saw this error when pig.jar is not provided.
The easiest way might be to copy pig.jar under /lib directory of workflow application path (directory where workflow.xml is located) on HDFS, then it should be automatically picked up by oozie.

Thanks
Ryota

From: Panshul Whisper <ou...@gmail.com>>
Reply-To: "user@oozie.apache.org<ma...@oozie.apache.org>" <us...@oozie.apache.org>>
Date: Friday, March 29, 2013 5:12 PM
To: "user@oozie.apache.org<ma...@oozie.apache.org>" <us...@oozie.apache.org>>
Subject: Error running pig from oozie workflow

Hello,

I am trying to run a pig script from within an oozie workflow. I am running CDH4 and using Hue to create the workflow and execute it.  But I am getting an error :


 Failing Oozie Launcher, Main class [org.apache.oozie.action.hadoop.PigMain], main() threw exception, org/apache/pig/Main
                              java.lang.NoClassDefFoundError: org/apache/pig/Main
                              at org.apache.oozie.action.hadoop.PigMain.runPigJob(PigMain.java:323)
                              at org.apache.oozie.action.hadoop.PigMain.run(PigMain.java:218)

The workflow xml:

<workflow-app name="testing workflow" xmlns="uri:oozie:workflow:0.4">
    <start to="app10sample"/>
    <action name="app10sample">
        <pig>
            <job-tracker>${jobTracker}</job-tracker>
            <name-node>${nameNode}</name-node>
            <prepare>
                  <delete path="${nameNode}/user/panshul/pig/output/app10samples/"/>
            </prepare>
            <script>/user/panshul/pig/steamSamples/app10sample.pig</script>
        </pig>
        <ok to="end"/>
        <error to="kill"/>
    </action>
    <kill name="kill">
        <message>Action failed, error message[${wf:errorMessage(wf:lastErrorNode())}]</message>
    </kill>
    <end name="end"/>
</workflow-app>


Found this error message when I clicked on the job id of the pig task submitted:


Cannot access: /user/panshul/oozie-oozi/0000003-130315174901117-oozie-oozi-W/app10sample--pig/output. Note: you are a Hue admin but not a HDFS superuser (which is "hdfs").

[Errno 2] File /user/panshul/oozie-oozi/0000003-130315174901117-oozie-oozi-W/app10sample--pig/output not found



The full error log is attached to the mail.
The full task tracker log is also attached to the mail.

Please help me find out the problem. I have no idea which way to proceed.

Thanking You,
--
Regards,
Ouch Whisper
010101010101