You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@pig.apache.org by "Barreto, Rafael" <ra...@thebackplane.com> on 2013/03/29 22:28:42 UTC

Unexpected error during execution

Hey people,

I'm facing a weird problem running a Pig script on EMR. It fails with
this error message:

2013-03-29 21:07:15,929 [main] ERROR org.apache.pig.tools.grunt.Grunt
- ERROR 2043: Unexpected error during execution.

However, the output is correctly produced and all jobs are reported to
be completed with success. In my investigation I ended up checking the
file

jobs/job_201303292045_0038_1364591135564_hadoop_PigLatin%253Aget%255Ftime%255Fon%255Fsite.pig

produced by the EMR logs. This is for the last job and it contains a
lot of messages like this:

MapAttempt TASK_TYPE="MAP" TASKID="task_201303292045_0038_m_000084"
TASK_ATTEMPT_ID="attempt_201303292045_0038_m_000084_1"
TASK_STATUS="FAILED" FINISH_TIME="1364591223800"
HOSTNAME="10\.6\.179\.195" ERROR="java\.io\.IOException: File already
exists:s3://backplane-analytics/batch_results/20130329-204202190520/get_time_on_site/part-m-00084
	at org\.apache\.hadoop\.fs\.s3native\.NativeS3FileSystem\.create(NativeS3FileSystem\.java:647)
	at org\.apache\.hadoop\.fs\.FileSystem\.create(FileSystem\.java:557)
	at org\.apache\.hadoop\.fs\.FileSystem\.create(FileSystem\.java:538)
	at org\.apache\.hadoop\.fs\.FileSystem\.create(FileSystem\.java:445)
	at org\.apache\.pig\.backend\.hadoop\.executionengine\.mapReduceLayer\.PigTextOutputFormat\.getRecordWriter(PigTextOutputFormat\.java:98)
	at org\.apache\.pig\.backend\.hadoop\.executionengine\.mapReduceLayer\.PigOutputFormat\.getRecordWriter(PigOutputFormat\.java:84)
	at org\.apache\.hadoop\.mapred\.MapTask$NewDirectOutputCollector\.<init>(MapTask\.java:635)
	at org\.apache\.hadoop\.mapred\.MapTask\.runNewMapper(MapTask\.java:760)
	at org\.apache\.hadoop\.mapred\.MapTask\.run(MapTask\.java:375)
	at org\.apache\.hadoop\.mapred\.Child$4\.run(Child\.java:255)
	at java\.security\.AccessController\.doPrivileged(Native Method)
	at javax\.security\.auth\.Subject\.doAs(Subject\.java:396)
	at org\.apache\.hadoop\.security\.UserGroupInformation\.doAs(UserGroupInformation\.java:1132)
	at org\.apache\.hadoop\.mapred\.Child\.main(Child\.java:249)

I don't have any clue to what could be happening since there's no
explicit error message.

Also, it's relavant to say I'm using datafu's StreamingQuantile UDF.
When I don't use this UDF, it runs ok. So... somehow, this is related
to StreamingQuantile, but since there's no error message, I can't
think of what could go wrong and fix it.

Do you guys have any clue about it?

Thanks a lot in advance,
Rafael