You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Olga Natkovich (JIRA)" <ji...@apache.org> on 2010/08/20 02:30:18 UTC
[jira] Created: (PIG-1550) better error handling in casting
relations to scalars
better error handling in casting relations to scalars
-----------------------------------------------------
Key: PIG-1550
URL: https://issues.apache.org/jira/browse/PIG-1550
Project: Pig
Issue Type: Bug
Reporter: Olga Natkovich
Assignee: Thejas M Nair
Fix For: 0.8.0
I ran the following script:
Input data:
joe 100
sam 20
bob 134
Script:
A = load 'user_clicks' as (user: chararray, clicks: int);
B = group A by user;
C = foreach A generate group, SUM(A.clicks);
D = foreach A generate clicks/(double)C.$1;
dump C;
Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 0: Scalar has more than one row in the output
However at the end of processing, I saw a misleading error:
210709 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (PIG-1550) better error handling in casting
relations to scalars
Posted by "Thejas M Nair (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Thejas M Nair updated PIG-1550:
-------------------------------
Status: Resolved (was: Patch Available)
Resolution: Fixed
Patch committed to trunk and 0.8 branch.
> better error handling in casting relations to scalars
> -----------------------------------------------------
>
> Key: PIG-1550
> URL: https://issues.apache.org/jira/browse/PIG-1550
> Project: Pig
> Issue Type: Bug
> Reporter: Olga Natkovich
> Assignee: Thejas M Nair
> Fix For: 0.8.0
>
> Attachments: PIG-1550.1.patch
>
>
> I ran the following script:
> Input data:
> joe 100
> sam 20
> bob 134
> Script:
> A = load 'user_clicks' as (user: chararray, clicks: int);
> B = group A by user;
> C = foreach A generate group, SUM(A.clicks);
> D = foreach A generate clicks/(double)C.$1;
> dump C;
> Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 0: Scalar has more than one row in the output
> However at the end of processing, I saw a misleading error:
> 210709 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
> 10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (PIG-1550) better error handling in casting
relations to scalars
Posted by "Thejas M Nair (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Thejas M Nair updated PIG-1550:
-------------------------------
Attachment: PIG-1550.1.patch
PIG-1550.1.patch
test-patch has succeeded . unit tests are still running.
[exec] +1 overall.
[exec]
[exec] +1 @author. The patch does not contain any @author tags.
[exec]
[exec] +1 tests included. The patch appears to include 3 new or modified tests.
[exec]
[exec] +1 javadoc. The javadoc tool did not generate any warning messages.
[exec]
[exec] +1 javac. The applied patch does not increase the total number of javac compiler warnings.
[exec]
[exec] +1 findbugs. The patch does not introduce any new Findbugs warnings.
[exec]
[exec] +1 release audit. The applied patch does not increase the total number of release audit warnings.
> better error handling in casting relations to scalars
> -----------------------------------------------------
>
> Key: PIG-1550
> URL: https://issues.apache.org/jira/browse/PIG-1550
> Project: Pig
> Issue Type: Bug
> Reporter: Olga Natkovich
> Assignee: Thejas M Nair
> Fix For: 0.8.0
>
> Attachments: PIG-1550.1.patch
>
>
> I ran the following script:
> Input data:
> joe 100
> sam 20
> bob 134
> Script:
> A = load 'user_clicks' as (user: chararray, clicks: int);
> B = group A by user;
> C = foreach A generate group, SUM(A.clicks);
> D = foreach A generate clicks/(double)C.$1;
> dump C;
> Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 0: Scalar has more than one row in the output
> However at the end of processing, I saw a misleading error:
> 210709 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
> 10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (PIG-1550) better error handling in casting
relations to scalars
Posted by "Olga Natkovich (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12905731#action_12905731 ]
Olga Natkovich commented on PIG-1550:
-------------------------------------
+1, looks good
> better error handling in casting relations to scalars
> -----------------------------------------------------
>
> Key: PIG-1550
> URL: https://issues.apache.org/jira/browse/PIG-1550
> Project: Pig
> Issue Type: Bug
> Reporter: Olga Natkovich
> Assignee: Thejas M Nair
> Fix For: 0.8.0
>
> Attachments: PIG-1550.1.patch
>
>
> I ran the following script:
> Input data:
> joe 100
> sam 20
> bob 134
> Script:
> A = load 'user_clicks' as (user: chararray, clicks: int);
> B = group A by user;
> C = foreach A generate group, SUM(A.clicks);
> D = foreach A generate clicks/(double)C.$1;
> dump C;
> Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 0: Scalar has more than one row in the output
> However at the end of processing, I saw a misleading error:
> 210709 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
> 10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (PIG-1550) better error handling in casting
relations to scalars
Posted by "Olga Natkovich (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12905717#action_12905717 ]
Olga Natkovich commented on PIG-1550:
-------------------------------------
I will review the patch
> better error handling in casting relations to scalars
> -----------------------------------------------------
>
> Key: PIG-1550
> URL: https://issues.apache.org/jira/browse/PIG-1550
> Project: Pig
> Issue Type: Bug
> Reporter: Olga Natkovich
> Assignee: Thejas M Nair
> Fix For: 0.8.0
>
> Attachments: PIG-1550.1.patch
>
>
> I ran the following script:
> Input data:
> joe 100
> sam 20
> bob 134
> Script:
> A = load 'user_clicks' as (user: chararray, clicks: int);
> B = group A by user;
> C = foreach A generate group, SUM(A.clicks);
> D = foreach A generate clicks/(double)C.$1;
> dump C;
> Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 0: Scalar has more than one row in the output
> However at the end of processing, I saw a misleading error:
> 210709 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
> 10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (PIG-1550) better error handling in casting
relations to scalars
Posted by "Thejas M Nair (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Thejas M Nair updated PIG-1550:
-------------------------------
Status: Patch Available (was: Open)
> better error handling in casting relations to scalars
> -----------------------------------------------------
>
> Key: PIG-1550
> URL: https://issues.apache.org/jira/browse/PIG-1550
> Project: Pig
> Issue Type: Bug
> Reporter: Olga Natkovich
> Assignee: Thejas M Nair
> Fix For: 0.8.0
>
> Attachments: PIG-1550.1.patch
>
>
> I ran the following script:
> Input data:
> joe 100
> sam 20
> bob 134
> Script:
> A = load 'user_clicks' as (user: chararray, clicks: int);
> B = group A by user;
> C = foreach A generate group, SUM(A.clicks);
> D = foreach A generate clicks/(double)C.$1;
> dump C;
> Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 0: Scalar has more than one row in the output
> However at the end of processing, I saw a misleading error:
> 210709 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
> 10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (PIG-1550) better error handling in casting
relations to scalars
Posted by "Thejas M Nair (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12905715#action_12905715 ]
Thejas M Nair commented on PIG-1550:
------------------------------------
Unit tests have succeeded. Patch is ready for review.
> better error handling in casting relations to scalars
> -----------------------------------------------------
>
> Key: PIG-1550
> URL: https://issues.apache.org/jira/browse/PIG-1550
> Project: Pig
> Issue Type: Bug
> Reporter: Olga Natkovich
> Assignee: Thejas M Nair
> Fix For: 0.8.0
>
> Attachments: PIG-1550.1.patch
>
>
> I ran the following script:
> Input data:
> joe 100
> sam 20
> bob 134
> Script:
> A = load 'user_clicks' as (user: chararray, clicks: int);
> B = group A by user;
> C = foreach A generate group, SUM(A.clicks);
> D = foreach A generate clicks/(double)C.$1;
> dump C;
> Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats - ERROR 0: Scalar has more than one row in the output
> However at the end of processing, I saw a misleading error:
> 210709 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
> 10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.