You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Olga Natkovich (JIRA)" <ji...@apache.org> on 2010/08/20 02:30:18 UTC

[jira] Created: (PIG-1550) better error handling in casting relations to scalars

better error handling in casting relations to scalars
-----------------------------------------------------

                 Key: PIG-1550
                 URL: https://issues.apache.org/jira/browse/PIG-1550
             Project: Pig
          Issue Type: Bug
            Reporter: Olga Natkovich
            Assignee: Thejas M Nair
             Fix For: 0.8.0


I ran the following script:

Input data:

joe     100
sam     20
bob     134

Script:

A = load 'user_clicks' as (user: chararray, clicks: int);
B = group A by user;
C = foreach A generate group, SUM(A.clicks);
D = foreach A generate clicks/(double)C.$1;
dump C;

Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats  - ERROR 0: Scalar has more than one row in the output
 However at the end of processing, I saw a misleading error:

210709 [main] ERROR org.apache.pig.tools.grunt.Grunt  - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (PIG-1550) better error handling in casting relations to scalars

Posted by "Thejas M Nair (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Thejas M Nair updated PIG-1550:
-------------------------------

        Status: Resolved  (was: Patch Available)
    Resolution: Fixed

Patch committed to trunk and 0.8 branch.


> better error handling in casting relations to scalars
> -----------------------------------------------------
>
>                 Key: PIG-1550
>                 URL: https://issues.apache.org/jira/browse/PIG-1550
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Olga Natkovich
>            Assignee: Thejas M Nair
>             Fix For: 0.8.0
>
>         Attachments: PIG-1550.1.patch
>
>
> I ran the following script:
> Input data:
> joe     100
> sam     20
> bob     134
> Script:
> A = load 'user_clicks' as (user: chararray, clicks: int);
> B = group A by user;
> C = foreach A generate group, SUM(A.clicks);
> D = foreach A generate clicks/(double)C.$1;
> dump C;
> Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats  - ERROR 0: Scalar has more than one row in the output
>  However at the end of processing, I saw a misleading error:
> 210709 [main] ERROR org.apache.pig.tools.grunt.Grunt  - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
> 10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (PIG-1550) better error handling in casting relations to scalars

Posted by "Thejas M Nair (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Thejas M Nair updated PIG-1550:
-------------------------------

    Attachment: PIG-1550.1.patch

PIG-1550.1.patch
test-patch has succeeded . unit tests are still running.
     [exec] +1 overall.
     [exec]
     [exec]     +1 @author.  The patch does not contain any @author tags.
     [exec]
     [exec]     +1 tests included.  The patch appears to include 3 new or modified tests.
     [exec]
     [exec]     +1 javadoc.  The javadoc tool did not generate any warning messages.
     [exec]
     [exec]     +1 javac.  The applied patch does not increase the total number of javac compiler warnings.
     [exec]
     [exec]     +1 findbugs.  The patch does not introduce any new Findbugs warnings.
     [exec]
     [exec]     +1 release audit.  The applied patch does not increase the total number of release audit warnings.


> better error handling in casting relations to scalars
> -----------------------------------------------------
>
>                 Key: PIG-1550
>                 URL: https://issues.apache.org/jira/browse/PIG-1550
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Olga Natkovich
>            Assignee: Thejas M Nair
>             Fix For: 0.8.0
>
>         Attachments: PIG-1550.1.patch
>
>
> I ran the following script:
> Input data:
> joe     100
> sam     20
> bob     134
> Script:
> A = load 'user_clicks' as (user: chararray, clicks: int);
> B = group A by user;
> C = foreach A generate group, SUM(A.clicks);
> D = foreach A generate clicks/(double)C.$1;
> dump C;
> Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats  - ERROR 0: Scalar has more than one row in the output
>  However at the end of processing, I saw a misleading error:
> 210709 [main] ERROR org.apache.pig.tools.grunt.Grunt  - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
> 10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (PIG-1550) better error handling in casting relations to scalars

Posted by "Olga Natkovich (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12905731#action_12905731 ] 

Olga Natkovich commented on PIG-1550:
-------------------------------------

+1, looks good

> better error handling in casting relations to scalars
> -----------------------------------------------------
>
>                 Key: PIG-1550
>                 URL: https://issues.apache.org/jira/browse/PIG-1550
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Olga Natkovich
>            Assignee: Thejas M Nair
>             Fix For: 0.8.0
>
>         Attachments: PIG-1550.1.patch
>
>
> I ran the following script:
> Input data:
> joe     100
> sam     20
> bob     134
> Script:
> A = load 'user_clicks' as (user: chararray, clicks: int);
> B = group A by user;
> C = foreach A generate group, SUM(A.clicks);
> D = foreach A generate clicks/(double)C.$1;
> dump C;
> Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats  - ERROR 0: Scalar has more than one row in the output
>  However at the end of processing, I saw a misleading error:
> 210709 [main] ERROR org.apache.pig.tools.grunt.Grunt  - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
> 10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (PIG-1550) better error handling in casting relations to scalars

Posted by "Olga Natkovich (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12905717#action_12905717 ] 

Olga Natkovich commented on PIG-1550:
-------------------------------------

I will review the patch


> better error handling in casting relations to scalars
> -----------------------------------------------------
>
>                 Key: PIG-1550
>                 URL: https://issues.apache.org/jira/browse/PIG-1550
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Olga Natkovich
>            Assignee: Thejas M Nair
>             Fix For: 0.8.0
>
>         Attachments: PIG-1550.1.patch
>
>
> I ran the following script:
> Input data:
> joe     100
> sam     20
> bob     134
> Script:
> A = load 'user_clicks' as (user: chararray, clicks: int);
> B = group A by user;
> C = foreach A generate group, SUM(A.clicks);
> D = foreach A generate clicks/(double)C.$1;
> dump C;
> Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats  - ERROR 0: Scalar has more than one row in the output
>  However at the end of processing, I saw a misleading error:
> 210709 [main] ERROR org.apache.pig.tools.grunt.Grunt  - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
> 10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (PIG-1550) better error handling in casting relations to scalars

Posted by "Thejas M Nair (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Thejas M Nair updated PIG-1550:
-------------------------------

    Status: Patch Available  (was: Open)

> better error handling in casting relations to scalars
> -----------------------------------------------------
>
>                 Key: PIG-1550
>                 URL: https://issues.apache.org/jira/browse/PIG-1550
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Olga Natkovich
>            Assignee: Thejas M Nair
>             Fix For: 0.8.0
>
>         Attachments: PIG-1550.1.patch
>
>
> I ran the following script:
> Input data:
> joe     100
> sam     20
> bob     134
> Script:
> A = load 'user_clicks' as (user: chararray, clicks: int);
> B = group A by user;
> C = foreach A generate group, SUM(A.clicks);
> D = foreach A generate clicks/(double)C.$1;
> dump C;
> Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats  - ERROR 0: Scalar has more than one row in the output
>  However at the end of processing, I saw a misleading error:
> 210709 [main] ERROR org.apache.pig.tools.grunt.Grunt  - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
> 10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (PIG-1550) better error handling in casting relations to scalars

Posted by "Thejas M Nair (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/PIG-1550?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12905715#action_12905715 ] 

Thejas M Nair commented on PIG-1550:
------------------------------------

Unit tests have succeeded. Patch is ready for review.


> better error handling in casting relations to scalars
> -----------------------------------------------------
>
>                 Key: PIG-1550
>                 URL: https://issues.apache.org/jira/browse/PIG-1550
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Olga Natkovich
>            Assignee: Thejas M Nair
>             Fix For: 0.8.0
>
>         Attachments: PIG-1550.1.patch
>
>
> I ran the following script:
> Input data:
> joe     100
> sam     20
> bob     134
> Script:
> A = load 'user_clicks' as (user: chararray, clicks: int);
> B = group A by user;
> C = foreach A generate group, SUM(A.clicks);
> D = foreach A generate clicks/(double)C.$1;
> dump C;
> Since C contains more than 1 tuple, I expected to get an error which I did. However, the error was not very clear. When the job failed, I did see a valid error (however it lacked the error code): 210630 [main] ERROR org.apache.pig.tools.pigstats.PigStats  - ERROR 0: Scalar has more than one row in the output
>  However at the end of processing, I saw a misleading error:
> 210709 [main] ERROR org.apache.pig.tools.grunt.Grunt  - ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage
> 10/08/19 17:16:22 ERROR grunt.Grunt: ERROR 2088: Unable to get results for: hdfs://wilbur20.labs.corp.sp1.yahoo.com:9020/tmp/temp818551960/tmp1063730945:org.apache.pig.impl.io.InterStorage

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.