You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Dmitriy V. Ryaboy (Updated) (JIRA)" <ji...@apache.org> on 2012/01/13 23:24:39 UTC

[jira] [Updated] (PIG-2474) DuplicateForEachColumnRewrite fails in some pig scripts

     [ https://issues.apache.org/jira/browse/PIG-2474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dmitriy V. Ryaboy updated PIG-2474:
-----------------------------------

    Description: 
The following script fails:

{code}
l = load 'tmp/t' as (key:int, v1, v2);              
g = group l by key;                                 
f = foreach g { o = order l by v2; generate l.v1; };
describe f;
f: {key: int,v1: bytearray,v2: bytearray}
-- NOTE: this schema for f is the schema of o, not of l.v1 that we generate!
dump f
2012-01-13 22:19:09,947 [main] INFO  org.apache.pig.tools.pigstats.ScriptState - Pig features used in the script: GROUP_BY
2012-01-13 22:19:09,974 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2000: Error processing rule DuplicateForEachColumnRewrite
{code}


  was:
The following script fails:

{code}
l = load 'tmp/t' as (key:int, v1, v2);              
g = group l by key;                                 
f = foreach g { o = order l by v2; generate l.v1; };
describe f;
f: {key: int,v1: bytearray,v2: bytearray}
-- NOTE: this schema for f is the schema of o, not of l.v1 that we generate!
dump f
2012-01-13 22:19:09,947 [main] INFO  org.apache.pig.tools.pigstats.ScriptState - Pig features used in the script: GROUP_BY
2012-01-13 22:19:09,974 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2000: Error processing rule DuplicateForEachColumnRewrite



    
> DuplicateForEachColumnRewrite fails in some pig scripts
> -------------------------------------------------------
>
>                 Key: PIG-2474
>                 URL: https://issues.apache.org/jira/browse/PIG-2474
>             Project: Pig
>          Issue Type: Bug
>    Affects Versions: 0.9.2
>            Reporter: Dmitriy V. Ryaboy
>
> The following script fails:
> {code}
> l = load 'tmp/t' as (key:int, v1, v2);              
> g = group l by key;                                 
> f = foreach g { o = order l by v2; generate l.v1; };
> describe f;
> f: {key: int,v1: bytearray,v2: bytearray}
> -- NOTE: this schema for f is the schema of o, not of l.v1 that we generate!
> dump f
> 2012-01-13 22:19:09,947 [main] INFO  org.apache.pig.tools.pigstats.ScriptState - Pig features used in the script: GROUP_BY
> 2012-01-13 22:19:09,974 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2000: Error processing rule DuplicateForEachColumnRewrite
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira