You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "liyunzhang_intel (JIRA)" <ji...@apache.org> on 2017/04/13 07:01:41 UTC

[jira] [Comment Edited] (PIG-5171) SecondarySort_7 is failing with spark exec type

    [ https://issues.apache.org/jira/browse/PIG-5171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15967212#comment-15967212 ] 

liyunzhang_intel edited comment on PIG-5171 at 4/13/17 7:00 AM:
----------------------------------------------------------------

[~nkollar]: it is a problem of mr. I guess if change SecondarySort_7 like( change desc to asc) , it will pass because in spark it implements secondary sort in desc while in mr it does not in current case. 
{code}
          {
                    'num' => 7,
                    'java_params' => ['-Dpig.accumulative.batchsize=5'],
                    'pig' => q\register :FUNCPATH:/testudf.jar;
                            a = load ':INPATH:/singlefile/studenttab10k' as (name:chararray, age:int, gpa:float);
                            b = group a by age parallel 10;
                            c = foreach b {
                                d = order a by gpa, name asc;
                                generate group, org.apache.pig.test.udf.evalfunc.AllFirstLetter(d.gpa), org.apache.pig.test.udf.evalfunc.AllFirstLetter(d.name);
                            };
                            store c into ':OUTPATH:';\,
                    },
{code}

Can you help verify it? if it is a mr bug, file jira and close current jira.


was (Author: kellyzly):
[~nkollar]: it is a problem of mr. I guess if change SecondarySort_7 like( change desc to asc) , it will pass because in spark it implements secondary sort in desc while in mr it does not. 
{code}
          {
                    'num' => 7,
                    'java_params' => ['-Dpig.accumulative.batchsize=5'],
                    'pig' => q\register :FUNCPATH:/testudf.jar;
                            a = load ':INPATH:/singlefile/studenttab10k' as (name:chararray, age:int, gpa:float);
                            b = group a by age parallel 10;
                            c = foreach b {
                                d = order a by gpa, name asc;
                                generate group, org.apache.pig.test.udf.evalfunc.AllFirstLetter(d.gpa), org.apache.pig.test.udf.evalfunc.AllFirstLetter(d.name);
                            };
                            store c into ':OUTPATH:';\,
                    },
{code}

Can you help verify it? if it is a mr bug, file jira and close current jira.

> SecondarySort_7 is failing with spark exec type
> -----------------------------------------------
>
>                 Key: PIG-5171
>                 URL: https://issues.apache.org/jira/browse/PIG-5171
>             Project: Pig
>          Issue Type: Sub-task
>          Components: spark
>            Reporter: Nandor Kollar
>             Fix For: spark-branch
>
>
> different output produced



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)