You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "liyunzhang_intel (JIRA)" <ji...@apache.org> on 2017/03/10 08:18:04 UTC
[jira] [Comment Edited] (PIG-5167) Limit_4 is failing with spark
exec type
[ https://issues.apache.org/jira/browse/PIG-5167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15904431#comment-15904431 ]
liyunzhang_intel edited comment on PIG-5167 at 3/10/17 8:17 AM:
----------------------------------------------------------------
[~knoguchi]: thanks your suggestion!
[~nkollar]: try to add sortArgs to original script, i have not tested it and hope it works
{code}
{
'num' => 4,
'pig' =>q\a = load ':INPATH:/singlefile/studentnulltab10k';
b = distinct a;
c = limit b 100;
store c into ':OUTPATH:';\,
+ 'sortArgs' => ['-t', ' ', '-k', '1,2'],
},
{code}
was (Author: kellyzly):
[~knoguchi]: thanks your suggestion!
[~szita]: try to add sortArgs to original script, i have not tested it and hope it works
{code}
{
'num' => 4,
'pig' =>q\a = load ':INPATH:/singlefile/studentnulltab10k';
b = distinct a;
c = limit b 100;
store c into ':OUTPATH:';\,
+ 'sortArgs' => ['-t', ' ', '-k', '1,2'],
},
{code}
> Limit_4 is failing with spark exec type
> ---------------------------------------
>
> Key: PIG-5167
> URL: https://issues.apache.org/jira/browse/PIG-5167
> Project: Pig
> Issue Type: Sub-task
> Components: spark
> Reporter: Nandor Kollar
> Assignee: Nandor Kollar
> Fix For: spark-branch
>
> Attachments: PIG-5167.patch
>
>
> results are different:
> {code}
> diff <(head -n 5 Limit_4.out/out_sorted) <(head -n 5 Limit_4_benchmark.out/out_sorted)
> 1,5c1,5
> < 50 3.00
> < 74 2.22
> < alice carson 66 2.42
> < alice quirinius 71 0.03
> < alice van buren 28 2.50
> ---
> > bob allen 0.28
> > bob allen 22 0.92
> > bob allen 25 2.54
> > bob allen 26 2.35
> > bob allen 27 2.17
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)