You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Koji Noguchi (JIRA)" <ji...@apache.org> on 2013/12/23 23:52:52 UTC
[jira] [Updated] (PIG-3641) Split "otherwise" producing incorrect
output when combined with ColumnPruning
[ https://issues.apache.org/jira/browse/PIG-3641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Koji Noguchi updated PIG-3641:
------------------------------
Attachment: pig-3641_v01.patch
This issue is similar to PIG-3051 where projection expression was pointing to a different operator. In Pig-3051, it was about copied LOSort. Here, it's about LOSplitOutput for 'otherwise'.
Attaching a preliminary patch. Need to add testing.
> Split "otherwise" producing incorrect output when combined with ColumnPruning
> -----------------------------------------------------------------------------
>
> Key: PIG-3641
> URL: https://issues.apache.org/jira/browse/PIG-3641
> Project: Pig
> Issue Type: Bug
> Affects Versions: 0.10.0, 0.12.0, 0.11.1, 0.13.0
> Reporter: Koji Noguchi
> Assignee: Koji Noguchi
> Attachments: pig-3641_v01.patch
>
>
> Our user was observing incorrect outputs depending on if the query had intermediate output or not. Below is a simplified testcase I came up with.
> {noformat}
> knoguchi pig> cat test.txt
> 9,1,ignored
> 9,1,ignored
> 9,1,ignored
> knoguchi pig> cat bz-6590644/test.pig
> A = load 'test.txt' using PigStorage(',') as (a1:int, a2:int, a3:chararray);
> B = foreach A generate a1,a2;
> SPLIT B into C1 if a2 == 1, D1 otherwise;
> C2 = foreach C1 generate a2;
> store C2 into '/tmp/testC';
> store D1 into '/tmp/testD';
> knoguchi@nameother-lm pig>
> {noformat}
> Incorrect output shown below. /tmp/testD should be empty but somehow has data in it.
> {noformat}
> knoguchi@nameother-lm pig> cat /tmp/testC/part-m-00000
> 1
> 1
> 1
> knoguchi pig> cat /tmp/testD/part-m-00000
> 9 1
> 9 1
> 9 1
> knoguchi pig>
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)