You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Rohini Palaniswamy (JIRA)" <ji...@apache.org> on 2014/01/30 17:42:09 UTC
[jira] [Commented] (PIG-3641) Split "otherwise" producing incorrect
output when combined with ColumnPruning
[ https://issues.apache.org/jira/browse/PIG-3641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13886759#comment-13886759 ]
Rohini Palaniswamy commented on PIG-3641:
-----------------------------------------
+1. I ran the full suite of unit tests and they are fine. TestAutoLocalMode fails, but that is unrelated.
> Split "otherwise" producing incorrect output when combined with ColumnPruning
> -----------------------------------------------------------------------------
>
> Key: PIG-3641
> URL: https://issues.apache.org/jira/browse/PIG-3641
> Project: Pig
> Issue Type: Bug
> Affects Versions: 0.10.0, 0.12.0, 0.11.1, 0.13.0
> Reporter: Koji Noguchi
> Assignee: Koji Noguchi
> Attachments: pig-3641_v01.patch, pig-3641_v02_withe2etest.patch
>
>
> Our user was observing incorrect outputs depending on if the query had intermediate output or not. Below is a simplified testcase I came up with.
> {noformat}
> knoguchi pig> cat test.txt
> 9,1,ignored
> 9,1,ignored
> 9,1,ignored
> knoguchi pig> cat bz-6590644/test.pig
> A = load 'test.txt' using PigStorage(',') as (a1:int, a2:int, a3:chararray);
> B = foreach A generate a1,a2;
> SPLIT B into C1 if a2 == 1, D1 otherwise;
> C2 = foreach C1 generate a2;
> store C2 into '/tmp/testC';
> store D1 into '/tmp/testD';
> knoguchi@nameother-lm pig>
> {noformat}
> Incorrect output shown below. /tmp/testD should be empty but somehow has data in it.
> {noformat}
> knoguchi@nameother-lm pig> cat /tmp/testC/part-m-00000
> 1
> 1
> 1
> knoguchi pig> cat /tmp/testD/part-m-00000
> 9 1
> 9 1
> 9 1
> knoguchi pig>
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)