You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Daniel Dai (JIRA)" <ji...@apache.org> on 2017/05/25 07:06:04 UTC
[jira] [Commented] (PIG-5224) Extra foreach from ColumnPrune
preventing Accumulator usage
[ https://issues.apache.org/jira/browse/PIG-5224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16024320#comment-16024320 ]
Daniel Dai commented on PIG-5224:
---------------------------------
The inserted LOForEach remove all the columns which are not used in the scripts going forward. The next LOForEach is not necessary doing that. I believe this is not for performance reason (The performance gain for removing several columns might be debatable), this is to make ColumnPruner simpler.
> Extra foreach from ColumnPrune preventing Accumulator usage
> -----------------------------------------------------------
>
> Key: PIG-5224
> URL: https://issues.apache.org/jira/browse/PIG-5224
> Project: Pig
> Issue Type: Improvement
> Reporter: Koji Noguchi
> Assignee: Koji Noguchi
> Attachments: pig-5224-v0-testonly.patch, pig-5224-v1.patch
>
>
> {code}
> A = load 'input' as (id:int, fruit);
> B = foreach A generate id; -- to enable columnprune
> C = group B by id;
> D = foreach C {
> o = order B by id;
> generate org.apache.pig.test.utils.AccumulatorBagCount(o);
> }
> STORE D into ...
> {code}
> Pig fails to use Accumulator interface for this UDF.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)