You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Brock Noland (JIRA)" <ji...@apache.org> on 2014/12/02 03:32:12 UTC
[jira] [Commented] (PIG-4239) "pig.output.lazy" not works in spark
mode
[ https://issues.apache.org/jira/browse/PIG-4239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14230863#comment-14230863 ]
Brock Noland commented on PIG-4239:
-----------------------------------
Patch looks good to me! Nice work!
{noformat}
storeJobConf.set("mapred.output.dir", poStore.getSFile()
.getFileName());
{noformat}
I think we should use the {{FileOutputFormat.set*}} method here..
> "pig.output.lazy" not works in spark mode
> -----------------------------------------
>
> Key: PIG-4239
> URL: https://issues.apache.org/jira/browse/PIG-4239
> Project: Pig
> Issue Type: Bug
> Components: spark
> Reporter: liyunzhang_intel
> Assignee: liyunzhang_intel
> Attachments: PIG-4239.patch, lazy, lazy.pig
>
>
> If "pig.output.lazy" is "true", empty part files from the output will be omitted.
> steps to reproduce
> 1. set "pig.output.lazy" as “true" in $PIG_HOME/conf/pig.properties.
> 2. run following lazy.pig script in spark mode:
> cat lazy.pig
> a = load '/user/pig/lazy' using PigStorage();
> b = filter a by $0 == 'hey';
> c = store b into '/tmp/lazy.out';
> lazy.pig and lazy are attached
> 3. empty file "/tmp/lazy.out/part-rxxxx” is still generated, the empty file is expected not to be generated when "pig.output.lazy" is "true"
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)