You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Lorand Bendig (JIRA)" <ji...@apache.org> on 2014/06/16 00:04:01 UTC

[jira] [Updated] (PIG-3365) Run as uber job if there is only one input split

     [ https://issues.apache.org/jira/browse/PIG-3365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Lorand Bendig updated PIG-3365:
-------------------------------

    Attachment: PIG-3365.patch

Patch based on the comments above. When {{mapreduce.job.ubertask.maxbytes}} is not set, default block size is the block size of the job's staging dir file system. 
(similarly what Hadoop 2.4.0 does in {{JobImpl#makeUberDecision}}). I renamed the property to pig.opt.ubertask.hint and set it to true by default.

I can change {{opt.fetch}} to {{pig.opt.fetch}} but changing the others would likely break backward compatibility.


> Run as uber job if there is only one input split
> ------------------------------------------------
>
>                 Key: PIG-3365
>                 URL: https://issues.apache.org/jira/browse/PIG-3365
>             Project: Pig
>          Issue Type: Improvement
>            Reporter: Rohini Palaniswamy
>            Assignee: Lorand Bendig
>              Labels: Performance
>         Attachments: PIG-3365.patch
>
>
> Hadoop 2 has support for uber mode (mapreduce.job.ubertask.enable=true) which runs the map and reduce on Application Master itself and reduces the overhead of launching a separate map/reduce task. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)