You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Daniel Dai (JIRA)" <ji...@apache.org> on 2015/03/13 22:51:38 UTC

[jira] [Updated] (PIG-3751) Generating Splits in Tez should be configurable to AM or client

     [ https://issues.apache.org/jira/browse/PIG-3751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Daniel Dai updated PIG-3751:
----------------------------
    Issue Type: Improvement  (was: Sub-task)
        Parent:     (was: PIG-3446)

> Generating Splits in Tez should be configurable to AM or client
> ---------------------------------------------------------------
>
>                 Key: PIG-3751
>                 URL: https://issues.apache.org/jira/browse/PIG-3751
>             Project: Pig
>          Issue Type: Improvement
>          Components: tez
>            Reporter: Rohini Palaniswamy
>            Assignee: Rohini Palaniswamy
>
> 1) TEZ-752 allows setting list of URIs to get delegation tokens. Set that to make Tez get delegation tokens and calculate input splits on AM
> 2) Try using Tez Grouping of input splits instead of pig.maxCombinedSplitSize grouping.
> Generating splits in AM is supposed to give performance boost. For those case where InputFormat or OutputFormat get delegation tokens and it is not possible to do that, then have a option to generate input splits on client. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)