You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Daniel Dai (JIRA)" <ji...@apache.org> on 2015/03/13 22:51:38 UTC
[jira] [Updated] (PIG-3751) Generating Splits in Tez should be
configurable to AM or client
[ https://issues.apache.org/jira/browse/PIG-3751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Daniel Dai updated PIG-3751:
----------------------------
Issue Type: Improvement (was: Sub-task)
Parent: (was: PIG-3446)
> Generating Splits in Tez should be configurable to AM or client
> ---------------------------------------------------------------
>
> Key: PIG-3751
> URL: https://issues.apache.org/jira/browse/PIG-3751
> Project: Pig
> Issue Type: Improvement
> Components: tez
> Reporter: Rohini Palaniswamy
> Assignee: Rohini Palaniswamy
>
> 1) TEZ-752 allows setting list of URIs to get delegation tokens. Set that to make Tez get delegation tokens and calculate input splits on AM
> 2) Try using Tez Grouping of input splits instead of pig.maxCombinedSplitSize grouping.
> Generating splits in AM is supposed to give performance boost. For those case where InputFormat or OutputFormat get delegation tokens and it is not possible to do that, then have a option to generate input splits on client.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)