You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Siddharth Seth (JIRA)" <ji...@apache.org> on 2015/10/10 00:46:05 UTC
[jira] [Created] (TEZ-2879) While grouping splits, allow an
alternate list of preferred locations to be provided per split
Siddharth Seth created TEZ-2879:
-----------------------------------
Summary: While grouping splits, allow an alternate list of preferred locations to be provided per split
Key: TEZ-2879
URL: https://issues.apache.org/jira/browse/TEZ-2879
Project: Apache Tez
Issue Type: Improvement
Reporter: Siddharth Seth
Assignee: Siddharth Seth
Split locations - at least for FileInputSplits - are generally tied to the location on HDFS where the split resides.
There are situations in which this location is not necessarily the best location to process this split.
e.g.
Clusters where compute and storage are separate.
Systems which cache data - cache affinity is more important.
Providing an alternate list of preferred locations allows grouping to the preferred locations, instead of always grouping based on the locations specified in the split.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)