You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Ming Ma (JIRA)" <ji...@apache.org> on 2016/11/30 19:26:59 UTC
[jira] [Created] (TEZ-3552) Shuffle split array when size-based
sorting is turned off
Ming Ma created TEZ-3552:
----------------------------
Summary: Shuffle split array when size-based sorting is turned off
Key: TEZ-3552
URL: https://issues.apache.org/jira/browse/TEZ-3552
Project: Apache Tez
Issue Type: Improvement
Reporter: Ming Ma
TEZ-3430 adds the functionality to skip size-based split sorting to help with job runtime. During further testing, the original split array for certain inputs before sorting aren't randomly distributed in size. So when the spit sorting is turned off, we should shuffle the split instead of doing nothing. That will make the size distribution more even.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)