You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Tsuyoshi Ozawa (JIRA)" <ji...@apache.org> on 2015/07/08 04:28:04 UTC
[jira] [Created] (TEZ-2604) PipelinedSorter doesn't use number of
items when creating SortSpan
Tsuyoshi Ozawa created TEZ-2604:
-----------------------------------
Summary: PipelinedSorter doesn't use number of items when creating SortSpan
Key: TEZ-2604
URL: https://issues.apache.org/jira/browse/TEZ-2604
Project: Apache Tez
Issue Type: Bug
Reporter: Tsuyoshi Ozawa
Assignee: Tsuyoshi Ozawa
{quote}
int items = 1024*1024;
int perItem = 16;
if(span.length() != 0) {
items = span.length();
perItem = span.kvbuffer.limit()/items;
items = (int) ((span.capacity)/(METASIZE+perItem));
if(items > 1024*1024) {
// our goal is to have 1M splits and sort early
items = 1024*1024;
}
}
Preconditions.checkArgument(listIterator.hasNext(), "block iterator should not be empty");
span = new SortSpan((ByteBuffer)listIterator.next().clear(), (1024*1024),
perItem, ConfigUtils.getIntermediateOutputKeyComparator(this.conf));
{quote}
Should we use items instead of (1024*1024)?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)