You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Rajesh Balamohan (JIRA)" <ji...@apache.org> on 2015/01/14 02:37:36 UTC
[jira] [Commented] (TEZ-1593) Refactor PipelinedSorter to remove
all MMAP based ByteBuffer references
[ https://issues.apache.org/jira/browse/TEZ-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14276348#comment-14276348 ]
Rajesh Balamohan commented on TEZ-1593:
---------------------------------------
WIP patch needs a rebase to master?
> Refactor PipelinedSorter to remove all MMAP based ByteBuffer references
> -----------------------------------------------------------------------
>
> Key: TEZ-1593
> URL: https://issues.apache.org/jira/browse/TEZ-1593
> Project: Apache Tez
> Issue Type: Bug
> Affects Versions: 0.6.0
> Reporter: Gopal V
> Assignee: Gopal V
> Labels: Performance
> Attachments: TEZ-1593.1.patch, TEZ-1593.2-WIP.patch
>
>
> The current implementation of PipelinedSorter has a slow section which revolves around key comparisons - this was relevant when the implementation used direct byte buffers to back the kvbuffer.
> {code}
> kvbuffer.position(istart);
> kvbuffer.get(ki, 0, ilen);
> kvbuffer.position(jstart);
> kvbuffer.get(kj, 0, jlen);
> // sort by key
> final int cmp = comparator.compare(ki, 0, ilen, kj, 0, jlen);
> {code}
> The kvbuffer.get into the arrays ki and kj are the slowest part of the comparator operation.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)