You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Rajesh Balamohan (JIRA)" <ji...@apache.org> on 2014/11/06 02:37:34 UTC

[jira] [Resolved] (TEZ-1733) TezMerger should sort FileChunks on size when merging

     [ https://issues.apache.org/jira/browse/TEZ-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rajesh Balamohan resolved TEZ-1733.
-----------------------------------
          Resolution: Fixed
       Fix Version/s: 0.6.0
    Target Version/s:   (was: 0.5.2)
        Hadoop Flags: Reviewed

>>
Commit: 93687aeec59263d949e8a66d6d78f812b93f3bdf
>>
Committed to master.  Thanks [~pramachandran] and [~gopalv]

> TezMerger should sort FileChunks on size when merging
> -----------------------------------------------------
>
>                 Key: TEZ-1733
>                 URL: https://issues.apache.org/jira/browse/TEZ-1733
>             Project: Apache Tez
>          Issue Type: Bug
>    Affects Versions: 0.5.2
>            Reporter: Gopal V
>            Assignee: Prakash Ramachandran
>            Priority: Critical
>             Fix For: 0.6.0
>
>         Attachments: TEZ-1733.1.patch, TEZ-1733.1.patch, TEZ-1733.2.patch, TEZ-1733.3.patch
>
>
>  MAPREDUCE-3685 fixed the Merger sort order for file chunks to use the decompressed size, to cut-down on CPU and IO costs.
> TezMerger needs an equivalent sorted TreeSet which sorts by the data  by size.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)