You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@tajo.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/10/29 09:40:27 UTC

[jira] [Commented] (TAJO-1271) Improve memory usage in HashShuffleFileWriteExec

    [ https://issues.apache.org/jira/browse/TAJO-1271?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14980056#comment-14980056 ] 

ASF GitHub Bot commented on TAJO-1271:
--------------------------------------

GitHub user jinossy opened a pull request:

    https://github.com/apache/tajo/pull/837

    TAJO-1271: Improve memory usage in HashShuffleFileWriteExec

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/jinossy/tajo TAJO-1271_1

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/tajo/pull/837.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #837
    
----
commit 00fe7c284b75bc69755a922144f3c9e784b887cb
Author: Jinho Kim <jh...@apache.org>
Date:   2015-10-29T05:58:39Z

    TAJO-1271: Improve memory usage in HashShuffleFileWriteExec

commit 8768d2f85e25ce0f5a12bae0a2c557d15ab59c41
Author: Jinho Kim <jh...@apache.org>
Date:   2015-10-29T06:10:04Z

    cleanup

commit 2dce0f44868f62df2970526d5608ef6b1b4f711e
Author: Jinho Kim <jh...@apache.org>
Date:   2015-10-29T08:02:47Z

    fix buffer overflow

----


> Improve memory usage in HashShuffleFileWriteExec
> ------------------------------------------------
>
>                 Key: TAJO-1271
>                 URL: https://issues.apache.org/jira/browse/TAJO-1271
>             Project: Tajo
>          Issue Type: Improvement
>          Components: Data Shuffle
>    Affects Versions: 0.9.0
>            Reporter: Jinho Kim
>            Assignee: Jinho Kim
>
> Currently, HashShuffleFileWriteExec keep the cloned tuple list. and It written the tulples by count. This affects the jvm memory.
> We should improve it as following :
> * Keep the tuples in off-heap and reuse the row batch
> * Asynchronously write the hash partitions



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)