You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tajo.apache.org by "Jinho Kim (JIRA)" <ji...@apache.org> on 2013/12/16 13:34:07 UTC

[jira] [Comment Edited] (TAJO-421) Improve split for compression file

    [ https://issues.apache.org/jira/browse/TAJO-421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13848996#comment-13848996 ] 

Jinho Kim edited comment on TAJO-421 at 12/16/13 12:32 PM:
-----------------------------------------------------------

Thank you for nice finding.
In the current implementation, compression text file only support non-split
it can't use disk volume scheduling. but If compressed file size less than a block size, we can use volume scheduling.
{noformat}
ex) test.snappy

hdfs block size : 64MB
disk volume : 1
file size <= 64 MB
{noformat}


was (Author: jhkim):
Thank you for nice finding.
In the current implementation, compression text file only support non-split
it can't use disk volume scheduling. but If compressed file size less than a block size, we can use volume scheduling.
{noformat}
ex)

block size : 64MB
disk volume : 1
file size >= 64 MB
{noformat}

> Improve split for compression file
> ----------------------------------
>
>                 Key: TAJO-421
>                 URL: https://issues.apache.org/jira/browse/TAJO-421
>             Project: Tajo
>          Issue Type: Improvement
>          Components: storage
>            Reporter: Jinho Kim
>            Assignee: Jinho Kim
>         Attachments: TAJO-421.patch, TAJO-421_2.patch
>
>
> If compressed file size less than the hdfs block size, the volume information can use.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)