You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "huaxiang sun (JIRA)" <ji...@apache.org> on 2016/11/11 04:48:59 UTC

[jira] [Comment Edited] (HBASE-17043) parallelize select() work in mob compaction

    [ https://issues.apache.org/jira/browse/HBASE-17043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15648370#comment-15648370 ] 

huaxiang sun edited comment on HBASE-17043 at 11/11/16 4:48 AM:
----------------------------------------------------------------

For 700k files, found that it took 6 ~ 7 seconds to finish the select logic. Compared with the file compact (I/O), it is nothing, still will see how to speed up to reduce this 6 ~ 7 seconds time.


was (Author: huaxiang):
For 70k files, found that it took 6 ~ 7 seconds to finish the select logic. Compared with the file compact (I/O), it is nothing, still will see how to speed up to reduce this 6 ~ 7 seconds time.

> parallelize select() work in mob compaction
> -------------------------------------------
>
>                 Key: HBASE-17043
>                 URL: https://issues.apache.org/jira/browse/HBASE-17043
>             Project: HBase
>          Issue Type: Improvement
>          Components: mob
>    Affects Versions: 2.0.0
>            Reporter: huaxiang sun
>            Assignee: huaxiang sun
>            Priority: Minor
>
> Today in https://github.com/apache/hbase/blob/master/hbase-server/src/main/java/org/apache/hadoop/hbase/mob/compactions/PartitionedMobCompactor.java#L141,  the select() is single-threaded. Give a large number of files, it will take several seconds to finish the job. Will see how this work can be divided and speed up the processing.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)