You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "ramkrishna.s.vasudevan (JIRA)" <ji...@apache.org> on 2019/05/22 11:23:00 UTC

[jira] [Comment Edited] (HBASE-22448) Scan is slow for Multiple Column prefixes

    [ https://issues.apache.org/jira/browse/HBASE-22448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16845781#comment-16845781 ] 

ramkrishna.s.vasudevan edited comment on HBASE-22448 at 5/22/19 11:22 AM:
--------------------------------------------------------------------------

Attached the output with some sysouts. Seems we are doing lot of SEEK_USING_HINTS for every column that we already visited for each for the cells. And this goes on for every column.


was (Author: ram_krish):
Attached the output with some sysouts. Seems we are doing lot of SEEK_USING_HINTS for every column that we already visited for each for the cells. And this goes on. 

> Scan is slow for Multiple Column prefixes
> -----------------------------------------
>
>                 Key: HBASE-22448
>                 URL: https://issues.apache.org/jira/browse/HBASE-22448
>             Project: HBase
>          Issue Type: Bug
>          Components: Scanners
>    Affects Versions: 1.4.8, 1.4.9
>            Reporter: Karthick
>            Assignee: Zheng Hu
>            Priority: Critical
>              Labels: prefix, scan, scanner
>             Fix For: 1.5.0, 1.4.10
>
>         Attachments: 0001-benchmark-UT.patch, HBaseFileImport.java, qualifiers.txt, scanquery.txt
>
>
> While scanning a row (around 10 lakhs columns) with 100 column prefixes, it takes around 4 seconds in hbase-1.2.5 and when the same query is executed in hbase-1.4.9 it takes around 50 seconds.
> Is there any way to optimise this?
>  
> *P.S:*
> We have applied the patch provided in [-HBASE-21620-|https://jira.apache.org/jira/browse/HBASE-21620] and  [-HBASE-21734-|https://jira.apache.org/jira/browse/HBASE-21734] . Attached *qualifiers*.*txt* file which contains the column keys. Use the *HBaseFileImport.java* file provided to populate in your table and use *scanquery.txt* to query.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)