You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@hbase.apache.org by Kevin Wright <ke...@gmail.com> on 2021/04/12 16:50:20 UTC

Multi-dimensional Range Queries Help

Hi!

Our application requires fast read queries that specify two ranges. One
range on timestamps, and another on ids. We are currently using Apache
HBase as our db, but we’re unsure how to optimally design the row keys /
schemas. Currently, scanning over row key (the ids) with filter on
timeranges is taking more time than what we expect. A normal query would
probably have say 200 rows that match the id range, and about 10 rows that
match both ranges, and we have currently on the order of 10s of millions of
rows.

We’re wondering if there’s something we can do to increase throughput with
HBase (e.g., is there something like composite indexing like in MySQL?).
Not sure if this is the best place to ask this, but if anyone could point
us to the right direction, that would be great!

Thank you!