You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hbase.apache.org by Software Dev <st...@gmail.com> on 2014/04/30 00:25:56 UTC

Need help with row and column design

Hey all. I have some questions regarding row key and column design.

We want to calculate some metrics based on our page views broken down
by minute, day, month and year. We also want this broken down country
and have the ability to filter by some other attributes such as the
sex of the user or whether or not the user is logged in... or both!

So we have the initial row key design as

YYYY
YYYYMMDD
YYYYMMDDHH
YYYYMMDDHHSS

Re: Need help with row and column design

Posted by Software Dev <st...@gmail.com>.
Sorry, I thought I deleted this message as it got cut off halfway when
I was writing it. Can you please look at my other post?

On Tue, Apr 29, 2014 at 3:36 PM, Ted Yu <yu...@gmail.com> wrote:
> The initial row key design would result in hot spot (w.r.t. writes)
>
> Is user id part of row key ?
>
> Have you looked at Sematext's HBaseWD library ?
>
> Lastly, Apache Phoenix may fit your needs.
>
>
> On Tue, Apr 29, 2014 at 3:25 PM, Software Dev <st...@gmail.com>wrote:
>
>> Hey all. I have some questions regarding row key and column design.
>>
>> We want to calculate some metrics based on our page views broken down
>> by minute, day, month and year. We also want this broken down country
>> and have the ability to filter by some other attributes such as the
>> sex of the user or whether or not the user is logged in... or both!
>>
>> So we have the initial row key design as
>>
>> YYYY
>> YYYYMMDD
>> YYYYMMDDHH
>> YYYYMMDDHHSS
>>

Re: Need help with row and column design

Posted by Ted Yu <yu...@gmail.com>.
The initial row key design would result in hot spot (w.r.t. writes)

Is user id part of row key ?

Have you looked at Sematext's HBaseWD library ?

Lastly, Apache Phoenix may fit your needs.


On Tue, Apr 29, 2014 at 3:25 PM, Software Dev <st...@gmail.com>wrote:

> Hey all. I have some questions regarding row key and column design.
>
> We want to calculate some metrics based on our page views broken down
> by minute, day, month and year. We also want this broken down country
> and have the ability to filter by some other attributes such as the
> sex of the user or whether or not the user is logged in... or both!
>
> So we have the initial row key design as
>
> YYYY
> YYYYMMDD
> YYYYMMDDHH
> YYYYMMDDHHSS
>