You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by Plamen Paskov <pl...@next-stream.com> on 2016/05/04 13:36:34 UTC

drill and hbase

Hi folks,
I'm trying to use apache drill + hbase for the following scenario. I 
have to create an events analytics system which is basically an api that 
will accept events and store them inside hbase table and after that i 
have to run funnel queries over the data.I will need to support 
different type of events each with it's own subset of parameters. I 
generated a test table with 75M rows and imported it in hbase. Now i'm 
trying to partition the events table by event_type with this command:

USE hbase;

CREATE TABLE events_part (user_id, type, timestamp, browser, 
browser_version) PARTITION BY (type) AS SELECT e.generic.user_id, 
e.generic.type FROM events AS e;

but i receive this error: *PARSE ERROR: Unable to create or drop 
tables/views. Schema [hbase] is immutable.*

I read that for hbase schema it's not possible to define workspaces and 
mark them as writable. How can i workaround this situation? I need to 
partition the data by event type because i'm expecting a lot of 
information to be stored in the table and query the data with *WHERE 
event_type = 'some_event_type'*.

Thanks in advance !

Re: drill and hbase

Posted by Steven Phillips <st...@dremio.com>.
No one has yet implemented an hbase writer in Drill. Without that, it is
not possible to write into an hbase table.

I don't know if anyone currently plans to work on this. If this something
you are interested in taking on, I can point you in the right direction.

On Wed, May 4, 2016 at 6:36 AM, Plamen Paskov <plamen.paskov@next-stream.com
> wrote:

> Hi folks,
> I'm trying to use apache drill + hbase for the following scenario. I have
> to create an events analytics system which is basically an api that will
> accept events and store them inside hbase table and after that i have to
> run funnel queries over the data.I will need to support different type of
> events each with it's own subset of parameters. I generated a test table
> with 75M rows and imported it in hbase. Now i'm trying to partition the
> events table by event_type with this command:
>
> USE hbase;
>
> CREATE TABLE events_part (user_id, type, timestamp, browser,
> browser_version) PARTITION BY (type) AS SELECT e.generic.user_id,
> e.generic.type FROM events AS e;
>
> but i receive this error: *PARSE ERROR: Unable to create or drop
> tables/views. Schema [hbase] is immutable.*
>
> I read that for hbase schema it's not possible to define workspaces and
> mark them as writable. How can i workaround this situation? I need to
> partition the data by event type because i'm expecting a lot of information
> to be stored in the table and query the data with *WHERE event_type =
> 'some_event_type'*.
>
> Thanks in advance !
>