You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Jonathan Ellis (JIRA)" <ji...@apache.org> on 2010/02/21 06:12:28 UTC

[jira] Commented: (HIVE-705) Let Hive can analyse hbase's tables

    [ https://issues.apache.org/jira/browse/HIVE-705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12836323#action_12836323 ] 

Jonathan Ellis commented on HIVE-705:
-------------------------------------

ISTM that merging the HBase columnfamilies into a single Hive table is the wrong approach and could lead to poor performance; rather, each HBase CF should be its own Hive table, which may of course be joined with others as necessary.  (I think using the word "table" for HBase's "collection of CFs" is unfortunate in the first place since they are different animals; fundamentally, the basic unit of data access in HBase is the CF.)

I'm interested because Cassandra is also looking at adding Hive support, and we also implement a ColumnFamily data model.

> Let Hive can analyse hbase's tables
> -----------------------------------
>
>                 Key: HIVE-705
>                 URL: https://issues.apache.org/jira/browse/HIVE-705
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: Samuel Guo
>            Assignee: John Sichi
>         Attachments: hbase-0.19.3-test.jar, hbase-0.19.3.jar, HIVE-705.1.patch, HIVE-705_draft.patch, HIVE-705_revision806905.patch, HIVE-705_revision883033.patch
>
>
> Add a serde over the hbase's tables, so that hive can analyse the data stored in hbase easily.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.