You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Samuel Guo (JIRA)" <ji...@apache.org> on 2009/08/09 08:06:15 UTC

[jira] Updated: (HIVE-705) Let Hive can analyse hbase's tables

     [ https://issues.apache.org/jira/browse/HIVE-705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Samuel Guo updated HIVE-705:
----------------------------

    Attachment: hbase-0.19.3.jar
                hbase-0.19.3-test.jar
                HIVE-705_draft.patch

Attach my patch.

There is a little different with my previous proposal.

creating a table will be:

-------------------------------------
CREATE EXTERNAL TABLE webpages(pageURL STRING, page_content STRING, anchors MAP<STRING, STRING>)
COMMENT 'This is the pages table'
ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.HBaseSerDe'
WITH SERDEPROPERTIES (
"hbase.columns.mapping" = "contents:page_content,anchors:",
)
STORED AS HBASETABLE
LOCATION '<hbase_table_location>'
--------------------------------------

The first field defined in the hive table will be mapped to hbase's table key and the left fields will be mapped to the hbase columns specified in serde properties named "hbase.columns.mapping".

And the timestamp field is not added now. I just retrieve the latest version of each hbase cell from a hbase table now.





> Let Hive can analyse hbase's tables
> -----------------------------------
>
>                 Key: HIVE-705
>                 URL: https://issues.apache.org/jira/browse/HIVE-705
>             Project: Hadoop Hive
>          Issue Type: New Feature
>            Reporter: Samuel Guo
>         Attachments: hbase-0.19.3-test.jar, hbase-0.19.3.jar, HIVE-705_draft.patch
>
>
> Add a serde over the hbase's tables, so that hive can analyse the data stored in hbase easily.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.