You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Oliver Zhou (JIRA)" <ji...@apache.org> on 2013/05/06 15:38:18 UTC

[jira] [Commented] (CASSANDRA-4131) Integrate Hive support to be in core cassandra

    [ https://issues.apache.org/jira/browse/CASSANDRA-4131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13649749#comment-13649749 ] 

Oliver Zhou commented on CASSANDRA-4131:
----------------------------------------

Hi Dmitry,

I try your build with cassandra 1.2.3/hive 0.9.0, I have a issue that I always get the duplicated records in Hive.

Cassandra column family:
CREATE COLUMN FAMILY users
WITH comparator = UTF8Type
AND key_validation_class=UTF8Type
AND column_metadata = [
{column_name: full_name, validation_class: UTF8Type}
{column_name: email, validation_class: UTF8Type}
{column_name: state, validation_class: UTF8Type}
{column_name: gender, validation_class: UTF8Type}
{column_name: birth_year, validation_class: LongType}
];

Hive Table:
CREATE EXTERNAL TABLE IF NOT EXISTS
users (key string, full_name string)
STORED BY 'org.apache.hadoop.hive.cassandra.CassandraStorageHandler'
WITH SERDEPROPERTIES ("cassandra.columns.mapping" = ":key,users:full_name" , "cassandra.cf.name" = "users")
TBLPROPERTIES ("cassandra.ks.name" = "ks33");

Hive Query:
select * from users;
always return duplicated rows (one row appears twice)

select count(1) from users;
return 2 but exactly I only insert one row.

Do you have any idea why this happen?
                
> Integrate Hive support to be in core cassandra
> ----------------------------------------------
>
>                 Key: CASSANDRA-4131
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4131
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Jeremy Hanna
>            Assignee: Edward Capriolo
>              Labels: hadoop, hive
>
> The standalone hive support (at https://github.com/riptano/hive) would be great to have in-tree so that people don't have to go out to github to download it and wonder if it's a left-for-dead external shim.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira