You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Alex Liu (JIRA)" <ji...@apache.org> on 2013/06/14 07:47:20 UTC
[jira] [Comment Edited] (CASSANDRA-5234) Table created through CQL3
are not accessble to Pig 0.10
[ https://issues.apache.org/jira/browse/CASSANDRA-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13670577#comment-13670577 ]
Alex Liu edited comment on CASSANDRA-5234 at 6/14/13 5:45 AM:
--------------------------------------------------------------
pull @ https://github.com/alexliu68/cassandra/pull/3
Use CassandraStorage for any cql3 tables, you will have composite columns in "columns" bag
Use CQL3Storage for any cql3 table.
{code}
cassandra://[username:password@]<keyspace>/<columnfamily>[?[page_size=<size>]
[&columns=<col1,col2>][&output_query=<prepared_statement>]
[&where_clause=<clause>][&split_size=<size>][&partitioner=<partitioner>]]
{code}
where
page_size is the number of cql3 rows per page (the default is 1000, it's optional)
columns is the column names for the cql3 select query, it's optional
where_clause is the user defined where clause on the indexed column, it's optional
split_size is the number of C* rows per split which can be used to tune the number of mappers
output_query is the prepared query for inserting data to cql3 table (replace the = by @ and ? by #,
because Pig can't take = and ? as url parameter values)
Output row are in the following format
{code}
(((name, value), (name, value)), (value ... value), (value...value))
{code}
where the name and value tuples are key name and value pairs.
The input schema: ((name, value), (name, value), (name, value)) where keys are in the front.
was (Author: alexliu68):
pull @ https://github.com/alexliu68/cassandra/pull/3
Use CassandraStorage for any cql3 tables, you will have composite columns in "columns" bag
Use CQL3Storage for any cql3 table.
{code}
cassandra://[username:password@]<keyspace>/<columnfamily>[?[page_size=<size>][&columns=<col1,col2>][&output_query=<prepared_statement>][&where_clause=<clause>][&split_size=<size>]]
{code}
where page_size is the number of cql3 rows per page (the default is 1000, it's optional)
columns is the column names for the cql3 select query, it's optional
where_clause is the user defined where clause on the indexed column, it's optional
split_size is the number of C* rows per split which can be used to tune the number of mappers
output_query is the prepared query for inserting data to cql3 table
The schema: ((name, value), (name, value), (name, value)) where keys are in the front.
> Table created through CQL3 are not accessble to Pig 0.10
> --------------------------------------------------------
>
> Key: CASSANDRA-5234
> URL: https://issues.apache.org/jira/browse/CASSANDRA-5234
> Project: Cassandra
> Issue Type: Bug
> Components: Hadoop
> Affects Versions: 1.2.1
> Environment: Red hat linux 5
> Reporter: Shamim Ahmed
> Fix For: 1.2.2
>
> Attachments: 5234.tx
>
>
> Hi,
> i have faced a bug when creating table through CQL3 and trying to load data through pig 0.10 as follows:
> java.lang.RuntimeException: Column family 'abc' not found in keyspace 'XYZ'
> at org.apache.cassandra.hadoop.pig.CassandraStorage.initSchema(CassandraStorage.java:1112)
> at org.apache.cassandra.hadoop.pig.CassandraStorage.setLocation(CassandraStorage.java:615).
> This effects from Simple table to table with compound key.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira