You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Alex Liu (JIRA)" <ji...@apache.org> on 2013/06/14 07:47:20 UTC

[jira] [Comment Edited] (CASSANDRA-5234) Table created through CQL3 are not accessble to Pig 0.10

    [ https://issues.apache.org/jira/browse/CASSANDRA-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13670577#comment-13670577 ] 

Alex Liu edited comment on CASSANDRA-5234 at 6/14/13 5:45 AM:
--------------------------------------------------------------

pull @ https://github.com/alexliu68/cassandra/pull/3

Use CassandraStorage for any cql3 tables, you will have composite columns in "columns" bag

Use CQL3Storage for any cql3 table.
{code}
cassandra://[username:password@]<keyspace>/<columnfamily>[?[page_size=<size>]
[&columns=<col1,col2>][&output_query=<prepared_statement>]
[&where_clause=<clause>][&split_size=<size>][&partitioner=<partitioner>]]
{code}

where 
  page_size is the number of cql3 rows per page (the default is 1000, it's optional)

  columns is the column names for the cql3 select query, it's optional
 
  where_clause is the user defined where clause on the indexed column, it's optional

  split_size is the number of C* rows per split which can be used to tune the number of mappers

  output_query is the prepared query for inserting data to cql3 table (replace the = by @ and ? by #,
      because Pig can't take = and ? as url parameter values)

Output row are in the following format
{code}
(((name, value), (name, value)), (value ... value), (value...value))
{code}

where the name and value tuples are key name and value pairs.


The input schema: ((name, value), (name, value), (name, value)) where keys are in the front.
                
      was (Author: alexliu68):
    pull @ https://github.com/alexliu68/cassandra/pull/3

Use CassandraStorage for any cql3 tables, you will have composite columns in "columns" bag

Use CQL3Storage for any cql3 table.
{code}
cassandra://[username:password@]<keyspace>/<columnfamily>[?[page_size=<size>][&columns=<col1,col2>][&output_query=<prepared_statement>][&where_clause=<clause>][&split_size=<size>]]
{code}

where page_size is the number of cql3 rows per page (the default is 1000, it's optional)
columns is the column names for the cql3 select query, it's optional
where_clause is the user defined where clause on the indexed column, it's optional
split_size is the number of C* rows per split which can be used to tune the number of mappers
output_query is the prepared query for inserting data to cql3 table

The schema: ((name, value), (name, value), (name, value)) where keys are in the front.
                  
> Table created through CQL3 are not accessble to Pig 0.10
> --------------------------------------------------------
>
>                 Key: CASSANDRA-5234
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-5234
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Hadoop
>    Affects Versions: 1.2.1
>         Environment: Red hat linux 5
>            Reporter: Shamim Ahmed
>             Fix For: 1.2.2
>
>         Attachments: 5234.tx
>
>
> Hi,
>   i have faced a bug when creating table through CQL3 and trying to load data through pig 0.10 as follows:
> java.lang.RuntimeException: Column family 'abc' not found in keyspace 'XYZ'
> 	at org.apache.cassandra.hadoop.pig.CassandraStorage.initSchema(CassandraStorage.java:1112)
> 	at org.apache.cassandra.hadoop.pig.CassandraStorage.setLocation(CassandraStorage.java:615).
> This effects from Simple table to table with compound key. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira