You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Paul Pak (JIRA)" <ji...@apache.org> on 2014/07/09 01:53:05 UTC

[jira] [Comment Edited] (CASSANDRA-6927) Create a CQL3 based bulk OutputFormat

    [ https://issues.apache.org/jira/browse/CASSANDRA-6927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14055678#comment-14055678 ] 

Paul Pak edited comment on CASSANDRA-6927 at 7/8/14 11:51 PM:
--------------------------------------------------------------

By doing the following for each columnFamily you want to write to:
{code:java}
    MultipleOutputs.addNamedOutput(job, "myColumnFamily", CqlBulkOutputFormat.class, Object.class, List.class);
    CqlConfigHelper.setColumnFamilySchema(conf, "myColumnFamily", "CREATE TABLE myKeyspace.myColumnFamily ...");
    CqlConfigHelper.setColumnFamilyInsertStatement(conf, "myColumnFamily", "UPDATE myKeyspace.myColumnFamily SET ....");
{code}
you'll be able to write to multiple columnFamilies by doing:
{code:java}
    MultipleOutputs multiOutputs = ...
    multiOutputs.write("myColumnFamily", null, values);
{code}


was (Author: sixpak32577):
By doing:
{code:java}
    MultipleOutputs.addNamedOutput(job, "myColumnFamily", CqlBulkOutputFormat.class, Object.class, List.class);
    CqlConfigHelper.setColumnFamilySchema(conf, "myColumnFamily", "CREATE TABLE myKeyspace.myColumnFamily ...");
    CqlConfigHelper.setColumnFamilyInsertStatement(conf, "myColumnFamily", "UPDATE myKeyspace.myColumnFamily SET ....");
{code}
you'll be able to write to multiple columnFamilies by doing:
{code:java}
    MultipleOutputs multiOutputs = ...
    multiOutputs.write("myColumnFamily", null, values);
{code}

> Create a CQL3 based bulk OutputFormat
> -------------------------------------
>
>                 Key: CASSANDRA-6927
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-6927
>             Project: Cassandra
>          Issue Type: New Feature
>          Components: Hadoop
>            Reporter: Paul Pak
>            Priority: Minor
>              Labels: cql3, hadoop
>         Attachments: 6927-2.0-branch-v2.txt, trunk-6927-v3.txt, trunk-6927.txt
>
>
> This is the CQL compatible version of BulkOutputFormat.  CqlOutputFormat exists, but doesn't write SSTables directly, similar to ColumnFamilyOutputFormat for thrift.



--
This message was sent by Atlassian JIRA
(v6.2#6252)