You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Brandon Williams (Commented) (JIRA)" <ji...@apache.org> on 2011/11/02 04:06:32 UTC

[jira] [Commented] (CASSANDRA-3045) Update ColumnFamilyOutputFormat to use new bulkload API

    [ https://issues.apache.org/jira/browse/CASSANDRA-3045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13141869#comment-13141869 ] 

Brandon Williams commented on CASSANDRA-3045:
---------------------------------------------

bq. Another idea: we could give the reducer a description of the ring, which it could plug into the partitioner/strategy to know what nodes to stream to. Gossip participation isn't technically required by the streaming protocol IIANM.

This is very close to working, the sstables can be sent without gossip or MS, but the problem lies in the StreamReply, which unfortunately goes over OTC and thus gets sent to the server, not the bulk loader. :(  I'm not sure if there's a simple way around this.
                
> Update ColumnFamilyOutputFormat to use new bulkload API
> -------------------------------------------------------
>
>                 Key: CASSANDRA-3045
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-3045
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Hadoop
>            Reporter: Jonathan Ellis
>            Assignee: Brandon Williams
>            Priority: Minor
>             Fix For: 1.1
>
>
> The bulk loading interface added in CASSANDRA-1278 is a great fit for Hadoop jobs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira