You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Brandon Williams (Updated) (JIRA)" <ji...@apache.org> on 2011/11/04 20:23:51 UTC

[jira] [Updated] (CASSANDRA-3045) Update ColumnFamilyOutputFormat to use new bulkload API

     [ https://issues.apache.org/jira/browse/CASSANDRA-3045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Brandon Williams updated CASSANDRA-3045:
----------------------------------------

    Attachment: 0001-Remove-gossip-SS-requirement-from-BulkLoader.patch

Posting what I have with this approach, though a bit rough.  No output format yet, just converts the loader to not require gossip.  There's a catch, however.  Since FileStreamTask uses a socket for every file, when all the files are done we don't have a socket left to acknowledge the session with, so the loading hangs at 'Waiting for targets to rebuild indexes' forever.  We can do major streaming surgery on FST, or possibly wait for CASSANDRA-3112 to get past this.
                
> Update ColumnFamilyOutputFormat to use new bulkload API
> -------------------------------------------------------
>
>                 Key: CASSANDRA-3045
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-3045
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Hadoop
>            Reporter: Jonathan Ellis
>            Assignee: Brandon Williams
>            Priority: Minor
>             Fix For: 1.1
>
>         Attachments: 0001-Remove-gossip-SS-requirement-from-BulkLoader.patch
>
>
> The bulk loading interface added in CASSANDRA-1278 is a great fit for Hadoop jobs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira