You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Wojciech Meler (JIRA)" <ji...@apache.org> on 2011/07/04 10:28:22 UTC

[jira] [Commented] (CASSANDRA-2527) Add ability to snapshot data as input to hadoop jobs

    [ https://issues.apache.org/jira/browse/CASSANDRA-2527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13059356#comment-13059356 ] 

Wojciech Meler commented on CASSANDRA-2527:
-------------------------------------------

It would be great to have more generic client access to snapshot data. Maybe snapshots should be visible as new keyspaces? Or maybe we should throw away snapshots and start cloning keyspaces? If cloned keyspace could be read-only it would work out of the box :).

> Add ability to snapshot data as input to hadoop jobs
> ----------------------------------------------------
>
>                 Key: CASSANDRA-2527
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2527
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Jeremy Hanna
>              Labels: hadoop
>
> It is desirable to have immutable inputs to hadoop jobs for the duration of the job.  That way re-execution of individual tasks do not alter the output.  One way to accomplish this would be to snapshot the data that is used as input to a job.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira