You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@cassandra.apache.org by AJ <aj...@dude.podzone.net> on 2011/06/14 23:33:11 UTC

Where is my data?

Is there an official deterministic formula to compute the various 
subsets of a given cluster that comprises a complete set of data 
(redundant rows ok)?  IOW, if multiple nodes become unavailable one at a 
time, at what point can I say <100% of my data is available?

Obviously, the method would have to take into consideration the ring 
layout along with the partition type, the # of nodes, 
replication_factor, replication strat, etc..

Thanks!

Re: Where is my data?

Posted by aaron morton <aa...@thelastpickle.com>.
I wrote a blog post about this sort of thing the other day 
http://thelastpickle.com/2011/06/13/Down-For-Me/

Let me know if you spot any problems. 

Cheers

-----------------
Aaron Morton
Freelance Cassandra Developer
@aaronmorton
http://www.thelastpickle.com

On 16 Jun 2011, at 02:20, AJ wrote:

> Thanks
> 
> On 6/15/2011 3:20 AM, Sylvain Lebresne wrote:
>> You can use the thrift call describe_ring(). It will returns a map
>> that associate to each range of the
>> ring who is a replica. Once any range has all it's endpoint
>> unavailable, that range of the data is unavailable.
>> 
>> --
>> Sylvain
>> 
> 


Re: Where is my data?

Posted by AJ <aj...@dude.podzone.net>.
Thanks

On 6/15/2011 3:20 AM, Sylvain Lebresne wrote:
> You can use the thrift call describe_ring(). It will returns a map
> that associate to each range of the
> ring who is a replica. Once any range has all it's endpoint
> unavailable, that range of the data is unavailable.
>
> --
> Sylvain
>


Re: Where is my data?

Posted by Sylvain Lebresne <sy...@datastax.com>.
You can use the thrift call describe_ring(). It will returns a map
that associate to each range of the
ring who is a replica. Once any range has all it's endpoint
unavailable, that range of the data is unavailable.

--
Sylvain

On Tue, Jun 14, 2011 at 11:33 PM, AJ <aj...@dude.podzone.net> wrote:
> Is there an official deterministic formula to compute the various subsets of
> a given cluster that comprises a complete set of data (redundant rows ok)?
>  IOW, if multiple nodes become unavailable one at a time, at what point can
> I say <100% of my data is available?
>
> Obviously, the method would have to take into consideration the ring layout
> along with the partition type, the # of nodes, replication_factor,
> replication strat, etc..
>
> Thanks!
>