You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@cassandra.apache.org by Ian Danforth <id...@numenta.com> on 2011/09/01 03:04:55 UTC

15 seconds to increment 17k keys?

All,

 I've got a 4 node cluster (ec2 m1.large instances, replication = 3)
that has one primary counter type column family, that has one column
in the family. There are millions of rows. Each operation consists of
doing a batch_insert through pycassa, which increments ~17k keys. A
majority of these keys are new in each batch.

 Each operation is taking up to 15 seconds. For our system this is a
significant bottleneck.

 Does anyone know if this write speed is expected?

Thanks in advance,

 Ian

Re: 15 seconds to increment 17k keys?

Posted by Richard Low <rl...@acunu.com>.

Assuming you have replicate_on_write enabled (which you almost
certainly do for counters), you have to do a read on a write for each
increment.  This means counter increments, even if all your data set
fits in cache, are significantly slower than normal column inserts.  I
would say ~1k increments per second is about right, although you can
probably do some tuning to improve this.

I've also found that the pycassa client uses significant amounts of
CPU, so be careful you are not CPU bound on the client.

-- 
Richard Low
Acunu | http://www.acunu.com | @acunu

On Thu, Sep 1, 2011 at 2:31 AM, Yang <te...@gmail.com> wrote:
> 1ms per add operation is the general order of magnitude I have seen with my
> tests.
>
>
> On Wed, Aug 31, 2011 at 6:04 PM, Ian Danforth <id...@numenta.com> wrote:
>>
>> All,
>>
>>  I've got a 4 node cluster (ec2 m1.large instances, replication = 3)
>> that has one primary counter type column family, that has one column
>> in the family. There are millions of rows. Each operation consists of
>> doing a batch_insert through pycassa, which increments ~17k keys. A
>> majority of these keys are new in each batch.
>>
>>  Each operation is taking up to 15 seconds. For our system this is a
>> significant bottleneck.
>>
>>  Does anyone know if this write speed is expected?
>>
>> Thanks in advance,
>>
>>  Ian
>
>

Re: 15 seconds to increment 17k keys?

Posted by Richard Low <rl...@acunu.com>.

On Thu, Sep 1, 2011 at 5:16 PM, Ian Danforth <id...@numenta.com> wrote:

> Does this scale with multiples of the replication factor or directly
> with number of nodes? Or more succinctly, to double the writes per
> second into the cluster how many more nodes would I need?

The write throughput scales with number of nodes, so double to get
double the write capacity.

Increasing the replication factor in general doesn't improve
performance (and increasing without increasing number of nodes
decreases performance).  This is because operations are performed on
all available replicas (with the exception of reads with low
consistency levels and read_repair_chance < 1.0).

Note also that there is just one read per counter increment, not a
read per replica.

-- 
Richard Low
Acunu | http://www.acunu.com | @acunu

Re: 15 seconds to increment 17k keys?

Posted by Ian Danforth <id...@numenta.com>.

Does this scale with multiples of the replication factor or directly
with number of nodes? Or more succinctly, to double the writes per
second into the cluster how many more nodes would I need? (Thanks for
the note on pycassa, I've checked and it's not the limiting factor)

Ian

On Thu, Sep 1, 2011 at 3:36 AM, Richard Low <rl...@acunu.com> wrote:
> Assuming you have replicate_on_write enabled (which you almost
> certainly do for counters), you have to do a read on a write for each
> increment.  This means counter increments, even if all your data set
> fits in cache, are significantly slower than normal column inserts.  I
> would say ~1k increments per second is about right, although you can
> probably do some tuning to improve this.
>
> I've also found that the pycassa client uses significant amounts of
> CPU, so be careful you are not CPU bound on the client.
>
> --
> Richard Low
> Acunu | http://www.acunu.com | @acunu
>
> On Thu, Sep 1, 2011 at 2:31 AM, Yang <te...@gmail.com> wrote:
>> 1ms per add operation is the general order of magnitude I have seen with my
>> tests.
>>
>>
>> On Wed, Aug 31, 2011 at 6:04 PM, Ian Danforth <id...@numenta.com> wrote:
>>>
>>> All,
>>>
>>>  I've got a 4 node cluster (ec2 m1.large instances, replication = 3)
>>> that has one primary counter type column family, that has one column
>>> in the family. There are millions of rows. Each operation consists of
>>> doing a batch_insert through pycassa, which increments ~17k keys. A
>>> majority of these keys are new in each batch.
>>>
>>>  Each operation is taking up to 15 seconds. For our system this is a
>>> significant bottleneck.
>>>
>>>  Does anyone know if this write speed is expected?
>>>
>>> Thanks in advance,
>>>
>>>  Ian
>>
>>
>

Re: 15 seconds to increment 17k keys?

Posted by Yang <te...@gmail.com>.

1ms per add operation is the general order of magnitude I have seen with my
tests.



On Wed, Aug 31, 2011 at 6:04 PM, Ian Danforth <id...@numenta.com> wrote:

> All,
>
>  I've got a 4 node cluster (ec2 m1.large instances, replication = 3)
> that has one primary counter type column family, that has one column
> in the family. There are millions of rows. Each operation consists of
> doing a batch_insert through pycassa, which increments ~17k keys. A
> majority of these keys are new in each batch.
>
>  Each operation is taking up to 15 seconds. For our system this is a
> significant bottleneck.
>
>  Does anyone know if this write speed is expected?
>
> Thanks in advance,
>
>  Ian
>

Re: 15 seconds to increment 17k keys?

Posted by Oleg Anastastasyev <ol...@gmail.com>.

> in the family. There are millions of rows. Each operation consists of
> doing a batch_insert through pycassa, which increments ~17k keys. A
> majority of these keys are new in each batch.
> 
>  Each operation is taking up to 15 seconds. For our system this is a
> significant bottleneck.
> 

Try to split your batch to smaller pieces and launch them in parallel. This way
you may get better performance, because all cores are employed and there will be
less copying/rebuilding of large structures inside thrift & cassandra. I found
that 1k rows in a batch is behaving better than 10k.

It is also a good idea to split batch to slices according to replication
strategy and communicate appropriate slice directly to its natural endpoint.
This will reduce neccessary intercommunication between nodes.