You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@cassandra.apache.org by Philippe <wa...@gmail.com> on 2011/11/15 22:16:00 UTC

Network traffic patterns

Hello,
I'm trying to understand the network usage I am seeing in my cluster, can
anyone shed some light?
It's an RF=3, 12-node, cassandra 0.8.6 cluster. The nodes are
p13,p14,p15...p24 and are consecutive in that order on the ring.
Each node is only a cassandra database. I am hitting the cluster from
another server (p4).

The pattern on p4 is the pattern is to

   1. read a lot of data (some columns for hundreds to tens of thousands of
   keys, split into 512-key multigets)
   2. process the data
   3. write back a byte array to cassandra (average size is 400 bytes)


p4 reads as

Re: Network traffic patterns

Posted by Boris Yen <yu...@gmail.com>.

I think ordered partitioner might cause most of the data to be saved only
on a few nodes. This could contribute to what you saw. Try to use random
partitioner if possible.

On Mon, Nov 21, 2011 at 6:53 AM, Philippe <wa...@gmail.com> wrote:

> I'm using BOP.
> Le 20 nov. 2011 13:09, "Boris Yen" <yu...@gmail.com> a écrit :
>
> I am just curious about which partitioner you are using?
>>
>> On Thu, Nov 17, 2011 at 4:30 PM, Philippe <wa...@gmail.com> wrote:
>>
>>> Hi Todd
>>> Yes all equal hardware. Nearly no CPU usage and no memory issues.
>>> Repairs are running in tens of minutes so i don't understand why
>>> replication would be backed up.
>>>
>>> Any other ideas?
>>> Le 17 nov. 2011 02:33, "Todd Burruss" <bb...@expedia.com> a écrit :
>>>
>>> Are all of your machines equal hardware?  Since those machines are
>>>> sending data somewhere, maybe they are behind in replicating and are
>>>> continuously catching up?
>>>>
>>>> Use a tool like tcpdump to find out where the data is going
>>>>
>>>> From: Philippe <wa...@gmail.com>
>>>> Reply-To: "user@cassandra.apache.org" <us...@cassandra.apache.org>
>>>> Date: Tue, 15 Nov 2011 13:22:38 -0800
>>>> To: user <us...@cassandra.apache.org>
>>>> Subject: Re: Network traffic patterns
>>>>
>>>> Sorry about the previous message, I've enabled keyboard shortcuts on
>>>> gmail...*sigh*...
>>>>
>>>> Hello,
>>>> I'm trying to understand the network usage I am seeing in my cluster,
>>>> can anyone shed some light?
>>>> It's an RF=3, 12-node, cassandra 0.8.6 cluster. repair is performed on
>>>> each node once a week, with a rolling schedule.
>>>> The nodes are p13,p14,p15...p24 and are consecutive in that order on
>>>> the ring. Each node is only a cassandra database. I am hitting the cluster
>>>> from another server (p4).
>>>>
>>>> p4 is doing this with 20 threads in parallel
>>>>
>>>>    1. read a lot of data (some columns for hundreds to tens of
>>>>    thousands of keys, split into 512-key multigets)
>>>>    2. process the data
>>>>    3. write back a byte array to cassandra (average size is 400 bytes)
>>>>    4. go back to 1
>>>>
>>>> According to my munin graphs, network usage is about as follows. I am
>>>> not surprised at the bias towards p13-p15 as p4 is getting & storing data
>>>> mainly for keys located on one of those nodes.
>>>>
>>>>    - p4 : 1.5Mb/s in and out
>>>>    - p13-p15 : 15Mb/s in and 80Mb/s out
>>>>    - p16-p24 : 45Mb/s in and 5Mb/s out
>>>>
>>>> What I don't understand is why p4 is only seeing 1.5Mb/s while I see
>>>> 80Mb/s on p13 & p15.
>>>>
>>>> The way I understand this:
>>>>
>>>>    - p4 makes a multiget to the cluster, electing to use any node in
>>>>    the cluster (IN traffic for describe the query)
>>>>    - coordinator node replays the query on all 3 replicas (so 3
>>>>    servers each get the IN traffic, mostly p13-p15)
>>>>    - each server replies to coordinator
>>>>    - coordinator chooses matching values and sends back data to p4
>>>>
>>>> So if p13-p15 are outputting 80Mb/s why am I not seeing 80Mb/s coming
>>>> into p4 which is on the receiving end ?
>>>>
>>>> Thanks
>>>>
>>>> 2011/11/15 Philippe <wa...@gmail.com>
>>>>
>>>>> Hello,
>>>>> I'm trying to understand the network usage I am seeing in my cluster,
>>>>> can anyone shed some light?
>>>>> It's an RF=3, 12-node, cassandra 0.8.6 cluster. The nodes are
>>>>> p13,p14,p15...p24 and are consecutive in that order on the ring.
>>>>> Each node is only a cassandra database. I am hitting the cluster from
>>>>> another server (p4).
>>>>>
>>>>> The pattern on p4 is the pattern is to
>>>>>
>>>>>    1. read a lot of data (some columns for hundreds to tens of
>>>>>    thousands of keys, split into 512-key multigets)
>>>>>    2. process the data
>>>>>    3. write back a byte array to cassandra (average size is 400 bytes)
>>>>>
>>>>>
>>>>> p4 reads as
>>>>>
>>>>
>>>>
>>

Re: Network traffic patterns

Posted by Philippe <wa...@gmail.com>.

I'm using BOP.
Le 20 nov. 2011 13:09, "Boris Yen" <yu...@gmail.com> a écrit :

> I am just curious about which partitioner you are using?
>
> On Thu, Nov 17, 2011 at 4:30 PM, Philippe <wa...@gmail.com> wrote:
>
>> Hi Todd
>> Yes all equal hardware. Nearly no CPU usage and no memory issues.
>> Repairs are running in tens of minutes so i don't understand why
>> replication would be backed up.
>>
>> Any other ideas?
>> Le 17 nov. 2011 02:33, "Todd Burruss" <bb...@expedia.com> a écrit :
>>
>> Are all of your machines equal hardware?  Since those machines are
>>> sending data somewhere, maybe they are behind in replicating and are
>>> continuously catching up?
>>>
>>> Use a tool like tcpdump to find out where the data is going
>>>
>>> From: Philippe <wa...@gmail.com>
>>> Reply-To: "user@cassandra.apache.org" <us...@cassandra.apache.org>
>>> Date: Tue, 15 Nov 2011 13:22:38 -0800
>>> To: user <us...@cassandra.apache.org>
>>> Subject: Re: Network traffic patterns
>>>
>>> Sorry about the previous message, I've enabled keyboard shortcuts on
>>> gmail...*sigh*...
>>>
>>> Hello,
>>> I'm trying to understand the network usage I am seeing in my cluster,
>>> can anyone shed some light?
>>> It's an RF=3, 12-node, cassandra 0.8.6 cluster. repair is performed on
>>> each node once a week, with a rolling schedule.
>>> The nodes are p13,p14,p15...p24 and are consecutive in that order on the
>>> ring. Each node is only a cassandra database. I am hitting the cluster from
>>> another server (p4).
>>>
>>> p4 is doing this with 20 threads in parallel
>>>
>>>    1. read a lot of data (some columns for hundreds to tens of
>>>    thousands of keys, split into 512-key multigets)
>>>    2. process the data
>>>    3. write back a byte array to cassandra (average size is 400 bytes)
>>>    4. go back to 1
>>>
>>> According to my munin graphs, network usage is about as follows. I am
>>> not surprised at the bias towards p13-p15 as p4 is getting & storing data
>>> mainly for keys located on one of those nodes.
>>>
>>>    - p4 : 1.5Mb/s in and out
>>>    - p13-p15 : 15Mb/s in and 80Mb/s out
>>>    - p16-p24 : 45Mb/s in and 5Mb/s out
>>>
>>> What I don't understand is why p4 is only seeing 1.5Mb/s while I see
>>> 80Mb/s on p13 & p15.
>>>
>>> The way I understand this:
>>>
>>>    - p4 makes a multiget to the cluster, electing to use any node in
>>>    the cluster (IN traffic for describe the query)
>>>    - coordinator node replays the query on all 3 replicas (so 3 servers
>>>    each get the IN traffic, mostly p13-p15)
>>>    - each server replies to coordinator
>>>    - coordinator chooses matching values and sends back data to p4
>>>
>>> So if p13-p15 are outputting 80Mb/s why am I not seeing 80Mb/s coming
>>> into p4 which is on the receiving end ?
>>>
>>> Thanks
>>>
>>> 2011/11/15 Philippe <wa...@gmail.com>
>>>
>>>> Hello,
>>>> I'm trying to understand the network usage I am seeing in my cluster,
>>>> can anyone shed some light?
>>>> It's an RF=3, 12-node, cassandra 0.8.6 cluster. The nodes are
>>>> p13,p14,p15...p24 and are consecutive in that order on the ring.
>>>> Each node is only a cassandra database. I am hitting the cluster from
>>>> another server (p4).
>>>>
>>>> The pattern on p4 is the pattern is to
>>>>
>>>>    1. read a lot of data (some columns for hundreds to tens of
>>>>    thousands of keys, split into 512-key multigets)
>>>>    2. process the data
>>>>    3. write back a byte array to cassandra (average size is 400 bytes)
>>>>
>>>>
>>>> p4 reads as
>>>>
>>>
>>>
>

Re: Network traffic patterns

Posted by Boris Yen <yu...@gmail.com>.

I am just curious about which partitioner you are using?

On Thu, Nov 17, 2011 at 4:30 PM, Philippe <wa...@gmail.com> wrote:

> Hi Todd
> Yes all equal hardware. Nearly no CPU usage and no memory issues.
> Repairs are running in tens of minutes so i don't understand why
> replication would be backed up.
>
> Any other ideas?
> Le 17 nov. 2011 02:33, "Todd Burruss" <bb...@expedia.com> a écrit :
>
> Are all of your machines equal hardware?  Since those machines are sending
>> data somewhere, maybe they are behind in replicating and are continuously
>> catching up?
>>
>> Use a tool like tcpdump to find out where the data is going
>>
>> From: Philippe <wa...@gmail.com>
>> Reply-To: "user@cassandra.apache.org" <us...@cassandra.apache.org>
>> Date: Tue, 15 Nov 2011 13:22:38 -0800
>> To: user <us...@cassandra.apache.org>
>> Subject: Re: Network traffic patterns
>>
>> Sorry about the previous message, I've enabled keyboard shortcuts on
>> gmail...*sigh*...
>>
>> Hello,
>> I'm trying to understand the network usage I am seeing in my cluster, can
>> anyone shed some light?
>> It's an RF=3, 12-node, cassandra 0.8.6 cluster. repair is performed on
>> each node once a week, with a rolling schedule.
>> The nodes are p13,p14,p15...p24 and are consecutive in that order on the
>> ring. Each node is only a cassandra database. I am hitting the cluster from
>> another server (p4).
>>
>> p4 is doing this with 20 threads in parallel
>>
>>    1. read a lot of data (some columns for hundreds to tens of thousands
>>    of keys, split into 512-key multigets)
>>    2. process the data
>>    3. write back a byte array to cassandra (average size is 400 bytes)
>>    4. go back to 1
>>
>> According to my munin graphs, network usage is about as follows. I am not
>> surprised at the bias towards p13-p15 as p4 is getting & storing data
>> mainly for keys located on one of those nodes.
>>
>>    - p4 : 1.5Mb/s in and out
>>    - p13-p15 : 15Mb/s in and 80Mb/s out
>>    - p16-p24 : 45Mb/s in and 5Mb/s out
>>
>> What I don't understand is why p4 is only seeing 1.5Mb/s while I see
>> 80Mb/s on p13 & p15.
>>
>> The way I understand this:
>>
>>    - p4 makes a multiget to the cluster, electing to use any node in the
>>    cluster (IN traffic for describe the query)
>>    - coordinator node replays the query on all 3 replicas (so 3 servers
>>    each get the IN traffic, mostly p13-p15)
>>    - each server replies to coordinator
>>    - coordinator chooses matching values and sends back data to p4
>>
>> So if p13-p15 are outputting 80Mb/s why am I not seeing 80Mb/s coming
>> into p4 which is on the receiving end ?
>>
>> Thanks
>>
>> 2011/11/15 Philippe <wa...@gmail.com>
>>
>>> Hello,
>>> I'm trying to understand the network usage I am seeing in my cluster,
>>> can anyone shed some light?
>>> It's an RF=3, 12-node, cassandra 0.8.6 cluster. The nodes are
>>> p13,p14,p15...p24 and are consecutive in that order on the ring.
>>> Each node is only a cassandra database. I am hitting the cluster from
>>> another server (p4).
>>>
>>> The pattern on p4 is the pattern is to
>>>
>>>    1. read a lot of data (some columns for hundreds to tens of
>>>    thousands of keys, split into 512-key multigets)
>>>    2. process the data
>>>    3. write back a byte array to cassandra (average size is 400 bytes)
>>>
>>>
>>> p4 reads as
>>>
>>
>>

Re: Network traffic patterns

Posted by Philippe <wa...@gmail.com>.

Hi Todd
Yes all equal hardware. Nearly no CPU usage and no memory issues.
Repairs are running in tens of minutes so i don't understand why
replication would be backed up.

Any other ideas?
Le 17 nov. 2011 02:33, "Todd Burruss" <bb...@expedia.com> a écrit :

> Are all of your machines equal hardware?  Since those machines are sending
> data somewhere, maybe they are behind in replicating and are continuously
> catching up?
>
> Use a tool like tcpdump to find out where the data is going
>
> From: Philippe <wa...@gmail.com>
> Reply-To: "user@cassandra.apache.org" <us...@cassandra.apache.org>
> Date: Tue, 15 Nov 2011 13:22:38 -0800
> To: user <us...@cassandra.apache.org>
> Subject: Re: Network traffic patterns
>
> Sorry about the previous message, I've enabled keyboard shortcuts on
> gmail...*sigh*...
>
> Hello,
> I'm trying to understand the network usage I am seeing in my cluster, can
> anyone shed some light?
> It's an RF=3, 12-node, cassandra 0.8.6 cluster. repair is performed on
> each node once a week, with a rolling schedule.
> The nodes are p13,p14,p15...p24 and are consecutive in that order on the
> ring. Each node is only a cassandra database. I am hitting the cluster from
> another server (p4).
>
> p4 is doing this with 20 threads in parallel
>
>    1. read a lot of data (some columns for hundreds to tens of thousands
>    of keys, split into 512-key multigets)
>    2. process the data
>    3. write back a byte array to cassandra (average size is 400 bytes)
>    4. go back to 1
>
> According to my munin graphs, network usage is about as follows. I am not
> surprised at the bias towards p13-p15 as p4 is getting & storing data
> mainly for keys located on one of those nodes.
>
>    - p4 : 1.5Mb/s in and out
>    - p13-p15 : 15Mb/s in and 80Mb/s out
>    - p16-p24 : 45Mb/s in and 5Mb/s out
>
> What I don't understand is why p4 is only seeing 1.5Mb/s while I see
> 80Mb/s on p13 & p15.
>
> The way I understand this:
>
>    - p4 makes a multiget to the cluster, electing to use any node in the
>    cluster (IN traffic for describe the query)
>    - coordinator node replays the query on all 3 replicas (so 3 servers
>    each get the IN traffic, mostly p13-p15)
>    - each server replies to coordinator
>    - coordinator chooses matching values and sends back data to p4
>
> So if p13-p15 are outputting 80Mb/s why am I not seeing 80Mb/s coming into
> p4 which is on the receiving end ?
>
> Thanks
>
> 2011/11/15 Philippe <wa...@gmail.com>
>
>> Hello,
>> I'm trying to understand the network usage I am seeing in my cluster, can
>> anyone shed some light?
>> It's an RF=3, 12-node, cassandra 0.8.6 cluster. The nodes are
>> p13,p14,p15...p24 and are consecutive in that order on the ring.
>> Each node is only a cassandra database. I am hitting the cluster from
>> another server (p4).
>>
>> The pattern on p4 is the pattern is to
>>
>>    1. read a lot of data (some columns for hundreds to tens of thousands
>>    of keys, split into 512-key multigets)
>>    2. process the data
>>    3. write back a byte array to cassandra (average size is 400 bytes)
>>
>>
>> p4 reads as
>>
>
>

Re: Network traffic patterns

Posted by Todd Burruss <bb...@expedia.com>.

Are all of your machines equal hardware?  Since those machines are sending data somewhere, maybe they are behind in replicating and are continuously catching up?

Use a tool like tcpdump to find out where the data is going

From: Philippe <wa...@gmail.com>>
Reply-To: "user@cassandra.apache.org<ma...@cassandra.apache.org>" <us...@cassandra.apache.org>>
Date: Tue, 15 Nov 2011 13:22:38 -0800
To: user <us...@cassandra.apache.org>>
Subject: Re: Network traffic patterns

Sorry about the previous message, I've enabled keyboard shortcuts on gmail...*sigh*...

Hello,
I'm trying to understand the network usage I am seeing in my cluster, can anyone shed some light?
It's an RF=3, 12-node, cassandra 0.8.6 cluster. repair is performed on each node once a week, with a rolling schedule.
The nodes are p13,p14,p15...p24 and are consecutive in that order on the ring. Each node is only a cassandra database. I am hitting the cluster from another server (p4).

p4 is doing this with 20 threads in parallel

 1.  read a lot of data (some columns for hundreds to tens of thousands of keys, split into 512-key multigets)
 2.  process the data
 3.  write back a byte array to cassandra (average size is 400 bytes)
 4.  go back to 1

According to my munin graphs, network usage is about as follows. I am not surprised at the bias towards p13-p15 as p4 is getting & storing data mainly for keys located on one of those nodes.

 *   p4 : 1.5Mb/s in and out
 *   p13-p15 : 15Mb/s in and 80Mb/s out
 *   p16-p24 : 45Mb/s in and 5Mb/s out

What I don't understand is why p4 is only seeing 1.5Mb/s while I see 80Mb/s on p13 & p15.

The way I understand this:

 *   p4 makes a multiget to the cluster, electing to use any node in the cluster (IN traffic for describe the query)
 *   coordinator node replays the query on all 3 replicas (so 3 servers each get the IN traffic, mostly p13-p15)
 *   each server replies to coordinator
 *   coordinator chooses matching values and sends back data to p4

So if p13-p15 are outputting 80Mb/s why am I not seeing 80Mb/s coming into p4 which is on the receiving end ?

Thanks

2011/11/15 Philippe <wa...@gmail.com>>
Hello,
I'm trying to understand the network usage I am seeing in my cluster, can anyone shed some light?
It's an RF=3, 12-node, cassandra 0.8.6 cluster. The nodes are p13,p14,p15...p24 and are consecutive in that order on the ring.
Each node is only a cassandra database. I am hitting the cluster from another server (p4).

The pattern on p4 is the pattern is to

 1.  read a lot of data (some columns for hundreds to tens of thousands of keys, split into 512-key multigets)
 2.  process the data
 3.  write back a byte array to cassandra (average size is 400 bytes)

p4 reads as

Re: Network traffic patterns

Posted by Philippe <wa...@gmail.com>.

Sorry about the previous message, I've enabled keyboard shortcuts on
gmail...*sigh*...

Hello,
I'm trying to understand the network usage I am seeing in my cluster, can
anyone shed some light?
It's an RF=3, 12-node, cassandra 0.8.6 cluster. repair is performed on each
node once a week, with a rolling schedule.
The nodes are p13,p14,p15...p24 and are consecutive in that order on the
ring. Each node is only a cassandra database. I am hitting the cluster from
another server (p4).

p4 is doing this with 20 threads in parallel

   1. read a lot of data (some columns for hundreds to tens of thousands of
   keys, split into 512-key multigets)
   2. process the data
   3. write back a byte array to cassandra (average size is 400 bytes)
   4. go back to 1

According to my munin graphs, network usage is about as follows. I am not
surprised at the bias towards p13-p15 as p4 is getting & storing data
mainly for keys located on one of those nodes.

   - p4 : 1.5Mb/s in and out
   - p13-p15 : 15Mb/s in and 80Mb/s out
   - p16-p24 : 45Mb/s in and 5Mb/s out

What I don't understand is why p4 is only seeing 1.5Mb/s while I see 80Mb/s
on p13 & p15.

The way I understand this:

   - p4 makes a multiget to the cluster, electing to use any node in the
   cluster (IN traffic for describe the query)
   - coordinator node replays the query on all 3 replicas (so 3 servers
   each get the IN traffic, mostly p13-p15)
   - each server replies to coordinator
   - coordinator chooses matching values and sends back data to p4

So if p13-p15 are outputting 80Mb/s why am I not seeing 80Mb/s coming into
p4 which is on the receiving end ?

Thanks

2011/11/15 Philippe <wa...@gmail.com>

> Hello,
> I'm trying to understand the network usage I am seeing in my cluster, can
> anyone shed some light?
> It's an RF=3, 12-node, cassandra 0.8.6 cluster. The nodes are
> p13,p14,p15...p24 and are consecutive in that order on the ring.
> Each node is only a cassandra database. I am hitting the cluster from
> another server (p4).
>
> The pattern on p4 is the pattern is to
>
>    1. read a lot of data (some columns for hundreds to tens of thousands
>    of keys, split into 512-key multigets)
>    2. process the data
>    3. write back a byte array to cassandra (average size is 400 bytes)
>
>
> p4 reads as
>