You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@cassandra.apache.org by A J <s5...@gmail.com> on 2012/03/13 18:35:50 UTC

Why is row lookup much faster than column lookup

>From my tests, I am seeing that a CF that has less than 100 columns
but millions of rows has a much lower latency to read a column in a
row than a CF that has only a few thousands of rows but wide rows with
each having 20K columns.

Example:
cf1 has 6 Million rows and each row has about 100 columns.
t1 = time.time()
cf1.get(1234,column_count=1)
t2 = time.time() - t1
print int(t2*1000)
takes 3 ms

cf2 has 5K rows and each row has about 18K columns.
t1 = time.time()
cf2.get(1234,column_count=1)
t2 = time.time() - t1
print int(t2*1000)
takes 82ms

Anything in general on the Cassandra architecture that causes row
lookup to be much faster than column lookup ?

Thanks.