You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@cassandra.apache.org by Jim Morrison <ji...@aninteractive.com> on 2010/11/04 22:13:31 UTC

Indexes on Columns & SubColumns Clarification

Hi,

I've been doing a lot of reading and I've one thing I'm not entirely
clear on - could someone clarify?

Q: Exactly at what point does indexing stop?

I'm trying to use cassandra to store log information that is both user
& time sensitive.

So I've a basic model like this:

detailed_log: { // supercolumnfamily (?)
	username : { // supercolum (?)
		uuid { price : 100, min	: 10, max : 500 }, // columns
		uuid { price : 100, min	: 10, max : 500 },
		uuid { price : 100, min	: 10, max : 500 },

                // ... expect 25,440 of these per username, per year
	}
}

Defined as:

<ColumnFamily Name="pricelog" ColumnType="Super"
CompareWith="UTF8Type" CompareSubcolumnsWith="TimeUUIDType"/>


I've read

http://wiki.apache.org/cassandra/CassandraLimitations and
http://wiki.apache.org/cassandra/DataModelv2#ColumnFamily_containing_SuperColumns

... and lots else I could find on t'interweb but I'm still unclear if
this is the right way to go about it!?


My query model, in pseudosql is:  (( where * is [price, min, max] etc ))

SELECT * FROM detailed_log WHERE username = 'foobar' AND uuid RANGE(
start_UUID -> end_UUID );


So, the question is, can I store my data like this, and retrieve it
efficiently, or do I need to combine my keys?

Hope I'm not being dense or asking an faq - I couldn't find a clear
answer anywhere.

Thanks!

Jim