You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@cassandra.apache.org by Aliou SOW <al...@hotmail.com> on 2012/04/11 11:42:52 UTC
INserting data in Cassandra
Hello
all,
We would like to
adopt Cassandra solution
for storing our biological data which
are essentially microarray data.
These data, formatted in text tabulated files, are in the form:
Sample 1 sample
2 … sample n
probe 1 value1 value 2 … …
probe
2
…
probe n
In fact the probes can vary from one chip to another
and you can have chips with more than one million probes,
the samples by cons
vary from one project to another,
the values are floats.
So we would like to represent our column families like they are
formatted in files with the names of the probes (which are unique) as the key,
and the names of the samples as column names and float values as column values
:)
To insert data,
I just created such for example a keyspace testKS
and a column
family testCF but
just by defining the
key (because the column names
vary), then a
file jason for insertion:
{"probe1" :{
"sample 1":value1(float),
"sample 2":value2(float),
…,
"sample n":value n(float)
},
"probe2" :{
"sample 1":value1(float),
"sample 2":value2(float),
…,
"sample n":value n(float)
}
}
And I used the
tool json2sstable, but that does
not work, I
always have an error:
java.lang.RuntimeException:
Can't write Super columns to the Standard Column Family.
So I have two
questions:
1) What I did
wrong, must I define the complete structure of my column family before
using json2sstable or is it the structure of my json file which is not good?
2) Otherwise what would be the best way
to proceed for insertion based on the data I dispose?
We use Cassandra 1.0.8.
Any help would be welcome.
Thanks.
RE: INserting data in Cassandra
Posted by Aliou SOW <al...@hotmail.com>.
Thanks :)
But finally i used Hector and it works fine :D
Date: Wed, 11 Apr 2012 17:19:15 +0200
From: bernarpa@gmail.com
To: user@cassandra.apache.org
Subject: Re: INserting data in Cassandra
On 04/11/12 11:42, Aliou SOW wrote:
And I used
the
tool json2sstable, but that does
not work,
I
always have an error:
java.lang.RuntimeException:
Can't
write Super columns to the Standard Column Family.
So I have
two
questions:
1) What I did
wrong, must I define the complete structure of my
column family before
using json2sstable or is it the structure of my json file
which is not good?
I think that you just have to specify column_type = 'super' when
creating your column family (see
http://stackoverflow.com/questions/6835183/set-super-column-family-in-cassandra-cli)
Paolo
Re: INserting data in Cassandra
Posted by Paolo Bernardi <be...@gmail.com>.
On 04/11/12 11:42, Aliou SOW wrote:
>
> And I used the tool json2sstable, but that does not work, I always
> have an error:
>
> java.lang.RuntimeException: Can't write Super columns to the Standard
> Column Family.
>
> So I have two questions:
> 1) What I did wrong, must I define the complete structure of my column
> family before using json2sstable or is it the structure of my json
> file which is not good?
>
I think that you just have to specify column_type = 'super' when
creating your column family (see
http://stackoverflow.com/questions/6835183/set-super-column-family-in-cassandra-cli)
Paolo
RE: INserting data in Cassandra
Posted by Aliou SOW <al...@hotmail.com>.
Hello,
Any help Or idea?
Thanks
From: alioujigga@hotmail.com
To: user@cassandra.apache.org
Subject: INserting data in Cassandra
Date: Wed, 11 Apr 2012 09:42:52 +0000
Hello
all,
We would like to
adopt Cassandra solution
for storing our biological data which
are essentially microarray data.
These data, formatted in text tabulated files, are in the form:
Sample 1 sample
2 … sample n
probe 1 value1 value 2 … …
probe
2
…
probe n
In fact the probes can vary from one chip to another
and you can have chips with more than one million probes,
the samples by cons
vary from one project to another,
the values are floats.
So we would like to represent our column families like they are
formatted in files with the names of the probes (which are unique) as the key,
and the names of the samples as column names and float values as column values
:)
To insert data,
I just created such for example a keyspace testKS
and a column
family testCF but
just by defining the
key (because the column names
vary), then a
file jason for insertion:
{"probe1" :{
"sample 1":value1(float),
"sample 2":value2(float),
…,
"sample n":value n(float)
},
"probe2" :{
"sample 1":value1(float),
"sample 2":value2(float),
…,
"sample n":value n(float)
}
}
And I used the
tool json2sstable, but that does
not work, I
always have an error:
java.lang.RuntimeException:
Can't write Super columns to the Standard Column Family.
So I have two
questions:
1) What I did
wrong, must I define the complete structure of my column family before
using json2sstable or is it the structure of my json file which is not good?
2) Otherwise what would be the best way
to proceed for insertion based on the data I dispose?
We use Cassandra 1.0.8.
Any help would be welcome.
Thanks.