You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@cassandra.apache.org by DENIZ <ni...@gmail.com> on 2015/01/13 14:58:43 UTC

Storing PDF data on Cassandra db

Hi All,

I want to store PDF documents on Cassandra db.What is the best way to store this type of data on Cassandra.How can I insert and select PDF file to the database.If possible can you explain with sample CQL statements

Thanks in Advance
Nil

Re: Storing PDF data on Cassandra db

Posted by Jonathan Haddad <jo...@jonhaddad.com>.
For a new user, there's no point in learning Thrift if that user intends on
upgrading past the version that they start with.  Thrift is a deprecated
protocol and there's no new functionality going into it.  In 3.0 the
sstable format is being upgraded to work primarily with native CQL
partitions / rows and have a translation layer to work with thrift for
compatibility.

If you're simply storing k->v there's no reason (or advantage) to thrift
over CQL.

CREATE TABLE pdf ( pdf_id uuid primary key, content blob );

will work just fine.  Adding additional fields as the system changes is
convenient and works with any tool that reads CQL and won't require custom
deserialization or any other nonsense.

Jon

On Tue Jan 13 2015 at 6:14:10 AM Peter Lin <wo...@gmail.com> wrote:

>
> you want to store the raw bytes, so look at examples for saving raw bytes.
>
> I generally recommend using Thrift if you're going to do a lot of
> read/write of binary data. CQL is good for primitive types, and maps/lists
> of primitive types. I'm bias, but it's simpler and easier to use thrift for
> storing file attachments.
>
> On Tue, Jan 13, 2015 at 8:58 AM, DENIZ <ni...@gmail.com> wrote:
>
>>  Hi All,
>>
>> I want to store PDF documents on Cassandra db.What is the best way to
>> store this type of data on Cassandra.How can I insert and select PDF file
>> to the database.If possible can you explain with sample CQL statements
>>
>> Thanks in Advance
>> Nil
>>
>
>

Re: Storing PDF data on Cassandra db

Posted by Peter Lin <wo...@gmail.com>.
you want to store the raw bytes, so look at examples for saving raw bytes.

I generally recommend using Thrift if you're going to do a lot of
read/write of binary data. CQL is good for primitive types, and maps/lists
of primitive types. I'm bias, but it's simpler and easier to use thrift for
storing file attachments.

On Tue, Jan 13, 2015 at 8:58 AM, DENIZ <ni...@gmail.com> wrote:

>  Hi All,
>
> I want to store PDF documents on Cassandra db.What is the best way to
> store this type of data on Cassandra.How can I insert and select PDF file
> to the database.If possible can you explain with sample CQL statements
>
> Thanks in Advance
> Nil
>