You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hbase.apache.org by Ramasubramanian <ra...@gmail.com> on 2014/05/23 22:07:57 UTC

What is the best database to handle large volume of data

Hi,

Request your advice and suggestions on deciding what database can we consider other than oracle to store huge volume of transactional data. It is expected to get around 100 millions of data in a day and we need to keep this in the database for any updates not less than 3 months. There is a portal which shows details which these data. 

So here the volume is too large which oracle cannot handle. 

Pls suggest what is the next data that we should consider. Will be helpful if you could state a rough write & read speeds. 

Note : after 'n' months the data will be moved to Hadoop (any other options?) for analytics with tableau as BI tool. 

Regards,
Rams



Re: What is the best database to handle large volume of data

Posted by Liam Slusser <ls...@gmail.com>.
Just to counter...Oracle can handle that type of load.  My company has an
extremely large DW in Oracle RDBMS, we add close to 100 million records as
day and have three years worth of history.  The hardware it runs on isn't
as impressive as you might think and all works very well.  The key in
having an extremely large Oracle database is understanding your data and
the star-relationships of that data so queries can execute quickly since
you are somewhat limited on system resources.

Hadoop is better at storing everything and the kitchen sink and being able
to do ad-hawk queries against it...but Oracle works great with structured
data.

And since this is a HBASE mailing list, we push probably around 200m
records into hbase a day (all things logs) and are able to do queries
against it.  It works really well.  The key is having a well designed key
so you can find things quickly.

thanks,
liam


On Fri, May 23, 2014 at 1:07 PM, Ramasubramanian <
ramasubramanian.narayanan@gmail.com> wrote:

>
> So here the volume is too large which oracle cannot handle.
>
>

Re: What is the best database to handle large volume of data

Posted by Wilm Schumacher <wi...@cawoom.com>.
Hi,

your question is very general and hard to answer regarding the lack of
essential information.

However, based on my assumption on what you are trying to do I would
recommend cassandra and materialized views for your portal (if the
questions are pre-computable) and indices (if the questions are
foreseeable).

On the other hand: if the questions to your data (i.e. your portal) are
more complex and user driven, hbase would be the method of choice (map
reduce)

Cassandra is said to be faster on writes. But 100 million insertions of
reasonable rows should be easy to manage for both db systems, even if
your cluster is very small.

However, if your data stream (and deletions) is constant (thus the
database size isn't growing), and your data is very complex, couchDB
(with the bigCouch extension) could be fine for your.

But my guess for you would be cassandra

Best wishes

Wilm

ps: I hope I will not get slapped for recommending something else than
HBase on this list ;)

pps: @Ramasubramanian: My ansers should be reviewed critically. I'm not
an "noSQL" expert. I ran very small hbase and cassandra clusters and a
very small mongoDB. So if an expert gives another answer ... go with it!

Am 23.05.2014 22:44, schrieb Ramasubramanian:
> Hi,
> 
> Just to add: there will be heavy writes and updates.
> 
> Regards, Rams
> 
> 
>> On 24-May-2014, at 1:37 am, Ramasubramanian
>> <ra...@gmail.com> wrote:
>> 
>> Hi,
>> 
>> Request your advice and suggestions on deciding what database can
>> we consider other than oracle to store huge volume of transactional
>> data. It is expected to get around 100 millions of data in a day
>> and we need to keep this in the database for any updates not less
>> than 3 months. There is a portal which shows details which these
>> data.
>> 
>> So here the volume is too large which oracle cannot handle.
>> 
>> Pls suggest what is the next data that we should consider. Will be
>> helpful if you could state a rough write & read speeds.
>> 
>> Note : after 'n' months the data will be moved to Hadoop (any other
>> options?) for analytics with tableau as BI tool.
>> 
>> Regards, Rams
>> 
>> 
> 

Re: What is the best database to handle large volume of data

Posted by Ramasubramanian <ra...@gmail.com>.
Hi,

Just to add: there will be heavy writes and updates. 

Regards,
Rams


> On 24-May-2014, at 1:37 am, Ramasubramanian <ra...@gmail.com> wrote:
> 
> Hi,
> 
> Request your advice and suggestions on deciding what database can we consider other than oracle to store huge volume of transactional data. It is expected to get around 100 millions of data in a day and we need to keep this in the database for any updates not less than 3 months. There is a portal which shows details which these data. 
> 
> So here the volume is too large which oracle cannot handle. 
> 
> Pls suggest what is the next data that we should consider. Will be helpful if you could state a rough write & read speeds. 
> 
> Note : after 'n' months the data will be moved to Hadoop (any other options?) for analytics with tableau as BI tool. 
> 
> Regards,
> Rams
> 
>