You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by ravikumar visweswara <ta...@gmail.com> on 2011/12/23 16:13:54 UTC

cassandra data to hadoop

Hello All,

I have a situation to dump cassandra data to hadoop cluster for further
analytics. Lot of other relevant data which is not present in cassandra is
already available in hdfs for analysis. Both are independent clusters right
now.
Is there a suggested way to get the data periodically or continuously to
HDFS from cassandra? Any ideas or references will be very helpful for me.

Thanks and Regards
R

Re: cassandra data to hadoop

Posted by ravikumar visweswara <ta...@gmail.com>.
Thank you for your reference. I have looked at Brisk. In our situation both
are disconnected clusters for various reasons and using different
distributions (i.e cloudera). Is there any other/similar way to inject data
to HDFS

R

On Fri, Dec 23, 2011 at 7:34 AM, Sanjeev Verma <sa...@gmail.com>wrote:

> Hey Ravi:
>
> Hadoop newbie here, so pardon me if I am pointing out the obvious - have
> you taken a look at this link -
> http://wiki.apache.org/cassandra/HadoopSupport
>
> Looks like Cassandra 0.6 onwards supports output to mapreduce.
>
> Regards
> Sanjeev
>
> On Fri, 2011-12-23 at 07:13 -0800, ravikumar visweswara wrote:
> > Hello All,
> >
> > I have a situation to dump cassandra data to hadoop cluster for further
> > analytics. Lot of other relevant data which is not present in cassandra
> is
> > already available in hdfs for analysis. Both are independent clusters
> right
> > now.
> > Is there a suggested way to get the data periodically or continuously to
> > HDFS from cassandra? Any ideas or references will be very helpful for me.
> >
> > Thanks and Regards
> > R
>
>
>

Re: cassandra data to hadoop

Posted by Sanjeev Verma <sa...@gmail.com>.
Hey Ravi:

Hadoop newbie here, so pardon me if I am pointing out the obvious - have
you taken a look at this link -
http://wiki.apache.org/cassandra/HadoopSupport

Looks like Cassandra 0.6 onwards supports output to mapreduce.

Regards
Sanjeev

On Fri, 2011-12-23 at 07:13 -0800, ravikumar visweswara wrote:
> Hello All,
> 
> I have a situation to dump cassandra data to hadoop cluster for further
> analytics. Lot of other relevant data which is not present in cassandra is
> already available in hdfs for analysis. Both are independent clusters right
> now.
> Is there a suggested way to get the data periodically or continuously to
> HDFS from cassandra? Any ideas or references will be very helpful for me.
> 
> Thanks and Regards
> R