You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@cassandra.apache.org by Mark <st...@gmail.com> on 2010/10/19 17:01:25 UTC

Dumping Cassandra into Hadoop

  As the subject implies I am trying to dump Cassandra rows into Hadoop. 
What is the easiest way for me to accomplish this? Thanks.

Should I be looking into pig for something like this?

Re: Dumping Cassandra into Hadoop

Posted by Sonal Goyal <so...@gmail.com>.
Have you checked https://issues.apache.org/jira/browse/CASSANDRA-913 ?
Thanks and Regards,
Sonal

Sonal Goyal | Founder and CEO | Nube Technologies LLP
http://www.nubetech.co | http://in.linkedin.com/in/sonalgoyal





On Tue, Oct 19, 2010 at 8:31 PM, Mark <st...@gmail.com> wrote:

>  As the subject implies I am trying to dump Cassandra rows into Hadoop.
> What is the easiest way for me to accomplish this? Thanks.
>
> Should I be looking into pig for something like this?
>

Re: Dumping Cassandra into Hadoop

Posted by aaron morton <aa...@thelastpickle.com>.
Depends on what you mean by dumping into Hadoop. 

If you want to read them from a Hadoop Job then you can use either native Hadoop or Pig. See the contrib/word_count and contrib/pig examples. 

If you want to copy the data into a Hadoop File System install then I guess almost anything that can read from Cassandra and create a file should be OK. You can then copy it onto the HFS and read from there. 

Hope that helps.
Aaron


On 20 Oct 2010, at 04:01, Mark wrote:

> As the subject implies I am trying to dump Cassandra rows into Hadoop. What is the easiest way for me to accomplish this? Thanks.
> 
> Should I be looking into pig for something like this?


Re: Dumping Cassandra into Hadoop

Posted by aaron morton <aa...@thelastpickle.com>.
Depends on what you mean by dumping into Hadoop. 

If you want to read them from a Hadoop Job then you can use either native Hadoop or Pig. See the contrib/word_count and contrib/pig examples. 

If you want to copy the data into a Hadoop File System install then I guess almost anything that can read from Cassandra and create a file should be OK. You can then copy it onto the HFS and read from there. 

Hope that helps.
Aaron


On 20 Oct 2010, at 04:01, Mark wrote:

> As the subject implies I am trying to dump Cassandra rows into Hadoop. What is the easiest way for me to accomplish this? Thanks.
> 
> Should I be looking into pig for something like this?