You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@cassandra.apache.org by Sagar Kohli <sa...@impetus.co.in> on 2011/03/17 09:28:29 UTC

hadoop cassandra

hi all,

is there any example of hadoop and cassandra integration where input is from hdfs and out put to cassandra

NOTE: i have gone through word count example provided with the source code, but it does not have above case..


regards
Sagar

________________________________

Are you exploring a Big Data Strategy ? Listen to this recorded webinar on Planning your Hadoop/ NoSQL projects for 2011 at www.impetus.com/featured_webinar?eventid=37

Follow us on www.twitter.com/impetuscalling or visit www.impetus.com to know more.


NOTE: This message may contain information that is confidential, proprietary, privileged or otherwise protected by law. The message is intended solely for the named addressee. If received in error, please destroy and notify the sender. Any use of this email is prohibited when received in error. Impetus does not represent, warrant and/or guarantee, that the integrity of this communication has been maintained nor that the communication is free of errors, virus, interception or interference.

RE: hadoop cassandra

Posted by Sagar Kohli <sa...@impetus.co.in>.
thanks Jeremy, its good pointer to start with

regards
Sagar
________________________________________
From: Jeremy Hanna [jeremy.hanna1234@gmail.com]
Sent: Thursday, March 17, 2011 7:34 PM
To: user@cassandra.apache.org
Subject: Re: hadoop cassandra

You can start with a word count example that's only for hdfs.  Then you can replace the reducer in that with the ReducerToCassandra that's in the cassandra word_count example.  You need to match up your Mapper's output to the Reducer's input and set a couple of configuration variables to tell it how to hook up to cassandra, but that should be it - a working word count example that takes input from hdfs and outputs to cassandra.

We kind of figured that plenty of documentation was out there for hadoop with hdfs.  The word count example just demonstrates something specific to cassandra.  However hadoop is so pluggable that as long as the input and output types line up, you can mix and match most anything with the inputformat and outputformat (like in word count you can output to cassandra or to the local filesystem - there are two different inner classes).

Does that help?

Jeremy

On Mar 17, 2011, at 3:28 AM, Sagar Kohli wrote:

> hi all,
>
> is there any example of hadoop and cassandra integration where input is from hdfs and out put to cassandra
>
> NOTE: i have gone through word count example provided with the source code, but it does not have above case..
>
>
> regards
> Sagar
>
>
> Are you exploring a Big Data Strategy ? Listen to this recorded webinar on Planning your Hadoop/ NoSQL projects for 2011 at www.impetus.com/featured_webinar?eventid=37
>
> Follow us on www.twitter.com/impetuscalling or visit www.impetus.com to know more.
>
>
> NOTE: This message may contain information that is confidential, proprietary, privileged or otherwise protected by law. The message is intended solely for the named addressee. If received in error, please destroy and notify the sender. Any use of this email is prohibited when received in error. Impetus does not represent, warrant and/or guarantee, that the integrity of this communication has been maintained nor that the communication is free of errors, virus, interception or interference.


________________________________

Are you exploring a Big Data Strategy ? Listen to this recorded webinar on Planning your Hadoop/ NoSQL projects for 2011 at www.impetus.com/featured_webinar?eventid=37

Follow us on www.twitter.com/impetuscalling or visit www.impetus.com to know more.


NOTE: This message may contain information that is confidential, proprietary, privileged or otherwise protected by law. The message is intended solely for the named addressee. If received in error, please destroy and notify the sender. Any use of this email is prohibited when received in error. Impetus does not represent, warrant and/or guarantee, that the integrity of this communication has been maintained nor that the communication is free of errors, virus, interception or interference.

Re: hadoop cassandra

Posted by Jeremy Hanna <je...@gmail.com>.
You can start with a word count example that's only for hdfs.  Then you can replace the reducer in that with the ReducerToCassandra that's in the cassandra word_count example.  You need to match up your Mapper's output to the Reducer's input and set a couple of configuration variables to tell it how to hook up to cassandra, but that should be it - a working word count example that takes input from hdfs and outputs to cassandra.

We kind of figured that plenty of documentation was out there for hadoop with hdfs.  The word count example just demonstrates something specific to cassandra.  However hadoop is so pluggable that as long as the input and output types line up, you can mix and match most anything with the inputformat and outputformat (like in word count you can output to cassandra or to the local filesystem - there are two different inner classes).

Does that help?

Jeremy

On Mar 17, 2011, at 3:28 AM, Sagar Kohli wrote:

> hi all,
> 
> is there any example of hadoop and cassandra integration where input is from hdfs and out put to cassandra
> 
> NOTE: i have gone through word count example provided with the source code, but it does not have above case..
> 
> 
> regards
> Sagar
> 
> 
> Are you exploring a Big Data Strategy ? Listen to this recorded webinar on Planning your Hadoop/ NoSQL projects for 2011 at www.impetus.com/featured_webinar?eventid=37 
> 
> Follow us on www.twitter.com/impetuscalling or visit www.impetus.com to know more. 
> 
> 
> NOTE: This message may contain information that is confidential, proprietary, privileged or otherwise protected by law. The message is intended solely for the named addressee. If received in error, please destroy and notify the sender. Any use of this email is prohibited when received in error. Impetus does not represent, warrant and/or guarantee, that the integrity of this communication has been maintained nor that the communication is free of errors, virus, interception or interference.