You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@cassandra.apache.org by Sanjay Acharya <sa...@yahoo.com> on 2010/12/03 21:33:50 UTC

Cassandra Map-Reduce

We are in the process of evaluating Cassandra for our needs and we expect to use 
parallelism for analytics on collected data. The following BLOG mentions that 

Cassandra's Map-Reduce support does not support multiple column families 
http://allthingshadoop.com/2010/04/24/running-hadoop-mapreduce-with-cassandra-nosql/


Has the same proven to be a challenge or blockers for those using the Map-Reduce 
functionality? Please share your experience. Also are there plans to support 
multiple column family map-reduce functionality?


      

Re: Cassandra Map-Reduce

Posted by Jonathan Ellis <jb...@gmail.com>.
It doesn't make sense at the RecordReader layer to consume multiple
CFs.  Chaining them together is usually best left to a higher level
like Pig, although you could do it manually if you wanted to badly
enough.

On Fri, Dec 3, 2010 at 2:33 PM, Sanjay Acharya <sa...@yahoo.com> wrote:
> We are in the process of evaluating Cassandra for our needs and we expect to
> use parallelism for analytics on collected data. The following BLOG mentions
> that
> Cassandra's Map-Reduce support does not support multiple column families
> http://allthingshadoop.com/2010/04/24/running-hadoop-mapreduce-with-cassandra-nosql/
>
> Has the same proven to be a challenge or blockers for those using the
> Map-Reduce functionality? Please share your experience. Also are there plans
> to support multiple column family map-reduce functionality?
>
>
>



-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of Riptano, the source for professional Cassandra support
http://riptano.com

Re: Cassandra Map-Reduce

Posted by Jeremy Hanna <je...@gmail.com>.
I created to explore doing that - it would seem like a reasonable thing to do with a batch/analytic/MR operation.  You might chime in to explain your use case on the ticket.

https://issues.apache.org/jira/browse/CASSANDRA-1821

On Dec 3, 2010, at 2:33 PM, Sanjay Acharya wrote:

> We are in the process of evaluating Cassandra for our needs and we expect to use parallelism for analytics on collected data. The following BLOG mentions that 
> Cassandra's Map-Reduce support does not support multiple column families http://allthingshadoop.com/2010/04/24/running-hadoop-mapreduce-with-cassandra-nosql/
> 
> Has the same proven to be a challenge or blockers for those using the Map-Reduce functionality? Please share your experience. Also are there plans to support multiple column family map-reduce functionality?
> 
> 
>