You are viewing a plain text version of this content. The canonical link for it is here.

Posted to commits@cassandra.apache.org by "Brandon Williams (Commented) (JIRA)" <ji...@apache.org> on 2012/01/23 20:39:41 UTC

[jira] [Commented] (CASSANDRA-2878) Allow map/reduce to use server-side query filters

    [ https://issues.apache.org/jira/browse/CASSANDRA-2878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13191380#comment-13191380 ] 

Brandon Williams commented on CASSANDRA-2878:
---------------------------------------------

Problems running word count:

{noformat}

java.io.IOException: InvalidRequestException(why:(Expected 4 or 0 byte int (8)) [wordcount][output_words][word1] failed validation)
        at org.apache.cassandra.hadoop.ColumnFamilyRecordWriter$RangeClient.run(ColumnFamilyRecordWriter.java:308)
Caused by: InvalidRequestException(why:(Expected 4 or 0 byte int (8)) [wordcount][output_words][word1] failed validation)
        at org.apache.cassandra.thrift.Cassandra$batch_mutate_result.read(Cassandra.java:18724)
        at org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:78)
        at org.apache.cassandra.thrift.Cassandra$Client.recv_batch_mutate(Cassandra.java:862)
        at org.apache.cassandra.thrift.Cassandra$Client.batch_mutate(Cassandra.java:848)
        at org.apache.cassandra.hadoop.ColumnFamilyRecordWriter$RangeClient.run(ColumnFamilyRecordWriter.java:300)
{noformat}

Note: using https://github.com/jbellis/cassandra/tree/2878-rebased
                
> Allow map/reduce to use server-side query filters
> -------------------------------------------------
>
>                 Key: CASSANDRA-2878
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2878
>             Project: Cassandra
>          Issue Type: New Feature
>          Components: Hadoop
>            Reporter: Mck SembWever
>            Assignee: Jonathan Ellis
>            Priority: Critical
>              Labels: hadoop
>             Fix For: 1.1
>
>
> Currently, when running a MapReduce job against data in a Cassandra data store, it reads through all the data for a particular ColumnFamily.  This could be optimized to only read through those rows that have to do with the query.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira