You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@cassandra.apache.org by "Shamim Ahmed (JIRA)" <ji...@apache.org> on 2013/05/07 09:01:20 UTC

[jira] [Created] (CASSANDRA-5544) Hadoop jobs assigns only one mapper in task

Shamim Ahmed created CASSANDRA-5544:
---------------------------------------

             Summary: Hadoop jobs assigns only one mapper in task 
                 Key: CASSANDRA-5544
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-5544
             Project: Cassandra
          Issue Type: Bug
          Components: Hadoop
            Reporter: Shamim Ahmed


We have got very strange beheviour of hadoop cluster after upgrading 
Cassandra from 1.1.5 to Cassandra 1.2.1. We have 5 nodes cluster of Cassandra, where three of them are hodoop slaves. Now when we are submitting job through Pig script, only one map assigns in task running on one of the hadoop slaves regardless of 
volume of data (already tried with more than million rows).
Configure of pig as follows:
export PIG_HOME=/oracle/pig-0.10.0
export PIG_CONF_DIR=${HADOOP_HOME}/conf
export PIG_INITIAL_ADDRESS=192.168.157.103
export PIG_RPC_PORT=9160
export PIG_PARTITIONER=org.apache.cassandra.dht.Murmur3Partitioner


Also we have these following properties in hadoop:
 <property>
 <name>mapred.tasktracker.map.tasks.maximum</name>
 <value>10</value>
 </property>
 <property>
 <name>mapred.map.tasks</name>
 <value>4</value>
 </property>

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira