You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-dev@hadoop.apache.org by "Sreekanth Ramakrishnan (JIRA)" <ji...@apache.org> on 2009/10/07 10:28:31 UTC

[jira] Created: (MAPREDUCE-1073) Progress reported for pipes tasks is incorrect.

Progress reported for pipes tasks is incorrect.
-----------------------------------------------

                 Key: MAPREDUCE-1073
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1073
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: pipes
            Reporter: Sreekanth Ramakrishnan


Currently in pipes, {{org.apache.hadoop.mapred.pipes.PipesMapRunner.run(RecordReader<K1, V1>, OutputCollector<K2, V2>, Reporter)}} we do the following:
{code}
        while (input.next(key, value)) {
          downlink.mapItem(key, value);
          if(skipping) {
            downlink.flush();
          }
        }
{code}

This would result in consumption of all the records for current task and taking task progress to 100% whereas the actual pipes application would be trailing behind. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.