You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Junping Du (JIRA)" <ji...@apache.org> on 2017/09/11 05:13:01 UTC

[jira] [Updated] (MAPREDUCE-6864) Hadoop streaming creates 2 mappers when the input has only one block

     [ https://issues.apache.org/jira/browse/MAPREDUCE-6864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Junping Du updated MAPREDUCE-6864:
----------------------------------
    Target Version/s: 2.8.3  (was: 2.8.1)

> Hadoop streaming creates 2 mappers when the input has only one block
> --------------------------------------------------------------------
>
>                 Key: MAPREDUCE-6864
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6864
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2
>    Affects Versions: 2.7.3
>            Reporter: Daniel Templeton
>
> If a streaming job is run against input that is less than 2 blocks, 2 mappers will be created, both operating on the same split, both producing (duplicate) output.  In some cases the second mapper will consistently fail.  I've not seen the failure with input less than 10 bytes or more than a couple MB.  I have seen it with a 4kB input.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-help@hadoop.apache.org