You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Allen Wittenauer (JIRA)" <ji...@apache.org> on 2014/07/21 23:54:41 UTC
[jira] [Resolved] (MAPREDUCE-194) Split Information errors when
input data volumn is trivial
[ https://issues.apache.org/jira/browse/MAPREDUCE-194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Allen Wittenauer resolved MAPREDUCE-194.
----------------------------------------
Resolution: Fixed
I'm going to mark this as stale.
> Split Information errors when input data volumn is trivial
> ----------------------------------------------------------
>
> Key: MAPREDUCE-194
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-194
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Environment: CentOS 5,
> hadoop-0.19.0
> Reporter: Leitao Guo
>
> The mapreduce input is a text file with only 8 lines ( filepath: /in_wc/pretty ), and we set "conf.setNumMapTasks(8)" in the program. I thought there will generate 8 maptasks, but actually, it generated 9 maptask. Counters of map tasks from the website show that, 0~7 maptask has "Map input records 1", and 8 maptask has "Map input records 0"
> The following is map task list information:
> task_200903121214_0029_m_000000 hdfs://guoleitao:9200/in_wc/pretty:0+4
> task_200903121214_0029_m_000001 hdfs://guoleitao:9200/in_wc/pretty:4+4
> task_200903121214_0029_m_000002 hdfs://guoleitao:9200/in_wc/pretty:8+4
> task_200903121214_0029_m_000003 hdfs://guoleitao:9200/in_wc/pretty:12+4
> task_200903121214_0029_m_000004 hdfs://guoleitao:9200/in_wc/pretty:16+4
> task_200903121214_0029_m_000005 hdfs://guoleitao:9200/in_wc/pretty:20+4
> task_200903121214_0029_m_000006 hdfs://guoleitao:9200/in_wc/pretty:24+4
> task_200903121214_0029_m_000007 hdfs://guoleitao:9200/in_wc/pretty:28+4
> task_200903121214_0029_m_000008 hdfs://guoleitao:9200/in_wc/pretty:32+4
--
This message was sent by Atlassian JIRA
(v6.2#6252)