You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Senthil Subramanian (JIRA)" <ji...@apache.org> on 2007/06/19 02:02:26 UTC
[jira] Updated: (HADOOP-1441) Splittability of input should be
controllable by application
[ https://issues.apache.org/jira/browse/HADOOP-1441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Senthil Subramanian updated HADOOP-1441:
----------------------------------------
Attachment: HADOOP-1441_1.patch
Added a configuration variable mapred.input.issplitable (default: true) which can be used to tell the framework not to split input files even if the underlying InputFormat is splittable.
> Splittability of input should be controllable by application
> ------------------------------------------------------------
>
> Key: HADOOP-1441
> URL: https://issues.apache.org/jira/browse/HADOOP-1441
> Project: Hadoop
> Issue Type: Improvement
> Components: mapred
> Affects Versions: 0.12.3
> Environment: ALL
> Reporter: Milind Bhandarkar
> Assignee: Milind Bhandarkar
> Fix For: 0.14.0
>
> Attachments: HADOOP-1441_1.patch
>
>
> Currently, isSplittable method of FileInputFormat always returns true. For some applications, it becomes necessary that the map task process entire file, rather than a block. Therefore, splittability of input (i.e. block-level split vs file-level-split) should be controllable by user via a configuration variable. The default could be block-level split, as is.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.