You are viewing a plain text version of this content. The canonical link for it is here.

Posted to mapreduce-issues@hadoop.apache.org by "Pranay Varma (JIRA)" <ji...@apache.org> on 2013/11/20 23:59:35 UTC

[jira] [Created] (MAPREDUCE-5635) FileInputFormat does not specify how the file is split

Pranay Varma created MAPREDUCE-5635:
---------------------------------------

             Summary: FileInputFormat does not specify how the file is split
                 Key: MAPREDUCE-5635
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5635
             Project: Hadoop Map/Reduce
          Issue Type: Bug
    Affects Versions: 2.2.0
         Environment: Does not matter.
            Reporter: Pranay Varma




Here is what the TextInputFormat javadoc says:
[TextInputFormat|http://hadoop.apache.org/docs/current/api/org/apache/hadoop/mapreduce/lib/input/TextInputFormat.html]

An InputFormat for plain text files. Files are broken into lines. Either linefeed or carriage-return are used to signal end of line. Keys are the position in the file, and values are the line of text..

FileInputFormat should say the same on
[FileInputFormat|http://hadoop.apache.org/docs/current/api/org/apache/hadoop/mapreduce/lib/input/FileInputFormat.html]





--
This message was sent by Atlassian JIRA
(v6.1#6144)