You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Pete Wyckoff (JIRA)" <ji...@apache.org> on 2008/09/19 03:28:44 UTC

[jira] Commented: (HADOOP-3566) Create an InputFormat for reading lines of text as Java Strings

    [ https://issues.apache.org/jira/browse/HADOOP-3566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12632482#action_12632482 ] 

Pete Wyckoff commented on HADOOP-3566:
--------------------------------------

If we had a LineReader Serialization implementation (HADOOP-4203)  that returns Strings that could be plugged into a flat file input format (HADOOP-4065), you would have this inputformat, albeit with the important problem of the signature being <LongWritable, String>. 

Could we not modify 4065 to accommodate this use case or am i missing something?
 

> Create an InputFormat for reading lines of text as Java Strings
> ---------------------------------------------------------------
>
>                 Key: HADOOP-3566
>                 URL: https://issues.apache.org/jira/browse/HADOOP-3566
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>            Reporter: Tom White
>            Assignee: Tom White
>             Fix For: 0.19.0
>
>         Attachments: hadoop-3566-v2.patch, hadoop-3566-v3.patch, hadoop-3566-v4.patch, hadoop-3566.patch
>
>
> Such a StringInputFormat would be like TextInputFormat but with input types of Long and String, rather than LongWritable and Text. This would allow users to write MapReduce programs that used only Java native types (i.e. no Writables).
> This is currently not possible to write without changes to Hadoop due to a limitation in the RecordReader interface explained here: https://issues.apache.org/jira/browse/HADOOP-3413?focusedCommentId=12597935#action_12597935

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.