You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Neelesh Srinivas Salian (JIRA)" <ji...@apache.org> on 2015/09/24 17:29:04 UTC

[jira] [Commented] (MAPREDUCE-2064) Tutorial should mention SetMapOutputKeyClass

    [ https://issues.apache.org/jira/browse/MAPREDUCE-2064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14906496#comment-14906496 ] 

Neelesh Srinivas Salian commented on MAPREDUCE-2064:
----------------------------------------------------

The latter part of your suggestion seems to be resolved in the Word Count 1.0 and 2.0 example code on http://hadoop.apache.org/docs/r2.7.1/hadoop-mapreduce-client/hadoop-mapreduce-client-core/MapReduceTutorial.html with calls to set job.setOutputKeyClass

Closing the jira.
Thank you.

> Tutorial should mention SetMapOutputKeyClass
> --------------------------------------------
>
>                 Key: MAPREDUCE-2064
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2064
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: documentation
>    Affects Versions: 0.21.0
>            Reporter: Clarence Gardner
>            Priority: Minor
>              Labels: newbie
>
> The official tutorial (mapred_tutorial.html) (and all other tutorials I've seen on the web) show a program that has the same datatypes for the key/value pairs emitted by the mapper and by the reducer, and shows a configuration call to Job.setOutput{Key,Value}Class but doesn't say that it refers to both the mapper and the reducer. It sounds like it refers to the reducer output. This might be mentioned in the "Job Configuration" section. Here is a possible addition, after the "The Job is used to specify ..." paragraph.
> The job also configures the types of its key/value pairs with setOutputKeyClass(type) andsetOutputValueClass(type), which appy to both the mapper and reducer classes. If the types output by the mapper and reducer are not the same, that should be followed with setMapOutputKeyClass(type) and setMapOutputValueClass(type).
> (I'm assuming that at least a call to setOutput{Key,Value}Class is required.)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)