You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Neelesh Srinivas Salian (JIRA)" <ji...@apache.org> on 2015/09/24 17:32:05 UTC

[jira] [Resolved] (MAPREDUCE-2064) Tutorial should mention SetMapOutputKeyClass

     [ https://issues.apache.org/jira/browse/MAPREDUCE-2064?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Neelesh Srinivas Salian resolved MAPREDUCE-2064.
------------------------------------------------
       Resolution: Fixed
    Fix Version/s: 1.0.4

> Tutorial should mention SetMapOutputKeyClass
> --------------------------------------------
>
>                 Key: MAPREDUCE-2064
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2064
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: documentation
>    Affects Versions: 0.21.0
>            Reporter: Clarence Gardner
>            Priority: Minor
>              Labels: newbie
>             Fix For: 1.0.4
>
>
> The official tutorial (mapred_tutorial.html) (and all other tutorials I've seen on the web) show a program that has the same datatypes for the key/value pairs emitted by the mapper and by the reducer, and shows a configuration call to Job.setOutput{Key,Value}Class but doesn't say that it refers to both the mapper and the reducer. It sounds like it refers to the reducer output. This might be mentioned in the "Job Configuration" section. Here is a possible addition, after the "The Job is used to specify ..." paragraph.
> The job also configures the types of its key/value pairs with setOutputKeyClass(type) andsetOutputValueClass(type), which appy to both the mapper and reducer classes. If the types output by the mapper and reducer are not the same, that should be followed with setMapOutputKeyClass(type) and setMapOutputValueClass(type).
> (I'm assuming that at least a call to setOutput{Key,Value}Class is required.)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)