You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Mass Dosage (JIRA)" <ji...@apache.org> on 2013/07/18 11:02:49 UTC

[jira] [Commented] (MAPREDUCE-2226) TaggedInputSplit should be public

    [ https://issues.apache.org/jira/browse/MAPREDUCE-2226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13712145#comment-13712145 ] 

Mass Dosage commented on MAPREDUCE-2226:
----------------------------------------

Hey guys, this issue has bitten us too. We're currently getting around it by using reflection which is far from ideal given that we're having to do it in a mapper for each item being mapped. I agree with [~tomwhite]'s suggestion to rename the class. Is all that is missing is taking the original patch and renaming the class? If [~esammer] isn't going to do it I could have a go.
                
> TaggedInputSplit should be public
> ---------------------------------
>
>                 Key: MAPREDUCE-2226
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2226
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>    Affects Versions: 0.20.1
>            Reporter: E. Sammer
>            Assignee: E. Sammer
>         Attachments: MAPREDUCE-2226.diff
>
>
> Currently it's not possible to get at the original InputSplits when using MultipleInputs. This is because TaggedInputSplit (used by DelegatingInputFormat used by MultipleInputs) is not public. This means things like the following do not work:
> {code}
> public void map(...) {
>   FileSplit fis = (FileSplit)((TaggedInputSplit) reporter.getInputSplit()).getInputSplit();
>   Path p = fis.getPath();
> }
> {code}
> This prevents users from getting at input split specific data.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira