You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Dan Ziemba (JIRA)" <ji...@apache.org> on 2018/03/23 00:30:00 UTC

[jira] [Commented] (MAPREDUCE-2226) TaggedInputSplit should be public

    [ https://issues.apache.org/jira/browse/MAPREDUCE-2226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16410619#comment-16410619 ] 

Dan Ziemba commented on MAPREDUCE-2226:
---------------------------------------

[~esammer] Is there a recommended better way to get the Path of the original InputSplit when using MR2? As a user of hadoop 2.6.0 (cloudera dist), this is still affecting me.  I have not found a better way to access this info other than reflection hackery as described in https://stackoverflow.com/a/11130420/1270447.

> TaggedInputSplit should be public
> ---------------------------------
>
>                 Key: MAPREDUCE-2226
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2226
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>    Affects Versions: 0.20.1
>            Reporter: E. Sammer
>            Assignee: E. Sammer
>            Priority: Major
>         Attachments: MAPREDUCE-2226.diff
>
>
> Currently it's not possible to get at the original InputSplits when using MultipleInputs. This is because TaggedInputSplit (used by DelegatingInputFormat used by MultipleInputs) is not public. This means things like the following do not work:
> {code}
> public void map(...) {
>   FileSplit fis = (FileSplit)((TaggedInputSplit) reporter.getInputSplit()).getInputSplit();
>   Path p = fis.getPath();
> }
> {code}
> This prevents users from getting at input split specific data.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-help@hadoop.apache.org