You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Tom White (JIRA)" <ji...@apache.org> on 2011/02/28 05:43:36 UTC

[jira] Updated: (MAPREDUCE-2226) TaggedInputSplit should be public

     [ https://issues.apache.org/jira/browse/MAPREDUCE-2226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tom White updated MAPREDUCE-2226:
---------------------------------

    Status: Open  (was: Patch Available)

Is it worth making TaggedInputSplit's name better as a part of making it public, e.g. MultipleInputsSplit?

Marking as open pending a new patch.


> TaggedInputSplit should be public
> ---------------------------------
>
>                 Key: MAPREDUCE-2226
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2226
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>    Affects Versions: 0.20.1
>            Reporter: E. Sammer
>            Assignee: E. Sammer
>         Attachments: MAPREDUCE-2226.diff
>
>
> Currently it's not possible to get at the original InputSplits when using MultipleInputs. This is because TaggedInputSplit (used by DelegatingInputFormat used by MultipleInputs) is not public. This means things like the following do not work:
> {code}
> public void map(...) {
>   FileSplit fis = (FileSplit)((TaggedInputSplit) reporter.getInputSplit()).getInputSplit();
>   Path p = fis.getPath();
> }
> {code}
> This prevents users from getting at input split specific data.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira