You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tez.apache.org by "Mohammad Kamrul Islam (JIRA)" <ji...@apache.org> on 2014/03/01 02:59:19 UTC

[jira] [Commented] (TEZ-698) Make it easy to create and configure MRInput/MROutput/ShuffleInput/SortedOutput

    [ https://issues.apache.org/jira/browse/TEZ-698?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13916709#comment-13916709 ] 

Mohammad Kamrul Islam commented on TEZ-698:
-------------------------------------------

[~bikassaha] Would you please give more details about this JIRA?

1. What methods  are we invoking now to configure MRI/O? Does the current WordCount example have this usage? I think you want those to be combined in an utility method.

2. For "pairs like ShuffleInput/SortedOutput" what configs to generate? Any current example is using?

3. What will be the right place two host those methods?


> Make it easy to create and configure MRInput/MROutput/ShuffleInput/SortedOutput
> -------------------------------------------------------------------------------
>
>                 Key: TEZ-698
>                 URL: https://issues.apache.org/jira/browse/TEZ-698
>             Project: Apache Tez
>          Issue Type: Sub-task
>            Reporter: Bikas Saha
>
> We have moved away from MR and its not necessary for anyone to write mappers and reducers or to configure them. But MR input and output and Shuffle related inputs/outputs. Currently we have to invoke a host of methods to configure them. If we can have a single API to make these configs then it would really help. Secondly for IO pairs like ShuffleInput/SortedOutput, their configs are related (KV types e.g.) So it maybe useful to have a combined API that generates configs for both in a single API.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)