You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by Alexandros Konstantinakis - Karmis <ae...@ece.ntua.gr> on 2010/06/04 13:30:21 UTC
Profiling hadoop application/MultipleOutputs
Hello everyone.
i just have a couple of short questions. I am using the new API.
a) i need to profile my application. I am aware of the setProfileEnabled
method in JobConf, has this been ported to the new API yet?
I have used PerfAnal in the past with great success for non-hadoop java
applications. Is it possible to use that to read the profile data?
b) I need to output 1 file per reducer. I understand this is possible
with multipleOutputs or MultiOutputFormat, neither of which has been
ported to the new API. Is my best choice to use this patch
https://issues.apache.org/jira/browse/MAPREDUCE-370 ?
thanks in advance
Re: Profiling hadoop application/MultipleOutputs
Posted by Alex Kozlov <al...@cloudera.com>.
On Fri, Jun 4, 2010 at 4:30 AM, Alexandros Konstantinakis - Karmis <
aeolist@ece.ntua.gr> wrote:
>
>
> b) I need to output 1 file per reducer. I understand this is possible with
> multipleOutputs or MultiOutputFormat, neither of which has been ported to
> the new API. Is my best choice to use this patch
> https://issues.apache.org/jira/browse/MAPREDUCE-370 ?
>
>
Hi Alexander,
A reducer outputs one file per reducer by default. MultipleOuputs has been
backported to CDH2 <http://archive.cloudera.com/cdh/2/>.
Alex K