You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Antonio Piccolboni (JIRA)" <ji...@apache.org> on 2014/07/23 00:44:40 UTC

[jira] [Commented] (MAPREDUCE-1767) Steaming infrastructures should provide statisics about job

    [ https://issues.apache.org/jira/browse/MAPREDUCE-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071061#comment-14071061 ] 

Antonio Piccolboni commented on MAPREDUCE-1767:
-----------------------------------------------

Such as?

> Steaming infrastructures should provide statisics about job
> -----------------------------------------------------------
>
>                 Key: MAPREDUCE-1767
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1767
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: contrib/streaming
>            Reporter: arkady borkovsky
>
> This should include
> -- the commands (mapper and reducer commands) executed
> -- time information (e.g. min, max, and avg start time, end time, elapsed time for tasks, total elapsed time )
> -- sizes -- bytes and records, min, max, avg per task and total, input and output
> -- information about input and output data sets (all output data sets, if there are several)
> -- all user counters (when they are implemented for streaming)
> the information should be stored in a file -- e.g. in the working directory from where the job was launched, with a name derived from the job name



--
This message was sent by Atlassian JIRA
(v6.2#6252)