You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Antonio Piccolboni (JIRA)" <ji...@apache.org> on 2014/07/23 00:44:40 UTC
[jira] [Commented] (MAPREDUCE-1767) Steaming infrastructures should
provide statisics about job
[ https://issues.apache.org/jira/browse/MAPREDUCE-1767?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071061#comment-14071061 ]
Antonio Piccolboni commented on MAPREDUCE-1767:
-----------------------------------------------
Such as?
> Steaming infrastructures should provide statisics about job
> -----------------------------------------------------------
>
> Key: MAPREDUCE-1767
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1767
> Project: Hadoop Map/Reduce
> Issue Type: Improvement
> Components: contrib/streaming
> Reporter: arkady borkovsky
>
> This should include
> -- the commands (mapper and reducer commands) executed
> -- time information (e.g. min, max, and avg start time, end time, elapsed time for tasks, total elapsed time )
> -- sizes -- bytes and records, min, max, avg per task and total, input and output
> -- information about input and output data sets (all output data sets, if there are several)
> -- all user counters (when they are implemented for streaming)
> the information should be stored in a file -- e.g. in the working directory from where the job was launched, with a name derived from the job name
--
This message was sent by Atlassian JIRA
(v6.2#6252)