You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-dev@hadoop.apache.org by "Hong Tang (JIRA)" <ji...@apache.org> on 2009/07/21 09:43:14 UTC

[jira] Created: (MAPREDUCE-778) Need a standalone JobHistory log anonymizer

Need a standalone JobHistory log anonymizer
-------------------------------------------

                 Key: MAPREDUCE-778
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-778
             Project: Hadoop Map/Reduce
          Issue Type: New Feature
            Reporter: Hong Tang


Job history logs contain a rich set of information that can help understand and characterize cluster workload and individual job execution. Examples of work that parses or utilizes job history include HADOOP-3585, MAPREDUCE-534, HDFS-459, MAPREDUCE-728, and MAPREDUCE-776. Some of the parsing tools developed in previous work already contains a component to anonymize the logs. It would be nice to combine these effort and have a common standalone tool that can anonymizes job history logs and preserve much of the structure of the files so that existing tools on top of job history logs continue work with no modification.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.