You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "Todd Lipcon (JIRA)" <ji...@apache.org> on 2009/11/03 04:51:59 UTC
[jira] Commented: (HADOOP-6244) Improvements to FileContext metrics
output formatting
[ https://issues.apache.org/jira/browse/HADOOP-6244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12772867#action_12772867 ]
Todd Lipcon commented on HADOOP-6244:
-------------------------------------
bq. While timestamps and distinguishing tags from metrics are both useful properties, changing the format of FileContext disrupts all the downstream consumers.
Agreed - this is why I left the old format in as a configuration option. Is anyone actually *using* the old format, though? When I wanted to look at metrics logs from FileContext in the past, I ended up actually writing a perl script to interpolate timestamps based on the file's mtime - pretty awful. I think now (ie before 1.0) is the time when we should feel free to change formats/APIs that are clearly bad as long as we provide a deprecated compatibility path, yes?
As for the code notes, I think all of your points are valid - I'll upload a new patch soon.
bq. better served by adding a FileContext using a standard format, like JSON, or base it on Avro.
JSON would be reasonable, but I think it's important we continue to have a simpler text metrics logging option. When fishing around on nodes it's good to be able to use perl, grep, and awk without having to install a JSON parser. Avro makes sense for those who want to do long term analysis, but I think that should be a separate patch.
> Improvements to FileContext metrics output formatting
> -----------------------------------------------------
>
> Key: HADOOP-6244
> URL: https://issues.apache.org/jira/browse/HADOOP-6244
> Project: Hadoop Common
> Issue Type: Improvement
> Components: metrics
> Affects Versions: 0.22.0
> Reporter: Todd Lipcon
> Assignee: Todd Lipcon
> Fix For: 0.22.0
>
> Attachments: hadoop-6244.txt
>
>
> The output of FileContext has two big issues: 1) it doesn't include a timestamp, 2) it doesn't differentiate between tags and metrics in formatting. This patch is to improve the output format to be more useful.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.