You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Rahul Singhal (JIRA)" <ji...@apache.org> on 2014/06/12 07:41:01 UTC

[jira] [Created] (SPARK-2127) Use application specific folders to dump metrics via CsvSink

Rahul Singhal created SPARK-2127:
------------------------------------

             Summary: Use application specific folders to dump metrics via CsvSink
                 Key: SPARK-2127
                 URL: https://issues.apache.org/jira/browse/SPARK-2127
             Project: Spark
          Issue Type: Improvement
          Components: Spark Core
    Affects Versions: 1.0.0
            Reporter: Rahul Singhal
            Priority: Minor


Currently when using the CsvSink, all application's csv metrics are dumped in the root folder (configured via "*.sink.csv.director" in metrics.properties). Also, some files that have common names (e.g. "jvm.PS-MarkSweep.count.csv") are reused. And if one is running the same application multiple times, the metrics get appended to previously existing files.

This makes it harder to parse these files and extract the information that one might be looking for. I suggest that a unique folder is created every time an application is run and use it to dump the metrics from that particular run only. This unique folder could be created similar the one that is currently craeted for logging application events (e.g. "spark-pi-1402484928439").



--
This message was sent by Atlassian JIRA
(v6.2#6252)