You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Jungtaek Lim (JIRA)" <ji...@apache.org> on 2018/06/07 13:06:00 UTC

[jira] [Created] (SPARK-24485) Measure and log elapsed time for filesystem operations in HDFSBackedStateStoreProvider

Jungtaek Lim created SPARK-24485:
------------------------------------

             Summary: Measure and log elapsed time for filesystem operations in HDFSBackedStateStoreProvider
                 Key: SPARK-24485
                 URL: https://issues.apache.org/jira/browse/SPARK-24485
             Project: Spark
          Issue Type: Improvement
          Components: Structured Streaming
    Affects Versions: 2.4.0
            Reporter: Jungtaek Lim


There're couple of operations which communicate with file system (mostly remote HDFS in production) in HDFSBackedStateStoreProvider, which contribute huge part of latency.

It would be better to measure the latency (elapsed time) and log to help debugging when there's unexpected huge latency on state store.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org