You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Shruti Gumma (Jira)" <ji...@apache.org> on 2019/08/20 18:54:00 UTC

[jira] [Created] (SPARK-28784) StreamExecution and StreamingQueryManager should utilize CheckpointFileManager to interact with checkpoint directories

Shruti Gumma created SPARK-28784:
------------------------------------

             Summary: StreamExecution and StreamingQueryManager should utilize CheckpointFileManager to interact with checkpoint directories
                 Key: SPARK-28784
                 URL: https://issues.apache.org/jira/browse/SPARK-28784
             Project: Spark
          Issue Type: Bug
          Components: Spark Core
    Affects Versions: 2.4.0
            Reporter: Shruti Gumma


After PR [https://github.com/apache/spark/pull/21048], the CheckpointFileManager interface was created to handle all structured streaming checkpointing operations and helps users to choose how they wish to write checkpointing files atomically.
StreamExecution and StreamingQueryManager still uses some FileSystem operations without using the CheckpointFileManager.
For instance,
[https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/StreamExecution.scala#L137]
[https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/StreamExecution.scala#L392]

Instead, StreamExecution and StreamingQueryManager should use CheckpointFileManager for these operations.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org