You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Apache Spark (JIRA)" <ji...@apache.org> on 2016/11/07 22:30:58 UTC

[jira] [Commented] (SPARK-18337) Memory Sink should be able to recover from checkpoints in Complete OutputMode

    [ https://issues.apache.org/jira/browse/SPARK-18337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15645673#comment-15645673 ] 

Apache Spark commented on SPARK-18337:
--------------------------------------

User 'brkyvz' has created a pull request for this issue:
https://github.com/apache/spark/pull/15801

> Memory Sink should be able to recover from checkpoints in Complete OutputMode
> -----------------------------------------------------------------------------
>
>                 Key: SPARK-18337
>                 URL: https://issues.apache.org/jira/browse/SPARK-18337
>             Project: Spark
>          Issue Type: Improvement
>          Components: Structured Streaming
>    Affects Versions: 2.0.1
>            Reporter: Burak Yavuz
>
> Memory sinks are not meant to be fault tolerant, but there are certain cases, where it would be nice that it can recover from checkpoints. In cases where you may use a scalable StateStore in StructuredStreaming (when you have an aggregation), and you add a filter based on a key or value in your state, it's nice to be able to continue from where you left off after failures.
> For correctness reasons, the output will ONLY be correct in Complete mode, so we could support that.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org