You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Stefan Richter (JIRA)" <ji...@apache.org> on 2017/02/06 09:08:41 UTC
[jira] [Created] (FLINK-5715) Asynchronous snapshotting for
HeapKeyedStateBackend
Stefan Richter created FLINK-5715:
-------------------------------------
Summary: Asynchronous snapshotting for HeapKeyedStateBackend
Key: FLINK-5715
URL: https://issues.apache.org/jira/browse/FLINK-5715
Project: Flink
Issue Type: New Feature
Components: State Backends, Checkpointing
Affects Versions: 1.3.0
Reporter: Stefan Richter
Assignee: Stefan Richter
Blocking snapshots render the HeapKeyedStateBackend practically unusable for many user in productions. Their jobs can not tolerate stopped processing for the time it takes to write gigabytes of data from memory to disk. Asynchronous snapshots would be a solution to this problem. The challenge for the implementation is coming up with a copy-on-write scheme for the in-memory hash maps that build the foundation of this backend. After taking a closer look, this problem is twofold. First, providing CoW semantics for the hashmap itself, as a mutible structure, thereby avoiding costly locking or blocking where possible. Second, CoW for the mutable value objects, e.g. through cloning via serializers.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)