You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@flink.apache.org by "Xiaogang Shi (JIRA)" <ji...@apache.org> on 2017/02/15 08:09:41 UTC

[jira] [Closed] (FLINK-5036) Perform the grouping of keys in restoring instead of checkpointing

     [ https://issues.apache.org/jira/browse/FLINK-5036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Xiaogang Shi closed FLINK-5036.
-------------------------------
    Resolution: Invalid

> Perform the grouping of keys in restoring instead of checkpointing
> ------------------------------------------------------------------
>
>                 Key: FLINK-5036
>                 URL: https://issues.apache.org/jira/browse/FLINK-5036
>             Project: Flink
>          Issue Type: Bug
>          Components: State Backends, Checkpointing
>            Reporter: Xiaogang Shi
>
> Whenever taking snapshots of {{RocksDBKeyedStateBackend}}, the values in the states will be written onto different files according to their key groups. The procedure is very costly when the states are very big. 
> Given that the snapshot operations will be performed much more frequently than restoring, we can leave the key groups as they are to improve the overall performance. In other words, we can perform the grouping of keys in restoring instead of in checkpointing.
> I think, the implementation will be very similar to the restoring of non-partitioned states. Each task will receive a collection of snapshots each of which contains a set of key groups. Each task will restore its states from the given snapshots by picking values in assigned key groups.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)