You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Roman Khachatryan (Jira)" <ji...@apache.org> on 2022/11/10 18:04:00 UTC
[jira] [Updated] (FLINK-29985) TaskManager doesn't close SlotTable on SIGTERM
[ https://issues.apache.org/jira/browse/FLINK-29985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Roman Khachatryan updated FLINK-29985:
--------------------------------------
Summary: TaskManager doesn't close SlotTable on SIGTERM (was: SlotTable not close on TM termination)
> TaskManager doesn't close SlotTable on SIGTERM
> ----------------------------------------------
>
> Key: FLINK-29985
> URL: https://issues.apache.org/jira/browse/FLINK-29985
> Project: Flink
> Issue Type: Bug
> Components: Runtime / Task
> Affects Versions: 1.16.0, 1.15.3
> Reporter: Roman Khachatryan
> Priority: Major
>
> When a slot is released, the associated resources are released as well, in particular, MemoryManager. MemoryManager might hold not only memory, but also some arbitrary shared resources (currently, PythonSharedResources and RocksDBSharedResources).
> When TM is stopped by JManager, its slot table is closed, causing all its slot to be released
> When TM is stopped by SIGTERM (i.e. external resource manager), its slot table is NOT closed.
> That means that in standalone clusters, some resources might not be released.
>
> As of now, RocksDBSharedResources contains only ephemeral resources.
> Not sure about PythonSharedResources, but likely it is associated with a separate process.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)