You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@uima.apache.org by "Richard Eckart de Castilho (Jira)" <de...@uima.apache.org> on 2023/01/12 15:19:00 UTC

[jira] [Resolved] (UIMA-5310) UIMA-DUCC: Agent may hang in cleanup code on startup

     [ https://issues.apache.org/jira/browse/UIMA-5310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Richard Eckart de Castilho resolved UIMA-5310.
----------------------------------------------
    Resolution: Abandoned

DUCC has been retired.

> UIMA-DUCC: Agent may hang in cleanup code on startup
> ----------------------------------------------------
>
>                 Key: UIMA-5310
>                 URL: https://issues.apache.org/jira/browse/UIMA-5310
>             Project: UIMA
>          Issue Type: Bug
>          Components: DUCC
>            Reporter: Jaroslaw Cwiklik
>            Assignee: Jaroslaw Cwiklik
>            Priority: Major
>
> When an agent starts up it checks if there are any cgroup containers left over from a previous agent. This may happen if for some reason an agent fails to stop a child process during a Ducc bounce for example. An agent tries to cleanup such processes with kill -9. Once the kill is done, the code goes into a loop checking cgroup.procs to confirm that a process is gone. If a process is still in a container, an agent waits awhile and does a check again. Typically a process dies and cgroups accounting is done quickly. The agent removes a container and proceeds to run normally.
> On rare occasions ducc_ling fails to run kill -9 command and the process persists leading to a hang. 
> An agent should not be blocking after the kill. If it finds a process still running it should report this fact and continue.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)