You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@trafodion.apache.org by "Gonzalo E Correa (JIRA)" <ji...@apache.org> on 2019/07/24 17:42:00 UTC

[jira] [Updated] (TRAFODION-3318) Change process management of DTM to improve HA behavior

     [ https://issues.apache.org/jira/browse/TRAFODION-3318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gonzalo E Correa updated TRAFODION-3318:
----------------------------------------
    Summary: Change process management of DTM to improve HA behavior  (was: Change process management of DTM improve HA behavior)

> Change process management of DTM to improve HA behavior
> -------------------------------------------------------
>
>                 Key: TRAFODION-3318
>                 URL: https://issues.apache.org/jira/browse/TRAFODION-3318
>             Project: Apache Trafodion
>          Issue Type: Improvement
>          Components: dtm, foundation
>            Reporter: Gonzalo E Correa
>            Priority: Major
>   Original Estimate: 120h
>  Remaining Estimate: 120h
>
> Current process management model for process type DTM enforces and soft node down behavior which kills all processes in a node where a DTM process terminates abnormally. The DTM process is recreated by the monitor along with all persistent processes hosted in that node.
> To reduce the fault zone impact, this change removes the soft node down/up functionality so that the DTM process is recreated without killing all other processes in the node. The rule where the persistent DTM process cannot be restarted within the configured retries in the specified time window will cause a node down will still be enforced.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)