You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@trafodion.apache.org by "Selvaganesan Govindarajan (JIRA)" <ji...@apache.org> on 2016/09/28 22:10:20 UTC

[jira] [Resolved] (TRAFODION-2245) Multiple sqcheck and jps processes running when monitor is downed and up as dcsserver checks if trafodion is up

     [ https://issues.apache.org/jira/browse/TRAFODION-2245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Selvaganesan Govindarajan resolved TRAFODION-2245.
--------------------------------------------------
    Resolution: Fixed

https://github.com/apache/incubator-trafodion/pull/726

> Multiple sqcheck and jps processes running when monitor is downed and up as dcsserver checks if trafodion is up
> ---------------------------------------------------------------------------------------------------------------
>
>                 Key: TRAFODION-2245
>                 URL: https://issues.apache.org/jira/browse/TRAFODION-2245
>             Project: Apache Trafodion
>          Issue Type: Bug
>          Components: dcs
>    Affects Versions: 2.1-incubating
>         Environment: Testing trafodion when failures occurred.  HDP 2.4 distro contents and a standard installation on CentOS 6
>            Reporter: Carol Pearson
>            Assignee: Selvaganesan Govindarajan
>
> Dcsserver checks if Trafodion is running by using sqcheck.  That can hang in some circumstances 
> In this case we had a DTM failure and recovery took a while. The node went to a SoftDown state as the DTM recovered.  Meanwhile, dcsserver was looking for trafodion to come up so that it could start the mxosrvrs on that node.  That resulted in many hung sqchecks - the notable symptom is that they all had the same ppid.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)