You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@trafodion.apache.org by "Carol Pearson (JIRA)" <ji...@apache.org> on 2016/09/24 07:54:20 UTC

[jira] [Created] (TRAFODION-2245) Multiple sqcheck and jps processes running when monitor is downed and up as dcsserver checks if trafodion is up

Carol Pearson created TRAFODION-2245:
----------------------------------------

             Summary: Multiple sqcheck and jps processes running when monitor is downed and up as dcsserver checks if trafodion is up
                 Key: TRAFODION-2245
                 URL: https://issues.apache.org/jira/browse/TRAFODION-2245
             Project: Apache Trafodion
          Issue Type: Bug
          Components: dcs
    Affects Versions: 2.1-incubating
         Environment: Testing trafodion when failures occurred.  HDP 2.4 distro contents and a standard installation on CentOS 6
            Reporter: Carol Pearson


Dcsserver checks if Trafodion is running by using sqcheck.  That can hang in some circumstances 

In this case we had a DTM failure and recovery took a while. The node went to a SoftDown state as the DTM recovered.  Meanwhile, dcsserver was looking for trafodion to come up so that it could start the mxosrvrs on that node.  That resulted in many hung sqchecks - the notable symptom is that they all had the same ppid.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)