You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@trafodion.apache.org by "Carol Pearson (JIRA)" <ji...@apache.org> on 2016/09/24 07:54:20 UTC
[jira] [Created] (TRAFODION-2245) Multiple sqcheck and jps
processes running when monitor is downed and up as dcsserver checks if
trafodion is up
Carol Pearson created TRAFODION-2245:
----------------------------------------
Summary: Multiple sqcheck and jps processes running when monitor is downed and up as dcsserver checks if trafodion is up
Key: TRAFODION-2245
URL: https://issues.apache.org/jira/browse/TRAFODION-2245
Project: Apache Trafodion
Issue Type: Bug
Components: dcs
Affects Versions: 2.1-incubating
Environment: Testing trafodion when failures occurred. HDP 2.4 distro contents and a standard installation on CentOS 6
Reporter: Carol Pearson
Dcsserver checks if Trafodion is running by using sqcheck. That can hang in some circumstances
In this case we had a DTM failure and recovery took a while. The node went to a SoftDown state as the DTM recovered. Meanwhile, dcsserver was looking for trafodion to come up so that it could start the mxosrvrs on that node. That resulted in many hung sqchecks - the notable symptom is that they all had the same ppid.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)