You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@trafodion.apache.org by "liu ming (JIRA)" <ji...@apache.org> on 2016/09/23 02:21:20 UTC
[jira] [Commented] (TRAFODION-2174) sometime sqstop hang and cannot
stop system
[ https://issues.apache.org/jira/browse/TRAFODION-2174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15515136#comment-15515136 ]
liu ming commented on TRAFODION-2174:
-------------------------------------
Reproduce today, with more info:
all SQL processes successfully shutdown, and RMS and TM, but Monitor is still there:
[liuliumi@10 incubator-trafodion]$ cstat
uid pid ppid wchan rss vsz time stat cmd
--- --- ---- ----- --- --- ---- ---- ---
liuliumi 23105 1 hrtime 38836 378624 00:03:44 Ssl /trafodion/incubator-trafodion/core/sqf/export/bin64d/monitor COLD
liuliumi 23106 1 hrtime 38860 378624 00:03:05 Ssl /trafodion/incubator-trafodion/core/sqf/export/bin64d/monitor COLD
liuliumi 23103 1 poll_s 1392 19136 00:00:00 S mpirun -disable-auto-cleanup -demux select -env SQ_IC TCP -env MPI_ERROR_LEVEL 2 -env SQ_PIDMAP 1 -env MPI_TMPDIR /trafodion/incubator-trafodion/core/sqf/tmp -env MY_SQROOT /trafodion/incubator-trafodion/core/sqf -np 2 /trafodion/incubator-trafodion/core/sqf/export/bin64d/monitor COLD
liuliumi 24662 24611 n_tty_ 224564 1360540 00:01:33 Sl+ sqlci
liuliumi 23123 23106 futex_ 4904 134320 00:01:10 Sl sqwatchdog SQMON1.1 00001 00001 023123 $WDG001 10.0.2.15:52292 00005 00001 00001 SPARE
liuliumi 23125 23105 futex_ 4924 134320 00:01:07 Sl sqwatchdog SQMON1.1 00000 00000 023125 $WDG000 10.0.2.15:53307 00005 00000 00001 SPARE
[liuliumi@10 incubator-trafodion]$
So should look into Monitor
> sometime sqstop hang and cannot stop system
> -------------------------------------------
>
> Key: TRAFODION-2174
> URL: https://issues.apache.org/jira/browse/TRAFODION-2174
> Project: Apache Trafodion
> Issue Type: Bug
> Reporter: liu ming
> Assignee: liu ming
>
> this issue occurred randomly, not very clear what is the consequence. People use Ctrl+C and then ckillall, not sure if this is safe.
> File jira to track it, next time meet this, we may need to gather more debug info.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)