You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@trafodion.apache.org by "liu ming (JIRA)" <ji...@apache.org> on 2016/09/23 02:21:20 UTC

[jira] [Commented] (TRAFODION-2174) sometime sqstop hang and cannot stop system

    [ https://issues.apache.org/jira/browse/TRAFODION-2174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15515136#comment-15515136 ] 

liu ming commented on TRAFODION-2174:
-------------------------------------

Reproduce today, with more info:
all SQL processes successfully shutdown, and RMS and TM, but Monitor is still there:
[liuliumi@10 incubator-trafodion]$  cstat
uid          pid   ppid  wchan   rss   vsz   time     stat cmd
---          ---   ----  -----   ---   ---   ----     ---- ---
liuliumi     23105     1 hrtime 38836 378624 00:03:44 Ssl  /trafodion/incubator-trafodion/core/sqf/export/bin64d/monitor COLD
liuliumi     23106     1 hrtime 38860 378624 00:03:05 Ssl  /trafodion/incubator-trafodion/core/sqf/export/bin64d/monitor COLD
liuliumi     23103     1 poll_s  1392  19136 00:00:00 S    mpirun -disable-auto-cleanup -demux select -env SQ_IC TCP -env MPI_ERROR_LEVEL 2 -env SQ_PIDMAP 1 -env MPI_TMPDIR /trafodion/incubator-trafodion/core/sqf/tmp -env MY_SQROOT /trafodion/incubator-trafodion/core/sqf -np 2 /trafodion/incubator-trafodion/core/sqf/export/bin64d/monitor COLD
liuliumi     24662 24611 n_tty_ 224564 1360540 00:01:33 Sl+ sqlci
liuliumi     23123 23106 futex_  4904 134320 00:01:10 Sl   sqwatchdog SQMON1.1 00001 00001 023123 $WDG001 10.0.2.15:52292 00005 00001 00001 SPARE
liuliumi     23125 23105 futex_  4924 134320 00:01:07 Sl   sqwatchdog SQMON1.1 00000 00000 023125 $WDG000 10.0.2.15:53307 00005 00000 00001 SPARE
[liuliumi@10 incubator-trafodion]$


So should look into Monitor

> sometime sqstop hang and cannot stop system
> -------------------------------------------
>
>                 Key: TRAFODION-2174
>                 URL: https://issues.apache.org/jira/browse/TRAFODION-2174
>             Project: Apache Trafodion
>          Issue Type: Bug
>            Reporter: liu ming
>            Assignee: liu ming
>
> this issue occurred randomly, not very clear what is the consequence. People use Ctrl+C and then ckillall, not sure if this is safe.
> File jira to track it, next time meet this, we may need to gather more debug info.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)