You are viewing a plain text version of this content. The canonical link for it is here.
Posted to apache-bugdb@apache.org by julian briggs <j....@sheffield.ac.uk> on 1998/03/23 11:13:21 UTC

general/1977: It hangs intermittantly. When hung apache will not serve anything. The parent is trying to

>Number:         1977
>Category:       general
>Synopsis:       It hangs intermittantly.  When hung apache will not serve anything.  The parent is trying to
>Confidential:   no
>Severity:       critical
>Priority:       medium
>Responsible:    apache
>State:          open
>Class:          sw-bug
>Submitter-Id:   apache
>Arrival-Date:   Mon Mar 23 02:20:00 PST 1998
>Last-Modified:
>Originator:     j.briggs@sheffield.ac.uk
>Organization:
apache
>Release:        1.3b5
>Environment:
I am running Solaris-2.6 with recommended patches on a Sun
Ultra 1 with 128Mb physical memory.  Built Apache-1.3b5 cleanly with
SunPro C compiler and mod_rewrite.  

SunOS cedar 5.6 Generic sun4u sparc SUNW,Ultra-1
Mar 23 09:13:48 cedar unix: mem = 131072K (0x8000000)
>Description:
Dear Apache,

     I am running Solaris-2.6 with recommended patches on a Sun
Ultra 1 with 128Mb physical memory.  Built Apache-1.3b5 cleanly with
SunPro C compiler and mod_rewrite.  It hangs intermittantly.

SunOS cedar 5.6 Generic sun4u sparc SUNW,Ultra-1
Mar 23 09:13:48 cedar unix: mem = 131072K (0x8000000)

When hung apache will not serve anything.  The parent is trying to
kill and wait for a hung child which will not die.

Kill -9 fails to kill this child. 
Kill -9 on the parent leaves orphaned children.
Reboot is the only cure I have found.

Julian


cedar# pseg httpd.conf
   httpd  3620  2031  0 16:08:56 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  4244  2031  0 18:29:30 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
    root  2031     1  0 12:42:17 ?        0:01 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  4302  2031  0 18:51:20 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  3616  2031  0 16:08:09 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  3644  2031  0 16:15:06 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  4301  2031  0 18:51:19 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  3711  2031  0 16:35:38 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  3838  2031  0 17:01:13 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  4309  2031  0 18:53:22 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  3615  2031  0 16:08:06 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  3667  2031  0 16:22:32 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  2380  2031  0 13:41:46 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  4255  2031  0 18:33:53 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  4307  2031  0 18:53:21 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  3660  2031  0 16:20:49 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  4308  2031  0 18:53:22 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  4271  2031  0 18:39:38 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  4292  2031  0 18:46:28 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  4284  2031  0 18:45:04 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
   httpd  4290  2031  0 18:46:25 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
cedar# 

kill(4309, SIGUSR1)                             = 0
waitid(P_ALL, 0, 0xEFFFFB90, WEXITED|WTRAPPED|WNOHANG) = 0
poll(0xEFFFDC10, 0, 1000)                       = 0
time()                                          = 890298196
getpid()                                        = 2031 [1]
kill(4309, SIGUSR1)                             = 0
waitid(P_ALL, 0, 0xEFFFFB90, WEXITED|WTRAPPED|WNOHANG) = 0
poll(0xEFFFDC10, 0, 1000)                       = 0
time()                                          = 890298197
getpid()                                        = 2031 [1]
kill(4309, SIGUSR1)                             = 0
waitid(P_ALL, 0, 0xEFFFFB90, WEXITED|WTRAPPED|WNOHANG) = 0
poll(0xEFFFDC10, 0, 1000)                       = 0
time()                                          = 890298198
getpid()                                        = 2031 [1]
kill(4309, SIGUSR1)                             = 0
waitid(P_ALL, 0, 0xEFFFFB90, WEXITED|WTRAPPED|WNOHANG) = 0
poll(0xEFFFDC10, 0, 1000)                       = 0
time()                                          = 890298199
getpid()                                        = 2031 [1]
kill(4309, SIGUSR1)                             = 0
waitid(P_ALL, 0, 0xEFFFFB90, WEXITED|WTRAPPED|WNOHANG) = 0
poll(0xEFFFDC10, 0, 1000)                       = 0
time()                                          = 890298200
getpid()                                        = 2031 [1]
kill(4309, SIGUSR1)                             = 0
waitid(P_ALL, 0, 0xEFFFFB90, WEXITED|WTRAPPED|WNOHANG) = 0
...
cedar# kill -9 4309
cedar# !ps
pseg 4309
   httpd  4309  2031  0 18:53:22 ?        0:00 /usr/local/httpd/bin/httpd -f /usr/local/httpd/conf/httpd.conf
cedar# 
cedar# showrev -p
Patch: 101242-11  Obsoletes:  Packages: 
Patch: 105216-01 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWcsu
Patch: 105393-01 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWcsu
Patch: 105518-01 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWcsu
Patch: 105615-02 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWcsu
Patch: 105621-01 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWcsu, SUNWarc
Patch: 105665-01 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWcsu
Patch: 105379-01 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWcsr
Patch: 105786-01 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWcsr
Patch: 105669-02 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWdtbas
Patch: 105558-01 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWdtdst
Patch: 105375-03 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWluxal, SUNWluxdv, SUNWluxop
Patch: 105357-01 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWses
Patch: 105223-01 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWssadv, SUNWssaop
Patch: 105356-01 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWssadv
Patch: 105407-01 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWvolu
Patch: 104468-06 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWsadmc, SUNWsadmo, SUNWsacb, SUNWsaco, SUNWsadmm, SUNWsadma, SUNWsadmb, SUNWsacd, SUNWsadmp
Patch: 102924-25 Obsoletes:  Requires:  Incompatibles:  Packages: SUNWsunpc
cedar# 

[Wed Mar 18 18:38:05 1998] [error] (13)Permission denied: file permissions deny server access: /home/dan/public_html/f14a_thumb.gif
[Wed Mar 18 18:41:51 1998] [error] File does not exist: /home/u6gwt/public_html/html/tracks/melbourn.htm
[Thu Mar 19 09:35:03 1998] [error] child process 2380 still did not exit, sending a SIGKILL
[Thu Mar 19 09:35:03 1998] [error] child process 4244 still did not exit, sending a SIGKILL
[Thu Mar 19 09:35:03 1998] [error] child process 4271 still did not exit, sending a SIGKILL
[Thu Mar 19 09:35:03 1998] [error] child process 3660 still did not exit, sending a SIGKILL
[Thu Mar 19 09:35:03 1998] [error] child process 4255 still did not exit, sending a SIGKILL
[Thu Mar 19 09:35:03 1998] [error] child process 3711 still did not exit, sending a SIGKILL
[Thu Mar 19 09:35:03 1998] [error] child process 4284 still did not exit, sending a SIGKILL
[Thu Mar 19 09:35:03 1998] [error] child process 4302 still did not exit, sending a SIGKILL
[Thu Mar 19 09:35:03 1998] [error] child process 4307 still did not exit, sending a SIGKILL
[Thu Mar 19 09:35:03 1998] [error] child process 4309 still did not exit, sending a SIGKILL
[Thu Mar 19 09:35:20 1998] [error] could not make child process 2380 exit, attempting to continue anyway
[Thu Mar 19 09:35:20 1998] [error] could not make child process 4244 exit, attempting to continue anyway
[Thu Mar 19 09:35:20 1998] [error] could not make child process 4271 exit, attempting to continue anyway
[Thu Mar 19 09:35:20 1998] [error] could not make child process 3660 exit, attempting to continue anyway
[Thu Mar 19 09:35:20 1998] [error] could not make child process 4255 exit, attempting to continue anyway
[Thu Mar 19 09:35:20 1998] [error] could not make child process 3711 exit, attempting to continue anyway
[Thu Mar 19 09:35:20 1998] [error] could not make child process 4284 exit, attempting to continue anyway
[Thu Mar 19 09:35:20 1998] [error] could not make child process 4302 exit, attempting to continue anyway
[Thu Mar 19 09:35:20 1998] [error] could not make child process 4307 exit, attempting to continue anyway
[Thu Mar 19 09:35:20 1998] [error] could not make child process 4309 exit, attempting to continue anyway
[Thu Mar 19 09:35:20 1998] [notice] httpd: caught SIGTERM, shutting down
>How-To-Repeat:
Run 1.3b5 on Solaris-2.6 for about 2-8 hours.  
I have seen no pattern in urls etc.

>Fix:
no
%0
>Audit-Trail:
>Unformatted:
[In order for any reply to be added to the PR database, ]
[you need to include <ap...@Apache.Org> in the Cc line ]
[and leave the subject line UNCHANGED.  This is not done]
[automatically because of the potential for mail loops. ]


kill and wait for a hung child which will not die.