You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@httpd.apache.org by Paul Sutton <pa...@eu.c2.net> on 1998/01/02 13:08:08 UTC

Re: win32: reason for inactive service taking 60s to shutdown

On Tue, 30 Dec 1997, Marc Slemko wrote:
> I like the windows manual:
> 
>   Note SIGINT is not supported for any Win32 application including Windows
>   NT and Windows 95. When a CTRL+C interrupt occurs, Win32 operating systems
>   generate a new thread to specifically handle that interrupt. This can
>   cause a single-thread application such as UNIX, to become multithreaded,
>   resulting in unexpected behavior. 
> 
> I wasn't aware UNIX was a single-thread application.

Welcome to Microsoft speak. Next you'll be suggesting that all the great
new NT5 features - like disk quotas, cron jobs, scripting, mountable
media, backups to non-tape devices, per-user home directories, remote
graphical access - have already been invented in other OSes. Surely not?

> Anyway, about the reason why Apache takes 60 seconds to terminate
> as a service if no requests come in.  The problem is that the child
> blocks in select() and doesn't get around to the event until the
> next time around the loop.

Yep, this is a problem. In my big patch of Win32 fixes, I changed this to
a one-second wait:

| - worker_main() now will not exit until all the connections in its
|   listen() queue are dealt with (previous it would "count_down" a
|   few requests, then die, which would loose the connections in
|   this process'es listen() queue). This now makes graceful restarts not
|   lose connections. Once worker_main() has been told to exit it
|   frees the mutex to allow another worker_main() to get into a listen()
|   [however it seems Win32 directs incoming requests to the first process
|   annoyingly]. To give the old worker_main() a chance to exit,
|   only sleep for 1 second in the select. Also catch problems if the
|   select() returns immediately with an error (previously this would
|   cause a busy loop forever).

In the current code, the parent process also has a loop containing a two
second delay. With my patch this becomes an infinite delay.

> I couldn't find anything in the Windows API to easily let us 
> deliver something to the app that will abort the select() early,
> nor could I figure out anything that will let us wait for either
> at the same time; the winsock functions are seperete from the
> WaitForSingleObject type calls.  

We can do this with an event. My patch creates an event called
"apache-signal" which is used to wake the parent up and get it to do
something. It can be used for both graceful restarts and (graceful)
shutdowns. 

You should be able to do something similar with WaitForMultipleObject
calls using both sockets and events, but I couldn't actually make this
work (waiting on a socket for a connect event, that is). Plus I think it
is nicer if the parent process is the one which gets notification of
apache "signals" (if the threads get the notification, they have to tell
the parent anyway, since otherwise the parent will happily recreate the
child process after it kills itself -- not very useful for doing a
shutdown!). 

I really would like to see my Win32 MT patch incorporated into the code. 
It fixes a lot of problems like these (although it isn't perfect, it is a
lot better than the current code. In my opinion anyway). 

Paul

Re: win32: reason for inactive service taking 60s to shutdown

Posted by Paul Sutton <pa...@awe.com>.

On Fri, 2 Jan 1998, Ben Laurie wrote:

> Paul Sutton wrote:
> > +#ifdef MULTITHREAD
> > +/* special debug stuff -- PCS */
>
> Why make this multithread only?

No reason...
 
> > +    /* after updating the shutdown_pending or restart flags, we need
> > +     * to wait up the parent process so it can see the changes. The
> > +     * parent will normally be waiting for either a child process
> > +     * to die, or for a signal on the "spache-stop" event. So set the
> > +     * "apache-signal" event.
> > +     */
> 
> This comment makes little sense to me (even after correcting the
> spelling mistakes). Should "spache-stop" be "apache-signal"?

Yep. apache-stop was the old name. It is now apache-signal.

//pcs

Re: win32: reason for inactive service taking 60s to shutdown

Posted by Ben Laurie <be...@algroup.co.uk>.

Paul Sutton wrote:
> +#ifdef MULTITHREAD
> +/* special debug stuff -- PCS */

Why make this multithread only?

> -static void sig_term(int sig)
> +#ifdef WIN32
> +static void signal_parent(void)
>  {
> +    HANDLE e;
> +    /* after updating the shutdown_pending or restart flags, we need
> +     * to wait up the parent process so it can see the changes. The
> +     * parent will normally be waiting for either a child process
> +     * to die, or for a signal on the "spache-stop" event. So set the
> +     * "apache-signal" event.
> +     */

This comment makes little sense to me (even after correcting the
spelling mistakes). Should "spache-stop" be "apache-signal"?

Anyway, the bulk of the patches are a little large to decide whether
they will work, but they kinda make sense to me. I presume you've tested
them some. I'm willing to give them a go for 1.3b4...

Cheers,

Ben.

-- 
Ben Laurie            |Phone: +44 (181) 735 0686|Apache Group member
Freelance Consultant  |Fax:   +44 (181) 735 0689|http://www.apache.org
and Technical Director|Email: ben@algroup.co.uk |Apache-SSL author
A.L. Digital Ltd,     |http://www.algroup.co.uk/Apache-SSL
London, England.      |"Apache: TDG" http://www.ora.com/catalog/apache

Re: win32: reason for inactive service taking 60s to shutdown

Posted by Paul Sutton <pa...@awe.com>.

On Fri, 2 Jan 1998, Ben Laurie wrote:
> Could you post your patch again?

Ok, here it is...

--

Date: Mon, 1 Dec 1997 11:39:10 +0000 (GMT)
Subject: [PATCH] a bundle of multithreading changes

The patch below is a first pass at fixing up the multithreading
implementation on Win32. There are a number of problems with the code at
the moment, most notably a lack of "restart" functionality without loosing
connections. There are also a lot of asserts() in the code, which cause
nasty exits and do not do any error checking in release builds. This patch
relies on my previous WIN32ERROR patch for aplog_error(). 

The changes are:

 - Replaced assert()'s with aplog_errors() in various places, so that
   we don't loose error checking in the release build. Also asserts()
   do an ungraceful exit - try to be kinder to the user.

 - Added some debug macros to send messages to the screen or error log
   (these are just for convenience - if you don't like them they can
   be removed) (APD1() thru APD5()).

 - Add model-independent functions to initiate a restart or shutdown. 
   Currently this is initiated in functions which know they are being
   called as a signal handler. This is now abstracted to a separate
   functions, which can be called from a signal handler, or on Win32
   from the service controller callback. (start_shutdown(),
   start_restart())

 - Added lots of comments to the MT part of the code probably containing
   lots of tyops, er, typos.

 - In child_sub_main() (the "worker" thread code) used ptrans for
   the temporary pool, instead of pchild - now consistent with
   Unix child_main() [Incidently I'm not sure why ptrans is a global,
   since it (should) only be used within child_main()?]

 - made the parent thread (worker_main()) exit without going into the
   main parent loop if it is being signalled to die. Create the child
   pool (pchild) here (since this pool has a lifetime of a process,
   not a "child" (thread)). Cleanup this pool on exit, and run
   child_exit_modules().

 - worker_main() now will not exit until all the connections in its
   listen() queue are dealt with (previous it would "count_down" a
   few requests, then die, which would loose the connections in
   this process'es listen() queue). This now makes graceful restarts not
   loose connections. Once worker_main() has been told to exit it
   frees the mutex to allow another worker_main() to get into a listen()
   [however it seems Win32 directs incoming requests to the first process
   annoyingly]. To give the old worker_main() a chance to exit,
   only sleep for 1 second in the select. Also catch problems if the
   select() returns immediately with an error (previously this would
   cause a busy loop forever).

 - In the parent process (master_main()) abstract the creation/removal
   of processes (create_process(), cleanup_process()). Only bother to
   create one child process at once (more are unnecessary). Use an event
   to simulate a signal (with variables used to determine what action
   should be taken). Block forever waiting for a child to die or for
   the "signal" event (previously it would loop every 2 seconds). Handle
   shutdowns and restarts (both graceful). Allow multiple graceful
   restarts. Log error conditions like when child processes have to
   be forcibly terminated.

 - Update os/win32/service.c so that a "net stop apache" sends a
   graceful shutdown "signal" (using start_shutdown()). By changing
   this to call start_restart() you can also test graceful restarts.

 - Some unused code to do ungraceful shutdown/restart. Currently 
   mutually exclusive with graceful restarts. Not compiled in by
   default. See the comments about UNGRACEFUL_RESTARTS in the patch.

On the downside, this patch

 - Makes the MT code more Win32 specific (this is necessary since
   it is useful to do WaitForMultipleObject() type things, for instance,
   and we don't have an abstraction for that). Also the Win32 stuff should
   use completion ports, and again we don't have an abstraction. However
   this does give us a good idea of stuff that does need abstracting.

 - I haven't checked for resource leaks, proper pool maintenence, or
   proper sequences of module init/child init/child exit API calls.

 - Makes http_main.c even bigger. I was going to split it out in
   core, multi-process and MT parts at the outset but that would
   have made comparing the patch pretty difficult. However I think it
   is a good idea to move the code in the #ifdef MULTITHREAD/#endif
   part to a separate http_main_mt.c (or maybe a http_main_win32 since
   it is quite specific at the moment). 

Oh well, here's the patch. If you want to get the debug messages to go to
the error log instead of the console, define -DDEBUG_TO_ERROR_LOG. 

Paul

diff -u --recursive ../../apachen/src/main/http_main.c ./main/http_main.c
--- ../../apachen/src/main/http_main.c	Thu Nov 27 14:09:34 1997
+++ ./main/http_main.c	Mon Dec  1 10:56:17 1997
@@ -108,6 +108,36 @@
 #include <bstring.h>		/* for IRIX, FD_SET calls bzero() */
 #endif
 
+#ifdef MULTITHREAD
+/* special debug stuff -- PCS */
+
+  /* APD1() to APD5() are macros to help us debug. Then can either
+   * log to the screen or the error_log file.
+   */
+
+# ifdef _DEBUG
+#  ifdef DEBUG_TO_ERROR_LOG
+#   define APD1(a) aplog_error(APLOG_MARK,APLOG_DEBUG|APLOG_NOERRNO,server_conf,a)
+#   define APD2(a,b) aplog_error(APLOG_MARK,APLOG_DEBUG|APLOG_NOERRNO,server_conf,a,b)
+#   define APD3(a,b,c) aplog_error(APLOG_MARK,APLOG_DEBUG|APLOG_NOERRNO,server_conf,a,b,c)
+#   define APD4(a,b,c,d) aplog_error(APLOG_MARK,APLOG_DEBUG|APLOG_NOERRNO,server_conf,a,b,c,d)
+#   define APD5(a,b,c,d,e) aplog_error(APLOG_MARK,APLOG_DEBUG|APLOG_NOERRNO,server_conf,a,b,c,d,e)
+#  else
+#   define APD1(a) printf("%s\n",a)
+#   define APD2(a,b) do { printf(a,b);putchar('\n'); } while(0);
+#   define APD3(a,b,c) do { printf(a,b,c);putchar('\n'); } while(0);
+#   define APD4(a,b,c,d) do { printf(a,b,c,d);putchar('\n'); } while(0);
+#   define APD5(a,b,c,d,e) do { printf(a,b,c,d,e);putchar('\n'); } while(0);
+#  endif
+# else /* !_DEBUG */
+#  define APD1(a) 
+#  define APD2(a,b) 
+#  define APD3(a,b,c) 
+#  define APD4(a,b,c,d) 
+#  define APD5(a,b,c,d,e) 
+# endif /* _DEBUG */
+#endif /* MULTITHREAD */
+
 #include "explain.h"
 
 #if !defined(max)
@@ -2062,21 +2092,92 @@
 static int volatile is_graceful;
 static int volatile generation;
 
-static void sig_term(int sig)
+#ifdef WIN32
+static void signal_parent(void)
 {
+    HANDLE e;
+    /* after updating the shutdown_pending or restart flags, we need
+     * to wait up the parent process so it can see the changes. The
+     * parent will normally be waiting for either a child process
+     * to die, or for a signal on the "spache-stop" event. So set the
+     * "apache-signal" event.
+     */
+
+    APD1("*** SIGNAL_PARENT SET ***");
+
+    e = OpenEvent(EVENT_ALL_ACCESS, FALSE, "apache-signal");
+    if (!e) {
+	/* Um, problem, can't signal the main loop, which means we can't
+	 * signal ourselves to die. Ignore for now...
+	 */
+	aplog_error(APLOG_MARK, APLOG_EMERG|APLOG_WIN32ERROR, server_conf,
+	    "OpenEvent on apache-signal event");
+	return;
+    }
+    if (SetEvent(e) == 0) {
+	/* Same problem as above */
+	aplog_error(APLOG_MARK, APLOG_EMERG|APLOG_WIN32ERROR, server_conf,
+	    "SetEvent on apache-signal event");
+	return;
+    }
+    CloseHandle(e);
+}
+#endif
+
+/*
+ * start_shutdown() and start_restart(), below, are a first stab at
+ * functions to initiate shutdown or restart within relying on signals. 
+ * Previously this was initiated in sig_term() and restart() signal handlers, 
+ * but we want to be able to start a shutdown/restart from other sources --
+ * e.g. on Win32, from the service manager. Now the service manager can
+ * call start_shutdown() or start_restart() as appropiate. 
+ */
+
+void start_shutdown(void)
+{
+    if (shutdown_pending == 1) {
+	/* Um, is this _probably_ not an error, if the user has
+	 * tried to do a shutdown twice quickly, so we won't
+	 * worry about reporting it.
+	 */
+	return;
+    }
     shutdown_pending = 1;
+
+#ifdef WIN32
+    signal_parent();	    /* get the parent process to wake up */
+#endif
 }
 
-static void restart(int sig)
+/* do a graceful restart if graceful == 1 */
+void start_restart(int graceful)
 {
+    if (restart_pending == 1) {
+	/* Probably not an error - don't bother reporting it */
+	printf("*** restart already pending\n");
+	return;
+    }
+    restart_pending = 1;
+    is_graceful = graceful;
+
 #ifdef WIN32
-    is_graceful = 0;
-#else
-    is_graceful = (sig == SIGUSR1);
+    signal_parent();	    /* get the parent process to wake up */
 #endif /* WIN32 */
-    restart_pending = 1;
 }
 
+static void sig_term(int sig)
+{
+    start_shutdown();
+}
+
+static void restart(int sig)
+{
+#ifndef WIN32
+    start_restart(sig == SIGUSR1);
+#else
+    start_restart(1);
+#endif
+}
 
 void set_signals(void)
 {
@@ -3690,15 +3791,96 @@
 #else /* ndef MULTITHREAD */
 
 
+/**********************************************************************
+ * Multithreaded implementation
+ *
+ * This code is fairly specific to Win32.
+ *
+ * The model used to handle requests is a set of threads. One "main"
+ * thread listens for new requests. When something becomes
+ * available, it does a select and places the newly available socket
+ * onto a list of "jobs" (add_job()). Then any one of a fixed number
+ * of "worker" threads takes the top job off the job list with
+ * remove_job() and handles that connection to completion. After
+ * the connection has finished the thread is free to take another
+ * job from the job list.
+ *
+ * In the code, the "main" thread is running within the worker_main()
+ * function. The first thing this function does is create the
+ * worker threads, which operate in the child_sub_main() function. The
+ * main thread then goes into a loop within worker_main() where they
+ * do a select() on the listening sockets. The select times out once
+ * per second so that the thread can check for an "exit" signal
+ * from the parent process (see below). If this signal is set, the 
+ * thread can exit, but only after it has accepted all incoming
+ * connections already in the listen queue (since Win32 appears
+ * to through away listened but unaccepted connections when a 
+ * process dies).
+ *
+ * Because the main and worker threads exist within a single process
+ * they are vulnerable to crashes or memory leaks (crashes can also
+ * be caused within modules, of course). There also needs to be a 
+ * mechanism to perform restarts and shutdowns. This is done by
+ * creating the main & worker threads within a subprocess. A
+ * main process (the "parent process") creates one (or more) 
+ * processes to do the work, then the parent sits around waiting
+ * for the working process to die, in which case it starts a new
+ * one. The parent process also handles restarts (by creating
+ * a new working process then signalling the previous working process 
+ * exit ) and shutdowns (by signalling the working process to exit).
+ * The parent process operates within the master_main() function. This
+ * process also handles requests from the service manager (NT only).
+ *
+ * Signalling between the parent and working process uses a Win32
+ * event. Each child has a unique name for the event, which is
+ * passed to it with the -c argument when the child is spawned. The
+ * parent sets (signals) this event to tell the child to die.
+ * At present all children do a graceful die - they finish all
+ * current jobs _and_ empty the listen queue before they exit.
+ * A non-graceful die would need a second event.
+ *
+ * The code below starts with functions at the lowest level -
+ * worker threads, and works up to the top level - the main()
+ * function of the parent process.
+ *
+ * The scoreboard (in process memory) contains details of the worker
+ * threads (within the active working process). There is no shared
+ * "scoreboard" between processes, since only one is ever active
+ * at once (or at most, two, when one has been told to shutdown but
+ * is processes outstanding requests, and a new one has been started).
+ * This is controlled by a "start_mutex" which ensures only one working
+ * process is active at once.
+ **********************************************************************/
+
+/* The code protected by #ifdef UNGRACEFUL_RESTARTS/#endif sections
+ * could implement a sort-of ungraceful restart for Win32. instead of
+ * graceful restarts. 
+ *
+ * However it does not work too well because it does not intercept a
+ * connection already in progress (in child_sub_main()). We'd have to
+ * get that to poll on the exit event. 
+ */
 
+/*
+ * Definition of jobs, shared by main and worker threads.
+ */
 
 typedef struct joblist_s {
     struct joblist_s *next;
     int sock;
 } joblist;
 
+/*
+ * Globals common to main and worker threads. This structure is not
+ * used by the parent process.
+ */
+
 typedef struct globals_s {
+#ifdef UNGRACEFUL_RESTART
+    HANDLE thread_exit_event;
+#else
     int exit_now;
+#endif
     semaphore *jobsemaphore;
     joblist *jobhead;
     joblist *jobtail;
@@ -3706,10 +3888,14 @@
     int jobcount;
 } globals;
 
-
 globals allowed_globals =
 {0, NULL, NULL, NULL, 0};
 
+/*
+ * add_job()/remove_job() - add or remove an accepted socket from the
+ * list of sockets connected to clients. allowed_globals.jobmutex protects
+ * against multiple concurrent access to the linked list of jobs.
+ */
 
 void add_job(int sock)
 {
@@ -3736,10 +3922,32 @@
     joblist *job;
     int sock;
 
-    ap_assert(allowed_globals.jobmutex);
+#ifdef UNGRACEFUL_RESTART
+    HANDLE hObjects[2];
+    int rv;
+
+    hObjects[0] = allowed_globals.jobsemaphore;
+    hObjects[1] = allowed_globals.thread_exit_event;
+
+    rv = WaitForMultipleObjects(2, hObjects, FALSE, INFINITE);
+    ap_assert(rv != WAIT_FAILED);
+    if (rv == WAIT_OBJECT_0 + 1) {
+	/* thread_exit_now */
+	APD1("thread got exit now event");
+	return -1;
+    }
+    /* must be semaphore */
+#else
     acquire_semaphore(allowed_globals.jobsemaphore);
+#endif
+    ap_assert(allowed_globals.jobmutex);
+
+#ifdef UNGRACEFUL_RESTART
+    if (!allowed_globals.jobhead) {
+#else
     acquire_mutex(allowed_globals.jobmutex);
     if (allowed_globals.exit_now && !allowed_globals.jobhead) {
+#endif
 	release_mutex(allowed_globals.jobmutex);
 	return (-1);
     }
@@ -3754,19 +3962,37 @@
     return (sock);
 }
 
+/*
+ * child_sub_main() - this is the main loop for the worker threads
+ *
+ * Each thread runs within this function. They wait within remove_job()
+ * for a job to become available, then handle all the requests on that
+ * connection until it is closed, then return to remove_job().
+ *
+ * The worker thread will exit when it removes a job which contains
+ * socket number -1. This provides a graceful thread exit, since
+ * it will never exit during a connection.
+ *
+ * This code in this function is basically equivalent to the child_main()
+ * from the multi-process (Unix) environment, except that we
+ *
+ *  - do not call child_init_modules (child init API phase)
+ *  - block in remove_job, and when unblocked we have an already
+ *    accepted socket, instead of blocking on a mutex or select().
+ */
 
-
-void child_sub_main(int child_num, int srv,
-		    int csd, int dupped_csd,
-		    int requests_this_child, pool *pchild)
+void child_sub_main(int child_num)
 {
     NET_SIZE_T clen;
     struct sockaddr sa_server;
     struct sockaddr sa_client;
+    pool *ptrans;
+    int requests_this_child = 0;
+    int csd = -1;
+    int dupped_csd = -1;
+    int srv = 0;
 
-    csd = -1;
-    dupped_csd = -1;
-    requests_this_child = 0;
+    ptrans = make_sub_pool(pconf);
 
     (void) update_child_status(child_num, SERVER_READY, (request_rec *) NULL);
 
@@ -3783,9 +4009,6 @@
     signal(SIGURG, timeout);
 #endif
 
-
-    pchild = make_sub_pool(NULL);
-
     while (1) {
 	BUFF *conn_io;
 	request_rec *r;
@@ -3801,16 +4024,19 @@
 	signal(SIGPIPE, timeout);
 #endif
 
-	clear_pool(pchild);
+	clear_pool(ptrans);
 
 	(void) update_child_status(child_num, SERVER_READY, (request_rec *) NULL);
 
+	/* Get job from the job list. This will block until a job is ready.
+	 * If -1 is returned then the main thread wants us to exit.
+	 */
 	csd = remove_job();
 	if (csd == -1)
 	    break;		/* time to exit */
 	requests_this_child++;
 
-	note_cleanups_for_socket(pchild, csd);
+	note_cleanups_for_socket(ptrans, csd);
 
 	/*
 	 * We now have a connection, so set it up with the appropriate
@@ -3834,7 +4060,7 @@
 	(void) update_child_status(child_num, SERVER_BUSY_READ,
 				   (request_rec *) NULL);
 
-	conn_io = bcreate(pchild, B_RDWR | B_SOCKET);
+	conn_io = bcreate(ptrans, B_RDWR | B_SOCKET);
 	dupped_csd = csd;
 #if defined(NEED_DUPPED_CSD)
 	if ((dupped_csd = dup(csd)) < 0) {
@@ -3842,11 +4068,11 @@
 			"dup: couldn't duplicate csd");
 	    dupped_csd = csd;	/* Oh well... */
 	}
-	note_cleanups_for_socket(pchild, dupped_csd);
+	note_cleanups_for_socket(ptrans, dupped_csd);
 #endif
 	bpushfd(conn_io, csd, dupped_csd);
 
-	current_conn = new_connection(pchild, server_conf, conn_io,
+	current_conn = new_connection(ptrans, server_conf, conn_io,
 				      (struct sockaddr_in *) &sa_client,
 				      (struct sockaddr_in *) &sa_server,
 				      child_num);
@@ -3880,7 +4106,7 @@
 	 * in our buffers.  If possible, try to avoid a hard close until the
 	 * client has ACKed our FIN and/or has stopped sending us data.
 	 */
-	kill_cleanups_for_socket(pchild, csd);
+	kill_cleanups_for_socket(ptrans, csd);
 
 #ifdef NO_LINGCLOSE
 	bclose(conn_io);	/* just close it */
@@ -3898,22 +4124,13 @@
 	}
 #endif
     }
-    destroy_pool(pchild);
+    destroy_pool(ptrans);
     (void) update_child_status(child_num, SERVER_DEAD, NULL);
 }
 
 
 void child_main(int child_num_arg)
 {
-
-    int srv = 0;
-    int csd = 0;
-    int dupped_csd = 0;
-    int requests_this_child = 0;
-    pool *ppool = NULL;
-
-    my_pid = getpid();
-
     /*
      * Only reason for this function, is to pass in
      * arguments to child_sub_main() on its stack so
@@ -3921,10 +4138,7 @@
      * variables and I don't need to make those
      * damn variables static/global
      */
-    child_sub_main(child_num_arg, srv,
-		   csd, dupped_csd,
-		   requests_this_child, ppool);
-
+    child_sub_main(child_num_arg);
 }
 
 
@@ -3943,9 +4157,13 @@
  * Executive routines.
  */
 
+extern void main_control_server(void *); /* in hellop.c */
+
 event *exit_event;
 mutex *start_mutex;
 
+#define MAX_SELECT_ERRORS 100
+
 void worker_main()
 {
     /*
@@ -3967,12 +4185,16 @@
     time_t end_time;
     int i;
     struct timeval tv;
-    int wait_time = 100;
+    int wait_time = 1;
     int start_exit = 0;
-    int count_down = 0;
     int start_mutex_released = 0;
     int max_jobs_per_exe;
     int max_jobs_after_exit_request;
+    HANDLE hObjects[2];
+    int count_select_errors = 0;
+    pool *pchild;
+
+    pchild = make_sub_pool(pconf);
 
     standalone = 1;
     sd = -1;
@@ -3998,7 +4220,29 @@
 
     reinit_scoreboard(pconf);
 
-    acquire_mutex(start_mutex);
+    //acquire_mutex(start_mutex);
+    hObjects[0] = (HANDLE)start_mutex;
+    hObjects[1] = (HANDLE)exit_event;
+    rv = WaitForMultipleObjects(2, hObjects, FALSE, INFINITE);
+    if (rv == WAIT_FAILED) {
+	aplog_error(APLOG_MARK,APLOG_ERR|APLOG_WIN32ERROR, server_conf,
+	    "Waiting for start_mutex or exit_event -- process will exit");
+
+	child_exit_modules(pconf, server_conf);
+	destroy_pool(pchild);
+
+	cleanup_scoreboard();
+	exit(0);
+    }
+    if (rv == WAIT_OBJECT_0 + 1) {
+	/* exit event signalled - exit now */
+	child_exit_modules(pconf, server_conf);
+	destroy_pool(pchild);
+
+	cleanup_scoreboard();
+	exit(0);
+    }
+    /* start_mutex obtained, continue into the select() loop */
 
     setup_listeners(pconf);
     set_signals();
@@ -4035,20 +4279,25 @@
 	}
     }
 
-    /* main loop */
-    for (;;) {
-	if (max_jobs_per_exe && (total_jobs > max_jobs_per_exe) && !start_exit) {
+    while (1) {
+	APD4("child PID %d: thread_main total_jobs=%d start_exit=%d",
+		my_pid, total_jobs, start_exit);
+	if ((max_jobs_per_exe && (total_jobs > max_jobs_per_exe) && !start_exit)) {
 	    start_exit = 1;
 	    wait_time = 1;
-	    count_down = max_jobs_after_exit_request;
 	    release_mutex(start_mutex);
 	    start_mutex_released = 1;
+	    APD2("process PID %d: start mutex released\n", my_pid);
 	}
 	if (!start_exit) {
 	    rv = WaitForSingleObject(exit_event, 0);
 	    ap_assert((rv == WAIT_TIMEOUT) || (rv == WAIT_OBJECT_0));
-	    if (rv == WAIT_OBJECT_0)
-		break;
+	    if (rv == WAIT_OBJECT_0) {
+		APD1("child: exit event signalled, exiting");
+		start_exit = 1;
+		/* Lets not break yet - we may have threads to clean up */
+		/* break;*/
+	    }
 	    rv = WaitForMultipleObjects(nthreads, child_handles, 0, 0);
 	    ap_assert(rv != WAIT_FAILED);
 	    if (rv != WAIT_TIMEOUT) {
@@ -4058,23 +4307,45 @@
 		break;
 	    }
 	}
+
+#if 0
+	/* Um, this made us exit before all the connections in our
+	 * listen queue were dealt with. 
+	 */
 	if (start_exit && max_jobs_after_exit_request && (count_down-- < 0))
 	    break;
+#endif
 	tv.tv_sec = wait_time;
 	tv.tv_usec = 0;
 
 	memcpy(&main_fds, &listenfds, sizeof(fd_set));
 	srv = ap_select(listenmaxfd + 1, &main_fds, NULL, NULL, &tv);
 #ifdef WIN32
-	if (srv == SOCKET_ERROR)
+	if (srv == SOCKET_ERROR) {
+	    /* Map the Win32 error into a standard Unix error condition */
 	    errno = WSAGetLastError();
+	    srv = -1;
+	}
 #endif /* WIN32 */
 
-	if (srv < 0 && errno != EINTR)
-	    aplog_error(APLOG_MARK, APLOG_ERR, server_conf, "select: (listen)");
-
-	if (srv < 0)
+	if (srv < 0) {
+	    /* Error occurred - if EINTR, loop around with problem */
+	    if (errno != EINTR) {
+		/* A "real" error occurred, log it and increment the count of
+		 * select errors. This count is used to ensure we don't go into
+		 * a busy loop of continuour errors.
+		 */
+		aplog_error(APLOG_MARK, APLOG_ERR, server_conf, "select: (listen)");
+		count_select_errors++;
+		if (count_select_errors > MAX_SELECT_ERRORS) {
+		    aplog_error(APLOG_MARK, APLOG_ERR|APLOG_NOERRNO, server_conf,
+			"Too many errors in select loop. Child process exiting.");
+		    break;
+		}
+	    }
 	    continue;
+	}
+	count_select_errors = 0;    /* reset count of errors */
 	if (srv == 0) {
 	    if (start_exit)
 		break;
@@ -4090,7 +4361,6 @@
 		sd = lr->fd;
 	    }
 	}
-
 	do {
 	    clen = sizeof(sa_client);
 	    csd = accept(sd, (struct sockaddr *) &sa_client, &clen);
@@ -4103,22 +4373,23 @@
 	} while (csd < 0 && errno == EINTR);
 
 	if (csd < 0) {
-
 #if defined(EPROTO) && defined(ECONNABORTED)
 	    if ((errno != EPROTO) && (errno != ECONNABORTED))
 #elif defined(EPROTO)
-		if (errno != EPROTO)
+	    if (errno != EPROTO)
 #elif defined(ECONNABORTED)
-		    if (errno != ECONNABORTED)
+	    if (errno != ECONNABORTED)
 #endif
-			aplog_error(APLOG_MARK, APLOG_ERR, server_conf,
-				    "accept: (client socket)");
+		aplog_error(APLOG_MARK, APLOG_ERR, server_conf,
+			    "accept: (client socket)");
 	}
 	else {
 	    add_job(csd);
 	    total_jobs++;
 	}
     }
+      
+    APD2("process PID %d exiting", my_pid);
 
     /* Get ready to shutdown and exit */
     allowed_globals.exit_now = 1;
@@ -4126,10 +4397,15 @@
 	release_mutex(start_mutex);
     }
 
+#ifdef UNGRACEFUL_RESTART
+    SetEvent(allowed_globals.thread_exit_event);
+#else
     for (i = 0; i < nthreads; i++) {
 	add_job(-1);
     }
+#endif
 
+    APD2("process PID %d waiting for worker threads to exit", my_pid);
     /* Wait for all your children */
     end_time = time(NULL) + 180;
     while (nthreads) {
@@ -4143,16 +4419,23 @@
 	break;
     }
 
+    APD2("process PID %d killing remaining worker threads", my_pid);
     for (i = 0; i < nthreads; i++) {
 	kill_thread(child_handles[i]);
 	free_thread(child_handles[i]);
     }
+#ifdef UNGRACEFUL_RESTART
+    ap_assert(CloseHandle(allowed_globals.thread_exit_event));
+#endif
     destroy_semaphore(allowed_globals.jobsemaphore);
     destroy_mutex(allowed_globals.jobmutex);
 
     child_exit_modules(pconf, server_conf);
+    destroy_pool(pchild);
 
     cleanup_scoreboard();
+
+    APD2("process PID %d exited", my_pid);
     exit(0);
 }				/* standalone_main */
 
@@ -4166,7 +4449,9 @@
 
     sprintf(buf, "%s_%d", prefix, ++(*child_num));
     _flushall();
-    *ev = create_event(0, 0, buf);
+    *ev = CreateEvent(NULL, TRUE, FALSE, buf);
+    APD2("create_event_and_spawn(): created process kill event %s", buf);
+
     ap_assert(*ev);
     pass_argv[0] = argv[0];
     pass_argv[1] = "-c";
@@ -4187,17 +4472,52 @@
 static int service_pause = 0;
 static int service_stop = 0;
 
+/**********************************************************************
+ * master_main - this is the parent (main) process. We create a
+ * child process to do the work, then sit around waiting for either
+ * the child to exit, or a restart or exit signal. If the child dies,
+ * we just respawn a new one. If we have a shutdown or graceful restart,
+ * tell the child to die when it is ready. If it is a non-graceful
+ * restart, force the child to die immediately.
+ **********************************************************************/
 
-int master_main(int argc, char **argv)
+#define MAX_PROCESSES 50 /* must be < MAX_WAIT_OBJECTS-1 */
+
+void cleanup_process(HANDLE *handles, HANDLE *events, int position, int *processes)
 {
-    /*
-     * 1. Create exit events for children
-     * 2. Spawn children
-     * 3. Wait for either of your children to die
-     * 4. Close hand to dead child & its event
-     * 5. Respawn that child
-     */
+    int i;
+    int handle = 0;
+
+    CloseHandle(handles[position]);
+    CloseHandle(events[position]);
+
+    handle = (int)handles[position];
+
+    for (i = position; i < (*processes)-1; i++) {
+	handles[i] = handles[i + 1];
+	events[i] = events[i + 1];
+    }
+    (*processes)--;
+
+    APD4("cleanup_processes: removed child in slot %d handle %d, max=%d", position, handle, *processes);
+}
 
+void create_process(HANDLE *handles, HANDLE *events, int *processes, int *child_num, char *kill_event_name, int argc, char **argv)
+{
+    int i = *processes;
+    HANDLE kill_event;
+
+    handles[i] = (HANDLE)create_event_and_spawn(argc, argv, &kill_event, child_num, kill_event_name);
+    ap_assert(handles[i] >= 0);
+    events[i] = kill_event;
+    (*processes)++;
+
+    APD4("create_processes: created child in slot %d handle %d, max=%d", 
+	(*processes)-1, handles[(*processes)-1], *processes);
+}
+
+int master_main(int argc, char **argv)
+{
     int nchild = daemons_to_start;
     event **ev;
     int *child;
@@ -4206,60 +4526,166 @@
     char buf[100];
     int i;
     time_t tmstart;
+    HANDLE signal_event;	/* used to signal shutdown/restart to parent */
+    HANDLE process_handles[MAX_PROCESSES];
+    HANDLE process_kill_events[MAX_PROCESSES];
+    int current_live_processes = 0; /* number of child process we know about */
+    int processes_to_create = 0;    /* number of child processes to create */
+    pool *pparent;  /* pool for the parent process. Cleaned on each restart */
+
+    nchild = 1;	    /* only allowed one child process for current generation */
+    processes_to_create = nchild;
+
+    is_graceful = 0;
+    ++generation;
+
+    signal_event = OpenEvent(EVENT_ALL_ACCESS, FALSE, "apache-signal");
+    if (!signal_event) {
+	aplog_error(APLOG_MARK, APLOG_EMERG|APLOG_WIN32ERROR, server_conf,
+		    "Cannot open apache-signal event");
+	exit(1);
+    }
 
     sprintf(buf, "Apache%d", getpid());
     start_mutex = create_mutex(buf);
     ev = (event **) alloca(sizeof(event *) * nchild);
-    child = (int *) alloca(sizeof(int) * nchild);
-    for (i = 0; i < nchild; i++) {
+    child = (int *) alloca(sizeof(int) * (nchild+1));
+    while (processes_to_create--) {
 	service_set_status(SERVICE_START_PENDING);
-	child[i] = create_event_and_spawn(argc, argv, &ev[i], &child_num, buf);
-	ap_assert(child[i] >= 0);
+	create_process(process_handles, process_kill_events, &current_live_processes, &child_num, buf, argc, argv);
     }
+
     service_set_status(SERVICE_RUNNING);
 
-    for (; !service_stop;) {
-	rv = WaitForMultipleObjects(nchild, (HANDLE *) child, FALSE, 2000);
-	ap_assert(rv != WAIT_FAILED);
-	if (rv == WAIT_TIMEOUT)
-	    continue;
-	cld = rv - WAIT_OBJECT_0;
-	ap_assert(rv < nchild);
-	CloseHandle((HANDLE) child[rv]);
-	CloseHandle(ev[rv]);
-	child[rv] = create_event_and_spawn(argc, argv, &ev[rv], &child_num, buf);
-	ap_assert(child[rv]);
-    }
+    restart_pending = shutdown_pending = 0;
 
-    /*
-     * Tell all your kids to stop. Wait for them for some time
-     * Kill those that haven't yet stopped
+    do { /* restart-pending */
+	if (!is_graceful) {
+	    restart_time = time(NULL);
+	}
+	clear_pool(pconf);
+	pparent = make_sub_pool(pconf);
+
+	server_conf = read_config(pconf, pparent, server_confname);
+	open_logs(server_conf, pconf);
+	init_modules(pconf, server_conf);
+	if (!is_graceful)
+	    reinit_scoreboard(pconf);
+
+	restart_pending = shutdown_pending = 0;
+
+	/* Wait for either a child process to die, or for the stop_event
+	 * to be signalled by the service manager or rpc server */
+	while (1) {
+	    /* Next line will block forever until either a child dies, or we
+	     * get signalled on the "apache-signal" event (e.g. if the user is
+	     * requesting a shutdown/restart)
+	     */
+	    if (current_live_processes == 0) {
+		/* Shouldn't happen, but better safe than sorry */
+		aplog_error(APLOG_MARK,APLOG_ERR|APLOG_NOERRNO, server_conf,
+ 			"master_main: no child processes alive! creating one");
+		create_process(process_handles, process_kill_events, &current_live_processes, &child_num, buf, argc, argv);
+		if (processes_to_create) {
+		    processes_to_create--;
+		}
+	    }
+	    process_handles[current_live_processes] = signal_event;
+	    rv = WaitForMultipleObjects(current_live_processes+1, (HANDLE *)process_handles, 
+			FALSE, INFINITE);
+	    if (rv == WAIT_FAILED) {
+		/* Something serious is wrong */
+		aplog_error(APLOG_MARK,APLOG_CRIT|APLOG_WIN32ERROR, server_conf,
+		    "WaitForMultipeObjects on process handles and apache-signal -- doing shutdown");
+		shutdown_pending = 1;
+		break;
+	    }
+	    ap_assert(rv != WAIT_TIMEOUT);
+	    cld = rv - WAIT_OBJECT_0;
+	    APD4("main process: wait finished, cld=%d handle %d (max=%d)", cld, process_handles[cld], current_live_processes);
+	    if (cld == current_live_processes) {
+		/* stop_event is signalled, we should exit now */
+		if (ResetEvent(signal_event) == 0)
+		    APD1("main process: *** ERROR: ResetEvent(stop_event) failed ***");
+		APD3("main process: stop_event signalled: shutdown_pending=%d, restart_pending=%d",
+		    shutdown_pending, restart_pending);
+		break;
+	    }
+	    ap_assert(cld < current_live_processes);
+	    cleanup_process(process_handles, process_kill_events, cld, &current_live_processes);
+	    APD2("main_process: child in slot %d died", rv);
+	    if (processes_to_create) {
+		create_process(process_handles, process_kill_events, &current_live_processes, &child_num, buf, argc, argv);
+		processes_to_create--;
+	    }
+	}
+	if (!shutdown_pending && !restart_pending) {
+	    aplog_error(APLOG_MARK,APLOG_CRIT|APLOG_NOERRNO, server_conf,
+		"master_main: no shutdown or restart variables set -- doing shutdown");
+	    shutdown_pending = 1;
+	}
+	if (shutdown_pending) {
+	    /* tell all child processes to die */
+	    for (i = 0; i < current_live_processes; i++) {
+		APD3("main process: signalling child #%d handle %d to die", i, process_handles[i]);
+		if (SetEvent(process_kill_events[i]) == 0)
+		    aplog_error(APLOG_MARK,APLOG_WIN32ERROR, server_conf,
+			"SetEvent for child process in slot #%d", i);
+	    }
+
+	    break;
+	}
+	if (restart_pending) {
+	    int children_to_kill = current_live_processes;
+
+	    APD1("--- Doing graceful restart ---");
+
+	    processes_to_create = nchild;
+	    for (i = 0; i < nchild; ++i) {
+		if (current_live_processes >= MAX_PROCESSES)
+		    break;
+		create_process(process_handles, process_kill_events, &current_live_processes, &child_num, buf, argc, argv);
+		processes_to_create--;
+	    }
+	    for (i = 0; i < children_to_kill; i++) {
+		APD3("main process: signalling child #%d handle %d to die", i, process_handles[i]);
+		if (SetEvent(process_kill_events[i]) == 0)
+		    aplog_error(APLOG_MARK,APLOG_WIN32ERROR, server_conf,
+ 			"SetEvent for child process in slot #%d", i);
+	    }
+	}
+	++generation;
+    } while (restart_pending);
+
+    /* If we dropped out of the loop we definitly want to die completely. We need to
+     * make sure we wait for all the child process to exit first.
      */
-    for (i = 0; i < nchild; i++) {
-	SetEvent(ev[i]);
-    }
 
-    for (tmstart = time(NULL); nchild && (tmstart < (time(NULL) + 60));) {
+    APD2("*** main process shutdown, processes=%d ***", current_live_processes);
+
+    CloseHandle(signal_event);
+
+    tmstart = time(NULL);
+    while (current_live_processes && ((tmstart+60) > time(NULL))) {
 	service_set_status(SERVICE_STOP_PENDING);
-	rv = WaitForMultipleObjects(nchild, (HANDLE *) child, FALSE, 2000);
-	ap_assert(rv != WAIT_FAILED);
+	rv = WaitForMultipleObjects(current_live_processes, (HANDLE *)process_handles, FALSE, 2000);
 	if (rv == WAIT_TIMEOUT)
 	    continue;
+	ap_assert(rv != WAIT_FAILED);
 	cld = rv - WAIT_OBJECT_0;
-	ap_assert(rv < nchild);
-	CloseHandle((HANDLE) child[rv]);
-	CloseHandle(ev[rv]);
-	for (i = rv; i < (nchild - 1); i++) {
-	    child[i] = child[i + 1];
-	    ev[i] = ev[i + 1];
-	}
-	nchild--;
-    }
-    for (i = 0; i < nchild; i++) {
-	TerminateProcess((HANDLE) child[i], 1);
+	ap_assert(rv < current_live_processes);
+	APD4("main_process: child in #%d handle %d died, left=%d", 
+	    rv, process_handles[rv], current_live_processes);
+	cleanup_process(process_handles, process_kill_events, cld, &current_live_processes);
+    }
+    for (i = 0; i < current_live_processes; i++) {
+	aplog_error(APLOG_MARK,APLOG_ERR|APLOG_NOERRNO, server_conf,
+ 	    "forcing termination of child #%d (handle %d)", i, process_handles[i]);
+	TerminateProcess((HANDLE) process_handles[i], 1);
     }
     service_set_status(SERVICE_STOPPED);
 
+    destroy_pool(pparent);
 
     destroy_mutex(start_mutex);
     return (0);
@@ -4314,6 +4740,7 @@
 #ifdef WIN32
 	case 'c':
 	    exit_event = open_event(optarg);
+	    APD2("child: opened process event %s", optarg);
 	    cp = strchr(optarg, '_');
 	    ap_assert(cp);
 	    *cp = 0;
diff -u --recursive ../../apachen/src/os/win32/service.c ./os/win32/service.c
--- ../../apachen/src/os/win32/service.c	Mon Nov  3 09:53:31 1997
+++ ./os/win32/service.c	Mon Dec  1 10:30:39 1997
@@ -91,7 +91,8 @@
     else
     {
         globdat.main_fn = main_fn;
-        globdat.stop_event = create_event(0, 0, NULL);
+	globdat.stop_event = CreateEvent(NULL, FALSE, FALSE, "apache-signal");
+//        globdat.stop_event = create_event(0, 0, NULL);
         globdat.stop_flag = stop;
         globdat.pause_flag = pause;
      
@@ -182,7 +183,8 @@
         //
         case SERVICE_CONTROL_STOP:
             state = SERVICE_STOP_PENDING;
-            *(globdat.stop_flag) = 1;
+//            *(globdat.stop_flag) = 1;
+	    start_shutdown();
             break;
 
         // Update the service status.

Re: win32: reason for inactive service taking 60s to shutdown

Posted by Ben Laurie <be...@algroup.co.uk>.

Paul Sutton wrote:
> I really would like to see my Win32 MT patch incorporated into the code.
> It fixes a lot of problems like these (although it isn't perfect, it is a
> lot better than the current code. In my opinion anyway).

Could you post your patch again?

Cheers,

Ben.

-- 
Ben Laurie            |Phone: +44 (181) 735 0686|Apache Group member
Freelance Consultant  |Fax:   +44 (181) 735 0689|http://www.apache.org
and Technical Director|Email: ben@algroup.co.uk |Apache-SSL author
A.L. Digital Ltd,     |http://www.algroup.co.uk/Apache-SSL
London, England.      |"Apache: TDG" http://www.ora.com/catalog/apache

Re: win32: reason for inactive service taking 60s to shutdown

Posted by Dean Gaudet <dg...@arctic.org>.

On Fri, 2 Jan 1998, Paul Sutton wrote:

> I really would like to see my Win32 MT patch incorporated into the code. 
> It fixes a lot of problems like these (although it isn't perfect, it is a
> lot better than the current code. In my opinion anyway). 

Then commit it :)  You've got a bunch +1s already I thought.   I was
certainly one of them.

Dean