You are viewing a plain text version of this content. The canonical link for it is here.

Posted to modperl@perl.apache.org by Stas Bekman <sb...@stason.org> on 2000/04/15 16:54:41 UTC

[correction] Benchmarking Apache::Registry and Perl Content Handler

Well, following the guardian Gunther suggestions I've tried to rerun these
tests as well, preloading the script to make the benchmark fair. This has
improved the results significantly, shortening the gaps. 

But read the both sets of tests and try to explain the phenomena that you
will see. You will find it at the end, under the [ReaderMETA] tag. Thanks!




=head1 Benchmarking Apache::Registry and Perl Content Handler

=head2 Light (Empty) Code

First lets see the overhead that Apache::Regsitry adds. In order to do
that we will use an almost empty scripts, that only send a basic
header and one word as a content.

The I<registry.pl> script running under C<Apache::Registry>:

  benchmarks/registry.pl
  ----------------------
  use strict;
  print "Content-type: text/plain\r\n\r\n";
  print "Hello";

The Perl Content handler:

  Benchmark/Handler.pm
  --------------------
  package Benchmark::Handler;
  use Apache::Constants qw(:common);
  
  sub handler{
    $r = shift;
    $r->send_http_header('text/html');
    $r->print("Hello");
    return OK;
  }
  1;

with settings:

  PerlModule Benchmark::Handler
  <Location /benchmark_handler>
    SetHandler perl-script
    PerlHandler Benchmark::Handler
  </Location>

so we get C<Benchmark::Handler> preloaded.

We will use the C<Apache::RegistryLoader> to preload the script as
well, so the benchmark will be fair and only the processing time will
be measured. In the I<startup.pl> we add:

  use Apache::RegistryLoader ();
  Apache::RegistryLoader->new->handler(
              "/perl/benchmarks/registry.pl",
   "/home/httpd/perl/benchmarks/registry.pl");

And we if we check the I<Compiled Registry Scripts" section with help
of L<Apache::Status|debug/Apache_Status_Embedded_Inter> (
http://localhost/perl-status?rgysubs ), where we see the listing of the
already compiled scripts :

  Apache::ROOT::perl::benchmarks::registry_2epl

So now we can proceed with the benchmark:

  % ab -n 1000 -c 10 http://localhost/perl/benchmarks/registry.pl
  
  Time taken for tests:   16.148 seconds
  Requests per second:    61.93
  Connnection Times (ms)
                min   avg   max
  Connect:        0     2   202
  Processing:    81   158    60
  Total:         81   160   262

  % ab -n 1000 -c 10 http://localhost/benchmark_handler
  
  Time taken for tests:   5.097 seconds
  Requests per second:    196.19
  Connnection Times (ms)
                min   avg   max
  Connect:        0     0     3
  Processing:    40    50   237
  Total:         40    50   240

So we can see that the average added overhead is about:

  160 - 50 = 110 milli-seconds

per script.

=head2 Heavy Code

Of course this overhead is insignificant when the code itself is
significantly heavier and slower. Let's leave the above code examples
umodified but add some CPU intensive processing operation (it can be
also an IO operation or a database query.)

  my $x = 100;
  my $y = log ($x ** 100)  for (0..10000);

  % ab -n 1000 -c 10 http://localhost/perl/benchmarks/registry.pl
  
  Time taken for tests:   82.614 seconds
  Requests per second:    12.10
  Connnection Times (ms)
                min   avg   max
  Connect:        0     3   670
  Processing:   187   819  1211
  Total:        187   822  1881

  % ab -n 1000 -c 10  http://localhost/benchmark_handler
  
  Time taken for tests:   15.000 seconds
  Requests per second:    66.67
  Connnection Times (ms)
                min   avg   max
  Connect:        0     2   112
  Processing:    22   147   770
  Total:         22   149   882

The processing time delta has grown to 673 milli-seconds (822-149).

[ReaderMETA]: Anyone knows to explain this phenomena? It's not clear
to me why adding the same CPU intesive code to the two handlers under
test, enlarges the delta of the average processing time between the
two handlers. I'd expect to see the same delta (of 110 msec) in this
case, but that's not what's happenning. Any ideas?

Notice that the hardware used in this test is not important since
what's important is the delta (because we are interested in the
comparison and not the absolute values).

The SW that was used: Apache/1.3.10-dev, (Unix) mod_perl/1.21_01-dev,
Perl5.005_03 for i386-linux.

The relevant server configuration:

  MinSpareServers 10
  MaxSpareServers 10
  StartServers 10
  MaxClients 20
  MaxRequestsPerChild 10000



______________________________________________________________________
Stas Bekman             | JAm_pH    --    Just Another mod_perl Hacker
http://stason.org/      | mod_perl Guide  http://perl.apache.org/guide 
mailto:stas@stason.org  | http://perl.org    http://stason.org/TULARC/
http://singlesheaven.com| http://perlmonth.com http://sourcegarden.org
----------------------------------------------------------------------

Apache::Scoreboard won't compile...

Posted by Rusty Foster <ru...@kuro5hin.org>.

I'm at a loss as to what to do here... 

The system in question: 
slackware 7, 
Apache 1.3.12, 
mod_perl 1.22,
perl 5.005_03
Apache::Scoreboard 0.10

Here's what happens when I run make (perl Makefile.PL runs fine):

[mkdir blib stuff...]
cp lib/Apache/ScoreboardGraph.pm blib/lib/Apache/ScoreboardGraph.pm
cp Scoreboard.pm blib/lib/Apache/Scoreboard.pm
make[1]: Entering directory `/usr/src/Apache-Scoreboard-0.10/Dummy'
mkdir ../blib/arch/auto/Apache/DummyScoreboard
mkdir ../blib/lib/auto/Apache/DummyScoreboard
cp DummyScoreboard.pm ../blib/lib/Apache/DummyScoreboard.pm
/usr/bin/perl -I/usr/lib/perl5/i386-linux -I/usr/lib/perl5
/usr/lib/perl5/ExtUtils/xsubpp  -typemap /usr/lib/perl5/ExtUtils/typemap
-typemap /usr/lib/perl5/site_perl/i386-linux/auto/Apache/typemap
-typemap typemap DummyScoreboard.xs >xstmp.c && mv xstmp.c
DummyScoreboard.c
Please specify prototyping behavior for DummyScoreboard.xs (see perlxs
manual)
cc -c -I../ -I/usr/lib/perl5/site_perl/i386-linux/auto/Apache/include
-I/usr/lib/perl5/site_perl/i386-linux/auto/Apache/include/modules/perl
-I/usr/lib/perl5/site_perl/i386-linux/auto/Apache/include/include
-I/usr/lib/perl5/site_perl/i386-linux/auto/Apache/include/regex
-I/usr/lib/perl5/site_perl/i386-linux/auto/Apache/include/os/unix
-I/include -I/usr/local/apache/include -Dbool=char -DHAS_BOOL
-I/usr/local/include -O2    -DVERSION=\"0.04\" -DXS_VERSION=\"0.04\"
-fpic -I/usr/lib/perl5/i386-linux/CORE  DummyScoreboard.c
In file included from DummyScoreboard.xs:2:
/usr/lib/perl5/site_perl/i386-linux/auto/Apache/include/include/scoreboard.h:150:
field `start_time' has incomplete type
/usr/lib/perl5/site_perl/i386-linux/auto/Apache/include/include/scoreboard.h:151:
field `stop_time' has incomplete type
make[1]: *** [DummyScoreboard.o] Error 1
make[1]: Leaving directory `/usr/src/Apache-Scoreboard-0.10/Dummy'
make: *** [subdirs] Error 2


Note in particular the "Please specify prototyping behavior for
DummyScoreboard.xs" and the "incomplete type" warnings. Does anyone know
what this problem is, or even know where I should start looking? Thanks
in advance.

--R
-- 
=======================================================================
|      Rusty Foster       | "You can never entirely stop being what   |
|   rusty@kuro5hin.org    | you once were. That's why it's important  |
|    rusty@intes.net      | to be the right person today, and not put |
| http://www.kuro5hin.org | it off till tomorrow."        -Larry Wall |
=======================================================================

Re: [3rd correction] Benchmarking Apache::Registry and Perl Content Handler

Posted by Joshua Chamas <jo...@chamas.com>.

Stas Bekman wrote:
> 
> Geez, mod_perl rocks!!!! When I've run the benchmarks on the *strong*
> machine the results were all different!
> 700+ RPS for the hello benchmark!!!
> 
> The only mistery left is why registry is so slow on the slow machine
> relative to fast machine??? And yeah I've used the latest apache/mod_perl
> 

The only ideas I have about the difference are mostly 
hardware related... CPU L1/L2 cache sizes may differ,
and on your slow CPU may be so small that you end up missing
much of the time to your also possibly slower RAM & bus.

Also too, since you had less RAM on your slower box, you
might have been swapping, though I doubt you would have 
let that get by you.

--Joshua

[3rd correction] Benchmarking Apache::Registry and Perl Content Handler

Posted by Stas Bekman <sb...@stason.org>.

Geez, mod_perl rocks!!!! When I've run the benchmarks on the *strong*
machine the results were all different! 
700+ RPS for the hello benchmark!!!

The only mistery left is why registry is so slow on the slow machine
relative to fast machine??? And yeah I've used the latest apache/mod_perl

Enough exclamation marks, see it for yourself. I'll repost the
Apache::Request vs CGI.pm on the strong machine tomorrow.

=head1 Benchmarking

Now we will run different benchmarks to learn what techniques should
be used and what not. The following SW/HW is used for the testing
purposes:

  HW: Dual Pentium II (Deschutes) 400Mhz 512 KB cache 256MB 
      RAM (DIMM PC100)

  SW: Linux (RH 6.1) Perl 5.005_03
      Apache/1.3.12 mod_perl/1.22 mod_ssl/2.6.2 OpenSSL/0.9.5

The relevant Apache configuration:

  MinSpareServers 10
  MaxSpareServers 20
  StartServers 10
  MaxClients 20
  MaxRequestsPerChild 10000


=head2 Apache::Registry and Perl Content Handler


=head3 The Light (Empty) Code

First lets see the overhead that C<Apache::Registry> adds. In order to
do that we will use an almost empty scripts, that only send a basic
header and one word as a content.

The I<registry.pl> script running under C<Apache::Registry>:

  benchmarks/registry.pl
  ----------------------
  use strict;
  print "Content-type: text/plain\r\n\r\n";
  print "Hello";

The Perl Content handler:

  Benchmark/Handler.pm
  --------------------
  package Benchmark::Handler;
  use Apache::Constants qw(:common);
  
  sub handler{
    $r = shift;
    $r->send_http_header('text/html');
    $r->print("Hello");
    return OK;
  }
  1;

with settings:

  PerlModule Benchmark::Handler
  <Location /benchmark_handler>
    SetHandler perl-script
    PerlHandler Benchmark::Handler
  </Location>

so we get C<Benchmark::Handler> preloaded.

We will use the C<Apache::RegistryLoader> to preload the script as
well, so the benchmark will be fair and only the processing time will
be measured. In the I<startup.pl> we add:

  use Apache::RegistryLoader ();
  Apache::RegistryLoader->new->handler(
              "/perl/benchmarks/registry.pl",
   "/home/httpd/perl/benchmarks/registry.pl");

And we if we check the I<Compiled Registry Scripts"> section with help
of L<Apache::Status|debug/Apache_Status_Embedded_Inter> (
http://localhost/perl-status?rgysubs ), where we see the listing of
the already compiled scripts :

  Apache::ROOT::perl::benchmarks::registry_2epl

=head3 The Heavy Code

We we will see that the overhead is insignificant when the code itself
is significantly heavier and slower. Let's leave the above code
examples umodified but add some CPU intensive processing operation (it
can be also an IO operation or a database query.)

  my $x = 100;
  my $y = log ($x ** 100)  for (0..10000);


=head3 Processing and Results

So now we can proceed with the benchmark. We will generate 5000
request with 10 as a concurrency level (i.e. emulating 10 concurrent
users):

  % ab -n 5000 -c 10 http://localhost/perl/benchmarks/registry.pl
  % ab -n 5000 -c 10 http://localhost/benchmark_handler

And the results:

=over 

=item *

  Light code:

    Type  	RPS 	Av.CTime
  -------       ---     -------
  Registry      561          16
  Handler       707          13

  Heavy code:

    Type  	RPS 	Av.CTime
  -------       ---     -------
  Registry       68         146
  Handler        70         141

  Reports: 
  -----------------------------------------------
  RPS       : Requests Per Second
  Av. CTime : Average request processing time (msec) as seen by client


=head3 Conclusions

=over 

=item * The Light Code

We can see that the average overhead added by C<Apache::Registry> is
about:

  16 - 13 = 3 milli-seconds

per request.

The difference in speed in per cents is about 19%.

=item * The Heavy Code

If we are looking at the average processing time, we see that the time
delta between the two handlers is almost the same and have grown from
3 msec to 5 msec. Which means that the identical heavy code that has
been added was running for 130msec (146-16). It doesn't mean that the
added code itself has been running for 130msec. It means that it took
130msec for this code to be completed, in multi-process environment
where each process gets a time slice to use CPU.

If we run this extra code under plain Benchmark:

  benchmark.pl
  ------------
  use Benchmark;
  
  timethis (1_000,
   sub {
    my $x = 100;
    my $y = log ($x ** 100)  for (0..10000);
  });

  % perl benchmark.pl
  timethis 1000: 25 wallclock secs (24.93 usr +  0.00 sys = 24.93 CPU)

We see that it takes about 25 CPU seconds to complete.

The interesting thing is that when the server under test runs on a
slow machine the results are completely different. I'll present them
here for comparison:

  Light code:

    Type  	RPS 	Av.CTime
  -------       ---     -------
  Registry       61         160
  Handler       196          50

  Heavy code:

    Type  	RPS 	Av.CTime
  -------       ---     -------
  Registry       12         822
  Handler        67         149

[ReaderMETA]: Anyone knows to explain this phenomena? It's not clear
to me why adding the same CPU intensive code to the two handlers under
test on the slow machine, enlarges the delta of the average processing
time between the two handlers. I'd expect to see the same delta (of
110 msec) in this case, but that's not what's happenning. Any ideas?


=back


______________________________________________________________________
Stas Bekman             | JAm_pH    --    Just Another mod_perl Hacker
http://stason.org/      | mod_perl Guide  http://perl.apache.org/guide 
mailto:stas@stason.org  | http://perl.org    http://stason.org/TULARC/
http://singlesheaven.com| http://perlmonth.com http://sourcegarden.org
----------------------------------------------------------------------