You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@subversion.apache.org by Jani Averbach <ja...@jaa.iki.fi> on 2005/02/09 17:20:49 UTC

SVN repositories statistics (Was: gcc evaluating svn)

On 2005-02-09 13:16+0100, Alex R. Mosteo wrote:
> I don't know if you're aware that some folks from gcc are informally 
> evaluating svn for a possible migration from cvs.
...
> http://gcc.gnu.org/ml/gcc/2005-02/msg00102.html
> http://gcc.gnu.org/ml/gcc/2005-02/msg00204.html
> http://gcc.gnu.org/ml/gcc/2005-02/msg00285.html

Boost <www.boost.org>
Re: [boost] Moving from CVS to Subversion?   
   http://lists.boost.org/MailArchives/boost/msg78409.php
Initial Subversion/RapidSVN experience
   http://lists.boost.org/MailArchives/boost/msg78376.php


NetBSD <www.netbsd.org>
Re: anoncvs problems
   http://marc.theaimsgroup.com/?l=netbsd-current-users&m=110764599704292&w=2

In this message Thor Lancelot Simon asked following:
http://marc.theaimsgroup.com/?l=netbsd-current-users&m=110774781517654&w=2
   ...
   I will repeat my question of some months ago: can anyone actually
   give an example of a repository with anywhere near as many files
   and revisions as ours, with a couple of hundred active developers
   who semi-regularly check in, and well over 100 simultaneous
   checkouts at peak periods, that is managed by Subversion?


We have had a statistic pages about "big" repositories in the past
(www/svn-repositories(2).html), but they have gone with r9756 as
defunct.

What do you think, would it make sense to start collecting something
similar, but this time the page would be devoted only for big
installations?

The relevant information would be at least the size, top revision at
the moment, average commits per day, average read/write users per day,
hardware used (CPU/RAM/IO), OS, access methods, used backend
(BDB/FSFS) and the rest of the old questions. The old questions are at
the end of this email.  For example, I would like to know these facts
of the repository of Apache Software Foundation
<http://svn.apache.org/repos/asf/>, if this information is public.

I think these kind of hard facts would be valuable for those who plan
or think switching, as addition to our 'Testimonials' page.


BR, Jani

Old questions from www/svn-repositories2.html:

1.  Your name
2.  Your email if you don't mind us putting that on the page
3.  Whether your repository is public or private
4.  A hyperlink to repository if it's public
5.  Size of the repository in MB (not your working copy)
6.  Approximate date you created the repository
7.  Current revision number of HEAD
8.  Number of users who use the repository
9.  Platform/OS that server is running on
10. Platform/OS that client(s) are using
11. What you are keeping in your Subversion repository
    (e.g. source code, docs, digital images, Word docs, etc)
12. Any other info you think is relevant

-- 
Jani Averbach

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org

Re: SVN repositories statistics (Was: gcc evaluating svn)

Posted by Brad <sv...@molandernet.com>.
I had the same problem when putting together a proof of concept for my
company 6 weeks ago. Most examples I see are open source with projects
smaller than the linux kernel.  

I can tell you that my company is planning on switching over early this
summer from a proprietary product (its such a bad product, I wont
mention it). We have 250 developers (+100 testers) and just under 200k
files. Our HEAD checkout will be around 12GB for all our projects
combined. Its mostly a machine issue from what I have seen. 

Our current plan is to use a raid10 (1TB) setup on a box with 4GB of RAM
and quad opertons running a 64 bit linux kernel. It will have four
network cards that will have particular projects/users assigned to them
so that the IO load is not too high.  If that runs too slow we will
separate projects onto other boxes.  In our performance tests (we setup
some bots to peg the system) a 2.4Ghz p4 with a 5400rpm drive kept up
with 50 developers just fine. It slows down, but its still respectable.
We dont anticipate any major problems wrt machine requirements. Another
point worth mentioning is that we also plan on using bdb to start. 

Open source projects are not a very good gauge because they typically
dont peak as high. They may have more developers, but the peak IO is
much lower.  A repository of the scope that we are talking about
requires some serious hardware no matter what product you use. If the
machine or product fails, we have ~350 people picking their noses and
screaming.  We chose subversion because of its simplicity and its
potential for integration to other systems (CRM, support, tracking). 

I will definitely post our results when the conversion is completed as I
know that many large companies are looking for a canary in the coal
mine. I hope to post back around May with our results. 

Brad




On Wed, 2005-02-09 at 10:37 -0700, Jani Averbach wrote:

> On 2005-02-09 11:32-0600, Ben Collins-Sussman wrote:
> > 
> > On Feb 9, 2005, at 11:20 AM, Jani Averbach wrote:
> > >
> > >   I will repeat my question of some months ago: can anyone actually
> > >   give an example of a repository with anywhere near as many files
> > >   and revisions as ours, with a couple of hundred active developers
> > >   who semi-regularly check in, and well over 100 simultaneous
> > >   checkouts at peak periods, that is managed by Subversion?
> > >
> > 
> > How about the ASF repository?
> 
> Exactly, the point is that where a casual wanderer can find those
> facts easily?  I think it is good service for our (potential) users if
> we list (again) these facts somewhere on our www-site.
> 
> BR, Jani
> 

Re: SVN repositories statistics (Was: gcc evaluating svn)

Posted by Jani Averbach <ja...@jaa.iki.fi>.
On 2005-02-09 11:32-0600, Ben Collins-Sussman wrote:
> 
> On Feb 9, 2005, at 11:20 AM, Jani Averbach wrote:
> >
> >   I will repeat my question of some months ago: can anyone actually
> >   give an example of a repository with anywhere near as many files
> >   and revisions as ours, with a couple of hundred active developers
> >   who semi-regularly check in, and well over 100 simultaneous
> >   checkouts at peak periods, that is managed by Subversion?
> >
> 
> How about the ASF repository?

Exactly, the point is that where a casual wanderer can find those
facts easily?  I think it is good service for our (potential) users if
we list (again) these facts somewhere on our www-site.

BR, Jani

-- 
Jani Averbach


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org

Re: SVN repositories statistics (Was: gcc evaluating svn)

Posted by Jani Averbach <ja...@jaa.iki.fi>.
On 2005-02-09 11:32-0600, Ben Collins-Sussman wrote:
> 
> On Feb 9, 2005, at 11:20 AM, Jani Averbach wrote:
> >
> >   I will repeat my question of some months ago: can anyone actually
> >   give an example of a repository with anywhere near as many files
> >   and revisions as ours, with a couple of hundred active developers
> >   who semi-regularly check in, and well over 100 simultaneous
> >   checkouts at peak periods, that is managed by Subversion?
> >
> 
> How about the ASF repository?

Exactly, the point is that where a casual wanderer can find those
facts easily?  I think it is good service for our (potential) users if
we list (again) these facts somewhere on our www-site.

BR, Jani

-- 
Jani Averbach


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@subversion.tigris.org
For additional commands, e-mail: users-help@subversion.tigris.org

Re: SVN repositories statistics (Was: gcc evaluating svn)

Posted by Ben Collins-Sussman <su...@collab.net>.
On Feb 9, 2005, at 11:20 AM, Jani Averbach wrote:
>
>    I will repeat my question of some months ago: can anyone actually
>    give an example of a repository with anywhere near as many files
>    and revisions as ours, with a couple of hundred active developers
>    who semi-regularly check in, and well over 100 simultaneous
>    checkouts at peak periods, that is managed by Subversion?
>

How about the ASF repository?


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@subversion.tigris.org
For additional commands, e-mail: users-help@subversion.tigris.org

Re: List of variables/substitutions

Posted by Jani Averbach <ja...@jaa.iki.fi>.
On 2005-02-09 09:24-0800, George Garvey wrote:
>    Where is a list of substitutions found, such as $Id$? I can't
> remember this moment the real name for these things ;)

http://svnbook.red-bean.com/en/1.1/ch07s02.html#svn-ch-7-sect-2.3.4

BR, Jani

-- 
Jani Averbach


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@subversion.tigris.org
For additional commands, e-mail: users-help@subversion.tigris.org

Re: List of variables/substitutions

Posted by Toby Johnson <to...@etjohnson.us>.
George Garvey wrote:

>   Where is a list of substitutions found, such as $Id$? I can't
>remember this moment the real name for these things ;)
>  
>
Those are called keywords; they're set using the svn:keywords property.
http://svnbook.red-bean.com/en/1.0/ch07s02.html#svn-ch-7-sect-2.3.4

P.S. Please don't start a new thread by replying to an old one; it makes 
it difficult for those of us that use threaded mail clients, even if you 
change the subject. You can copy and paste the email address instead.

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@subversion.tigris.org
For additional commands, e-mail: users-help@subversion.tigris.org

List of variables/substitutions

Posted by George Garvey <tm...@inxservices.com>.
   Where is a list of substitutions found, such as $Id$? I can't
remember this moment the real name for these things ;)

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@subversion.tigris.org
For additional commands, e-mail: users-help@subversion.tigris.org

Re: SVN repositories statistics (Was: gcc evaluating svn)

Posted by Justin Erenkrantz <ju...@erenkrantz.com>.
--On Wednesday, February 9, 2005 10:20 AM -0700 Jani Averbach 
<ja...@jaa.iki.fi> wrote:

> We have had a statistic pages about "big" repositories in the past
> (www/svn-repositories(2).html), but they have gone with r9756 as
> defunct.
>
> What do you think, would it make sense to start collecting something
> similar, but this time the page would be devoted only for big
> installations?

I'd be happy to share detailed war stories with other big folks, if 
desired.  Alas, I can't keep up with all the users@ traffic.

> The relevant information would be at least the size, top revision at
> the moment, average commits per day, average read/write users per day,
> hardware used (CPU/RAM/IO), OS, access methods, used backend
> (BDB/FSFS) and the rest of the old questions. The old questions are at
> the end of this email.  For example, I would like to know these facts
> of the repository of Apache Software Foundation
> <http://svn.apache.org/repos/asf/>, if this information is public.

Over at apache.org, we're at over 150k revisions with a 6.4GB repository. 
That's with a bit more than 1/2 of our projects converted: I'd expect by 
the end of this year, CVS will be sunset.

Jakarta has started down the SVN path - we're currently devising an orderly 
migration for most of their projects.  That only leaves the XML projects as 
the big collective laggard on the migration.

We're using BDB 4.3.27 on FreeBSD 4.x.  We've been getting hit with ViewCVS 
lockup problems, but C-Mike and I have been trying to isolate this problem 
and I think we've contained it for now.

We have about 900 committers with write access, and of course, an unknown 
number of people checking out at any time.

Our repository: <http://svn.apache.org/repos/asf/>
Henk Penning has analog info: <http://www.apache.org/~henkp/>
Real-time load info is at: <http://svn.apache.org/server-status/>
Machine list: <http://www.apache.org/dev/machines.html>

(For analog, cvs is the SVN server.)

The same physical machine (minotaur) also hosts www.apache.org. 
Scalability has not been an issue yet: actually CVS is a worse performance 
drain than SVN.  Even then, we've got an SSL accelerator card that we'll be 
installing next month; but that's more because it's cool than a necessity. 
=)

HTH.  -- justin

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@subversion.tigris.org
For additional commands, e-mail: users-help@subversion.tigris.org

Re: SVN repositories statistics (Was: gcc evaluating svn)

Posted by Ben Collins-Sussman <su...@collab.net>.
On Feb 9, 2005, at 11:20 AM, Jani Averbach wrote:
>
>    I will repeat my question of some months ago: can anyone actually
>    give an example of a repository with anywhere near as many files
>    and revisions as ours, with a couple of hundred active developers
>    who semi-regularly check in, and well over 100 simultaneous
>    checkouts at peak periods, that is managed by Subversion?
>

How about the ASF repository?


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org

Re: SVN repositories statistics (Was: gcc evaluating svn)

Posted by Justin Erenkrantz <ju...@erenkrantz.com>.
--On Wednesday, February 9, 2005 10:20 AM -0700 Jani Averbach 
<ja...@jaa.iki.fi> wrote:

> We have had a statistic pages about "big" repositories in the past
> (www/svn-repositories(2).html), but they have gone with r9756 as
> defunct.
>
> What do you think, would it make sense to start collecting something
> similar, but this time the page would be devoted only for big
> installations?

I'd be happy to share detailed war stories with other big folks, if 
desired.  Alas, I can't keep up with all the users@ traffic.

> The relevant information would be at least the size, top revision at
> the moment, average commits per day, average read/write users per day,
> hardware used (CPU/RAM/IO), OS, access methods, used backend
> (BDB/FSFS) and the rest of the old questions. The old questions are at
> the end of this email.  For example, I would like to know these facts
> of the repository of Apache Software Foundation
> <http://svn.apache.org/repos/asf/>, if this information is public.

Over at apache.org, we're at over 150k revisions with a 6.4GB repository. 
That's with a bit more than 1/2 of our projects converted: I'd expect by 
the end of this year, CVS will be sunset.

Jakarta has started down the SVN path - we're currently devising an orderly 
migration for most of their projects.  That only leaves the XML projects as 
the big collective laggard on the migration.

We're using BDB 4.3.27 on FreeBSD 4.x.  We've been getting hit with ViewCVS 
lockup problems, but C-Mike and I have been trying to isolate this problem 
and I think we've contained it for now.

We have about 900 committers with write access, and of course, an unknown 
number of people checking out at any time.

Our repository: <http://svn.apache.org/repos/asf/>
Henk Penning has analog info: <http://www.apache.org/~henkp/>
Real-time load info is at: <http://svn.apache.org/server-status/>
Machine list: <http://www.apache.org/dev/machines.html>

(For analog, cvs is the SVN server.)

The same physical machine (minotaur) also hosts www.apache.org. 
Scalability has not been an issue yet: actually CVS is a worse performance 
drain than SVN.  Even then, we've got an SSL accelerator card that we'll be 
installing next month; but that's more because it's cool than a necessity. 
=)

HTH.  -- justin

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org