You are viewing a plain text version of this content. The canonical link for it is here.
Posted to general@incubator.apache.org by Tony Stevenson <to...@pc-tony.com> on 2008/04/22 18:22:36 UTC

Size of websites in incubator.apache.org

Good day,

As part of rolling out the new backup server for the infra team, I have 
discovered that several podling sites are extremely large.

Namely:

119M    /x1/www/incubator.apache.org/activemq
324M    /x1/www/incubator.apache.org/cxf
102M    /x1/www/incubator.apache.org/directory
166M    /x1/www/incubator.apache.org/lucene.net
587M    /x1/www/incubator.apache.org/openjpa
299M    /x1/www/incubator.apache.org/servicemix
166M    /x1/www/incubator.apache.org/uima


I am singling out all sites that over 100MB in size here.  Can someone 
please check the contents of these directories?  I appreciate that some 
of them have graduated from the incubator and as such, these datasets 
are either redundant or should be archived.

I would appreciate a definitive directive as to what should be done with 
  these directories.

I will also be updating the documentation on how to handle 
graduation/removal from the incubator.  I'll send an update once this 
has been done too.


Cheers,
Tony




---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: Size of websites in incubator.apache.org

Posted by Erik Hatcher <er...@ehatchersolutions.com>.
>>  166M    /x1/www/incubator.apache.org/lucene.net

i've just removed some (large) old docs.

	Erik


---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: Size of websites in incubator.apache.org

Posted by Guillaume Nodet <gn...@gmail.com>.
On Tue, Apr 22, 2008 at 6:22 PM, Tony Stevenson <to...@pc-tony.com> wrote:
> Good day,
>
>  As part of rolling out the new backup server for the infra team, I have
> discovered that several podling sites are extremely large.
>
>  Namely:
>
>  119M    /x1/www/incubator.apache.org/activemq
>  324M    /x1/www/incubator.apache.org/cxf
>  102M    /x1/www/incubator.apache.org/directory
>  166M    /x1/www/incubator.apache.org/lucene.net
>  587M    /x1/www/incubator.apache.org/openjpa
>  299M    /x1/www/incubator.apache.org/servicemix

incubator.apache.org/servicemix is already redirecting to servicemix.apache.org
I'll clean the remove the content of the directory asap.

>  166M    /x1/www/incubator.apache.org/uima
>
>
>  I am singling out all sites that over 100MB in size here.  Can someone
> please check the contents of these directories?  I appreciate that some of
> them have graduated from the incubator and as such, these datasets are
> either redundant or should be archived.
>
>  I would appreciate a definitive directive as to what should be done with
> these directories.
>
>  I will also be updating the documentation on how to handle
> graduation/removal from the incubator.  I'll send an update once this has
> been done too.
>
>
>  Cheers,
>  Tony
>
>
>
>
>  ---------------------------------------------------------------------
>  To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
>  For additional commands, e-mail: general-help@incubator.apache.org
>
>



-- 
Cheers,
Guillaume Nodet
------------------------
Blog: http://gnodet.blogspot.com/

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: Size of websites in incubator.apache.org

Posted by Emmanuel Lecharny <el...@apache.org>.
Tony Stevenson wrote:
> Good day,
Hi

> 102M    /x1/www/incubator.apache.org/directory

we have exited the incubator 3 years ago ... This directory can be 
archived or removed at will.

Thanks !

-- 
--
cordialement, regards,
Emmanuel Lécharny
www.iktek.com
directory.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: Size of websites in incubator.apache.org

Posted by Craig L Russell <Cr...@Sun.COM>.
In case it wasn't clear, the incubator/openjpa web site already  
redirects to the live site.

There's no need to preserve it. It's obsolete.

Craig

On Apr 22, 2008, at 9:32 PM, Craig L Russell wrote:

> With 587 MB, OpenJPA wins and is still champion. ;-)
>
> It's ok to redirect the incubator site to openjpa.apache.org.
>
> Craig
>
> On Apr 22, 2008, at 10:03 AM, Robert Burrell Donkin wrote:
>
>> On Tue, Apr 22, 2008 at 5:22 PM, Tony Stevenson <to...@pc-tony.com>  
>> wrote:
>>> Good day,
>>>
>>> As part of rolling out the new backup server for the infra team, I  
>>> have
>>> discovered that several podling sites are extremely large.
>>>
>>> Namely:
>>>
>>> 119M    /x1/www/incubator.apache.org/activemq
>>
>> graduated -> activemq.apache.org
>>
>>> 324M    /x1/www/incubator.apache.org/cxf
>>
>> IIRC graduating -> cxf.apache.org
>>
>>> 102M    /x1/www/incubator.apache.org/directory
>>
>> graduated -> directory.apache.org
>>
>>> 166M    /x1/www/incubator.apache.org/lucene.net
>>> 587M    /x1/www/incubator.apache.org/openjpa
>>
>> graduated -> openjpa.apache.org
>>
>>> 299M    /x1/www/incubator.apache.org/servicemix
>>
>> graduated -> servicemix.apache.org
>>
>>> 166M    /x1/www/incubator.apache.org/uima
>>
>> still here :-)
>>
>>> I am singling out all sites that over 100MB in size here.  Can  
>>> someone
>>> please check the contents of these directories?  I appreciate that  
>>> some of
>>> them have graduated from the incubator and as such, these datasets  
>>> are
>>> either redundant or should be archived.
>>>
>>> I would appreciate a definitive directive as to what should be  
>>> done with
>>> these directories.
>>
>> IMHO graduate websites should be deleted but probably polite to  
>> inform
>> PMCs first
>>
>>> I will also be updating the documentation on how to handle
>>> graduation/removal from the incubator.  I'll send an update once  
>>> this has
>>> been done too.
>>
>> great
>>
>> - robert
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
>> For additional commands, e-mail: general-help@incubator.apache.org
>>
>
> Craig Russell
> Architect, Sun Java Enterprise System http://java.sun.com/products/jdo
> 408 276-5638 mailto:Craig.Russell@sun.com
> P.S. A good JDO? O, Gasp!
>

Craig Russell
Architect, Sun Java Enterprise System http://java.sun.com/products/jdo
408 276-5638 mailto:Craig.Russell@sun.com
P.S. A good JDO? O, Gasp!


Re: Size of websites in incubator.apache.org

Posted by Craig L Russell <Cr...@Sun.COM>.
With 587 MB, OpenJPA wins and is still champion. ;-)

It's ok to redirect the incubator site to openjpa.apache.org.

Craig

On Apr 22, 2008, at 10:03 AM, Robert Burrell Donkin wrote:

> On Tue, Apr 22, 2008 at 5:22 PM, Tony Stevenson <to...@pc-tony.com>  
> wrote:
>> Good day,
>>
>> As part of rolling out the new backup server for the infra team, I  
>> have
>> discovered that several podling sites are extremely large.
>>
>> Namely:
>>
>> 119M    /x1/www/incubator.apache.org/activemq
>
> graduated -> activemq.apache.org
>
>> 324M    /x1/www/incubator.apache.org/cxf
>
> IIRC graduating -> cxf.apache.org
>
>> 102M    /x1/www/incubator.apache.org/directory
>
> graduated -> directory.apache.org
>
>> 166M    /x1/www/incubator.apache.org/lucene.net
>> 587M    /x1/www/incubator.apache.org/openjpa
>
> graduated -> openjpa.apache.org
>
>> 299M    /x1/www/incubator.apache.org/servicemix
>
> graduated -> servicemix.apache.org
>
>> 166M    /x1/www/incubator.apache.org/uima
>
> still here :-)
>
>> I am singling out all sites that over 100MB in size here.  Can  
>> someone
>> please check the contents of these directories?  I appreciate that  
>> some of
>> them have graduated from the incubator and as such, these datasets  
>> are
>> either redundant or should be archived.
>>
>> I would appreciate a definitive directive as to what should be done  
>> with
>> these directories.
>
> IMHO graduate websites should be deleted but probably polite to inform
> PMCs first
>
>> I will also be updating the documentation on how to handle
>> graduation/removal from the incubator.  I'll send an update once  
>> this has
>> been done too.
>
> great
>
> - robert
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
> For additional commands, e-mail: general-help@incubator.apache.org
>

Craig Russell
Architect, Sun Java Enterprise System http://java.sun.com/products/jdo
408 276-5638 mailto:Craig.Russell@sun.com
P.S. A good JDO? O, Gasp!


Re: Size of websites in incubator.apache.org

Posted by Robert Burrell Donkin <ro...@gmail.com>.
On Tue, Apr 22, 2008 at 5:22 PM, Tony Stevenson <to...@pc-tony.com> wrote:
> Good day,
>
>  As part of rolling out the new backup server for the infra team, I have
> discovered that several podling sites are extremely large.
>
>  Namely:
>
>  119M    /x1/www/incubator.apache.org/activemq

graduated -> activemq.apache.org

>  324M    /x1/www/incubator.apache.org/cxf

IIRC graduating -> cxf.apache.org

>  102M    /x1/www/incubator.apache.org/directory

graduated -> directory.apache.org

>  166M    /x1/www/incubator.apache.org/lucene.net
>  587M    /x1/www/incubator.apache.org/openjpa

graduated -> openjpa.apache.org

>  299M    /x1/www/incubator.apache.org/servicemix

graduated -> servicemix.apache.org

>  166M    /x1/www/incubator.apache.org/uima

still here :-)

>  I am singling out all sites that over 100MB in size here.  Can someone
> please check the contents of these directories?  I appreciate that some of
> them have graduated from the incubator and as such, these datasets are
> either redundant or should be archived.
>
>  I would appreciate a definitive directive as to what should be done with
> these directories.

IMHO graduate websites should be deleted but probably polite to inform
PMCs first

>  I will also be updating the documentation on how to handle
> graduation/removal from the incubator.  I'll send an update once this has
> been done too.

great

- robert

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: Size of websites in incubator.apache.org

Posted by Daniel Kulp <dk...@apache.org>.
OK.  I've gone ahead and updated the graduation guide to point to the top 
level .htaccess file.  (I hope no one minds someone not on the incubator 
PMC updating that.)

I also added several of the projects to the .htaccess file.  Basically, 
the graduated projects that used a simple .htaccess or a meta-refresh 
(yes, a couple are doing that, ick) I've put in the .htaccess.  Thus, 
their directories could be removed.   

However, a couple projects are using an .htaccess that is much more 
complex than a simple "one liner" so I left them as is.  (example: 
ftpserver)

Dan


On Wednesday 23 April 2008, Justin Erenkrantz wrote:
> On Wed, Apr 23, 2008 at 11:26 AM, Robert Burrell Donkin
>
> <ro...@gmail.com> wrote:
> >  AIUI using the top level .htaccess is better for performance so
> > that's what i recommend (but hopefully someone will jump in and
> > correct me if
>
> +1.
>
> (Not having .htaccess at all is actually best; but that requires us
> tweaking the master httpd conf files whenever a PMC wants a redirect -
> doable but feh.)  -- justin
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
> For additional commands, e-mail: general-help@incubator.apache.org



-- 
J. Daniel Kulp
Principal Engineer, IONA
dkulp@apache.org
http://www.dankulp.com/blog

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: Size of websites in incubator.apache.org

Posted by Justin Erenkrantz <ju...@erenkrantz.com>.
On Wed, Apr 23, 2008 at 11:26 AM, Robert Burrell Donkin
<ro...@gmail.com> wrote:
>  AIUI using the top level .htaccess is better for performance so that's
>  what i recommend (but hopefully someone will jump in and correct me if

+1.

(Not having .htaccess at all is actually best; but that requires us
tweaking the master httpd conf files whenever a PMC wants a redirect -
doable but feh.)  -- justin

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: Size of websites in incubator.apache.org

Posted by Robert Burrell Donkin <ro...@gmail.com>.
On Wed, Apr 23, 2008 at 7:20 PM, Daniel Kulp <dk...@apache.org> wrote:
>
>  I was adding cxf to the site-publish/.htaccess file and was going to go
>  ahead and add the others that have graduated, but I want to double check
>  something first.
>
>  Several of the graduated projects have created their own .htaccess in
>  their project directory wrather that use the top level .htaccess.
>  Example: servicemix/.htaccess
>
>  The question is: is it better to leave it like that or move them to the
>  top level .htaccess and completely remove the project directory?  I
>  don't really care which, but consistency is probably good and which ever
>  way we go, it should be documented in the post graduation checklist
>  stuff.

AIUI using the top level .htaccess is better for performance so that's
what i recommend (but hopefully someone will jump in and correct me if
i'm wrong)

- robert

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: Size of websites in incubator.apache.org

Posted by Daniel Kulp <dk...@apache.org>.
I was adding cxf to the site-publish/.htaccess file and was going to go 
ahead and add the others that have graduated, but I want to double check 
something first.

Several of the graduated projects have created their own .htaccess in 
their project directory wrather that use the top level .htaccess.  
Example: servicemix/.htaccess

The question is: is it better to leave it like that or move them to the 
top level .htaccess and completely remove the project directory?  I 
don't really care which, but consistency is probably good and which ever 
way we go, it should be documented in the post graduation checklist 
stuff.

Dan




On Tuesday 22 April 2008, Tony Stevenson wrote:
> Good day,
>
> As part of rolling out the new backup server for the infra team, I
> have discovered that several podling sites are extremely large.
>
> Namely:
>
> 119M    /x1/www/incubator.apache.org/activemq
> 324M    /x1/www/incubator.apache.org/cxf
> 102M    /x1/www/incubator.apache.org/directory
> 166M    /x1/www/incubator.apache.org/lucene.net
> 587M    /x1/www/incubator.apache.org/openjpa
> 299M    /x1/www/incubator.apache.org/servicemix
> 166M    /x1/www/incubator.apache.org/uima
>
>
> I am singling out all sites that over 100MB in size here.  Can someone
> please check the contents of these directories?  I appreciate that
> some of them have graduated from the incubator and as such, these
> datasets are either redundant or should be archived.
>
> I would appreciate a definitive directive as to what should be done
> with these directories.
>
> I will also be updating the documentation on how to handle
> graduation/removal from the incubator.  I'll send an update once this
> has been done too.
>
>
> Cheers,
> Tony
>
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
> For additional commands, e-mail: general-help@incubator.apache.org



-- 
J. Daniel Kulp
Principal Engineer, IONA
dkulp@apache.org
http://www.dankulp.com/blog

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: Size of websites in incubator.apache.org

Posted by Robert Burrell Donkin <ro...@gmail.com>.
On Tue, Apr 22, 2008 at 11:02 PM, Justin Erenkrantz
<ju...@erenkrantz.com> wrote:
> On Tue, Apr 22, 2008 at 12:53 PM, Tony Stevenson <to...@pc-tony.com> wrote:
>  > Justin
>  >
>  >  Is the prefereed method of handling these, moving them to archive.a.o/incubator.a.o/($podling)/ ?? Or is there an alternate location?

probably not

>  If the projects have indeed graduated, and they already have
>  $podling.apache.org up, and no one responds to clean them up within,
>  say, a week, I'd toss 'em entirely and enforce a redirect from the old
>  incubator.apache.org URL to the new <tlp>.apache.org site.

+1

>  If you feel charitable and want to save the artifacts on the backup
>  box somewhere (since they're already copied over) for a little while
>  longer, feel free...but, IMO, we don't need to persist these sites.

+1

- robert

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: Size of websites in incubator.apache.org

Posted by Justin Erenkrantz <ju...@erenkrantz.com>.
On Tue, Apr 22, 2008 at 12:53 PM, Tony Stevenson <to...@pc-tony.com> wrote:
> Justin
>
>  Is the prefereed method of handling these, moving them to archive.a.o/incubator.a.o/($podling)/ ?? Or is there an alternate location?

If the projects have indeed graduated, and they already have
$podling.apache.org up, and no one responds to clean them up within,
say, a week, I'd toss 'em entirely and enforce a redirect from the old
incubator.apache.org URL to the new <tlp>.apache.org site.

If you feel charitable and want to save the artifacts on the backup
box somewhere (since they're already copied over) for a little while
longer, feel free...but, IMO, we don't need to persist these sites.
-- justin

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: Size of websites in incubator.apache.org

Posted by Tony Stevenson <to...@pc-tony.com>.
Justin

Is the prefereed method of handling these, moving them to archive.a.o/incubator.a.o/($podling)/ ?? Or is there an alternate location?


Tony
Sent from my BlackBerry® wireless device

-----Original Message-----
From: "Justin Erenkrantz" <ju...@erenkrantz.com>

Date: Tue, 22 Apr 2008 10:59:05 
To:general@incubator.apache.org
Subject: Re: Size of websites in incubator.apache.org


On Tue, Apr 22, 2008 at 10:56 AM, Marshall Schor <ms...@schor.com> wrote:
>  Based on this, I would like to keep things as they are, unless there is a
> new conclusion about where things like documentation should go.

Nah - that's fine.  The issue is the TLPs that have graduated and left
a bunch of stuff in their incubator dirs.  -- justin

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: Size of websites in incubator.apache.org

Posted by Justin Erenkrantz <ju...@erenkrantz.com>.
On Tue, Apr 22, 2008 at 10:56 AM, Marshall Schor <ms...@schor.com> wrote:
>  Based on this, I would like to keep things as they are, unless there is a
> new conclusion about where things like documentation should go.

Nah - that's fine.  The issue is the TLPs that have graduated and left
a bunch of stuff in their incubator dirs.  -- justin

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org


Re: Size of websites in incubator.apache.org

Posted by Marshall Schor <ms...@schor.com>.
Tony Stevenson wrote:
> Good day,
>
> As part of rolling out the new backup server for the infra team, I 
> have discovered that several podling sites are extremely large.
>
> Namely:
>
> ...
> 166M    /x1/www/incubator.apache.org/uima
I checked into this and discovered that > 85% of the space is due to our 
keeping various kinds of documentation for our releases on our website, 
including ~ 40 MB for the Javadoc API documentation of the "current" 
release. 

We keep past release documentation here (but not the API Docs), for 2 
other past releases - these take ~ 40 MB. 

We ended up keeping our documentation in SVN and checking it out onto 
the website, after a long discussion of pros/cons, ending with this in a 
note from Robert Burrell Donkin, concerning where to keep the 
documentation and Javadocs, in which he said:

... <snip>
i talked it over the the infra team and their strong recommendation
was to store in svn and then checkout onto the website
... <snip>

You can see the whole email thread here:
http://www.mail-archive.com/uima-dev@incubator.apache.org/msg05150.html

Based on this, I would like to keep things as they are, unless there is 
a new conclusion about where things like documentation should go.

-Marshall Schor

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscribe@incubator.apache.org
For additional commands, e-mail: general-help@incubator.apache.org