You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@cocoon.apache.org by Maik Dobryn <ma...@dobryn.de> on 2003/07/30 01:20:11 UTC

CLI does not follow all links while generating the offline version of the web site

Hi there,

I want to use the Cocoon 2.1m3 command line interface to generate an offline 
version of my project site.
The online version works fine but the offline version includes some broken 
links (nevertheless, brokenlinks.xml is still empty).

My site consists on only static content, for example a lot of linked resources 
which I do reference by using the directory generator.

Does anybody know this kind of failure?
Could it probably be a problem with the resource path depth?

Thanks in advance

Maik





---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@cocoon.apache.org
For additional commands, e-mail: users-help@cocoon.apache.org


Re: CLI does not follow all links while generating the offline version of the web site

Posted by Upayavira <uv...@upaya.co.uk>.
On Wed, 30 Jul 2003 19:51:00 +0200, "Maik Dobryn" <ma...@dobryn.de> said:
> 
> Upayavira,
> 
> thank You a lot for Your support. 
> 
> I found out that for example a HTML directory content list contains four 
> links. Four links were found by Cocoon, but only three were processed.
> 
> Another interesting thing is: If the resource external file contains HTML
> with 
> his own links, this links were not found by Cocoon but sometimes
> processed.

Cocoon does not follow links beginning with http://, as it cannot
evaluate what part of the URL is relevant to Cocoon. Is the unfollowed
link an http:// link or a relative one?

Can you post the four links?

> Additionaly, an upgrade to Cocoon 2.1rc1 brought no change.

There hasn't been that much change in the CLI in rc1.

Regards, Upayavira

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@cocoon.apache.org
For additional commands, e-mail: users-help@cocoon.apache.org


Re: CLI does not follow all links while generating the offline version of the web site

Posted by Maik Dobryn <ma...@dobryn.de>.
Upayavira,

thank You a lot for Your support. 

I found out that for example a HTML directory content list contains four 
links. Four links were found by Cocoon, but only three were processed.

Another interesting thing is: If the resource external file contains HTML with 
his own links, this links were not found by Cocoon but sometimes processed.

Additionaly, an upgrade to Cocoon 2.1rc1 brought no change.

Regards, Maik


On Wednesday 30 July 2003 11:56, Upayavira wrote:
> On Wed, 30 Jul 2003 09:06:42 +0200, "Maik Dobryn" <ma...@dobryn.de> said:
> > Hi Upayavira,
> >
> > Thank You for the quick response.
> >
> > I have a resouce file folder (incuding a lot of subfolders) which has
> > more
> > than 150MB.
> > The files in this folder are crawled by the cocoon directory generator
> > which
> > produces one HTML content list per subfolder. So, each file in any folder
> > is
> > linked by the genderated list.
> > Every time I use the online version of my project site, I can reach every
> > file
> > by the HTML list. Not so in case of the offline version, here a lot of
> > the
> > links are broken, the resource file in the background was not copied to
> > the
> > offline folder.
> > So the new resource folder only has 71MB!
> >
> > Any idea?
>
> What is shown in the output of the CLI? Particularly the page that links
> to your bigger folder. The number within the [xx] is the number of links
> found in the page. Did it successfully find the links?
>
> Regards, Upayavira



---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@cocoon.apache.org
For additional commands, e-mail: users-help@cocoon.apache.org


Re: CLI does not follow all links while generating the offline version of the web site

Posted by Upayavira <uv...@upaya.co.uk>.
On Wed, 30 Jul 2003 09:06:42 +0200, "Maik Dobryn" <ma...@dobryn.de> said:
> Hi Upayavira,
> 
> Thank You for the quick response.
> 
> I have a resouce file folder (incuding a lot of subfolders) which has
> more 
> than 150MB.
> The files in this folder are crawled by the cocoon directory generator
> which 
> produces one HTML content list per subfolder. So, each file in any folder
> is 
> linked by the genderated list.
> Every time I use the online version of my project site, I can reach every
> file 
> by the HTML list. Not so in case of the offline version, here a lot of
> the 
> links are broken, the resource file in the background was not copied to
> the 
> offline folder.
> So the new resource folder only has 71MB!
> 
> Any idea?

What is shown in the output of the CLI? Particularly the page that links
to your bigger folder. The number within the [xx] is the number of links
found in the page. Did it successfully find the links?

Regards, Upayavira

> Maik  
> 
> 
> On Wednesday 30 July 2003 07:49, Upayavira wrote:
> > On Wed, 30 Jul 2003 01:20:11 +0200, "Maik Dobryn" <ma...@dobryn.de> said:
> > > Hi there,
> > >
> > > I want to use the Cocoon 2.1m3 command line interface to generate an
> > > offline
> > > version of my project site.
> > > The online version works fine but the offline version includes some
> > > broken
> > > links (nevertheless, brokenlinks.xml is still empty).
> > >
> > > My site consists on only static content, for example a lot of linked
> > > resources
> > > which I do reference by using the directory generator.
> > >
> > > Does anybody know this kind of failure?
> > > Could it probably be a problem with the resource path depth?
> >
> > Maik,
> >
> > Can you give me more details? How do you know that there are unreported
> > broken links?
> >
> > At present, the CLI does an exhaustive crawl of content. It doesn't have
> > a maximum depth.
> >
> > Regards, Upayavira
> 
> 
> 
> 
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscribe@cocoon.apache.org
> For additional commands, e-mail: users-help@cocoon.apache.org
> 
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@cocoon.apache.org
For additional commands, e-mail: users-help@cocoon.apache.org


Re: CLI does not follow all links while generating the offline version of the web site

Posted by Maik Dobryn <ma...@dobryn.de>.
Hi Upayavira,

Thank You for the quick response.

I have a resouce file folder (incuding a lot of subfolders) which has more 
than 150MB.
The files in this folder are crawled by the cocoon directory generator which 
produces one HTML content list per subfolder. So, each file in any folder is 
linked by the genderated list.
Every time I use the online version of my project site, I can reach every file 
by the HTML list. Not so in case of the offline version, here a lot of the 
links are broken, the resource file in the background was not copied to the 
offline folder.
So the new resource folder only has 71MB!

Any idea?

Maik  


On Wednesday 30 July 2003 07:49, Upayavira wrote:
> On Wed, 30 Jul 2003 01:20:11 +0200, "Maik Dobryn" <ma...@dobryn.de> said:
> > Hi there,
> >
> > I want to use the Cocoon 2.1m3 command line interface to generate an
> > offline
> > version of my project site.
> > The online version works fine but the offline version includes some
> > broken
> > links (nevertheless, brokenlinks.xml is still empty).
> >
> > My site consists on only static content, for example a lot of linked
> > resources
> > which I do reference by using the directory generator.
> >
> > Does anybody know this kind of failure?
> > Could it probably be a problem with the resource path depth?
>
> Maik,
>
> Can you give me more details? How do you know that there are unreported
> broken links?
>
> At present, the CLI does an exhaustive crawl of content. It doesn't have
> a maximum depth.
>
> Regards, Upayavira






---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@cocoon.apache.org
For additional commands, e-mail: users-help@cocoon.apache.org


Re: CLI does not follow all links while generating the offline version of the web site

Posted by Upayavira <uv...@upaya.co.uk>.
On Wed, 30 Jul 2003 01:20:11 +0200, "Maik Dobryn" <ma...@dobryn.de> said:
> 
> Hi there,
> 
> I want to use the Cocoon 2.1m3 command line interface to generate an
> offline 
> version of my project site.
> The online version works fine but the offline version includes some
> broken 
> links (nevertheless, brokenlinks.xml is still empty).
> 
> My site consists on only static content, for example a lot of linked
> resources 
> which I do reference by using the directory generator.
> 
> Does anybody know this kind of failure?
> Could it probably be a problem with the resource path depth?

Maik,

Can you give me more details? How do you know that there are unreported
broken links?

At present, the CLI does an exhaustive crawl of content. It doesn't have
a maximum depth.

Regards, Upayavira

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@cocoon.apache.org
For additional commands, e-mail: users-help@cocoon.apache.org