You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Nuther <nu...@proservice.ge> on 2007/04/19 08:29:01 UTC

nutch-0.9.release: Odd Fetcher behaviour

Hi, all.
I've just installed nutch-0.9 on my FreeBSD box, and executed fetcher.
everything seemed to work perfect except one thing.
Here you can see it:

2007-04-19 15:15:32,628 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/news/1.0
2007-04-19 15:15:32,628 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/weather/
2007-04-19 15:15:32,629 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/news/rss.php
2007-04-19 15:15:32,629 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/linx/+escape(document.referrer)+
2007-04-19 15:15:32,629 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/sanat/
2007-04-19 15:15:32,629 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/phpBB2/
2007-04-19 15:15:32,629 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/news/1.2
..........................................................................................
..........................................................................................
..........................................................................................
2007-04-19 15:15:32,638 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/linx/1.1
2007-04-19 15:15:32,639 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/linx/1.3
2007-04-19 15:15:32,639 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/geo/
2007-04-19 15:15:32,639 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/news/index.php?cat=crime
2007-04-19 15:15:32,639 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/news/index.php?cat=sport
2007-04-19 15:15:32,639 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/news/+escape(document.referrer)+

Please point your attention at +escape(document.referrer)+ . 
Why does nutch include this string to link?
Is this a bug?

P.S. nutch-0.7 doesn't have this kind of thing.

Thanks,


-- 
Regards,
 Nuther                          mailto:nuther@proservice.ge

Re: nutch-0.9.release: Odd Fetcher behaviour

Posted by Nuther <nu...@proservice.ge>.
Hi, Nuther.

I'm sorry, I guess it was site fault..
You wrote 19 апреля 2007 г., 11:29:01:

> Hi, all.
> I've just installed nutch-0.9 on my FreeBSD box, and executed fetcher.
> everything seemed to work perfect except one thing.
> Here you can see it:

> 2007-04-19 15:15:32,628 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/news/1.0
> 2007-04-19 15:15:32,628 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/weather/
> 2007-04-19 15:15:32,629 INFO  fetcher.Fetcher - fetching
> http://www.abhazia.com/news/rss.php
> 2007-04-19 15:15:32,629 INFO  fetcher.Fetcher - fetching
> http://www.abhazia.com/linx/+escape(document.referrer)+
> 2007-04-19 15:15:32,629 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/sanat/
> 2007-04-19 15:15:32,629 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/phpBB2/
> 2007-04-19 15:15:32,629 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/news/1.2
> ..........................................................................................
> ..........................................................................................
> ..........................................................................................
> 2007-04-19 15:15:32,638 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/linx/1.1
> 2007-04-19 15:15:32,639 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/linx/1.3
> 2007-04-19 15:15:32,639 INFO  fetcher.Fetcher - fetching http://www.abhazia.com/geo/
> 2007-04-19 15:15:32,639 INFO  fetcher.Fetcher - fetching
> http://www.abhazia.com/news/index.php?cat=crime
> 2007-04-19 15:15:32,639 INFO  fetcher.Fetcher - fetching
> http://www.abhazia.com/news/index.php?cat=sport
> 2007-04-19 15:15:32,639 INFO  fetcher.Fetcher - fetching
> http://www.abhazia.com/news/+escape(document.referrer)+

> Please point your attention at +escape(document.referrer)+ . 
> Why does nutch include this string to link?
> Is this a bug?

> P.S. nutch-0.7 doesn't have this kind of thing.

> Thanks,





-- 
Regards,
 Nuther                          mailto:nuther@proservice.ge