You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Rozina Sorathia <Ro...@KPITCummins.com> on 2005/09/03 11:38:52 UTC
Link Analysis Score..
I wanted to know where exactly the Link Analysis Score is calculated
...Is there any code snippet available.?
How is the Link Analysis Score affecting the overall final score of the
document?
Rozina Sorathia,
Systems Executive,
KPIT Cummins Infosystems Ltd.,
rozinas@kpitcummins.com
RE: httpd/unix-directory
Posted by EM <em...@cpuedge.com>.
The issue happened quite a lot with my last fetchlist (I'm using the
official 0.7), the next time it happens I can send you a list of urls if you
like?
-----Original Message-----
From: Michael Nebel [mailto:michael@nebel.de]
Sent: Tuesday, September 06, 2005 2:42 PM
To: nutch-user@lucene.apache.org
Subject: Re: httpd/unix-directory
Hi,
looking at my apache, i get directory-listings as "Content-Type:
text-html" not "httpd/unix-directory"... What kind of server are you
crawling?
Regards
Michael
EM wrote:
> Shouldn't "httpd/unix-directory" be parsed?
>
> Message from the logs:
> fetch okay, but can't parse http://<xxxxxxxxxx>/, reason: failed(2,203):
> Content-Type not text/html: httpd/unix-directory
--
Michael Nebel
http://www.nebel.de/
http://www.netluchs.de/
Re: httpd/unix-directory
Posted by Michael Nebel <mi...@nebel.de>.
Hi,
looking at my apache, i get directory-listings as "Content-Type:
text-html" not "httpd/unix-directory"... What kind of server are you
crawling?
Regards
Michael
EM wrote:
> Shouldn't "httpd/unix-directory" be parsed?
>
> Message from the logs:
> fetch okay, but can't parse http://<xxxxxxxxxx>/, reason: failed(2,203):
> Content-Type not text/html: httpd/unix-directory
--
Michael Nebel
http://www.nebel.de/
http://www.netluchs.de/
httpd/unix-directory
Posted by EM <em...@cpuedge.com>.
Shouldn't "httpd/unix-directory" be parsed?
Message from the logs:
fetch okay, but can't parse http://<xxxxxxxxxx>/, reason: failed(2,203):
Content-Type not text/html: httpd/unix-directory
Re: Link Analysis Score..
Posted by Piotr Kosiorowski <pk...@gmail.com>.
There are many ways nutch can boost document in the index. But I suspect
you are refereing to analyze process - it uses PagrReank computation for
page score. For details read DistributedAnalysisTool - especially
computeRound method.
Regards
Piotr
Rozina Sorathia wrote:
> I wanted to know where exactly the Link Analysis Score is calculated …Is
> there any code snippet available.?
>
> How is the Link Analysis Score affecting the overall final score of the
> document?
>
>
>
>
>
>
>
> //Rozina Sorathia,//
>
> //Systems Executive,//
>
> //KPIT Cummins Infosystems Ltd.,//
>
> //rozinas@kpitcummins.com <ma...@kpitcummins.com>//
>
>
>
>
>