You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Anton Potehin <an...@orbita1.ru> on 2005/11/08 11:22:43 UTC
rank system
What about scoring in mapred? I have looked crawl/crawl.java but I did
not found anything concerned with page scores calculating. Does the
mapred use ranking system somehow?
Is it possible to use mapred for clustering whole-web crawling or it
works with Intranet Crawling only?
RE: rank system
Posted by an...@orbita1.ru.
Alright i see in crawl/Indexer.java in method "reduce" object class dbDatum
which contain score. But where calculate this score?
What formula using when calculate score?
-----Original Message-----
From: Stefan Groschupf [mailto:sg@media-style.com]
Sent: Tuesday, November 08, 2005 1:54 PM
To: nutch-dev@lucene.apache.org
Subject: Re: rank system
Pre score calculation is done in the indexer.
Yes it works with complete webcrawls as well, and it works very well
for that. :-)
Stefan
Am 08.11.2005 um 11:22 schrieb Anton Potehin:
> What about scoring in mapred? I have looked crawl/crawl.java but I did
> not found anything concerned with page scores calculating. Does the
> mapred use ranking system somehow?
>
> Is it possible to use mapred for clustering whole-web crawling or it
> works with Intranet Crawling only?
>
>
>
Re: rank system
Posted by Stefan Groschupf <sg...@media-style.com>.
Pre score calculation is done in the indexer.
Yes it works with complete webcrawls as well, and it works very well
for that. :-)
Stefan
Am 08.11.2005 um 11:22 schrieb Anton Potehin:
> What about scoring in mapred? I have looked crawl/crawl.java but I did
> not found anything concerned with page scores calculating. Does the
> mapred use ranking system somehow?
>
> Is it possible to use mapred for clustering whole-web crawling or it
> works with Intranet Crawling only?
>
>
>