You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Lourival Júnior <ju...@gmail.com> on 2006/07/07 20:53:15 UTC

Index algorithm

Could anyone give some link or document about the nutch's index algorithm? I
don't found many ones...

Regards

-- 
Lourival Junior
Universidade Federal do Pará
Curso de Bacharelado em Sistemas de Informação
http://www.ufpa.br/cbsi
Msn: junior_ufpa@hotmail.com

Re: Index algorithm

Posted by Stefan Groschupf <sg...@media-style.com>.
Hi,
nutch uses lucene.
So you will find that interesting:
http://lucene.apache.org/java/docs/api/org/apache/lucene/search/ 
Similarity.html
Beside that nutch uses a kind of opic:
http://lucene.apache.org/nutch/nutch-nightly/docs/api/org/apache/ 
nutch/scoring/opic/OPICScoringFilter.html

Also have a look here:
http://svn.apache.org/viewvc/lucene/nutch/trunk/src/java/org/apache/ 
nutch/indexer/Indexer.java?view=markup
search for:
// run scoring filters

Hope this answers your question.

Stefan



Am 07.07.2006 um 11:53 schrieb Lourival Júnior:

> Could anyone give some link or document about the nutch's index  
> algorithm? I
> don't found many ones...
>
> Regards
>
> -- 
> Lourival Junior
> Universidade Federal do Pará
> Curso de Bacharelado em Sistemas de Informação
> http://www.ufpa.br/cbsi
> Msn: junior_ufpa@hotmail.com