You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Smith Norton <sm...@gmail.com> on 2007/09/06 13:15:41 UTC

ranking works in topN selection?

I have some questions.

1. What is the ranking algorithm for Nutch?

2. Does this ranking algorithm come into play, when at a certain crawl
depth, Nutch is confronted with 10000 possible URLs but the topN value
is only 100.

3. When exactly is the topN selection done? Generation? Fetch? which phase?