You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Smith Norton <sm...@gmail.com> on 2007/09/06 13:15:41 UTC
ranking works in topN selection?
I have some questions.
1. What is the ranking algorithm for Nutch?
2. Does this ranking algorithm come into play, when at a certain crawl
depth, Nutch is confronted with 10000 possible URLs but the topN value
is only 100.
3. When exactly is the topN selection done? Generation? Fetch? which phase?