You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Michael Ji <fj...@yahoo.com> on 2005/09/05 21:49:08 UTC

link analysis in OC

hi Kelvin:

Did OC compute page score same as Nutch crawling?

I found Nutch/index compute document boost value based
on the score/anchor data in segment/fetchlist data
structure.

I guess OC won't generate this boost score by itself
or use its' own data structure. So if we want to have
this score saved in lucene index, we need to use
nutch/generate.. to get the fetchlist and generate
webdb.

That means OC will live with Nutch's webdb and other
data structures.

Is my though right?

thanks,

Michael Ji

__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com