You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Euan Clark <eu...@nzs.com> on 2009/08/01 16:35:02 UTC

crawlset and webgraph discrepancy

Hi,

I notice URLs that have been fetched (Status 33) but they don't appear in
the nodedump in webgraph.
To my mind this indicates there are no other pages in the crawlset that link
to the page(s) in question.

Shouldn't webgraph include these pages and just score them lowest?