You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Fuad Efendi <fu...@efendi.ca> on 2005/08/29 04:42:29 UTC
Need to reconstruct URLs from segment
Hello,
I need to read segment/page and to reconstruct URL of this page.
WebDB contains same URL (it contains even more URLs)
I can use
SegmentReader.next(FetcherOutput fo, Content co, ParseText pt, ParseData
pd) and find some FetcherOutput, FetchListEntry, Page, URL.
FetcherOutput has an attribute FetchListEntry.url
Will method call IWebDBReader.getPage(url) return same Page, as
FetchListEntry.getPage()?
Thanks,
Fuad.