You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Fuad Efendi <fu...@efendi.ca> on 2005/08/29 04:42:29 UTC

Need to reconstruct URLs from segment

Hello,

I need to read segment/page and to reconstruct URL of this page. 
WebDB contains same URL (it contains even more URLs)
I can use
SegmentReader.next(FetcherOutput fo, Content co, ParseText pt, ParseData
pd) and find some FetcherOutput, FetchListEntry, Page, URL.

FetcherOutput has an attribute FetchListEntry.url

Will method call IWebDBReader.getPage(url) return same Page, as
FetchListEntry.getPage()?

Thanks,
Fuad.