You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Amna Waqar <am...@gmail.com> on 2011/02/01 06:08:24 UTC

help:Nutch segment architecture

Hello everyone, i want to know the exact structure of segments formed as a
result of crawl process.In what format,the content of the fetched web site
is stored,How can we see the content from segments? Does this in the form of
bytes,(unicode) or mapfile? Tell me the mapfile structure used by nutch .
Thanks in advance
Regards
Amna Waqar