You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Saurabh Suman <sa...@rediff.com> on 2009/07/30 06:17:49 UTC
How fetcher works
Hi
I have some confusion regarding Fetcher.java. Does Fetcher fetches Html
page ,stores it first and then parse?
Can i just store the html and i don't want to parse it?
--
View this message in context: http://www.nabble.com/How-fetcher-works-tp24731940p24731940.html
Sent from the Nutch - User mailing list archive at Nabble.com.
Re: How fetcher works
Posted by reinhard schwab <re...@aon.at>.
Saurabh Suman schrieb:
> Hi
> I have some confusion regarding Fetcher.java. Does Fetcher fetches Html
> page ,stores it first and then parse?
> Can i just store the html and i don't want to parse it?
>
it can. it has a -noParsing option
bin/nutch fetch
Usage: Fetcher <segment> [-threads n] [-noParsing]