You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Rajendra Patil <Ra...@KPITCummins.com> on 2005/09/01 06:55:52 UTC

how to generate segments with html pages as input

HI,
Any idea how to generate segments with some html pages instead of
fetching/crawling from urllist or dmoz . I have bunch of html pages & is
it possible to create segments with these html pages as input??? 

Thanx & Regards,
Rajendra



Re: how to generate segments with html pages as input

Posted by 牟晓峰 <he...@gmail.com>.
 take a look at the Nutch FAQ (
http://wiki.apache.org/nutch/FAQ), in section Indexing, "How do I index my
local file system?"


2005/9/1, Rajendra Patil <Ra...@kpitcummins.com>:
> HI,
> Any idea how to generate segments with some html pages instead of
> fetching/crawling from urllist or dmoz . I have bunch of html pages & is
> it possible to create segments with these html pages as input???
> 
> Thanx & Regards,
> Rajendra
> 
> 
>