You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by pavan kosuru <ko...@gmail.com> on 2009/01/17 09:48:19 UTC

Reg:Local file system crawl

Hi ,
   I am trying to crawl some local file structure which is in my hard disk
using the protocol-file plugin where the directory structure is in unicode
and file names also in unicode and all the files are html documents of some
web site structure. I can able to crawl the files which are in roman, but
unable to crawl the files in the above format. Please help me out of this.

Thanks in advance,
Pavan
-- 

              ------        Pavan ...