You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by pavan kosuru <ko...@gmail.com> on 2009/01/09 07:40:27 UTC

Reg: Local file system crawling

Hi ,
   I am trying to crawl some local file structure which is in my hard disk
using the protocol-file plugin where the directory structure is in unicode
and file names also in unicode and all the files are html documents of some
web site structure. I can able to crawl the files which are in roman, but
unable to crawl the files in the above format. Please help me out of this.

Thanks in advance,
Pavan

-- 

              ------        Pavan ...

Reg: Local file system crawling

Posted by pavan kosuru <ko...@gmail.com>.
Hi ,
   I am trying to crawl some local file structure which is in my hard disk
using the protocol-file plugin where the directory structure is in unicode
and file names also in unicode and all the files are html documents of some
web site structure. I can able to crawl the files which are in roman, but
unable to crawl the files in the above format. Please help me out of this.

Thanks in advance,
Pavan

              ------        Pavan ...