You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Saurabh Suman <sa...@rediff.com> on 2009/07/09 07:21:12 UTC

How to crawl URLs getting from RSSParser

Hi Nutch guys
I used org.apache.nutch.parse.rss.RSSParser , for parsing RSS feeds. It is
showing urls on console.Now i want to crawl those urls. 
              How will i do this? Does RSSPrser class store it in crawldb or
i  need to send to all URLs to crawldb.Then run the crawl command.
   Is there another approach?
-- 
View this message in context: http://www.nabble.com/How-to-crawl-URLs-getting-from-RSSParser-tp24404179p24404179.html
Sent from the Nutch - User mailing list archive at Nabble.com.