You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Frédéric Passaniti <f....@gmail.com> on 2014/05/22 16:34:11 UTC

Nutch readings for developers

Hello everyone,

I'm looking for some litterature/readings about HOW TO develop plugins in
nutch, understand very well the deep architecture of the crawler.
What are the different entry points for custom code in the crawling and
indexing process.
More particullary how to develop custom parsers and content extractors, how
to redirect the parsed content into a custom storage service etc...

If you have good blogs, sites, wikis or even git/googlecode small project
to have a look....

It would be much appreciated !!

Thank you !


-- 
Frédéric Passaniti