You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Paul Lypaczewski <pa...@yahoo.ca> on 2011/01/17 02:03:10 UTC

How to access fetcher in a plugin

Hi 
I have written a plug-in for Nutch to do some customized parsing. I found that the interface of a filter only gives me objects related to a certain HTML page, as below:

public ParseResult filter(Content content, ParseResult parseResult, HTMLMetaTags metaTags, DocumentFragment doc)

I wonder is there a way to access the fetcher object, which is actually calling my plug-in? The reason why I need this is that I need to store a global object for plug-in filters to access, and I think it is better to store it as a member of fetcher.

Thank you!

Paul