You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Sourajit Basak <so...@gmail.com> on 2012/08/12 20:20:08 UTC

chaining a custom parser (1.5)

Is it possible to chain (html) parsers in v1.5 ? Suppose I want both the
standard html parser plus my custom parser.

Re: chaining a custom parser (1.5)

Posted by Sourajit Basak <so...@gmail.com>.
I didn't chain any filter yet, but my custom filter worked alongside the
"parse" phase.
@Lewis - Thanks for the pointer.



On Sun, Aug 12, 2012 at 11:58 PM, Lewis John Mcgibbney <
lewis.mcgibbney@gmail.com> wrote:

> Yes. Please see the execution of the microformats-reltag parser and
> indexing filter.
> I was running some tests today and this parserfilter is invoked
> alongside (after) the html parser.
>
> Lewis
>
> On Sun, Aug 12, 2012 at 7:20 PM, Sourajit Basak
> <so...@gmail.com> wrote:
> > Is it possible to chain (html) parsers in v1.5 ? Suppose I want both the
> > standard html parser plus my custom parser.
>
>
>
> --
> Lewis
>

Re: chaining a custom parser (1.5)

Posted by Lewis John Mcgibbney <le...@gmail.com>.
Yes. Please see the execution of the microformats-reltag parser and
indexing filter.
I was running some tests today and this parserfilter is invoked
alongside (after) the html parser.

Lewis

On Sun, Aug 12, 2012 at 7:20 PM, Sourajit Basak
<so...@gmail.com> wrote:
> Is it possible to chain (html) parsers in v1.5 ? Suppose I want both the
> standard html parser plus my custom parser.



-- 
Lewis