You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Fuad Efendi <fu...@efendi.ca> on 2005/08/10 20:56:00 UTC
How to extend Nutch?
I need some pre-processing, to add additional fields to Document, and to
show it on a web-page
I probably need to work with plugins, and to modify config files...
nutch-conf.xsl
nutch-default.xml
nutch-site.xml
Am I right?
Thanks
-----Original Message-----
From: Fuad Efendi [mailto:fuad@efendi.ca]
Sent: Wednesday, August 10, 2005 2:15 PM
To: nutch-user@lucene.apache.org
Subject: RE: [Nutch-general] How to extend Nutch
So, I need to modify some existing classes, isn't it?
-----Original Message-----
From: ogjunk-nutch@yahoo.com [mailto:ogjunk-nutch@yahoo.com]
Sent: Wednesday, August 10, 2005 1:48 PM
To: user@nutch.org
Subject: Re: [Nutch-general] How to extend Nutch
Probably IndexingFilter or HtmlParser for indexing and for indexing I
think there is something in org.apache.nutch.search.... some class that
starts with Raw.... I just saw this in the Javadoc earlier.
Otis
--- Fuad Efendi <fu...@efendi.ca> wrote:
> I need specific pre-processing of a html-page, to add more fields to
> Document before storing it in Index, and to modify web-interface
> accordingly.
>
> Where is the base point of extension?
> Thanks!
>
Re: How to extend Nutch?
Posted by Michael Ji <fj...@yahoo.com>.
hi Fuad:
I am probably doing the same thing. I think plug-in is
the right place to put my own code.
But not sure, why we need to touch other config files.
Regards,
Michael Ji
--- Fuad Efendi <fu...@efendi.ca> wrote:
>
> I need some pre-processing, to add additional fields
> to Document, and to
> show it on a web-page
> I probably need to work with plugins, and to modify
> config files...
>
> nutch-conf.xsl
> nutch-default.xml
> nutch-site.xml
>
> Am I right?
> Thanks
>
>
> -----Original Message-----
> From: Fuad Efendi [mailto:fuad@efendi.ca]
> Sent: Wednesday, August 10, 2005 2:15 PM
> To: nutch-user@lucene.apache.org
> Subject: RE: [Nutch-general] How to extend Nutch
>
>
> So, I need to modify some existing classes, isn't
> it?
>
>
> -----Original Message-----
> From: ogjunk-nutch@yahoo.com
> [mailto:ogjunk-nutch@yahoo.com]
> Sent: Wednesday, August 10, 2005 1:48 PM
> To: user@nutch.org
> Subject: Re: [Nutch-general] How to extend Nutch
>
>
> Probably IndexingFilter or HtmlParser for indexing
> and for indexing I
> think there is something in
> org.apache.nutch.search.... some class that
> starts with Raw.... I just saw this in the Javadoc
> earlier.
>
> Otis
>
> --- Fuad Efendi <fu...@efendi.ca> wrote:
>
> > I need specific pre-processing of a html-page, to
> add more fields to
> > Document before storing it in Index, and to modify
> web-interface
> > accordingly.
> >
> > Where is the base point of extension?
> > Thanks!
> >
>
>
>
>
__________________________________________________
Do You Yahoo!?
Tired of spam? Yahoo! Mail has the best spam protection around
http://mail.yahoo.com