You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Siva Bandhamravuri <sb...@umich.edu> on 2005/04/08 20:31:29 UTC
nutch engines
Hi,
In the nutch src/engines directory, there are the files
Altavista.src FAST.src Google.src Inktomi.src
1: What is the significance of these files?
2: I am adding some functionality to nutch and would like to test its
performance agains google. How can I do that? My nutch crawl is on specific
urls.
thanks
Siva
Re: [Nutch-dev] Re: nutch engines
Posted by Zhou LiBing <zh...@gmail.com>.
Thank you
On 4/14/05, Doug Cutting <cu...@nutch.org> wrote:
>
> Stefan Groschupf wrote:
> > Some weeks ago I was staring to write a small tool to be able comparing
> > result via command line.
> > However I never finished the work, but if you like I can send you
> > sources but there is still some work to do.
>
> Mike wrote code to do this a while back. It was difficult to upgrade
> when plugins were added, and no one was using it, so it was removed.
> But you can still find the code here:
>
>
> http://cvs.sourceforge.net/viewcvs.py/nutch/nutch/src/java/net/nutch/quality/Attic/
>
> It would probably be easier to revive this code than to start from
> scratch.
>
> Doug
>
>
> -------------------------------------------------------
> SF email is sponsored by - The IT Product Guide
> Read honest & candid reviews on hundreds of IT Products from real users.
> Discover which products truly live up to the hype. Start reading now.
> http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click
> _______________________________________________
> Nutch-developers mailing list
> Nutch-developers@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nutch-developers
>
--
---Letter From your friend Blue at HUST CGCL---
Re: nutch engines
Posted by Doug Cutting <cu...@nutch.org>.
Stefan Groschupf wrote:
> Some weeks ago I was staring to write a small tool to be able comparing
> result via command line.
> However I never finished the work, but if you like I can send you
> sources but there is still some work to do.
Mike wrote code to do this a while back. It was difficult to upgrade
when plugins were added, and no one was using it, so it was removed.
But you can still find the code here:
http://cvs.sourceforge.net/viewcvs.py/nutch/nutch/src/java/net/nutch/quality/Attic/
It would probably be easier to revive this code than to start from scratch.
Doug
Re: [Nutch-dev] Re: nutch engines
Posted by Zhou LiBing <zh...@gmail.com>.
Could you send me a copy of your sourcecode about your tool ?
thank you
On 4/10/05, Stefan Groschupf <sg...@media-style.com> wrote:
>
> Siva,
> 1.)
> I ask myself this question as well some weeks ago.
> I guess this are firefox plugins.
> 2.) There are already tools to compare results how ever they are
> webbased like the tool from Antonio Gulli.
> http://rankcomparison.di.unipi.it/
>
> Some weeks ago I was staring to write a small tool to be able
> comparing result via command line.
> However I never finished the work, but if you like I can send you
> sources but there is still some work to do.
>
> Stefan
>
> Am 08.04.2005 um 20:31 schrieb Siva Bandhamravuri:
>
> >
> > Hi,
> > In the nutch src/engines directory, there are the files
> > Altavista.src FAST.src Google.src Inktomi.src
> >
> > 1: What is the significance of these files?
> > 2: I am adding some functionality to nutch and would like to test its
> > performance agains google. How can I do that? My nutch crawl is on
> > specific
> > urls.
> >
> > thanks
> >
> > Siva
> >
> >
> ---------------------------------------------------------------
> company: http://www.media-style.com
> forum: http://www.text-mining.org
> blog: http://www.find23.net
>
>
> -------------------------------------------------------
> SF email is sponsored by - The IT Product Guide
> Read honest & candid reviews on hundreds of IT Products from real users.
> Discover which products truly live up to the hype. Start reading now.
> http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click
> _______________________________________________
> Nutch-developers mailing list
> Nutch-developers@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nutch-developers
>
--
---Letter From your friend Blue at HUST CGCL---
Re: nutch engines
Posted by Stefan Groschupf <sg...@media-style.com>.
Siva,
1.)
I ask myself this question as well some weeks ago.
I guess this are firefox plugins.
2.) There are already tools to compare results how ever they are
webbased like the tool from Antonio Gulli.
http://rankcomparison.di.unipi.it/
Some weeks ago I was staring to write a small tool to be able
comparing result via command line.
However I never finished the work, but if you like I can send you
sources but there is still some work to do.
Stefan
Am 08.04.2005 um 20:31 schrieb Siva Bandhamravuri:
>
> Hi,
> In the nutch src/engines directory, there are the files
> Altavista.src FAST.src Google.src Inktomi.src
>
> 1: What is the significance of these files?
> 2: I am adding some functionality to nutch and would like to test its
> performance agains google. How can I do that? My nutch crawl is on
> specific
> urls.
>
> thanks
>
> Siva
>
>
---------------------------------------------------------------
company: http://www.media-style.com
forum: http://www.text-mining.org
blog: http://www.find23.net