You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Siva Bandhamravuri <sb...@umich.edu> on 2005/04/08 20:31:29 UTC

nutch engines

Hi,
  In the nutch src/engines directory, there are the files
Altavista.src  FAST.src  Google.src  Inktomi.src

1: What is the significance of these files?
2: I am adding some functionality to nutch and would like to test its
performance agains google. How can I do that? My nutch crawl is on specific
urls.

thanks

Siva

Re: [Nutch-dev] Re: nutch engines

Posted by Zhou LiBing <zh...@gmail.com>.
Thank you 

On 4/14/05, Doug Cutting <cu...@nutch.org> wrote: 
> 
> Stefan Groschupf wrote:
> > Some weeks ago I was staring to write a small tool to be able comparing
> > result via command line.
> > However I never finished the work, but if you like I can send you
> > sources but there is still some work to do.
> 
> Mike wrote code to do this a while back. It was difficult to upgrade
> when plugins were added, and no one was using it, so it was removed.
> But you can still find the code here:
> 
> 
> http://cvs.sourceforge.net/viewcvs.py/nutch/nutch/src/java/net/nutch/quality/Attic/
> 
> It would probably be easier to revive this code than to start from 
> scratch.
> 
> Doug
> 
> 
> -------------------------------------------------------
> SF email is sponsored by - The IT Product Guide
> Read honest & candid reviews on hundreds of IT Products from real users.
> Discover which products truly live up to the hype. Start reading now.
> http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click
> _______________________________________________
> Nutch-developers mailing list
> Nutch-developers@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nutch-developers
> 



-- 
---Letter From your friend Blue at HUST CGCL---

Re: nutch engines

Posted by Doug Cutting <cu...@nutch.org>.
Stefan Groschupf wrote:
> Some weeks ago I was staring to write a small tool to be able  comparing 
> result via command line.
> However I never finished the work, but if you like I can send you 
> sources  but there is still some work to do.

Mike wrote code to do this a while back.  It was difficult to upgrade 
when plugins were added, and no one was using it, so it was removed. 
But you can still find the code here:

http://cvs.sourceforge.net/viewcvs.py/nutch/nutch/src/java/net/nutch/quality/Attic/

It would probably be easier to revive this code than to start from scratch.

Doug

Re: [Nutch-dev] Re: nutch engines

Posted by Zhou LiBing <zh...@gmail.com>.
Could you send me a copy of your sourcecode about your tool ?
 thank you

 On 4/10/05, Stefan Groschupf <sg...@media-style.com> wrote: 
> 
> Siva,
> 1.)
> I ask myself this question as well some weeks ago.
> I guess this are firefox plugins.
> 2.) There are already tools to compare results how ever they are
> webbased like the tool from Antonio Gulli.
> http://rankcomparison.di.unipi.it/
> 
> Some weeks ago I was staring to write a small tool to be able
> comparing result via command line.
> However I never finished the work, but if you like I can send you
> sources but there is still some work to do.
> 
> Stefan
> 
> Am 08.04.2005 um 20:31 schrieb Siva Bandhamravuri:
> 
> >
> > Hi,
> > In the nutch src/engines directory, there are the files
> > Altavista.src FAST.src Google.src Inktomi.src
> >
> > 1: What is the significance of these files?
> > 2: I am adding some functionality to nutch and would like to test its
> > performance agains google. How can I do that? My nutch crawl is on
> > specific
> > urls.
> >
> > thanks
> >
> > Siva
> >
> >
> ---------------------------------------------------------------
> company: http://www.media-style.com
> forum: http://www.text-mining.org
> blog: http://www.find23.net
> 
> 
> -------------------------------------------------------
> SF email is sponsored by - The IT Product Guide
> Read honest & candid reviews on hundreds of IT Products from real users.
> Discover which products truly live up to the hype. Start reading now.
> http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click
> _______________________________________________
> Nutch-developers mailing list
> Nutch-developers@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nutch-developers
> 



-- 
---Letter From your friend Blue at HUST CGCL---

Re: nutch engines

Posted by Stefan Groschupf <sg...@media-style.com>.
Siva,
1.)
I ask myself this question as well some weeks ago.
I guess this are firefox plugins.
2.) There are already tools to compare results how ever they are 
webbased like the tool from Antonio Gulli.
http://rankcomparison.di.unipi.it/

Some weeks ago I was staring to write a small tool to be able  
comparing result via command line.
However I never finished the work, but if you like I can send you 
sources  but there is still some work to do.


Stefan


Am 08.04.2005 um 20:31 schrieb Siva Bandhamravuri:

>
> Hi,
>   In the nutch src/engines directory, there are the files
> Altavista.src  FAST.src  Google.src  Inktomi.src
>
> 1: What is the significance of these files?
> 2: I am adding some functionality to nutch and would like to test its
> performance agains google. How can I do that? My nutch crawl is on 
> specific
> urls.
>
> thanks
>
> Siva
>
>
---------------------------------------------------------------
company:		http://www.media-style.com
forum:		http://www.text-mining.org
blog:			http://www.find23.net