You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Stephen Ensor <st...@gmail.com> on 2006/03/08 10:07:51 UTC

help with creating a directory ie front page menu of common terms

Hi, I am using nutch to create a vertical search site and wish to create a
directory type menu for my front page with all the most common terms in my
index.

For example say my vertical search is pets and my index is full of pet sites
and pages, the common terms would be (cat, dog, fish, food, vet, etc…).  Would
this be possible to generate using nutch and some plugin?

Any help is much appreciated, Thanks

Steve

Re: help with creating a directory ie front page menu of common terms

Posted by "Insurance Squared Inc." <gc...@insurancesquared.com>.
Just a note that while this idea is good, displaying 'recent searches' 
can be used by spammers. All they have to do is hammer your server with 
a bunch of queries to 'www.some-poker-site.com' and their website gets a 
link from yours. I'd be very leary of republishing any user inputs to 
your system as it's prone to abuse.



Stephen Ensor wrote:

>Hi, I am using nutch to create a vertical search site and wish to create a
>directory type menu for my front page with all the most common terms in my
>index.
>
>For example say my vertical search is pets and my index is full of pet sites
>and pages, the common terms would be (cat, dog, fish, food, vet, etc…).  Would
>this be possible to generate using nutch and some plugin?
>
>Any help is much appreciated, Thanks
>
>Steve
>
>  
>

Re: help with creating a directory ie front page menu of common terms

Posted by Stefan Groschupf <sg...@media-style.com>.
Have a look to the IndexReader a object in the the lucene package.

Am 08.03.2006 um 10:07 schrieb Stephen Ensor:

> Hi, I am using nutch to create a vertical search site and wish to  
> create a
> directory type menu for my front page with all the most common  
> terms in my
> index.
>
> For example say my vertical search is pets and my index is full of  
> pet sites
> and pages, the common terms would be (cat, dog, fish, food, vet,  
> etc…).  Would
> this be possible to generate using nutch and some plugin?
>
> Any help is much appreciated, Thanks
>
> Steve

---------------------------------------------------------------
company:        http://www.media-style.com
forum:        http://www.text-mining.org
blog:            http://www.find23.net