You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by "Plaatje, Patrick" <Pa...@getronics.com> on 2008/11/25 20:09:10 UTC

Keyword extraction

Hi all,

Strugling with a question I recently got from a collegue: is it possible
to extract keywords from indexed content?

In my opinion it should be possible to find out on what words the
ranking of the indexed content is the highest (Lucene or Solr), but have
no clue where to begin. Anyone having suggestions?

Best,

Patrick

Re: Keyword extraction

Posted by Shalin Shekhar Mangar <sh...@gmail.com>.
You might also be interested in
http://wiki.apache.org/solr/TermVectorComponent

On Wed, Nov 26, 2008 at 12:39 AM, Plaatje, Patrick <
Patrick.Plaatje@getronics.com> wrote:

> Hi all,
>
> Strugling with a question I recently got from a collegue: is it possible
> to extract keywords from indexed content?
>
> In my opinion it should be possible to find out on what words the
> ranking of the indexed content is the highest (Lucene or Solr), but have
> no clue where to begin. Anyone having suggestions?
>
> Best,
>
> Patrick
>



-- 
Regards,
Shalin Shekhar Mangar.

Re: Keyword extraction

Posted by Ryan McKinley <ry...@gmail.com>.
lots of approaches out there...

the easiest "off the shelf" method would be to use the  
MoreLikeThisHandler and get the top "interesting" terms;

http://wiki.apache.org/solr/MoreLikeThisHandler

ryan


On Nov 25, 2008, at 2:09 PM, Plaatje, Patrick wrote:

> Hi all,
>
> Strugling with a question I recently got from a collegue: is it  
> possible
> to extract keywords from indexed content?
>
> In my opinion it should be possible to find out on what words the
> ranking of the indexed content is the highest (Lucene or Solr), but  
> have
> no clue where to begin. Anyone having suggestions?
>
> Best,
>
> Patrick