You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by "Plaatje, Patrick" <Pa...@getronics.com> on 2008/11/25 20:09:10 UTC
Keyword extraction
Hi all,
Strugling with a question I recently got from a collegue: is it possible
to extract keywords from indexed content?
In my opinion it should be possible to find out on what words the
ranking of the indexed content is the highest (Lucene or Solr), but have
no clue where to begin. Anyone having suggestions?
Best,
Patrick
Re: Keyword extraction
Posted by Shalin Shekhar Mangar <sh...@gmail.com>.
You might also be interested in
http://wiki.apache.org/solr/TermVectorComponent
On Wed, Nov 26, 2008 at 12:39 AM, Plaatje, Patrick <
Patrick.Plaatje@getronics.com> wrote:
> Hi all,
>
> Strugling with a question I recently got from a collegue: is it possible
> to extract keywords from indexed content?
>
> In my opinion it should be possible to find out on what words the
> ranking of the indexed content is the highest (Lucene or Solr), but have
> no clue where to begin. Anyone having suggestions?
>
> Best,
>
> Patrick
>
--
Regards,
Shalin Shekhar Mangar.
Re: Keyword extraction
Posted by Ryan McKinley <ry...@gmail.com>.
lots of approaches out there...
the easiest "off the shelf" method would be to use the
MoreLikeThisHandler and get the top "interesting" terms;
http://wiki.apache.org/solr/MoreLikeThisHandler
ryan
On Nov 25, 2008, at 2:09 PM, Plaatje, Patrick wrote:
> Hi all,
>
> Strugling with a question I recently got from a collegue: is it
> possible
> to extract keywords from indexed content?
>
> In my opinion it should be possible to find out on what words the
> ranking of the indexed content is the highest (Lucene or Solr), but
> have
> no clue where to begin. Anyone having suggestions?
>
> Best,
>
> Patrick