You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by John Adam <jo...@yahoo.com> on 2004/08/02 04:25:25 UTC
reverse lookup
Is there a way to get most significant words of a document if i give a document number.
---------------------------------
Do you Yahoo!?
Yahoo! Mail - You care about security. So do we.
Re: reverse lookup
Posted by pr...@gmx.de.
John,
You might take a look at a patch I submitted to lucene-dev this morning
(http://issues.apache.org/bugzilla/show_bug.cgi?id=30429) . It's aimed at
pseudo-relevance feedback, so it automatically adds terms from an initial
answer set to a query, but of course you can do whatever you want with the
terms after they've been ranked. To rank terms and find the best descriptors
the Robertson Selection Value is employed.
Best regards,
Rene
> On Aug 1, 2004, at 10:25 PM, John Adam wrote:
>
> Is there a way to get most significant words of a document if i give a
> document number.
>
--
NEU: WLAN-Router f�r 0,- EUR* - auch f�r DSL-Wechsler!
GMX DSL = superg�nstig & kabellos http://www.gmx.net/de/go/dsl
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org
Re: reverse lookup
Posted by Erik Hatcher <er...@ehatchersolutions.com>.
On Aug 1, 2004, at 10:25 PM, John Adam wrote:
>
> Is there a way to get most significant words of a document if i give a
> document number.
>
Have a look at the term vector support new in v1.4. For a document
number and field name, you get terms and frequencies:
TermFreqVector vector =
reader.getTermFreqVector(id, "contents");
Note: you do need to set the storeTermVectors flag on the field during
indexing to enable it.
Erik
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org