You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Peter Becker <pb...@dstc.edu.au> on 2003/07/15 09:33:15 UTC

Doing it all backwards

Hi,

is there any way to get the keywords for certain fields in document 
easily? The situation is that I have small sets of documents coming back 
from queries and I want to compare those in terms of similarity. The 
questions are: what are the common terms within each set and what are 
the terms least common between the sets.

Performance is not much of an issue and the documents are still 
accessible, I just wonder which pieces of your code might be reusable 
for this task. After reading some code (e.g. 
DocumentWriter.invertDocument(..)) I am afraid that there is not much I 
could reuse easily, but I thought I better ask :-)

  Peter


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org