You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by peelman <nj...@hotmail.com> on 2009/10/29 03:56:36 UTC

Re: TermVector (TF-IDF Scores) or MoreLikeThis From Subset of Documents



peelman wrote:
> 
> I have an index of about 3 million documents, and specific list of
> document ids that belong in that 3 million (somewhere around 20-50
> documents on average).  With my filtered list of documents I want to be
> able to get TF-IDF scores or run a MoreLikeThis query against ONE
> particular document but calculated based on only that small subset,
> instead of the scores from the entire 3 million document index.
> 
> Is there an easy way to do this using a filtered/subquery, or via any
> other means?
> 
> Presently I am testing by creating a new index out of the subset of
> documents to get the TF-IDF scores, but obviously that is not going to
> work or scale in a finished implementation.
> 
> Thanks in advance.
> 

-- 
View this message in context: http://www.nabble.com/TermVector-%28TF-IDF-Scores%29-or-MoreLikeThis-From-Subset-of-Documents-tp26105328p26105460.html
Sent from the Solr - User mailing list archive at Nabble.com.