You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Neil O. Rouben" <kn...@gmail.com> on 2005/05/11 14:50:03 UTC
Contributing Query Expansion Module (corrected)
I would like to contribute a module that performs Query Expansion (QE)
in Lucene. Please let me know how I may go about doing that. For more
details about module please see http://lucene-qe.sourceforge.net
I implemented Rocchio Query Expansion method. Terms for expansion
could be acquired in local document repository or by using www through
the use of Google's Web API.
Query Expansion - Adding search terms to a user's search. Query
expansion is the process of a search engine adding search terms to a
user's weighted search. The intent is to improve precision and/or
recall. The additional terms may be taken from a thesaurus. For
example a search for "car" may be expanded to: car cars auto autos
automobile automobiles [foldoc.org].
Performance
Experiments were conducted on the data from TREC 2004 Robust Track
(trec.nist.gov).
Note: This data is provided for reference purposes only. Better
performance on the specific data set may not necessary be repeated on
the different data sets, etc...
Tag MAP P10 %no
Lucene QE 0.2433 0.3936 18.10%
Lucene gQE 0.2332 0.3984 14%
KB-R-FIS gQE 0.2322 0.4076 14%
Lucene 0.2 0.37 15%
MAP - mean average precision
P10 - average of precision at 10 documents retrieved
%no - percentage of topics with no relevant in the top 10 retrieved
Lucene - unmodified version 1.4.3
Lucene QE - Lucene with local query expansion
Lucene gQE – Lucene system that utilized Rocchio's query expansion
along with Google.
KB-R-FIS gQE – My Fuzzy Inference System that utilized Rocchio's query
expansion along with Google.