You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by Grant Ingersoll <gs...@apache.org> on 2008/12/11 14:36:33 UTC
Re: [jira] Commented: (LUCENE-965) Implement a state-of-the-art retrieval function in Lucene
I don't think the original authors have followed up on this patch at
all since first posting.
On Nov 27, 2008, at 6:44 AM, Ian Holsman (JIRA) wrote:
>
> [ https://issues.apache.org/jira/browse/LUCENE-965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12651332
> #action_12651332 ]
>
> Ian Holsman commented on LUCENE-965:
> ------------------------------------
>
> It's a bit late over here, but when I try to apply the patch it
> doesn't seem to have the AXSimilarity class in it.
> is there a file missing here, or should i not be looking at applying
> patches late at night?
>
>> Implement a state-of-the-art retrieval function in Lucene
>> ---------------------------------------------------------
>>
>> Key: LUCENE-965
>> URL: https://issues.apache.org/jira/browse/LUCENE-965
>> Project: Lucene - Java
>> Issue Type: Improvement
>> Components: Search
>> Affects Versions: 2.2
>> Reporter: Hui Fang
>> Fix For: 3.0
>>
>> Attachments: axiomaticFunction.patch
>>
>>
>> We implemented the axiomatic retrieval function, which is a state-
>> of-the-art retrieval function, to
>> replace the default similarity function in Lucene. We compared the
>> performance of these two functions and reported the results at http://sifaka.cs.uiuc.edu/hfang/lucene/Lucene_exp.pdf
>> .
>> The report shows that the performance of the axiomatic retrieval
>> function is much better than the default function. The axiomatic
>> retrieval function is able to find more relevant documents and
>> users can see more relevant documents in the top-ranked documents.
>> Incorporating such a state-of-the-art retrieval function could
>> improve the search performance of all the applications which were
>> built upon Lucene.
>> Most changes related to the implementation are made in
>> AXSimilarity, TermScorer and TermQuery.java. However, many test
>> cases are hand coded to test whether the implementation of the
>> default function is correct. Thus, I also made the modification to
>> many test files to make the new retrieval function pass those
>> cases. In fact, we found that some old test cases are not
>> reasonable. For example, in the testQueries02 of TestBoolean2.java,
>> the query is "+w3 xx", and we have two documents "w1 xx w2 yy w3"
>> and "w1 w3 xx w2 yy w3".
>> The second document should be more relevant than the first one,
>> because it has more
>> occurrences of the query term "w3". But the original test case
>> would require us to rank
>> the first document higher than the second one, which is not
>> reasonable.
>
> --
> This message is automatically generated by JIRA.
> -
> You can reply to this email to add a comment to the issue online.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-dev-help@lucene.apache.org
>
--------------------------
Grant Ingersoll
Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ
---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org