You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Richard Marr (JIRA)" <ji...@apache.org> on 2009/06/13 08:15:07 UTC

[jira] Updated: (LUCENE-1690) Morelikethis queries are very slow compared to other search types

     [ https://issues.apache.org/jira/browse/LUCENE-1690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Richard Marr updated LUCENE-1690:
---------------------------------

    Attachment: LUCENE-1690.patch

This patch implements a basic hashmap term frequency cache. It shouldn't affect any applications that don't opt-in to using it, and applications that do should see an order of magnitude performance improvement for MLT queries.

This cache implementation is tied to the MLT object but can be cleared on demand.

> Morelikethis queries are very slow compared to other search types
> -----------------------------------------------------------------
>
>                 Key: LUCENE-1690
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1690
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/*
>    Affects Versions: 2.4.1
>            Reporter: Richard Marr
>            Priority: Minor
>         Attachments: LUCENE-1690.patch
>
>   Original Estimate: 2h
>  Remaining Estimate: 2h
>
> The MoreLikeThis object performs term frequency lookups for every query.  From my testing that's what seems to take up the majority of time for MoreLikeThis searches.  
> For some (I'd venture many) applications it's not necessary for term statistics to be looked up every time. A fairly naive opt-in caching mechanism tied to the life of the MoreLikeThis object would allow applications to cache term statistics for the duration that suits them.
> I've got this working in my test code. I'll put together a patch file when I get a minute. From my testing this can improve performance by a factor of around 10.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org