You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucenenet.apache.org by "Digy (JIRA)" <ji...@apache.org> on 2008/01/10 20:53:33 UTC
[jira] Created: (LUCENENET-107) Search while sorting is slow
Search while sorting is slow
----------------------------
Key: LUCENENET-107
URL: https://issues.apache.org/jira/browse/LUCENENET-107
Project: Lucene.Net
Issue Type: Bug
Environment: Lucene.Net 2.1
Reporter: Digy
Attachments: TestSort.rar
Mark wrote:
"
I have an index that when i search for term with out sorting the results are returned in .7 seconds or less. When I choose to search with a column chosen for sorting the results will return in 7 - 10 seconds. The larger the # of hits the longer it takes when sorting. The fields that can be sorted on are unique to the document,stored in the index as un_tokenized. I am using lucene.net 2.0. Index size is normally around 2.5 gig but i have the same problem on a 500 meg index too. Has anyone else experienced this?
Any help will be greatly appreciated.
Thanks,
Mark Burks
"
I have prepared a test case where TestIndex contains ~200000 docs.
a) making a search and then sorting the (500)results (I used SortedDictionary for sorting) takes ~140 msec.
b) making a search with Lucene's Sort takes ~6046 msec.
So, there is something to be fixed in Lucene.Net
DIGY
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (LUCENENET-107) Search while sorting is slow
Posted by "Erich Eichinger (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCENENET-107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12558298#action_12558298 ]
Erich Eichinger commented on LUCENENET-107:
-------------------------------------------
> Should there be a new method or parameter forcing not to cache the terms
Digy: Yes, I think there should be. At least in our website projects here, we're using heavy concurrent Reader / Writer scenarios. Taking this a bit further, it's maybe a good idea to introduce some kind of parameter-profile for tuning Lucene for certain usage scenarios. Users then could easily select from "readonly", "mostly read/few writes", "read/write equal", ... profiles.
> Search while sorting is slow
> ----------------------------
>
> Key: LUCENENET-107
> URL: https://issues.apache.org/jira/browse/LUCENENET-107
> Project: Lucene.Net
> Issue Type: Bug
> Environment: Lucene.Net 2.1
> Reporter: Digy
> Attachments: TestSort.rar
>
>
> Mark wrote:
> "
> I have an index that when i search for term with out sorting the results are returned in .7 seconds or less. When I choose to search with a column chosen for sorting the results will return in 7 - 10 seconds. The larger the # of hits the longer it takes when sorting. The fields that can be sorted on are unique to the document,stored in the index as un_tokenized. I am using lucene.net 2.0. Index size is normally around 2.5 gig but i have the same problem on a 500 meg index too. Has anyone else experienced this?
> Any help will be greatly appreciated.
> Thanks,
> Mark Burks
> "
> I have prepared a test case where TestIndex contains ~200000 docs.
> a) making a search and then sorting the (500)results (I used SortedDictionary for sorting) takes ~140 msec.
> b) making a search with Lucene's Sort takes ~6046 msec.
> So, there is something to be fixed in Lucene.Net
> DIGY
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (LUCENENET-107) Search while sorting is slow
Posted by "Digy (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCENENET-107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Digy updated LUCENENET-107:
---------------------------
Attachment: TestSort.rar
> Search while sorting is slow
> ----------------------------
>
> Key: LUCENENET-107
> URL: https://issues.apache.org/jira/browse/LUCENENET-107
> Project: Lucene.Net
> Issue Type: Bug
> Environment: Lucene.Net 2.1
> Reporter: Digy
> Attachments: TestSort.rar
>
>
> Mark wrote:
> "
> I have an index that when i search for term with out sorting the results are returned in .7 seconds or less. When I choose to search with a column chosen for sorting the results will return in 7 - 10 seconds. The larger the # of hits the longer it takes when sorting. The fields that can be sorted on are unique to the document,stored in the index as un_tokenized. I am using lucene.net 2.0. Index size is normally around 2.5 gig but i have the same problem on a 500 meg index too. Has anyone else experienced this?
> Any help will be greatly appreciated.
> Thanks,
> Mark Burks
> "
> I have prepared a test case where TestIndex contains ~200000 docs.
> a) making a search and then sorting the (500)results (I used SortedDictionary for sorting) takes ~140 msec.
> b) making a search with Lucene's Sort takes ~6046 msec.
> So, there is something to be fixed in Lucene.Net
> DIGY
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (LUCENENET-107) Search while sorting is slow
Posted by "Digy (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCENENET-107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12558225#action_12558225 ]
Digy commented on LUCENENET-107:
--------------------------------
Most of the time of searching&sorting is spent in warming up the cache (FieldCacheImpl.Cache.Get.CreateValue).
Should there be a new method or parameter forcing not to cache the terms and just do the sort if IndexSearcher(or IndexReader) is to be used just once?
DIGY
> Search while sorting is slow
> ----------------------------
>
> Key: LUCENENET-107
> URL: https://issues.apache.org/jira/browse/LUCENENET-107
> Project: Lucene.Net
> Issue Type: Bug
> Environment: Lucene.Net 2.1
> Reporter: Digy
> Attachments: TestSort.rar
>
>
> Mark wrote:
> "
> I have an index that when i search for term with out sorting the results are returned in .7 seconds or less. When I choose to search with a column chosen for sorting the results will return in 7 - 10 seconds. The larger the # of hits the longer it takes when sorting. The fields that can be sorted on are unique to the document,stored in the index as un_tokenized. I am using lucene.net 2.0. Index size is normally around 2.5 gig but i have the same problem on a 500 meg index too. Has anyone else experienced this?
> Any help will be greatly appreciated.
> Thanks,
> Mark Burks
> "
> I have prepared a test case where TestIndex contains ~200000 docs.
> a) making a search and then sorting the (500)results (I used SortedDictionary for sorting) takes ~140 msec.
> b) making a search with Lucene's Sort takes ~6046 msec.
> So, there is something to be fixed in Lucene.Net
> DIGY
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Closed: (LUCENENET-107) Search while sorting is slow
Posted by "Digy (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/LUCENENET-107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Digy closed LUCENENET-107.
--------------------------
Resolution: Won't Fix
Assignee: Digy
Divergence from java.
> Search while sorting is slow
> ----------------------------
>
> Key: LUCENENET-107
> URL: https://issues.apache.org/jira/browse/LUCENENET-107
> Project: Lucene.Net
> Issue Type: Bug
> Environment: Lucene.Net 2.1
> Reporter: Digy
> Assignee: Digy
> Attachments: TestSort.rar
>
>
> Mark wrote:
> "
> I have an index that when i search for term with out sorting the results are returned in .7 seconds or less. When I choose to search with a column chosen for sorting the results will return in 7 - 10 seconds. The larger the # of hits the longer it takes when sorting. The fields that can be sorted on are unique to the document,stored in the index as un_tokenized. I am using lucene.net 2.0. Index size is normally around 2.5 gig but i have the same problem on a 500 meg index too. Has anyone else experienced this?
> Any help will be greatly appreciated.
> Thanks,
> Mark Burks
> "
> I have prepared a test case where TestIndex contains ~200000 docs.
> a) making a search and then sorting the (500)results (I used SortedDictionary for sorting) takes ~140 msec.
> b) making a search with Lucene's Sort takes ~6046 msec.
> So, there is something to be fixed in Lucene.Net
> DIGY
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.