You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Yonik Seeley (JIRA)" <ji...@apache.org> on 2010/08/25 20:07:16 UTC

[jira] Created: (SOLR-2089) Faceting: order term ords before converting to values

Faceting: order term ords before converting to values
-----------------------------------------------------

                 Key: SOLR-2089
                 URL: https://issues.apache.org/jira/browse/SOLR-2089
             Project: Solr
          Issue Type: Improvement
    Affects Versions: 1.4
            Reporter: Yonik Seeley


We should be able to speed up multi-valued faceting that sorts by count and returns many values by first sorting the term ords before converting them to a string.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Commented: (SOLR-2089) Faceting: order term ords before converting to values

Posted by "Yonik Seeley (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12902731#action_12902731 ] 

Yonik Seeley commented on SOLR-2089:
------------------------------------

I've committed this to trunk.  Will backport to 3x.

> Faceting: order term ords before converting to values
> -----------------------------------------------------
>
>                 Key: SOLR-2089
>                 URL: https://issues.apache.org/jira/browse/SOLR-2089
>             Project: Solr
>          Issue Type: Improvement
>    Affects Versions: 1.4
>            Reporter: Yonik Seeley
>         Attachments: SOLR-2089.patch
>
>
> We should be able to speed up multi-valued faceting that sorts by count and returns many values by first sorting the term ords before converting them to a string.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Commented: (SOLR-2089) Faceting: order term ords before converting to values

Posted by "Yonik Seeley (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12902623#action_12902623 ] 

Yonik Seeley commented on SOLR-2089:
------------------------------------

Results:
docs=10M, docs matching query=1M, facet on field of 100,000 unique terms, facet.method=fc (multivalued)

|facet.limit|ms to facet trunk|ms to facet patch|
|100|63|63|
|1000|228|191|
|5000|722|307|
|10000|1033|316|

So a decent speedup when facet.limit is very high.
It will also help when facet.limit is high relative to the number of unique terms (since the speedup is due to ordering the term ords and not having to seek as often).

I plan on committing soon if there are no objections.

> Faceting: order term ords before converting to values
> -----------------------------------------------------
>
>                 Key: SOLR-2089
>                 URL: https://issues.apache.org/jira/browse/SOLR-2089
>             Project: Solr
>          Issue Type: Improvement
>    Affects Versions: 1.4
>            Reporter: Yonik Seeley
>         Attachments: SOLR-2089.patch
>
>
> We should be able to speed up multi-valued faceting that sorts by count and returns many values by first sorting the term ords before converting them to a string.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Updated: (SOLR-2089) Faceting: order term ords before converting to values

Posted by "Yonik Seeley (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Yonik Seeley updated SOLR-2089:
-------------------------------

    Attachment: SOLR-2089.patch

Here's a draft patch that implements the sort before term val lookup.
Currently untested for both speed and correctness.

> Faceting: order term ords before converting to values
> -----------------------------------------------------
>
>                 Key: SOLR-2089
>                 URL: https://issues.apache.org/jira/browse/SOLR-2089
>             Project: Solr
>          Issue Type: Improvement
>    Affects Versions: 1.4
>            Reporter: Yonik Seeley
>         Attachments: SOLR-2089.patch
>
>
> We should be able to speed up multi-valued faceting that sorts by count and returns many values by first sorting the term ords before converting them to a string.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Resolved: (SOLR-2089) Faceting: order term ords before converting to values

Posted by "Yonik Seeley (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Yonik Seeley resolved SOLR-2089.
--------------------------------

    Fix Version/s: 3.1
                   4.0
       Resolution: Fixed

OK, this has been backported to 3x, as well as fixing a bug that mark noticed (that was failing to trigger a test failure because of the way internal exceptions are handled in faceting).  SolrTestCaseJ4.assertQ has been updated to fail if faceting is on and if an exception appears in the response.

> Faceting: order term ords before converting to values
> -----------------------------------------------------
>
>                 Key: SOLR-2089
>                 URL: https://issues.apache.org/jira/browse/SOLR-2089
>             Project: Solr
>          Issue Type: Improvement
>    Affects Versions: 1.4
>            Reporter: Yonik Seeley
>             Fix For: 3.1, 4.0
>
>         Attachments: SOLR-2089.patch
>
>
> We should be able to speed up multi-valued faceting that sorts by count and returns many values by first sorting the term ords before converting them to a string.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org