You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Robert Muir (JIRA)" <ji...@apache.org> on 2013/11/28 06:53:35 UTC

[jira] [Updated] (SOLR-5512) Optimize DocValuesFacets

     [ https://issues.apache.org/jira/browse/SOLR-5512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Muir updated SOLR-5512:
------------------------------

    Attachment: SOLR-5512.patch

patch.

also fixes a few unrelated bugs that got in the way:
* BitDocSet's DISI's cost() method doesnt prorate for the size of the segment relative to the whole thing (since the bitset is unfortunately top-level)
* If you hit an exception during faceting, the stacktrace is completely lost because of a bug in SimpleFacets (from SOLR-2548)

> Optimize DocValuesFacets 
> -------------------------
>
>                 Key: SOLR-5512
>                 URL: https://issues.apache.org/jira/browse/SOLR-5512
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Robert Muir
>         Attachments: SOLR-5512.patch
>
>
> This works well in the general case (esp with huge numbers of unique values), but the SortedSetDocValuesAccumulator in lucene/facets does the algorithm better for typical cases (smaller number of unique values wrt the size of the document set).
> In this case, it collects directly with per-segment local ords, then remaps as a second step. So this is a lot less remapping.
> Its too bad the code is separate at the moment, for now lets steal the heuristic.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org