You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "Gautam Worah (Jira)" <ji...@apache.org> on 2021/05/18 17:33:00 UTC

[jira] [Created] (LUCENE-9964) FacetResult.labelValues.value is not accurate for duplicate labels in a document

Gautam Worah created LUCENE-9964:
------------------------------------

             Summary: FacetResult.labelValues.value is not accurate for duplicate labels in a document
                 Key: LUCENE-9964
                 URL: https://issues.apache.org/jira/browse/LUCENE-9964
             Project: Lucene - Core
          Issue Type: Improvement
          Components: modules/facet
    Affects Versions: 8.8.1
            Reporter: Gautam Worah


As part of a separate [bug|https://github.com/apache/lucene/pull/131] in FacetResult#value we discovered that FacetResult.labelValues.value is not accurate for duplicate labels in a document that uses SortedNumericDocValuesFields.

In theory, each label should only be counted once from a document when returning the labelValues, but today, each duplicate label in a document is counted uniquely.

A test case showing the current (inaccurate) output is [here|https://github.com/gautamworah96/lucene/commit/042878117308f76629a27b0bcf83e25f074dc8b1#diff-6fda2d7520edb7e11d4af3f5b6e80c073a0e20790ec42692573137959837d742]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org