You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by GitBox <gi...@apache.org> on 2022/03/02 00:57:22 UTC

[GitHub] [lucene] gsmiller opened a new pull request #719: LUCENE-10444: Support alternate aggregation functions in association facets

gsmiller opened a new pull request #719:
URL: https://github.com/apache/lucene/pull/719


   This is a backport of #718. It provides backwards-compatiblity by delegating existing "sum association faceting" implementations to the new classes. This also provided an easy way to benchmark the change against the existing version. Results of `luceneutil` on `wikimedium10m` are here:
   
   ```
                               TaskQPS baseline      StdDevQPS candidate      StdDev                Pct diff p-value
          BrowseDayOfYearSSDVFacets       12.45     (17.5%)       12.15     (16.2%)   -2.4% ( -30% -   38%) 0.658
        BrowseRandomLabelTaxoFacets       18.18     (17.9%)       17.78     (16.8%)   -2.2% ( -31% -   39%) 0.693
                          MedPhrase       10.66      (3.1%)       10.53      (3.0%)   -1.2% (  -7% -    5%) 0.220
                        AndHighHigh       70.63      (3.6%)       69.99      (4.3%)   -0.9% (  -8% -    7%) 0.462
                   HighSloppyPhrase       31.76      (3.4%)       31.52      (3.3%)   -0.7% (  -7% -    6%) 0.483
                    MedSloppyPhrase        7.89      (4.0%)        7.84      (2.7%)   -0.7% (  -7% -    6%) 0.524
                            Respell       51.61      (1.0%)       51.29      (1.5%)   -0.6% (  -3% -    1%) 0.135
                            LowTerm     1877.52      (3.6%)     1866.08      (4.0%)   -0.6% (  -7% -    7%) 0.615
                         AndHighMed      215.42      (3.1%)      214.17      (3.2%)   -0.6% (  -6% -    5%) 0.560
                             Fuzzy2       71.94      (1.0%)       71.56      (1.5%)   -0.5% (  -2% -    1%) 0.180
                         OrHighHigh       26.01      (3.9%)       25.88      (4.5%)   -0.5% (  -8% -    8%) 0.702
                          LowPhrase      141.37      (2.0%)      140.65      (2.1%)   -0.5% (  -4% -    3%) 0.440
                          OrHighMed       91.32      (3.6%)       90.93      (4.1%)   -0.4% (  -7% -    7%) 0.722
              BrowseMonthTaxoFacets       28.47     (23.2%)       28.35     (23.4%)   -0.4% ( -38% -   60%) 0.955
                             IntNRQ      127.41      (1.5%)      126.93      (1.6%)   -0.4% (  -3% -    2%) 0.449
                             Fuzzy1       64.42      (1.1%)       64.24      (1.9%)   -0.3% (  -3% -    2%) 0.577
                          OrHighLow     1010.14      (2.5%)     1007.54      (2.8%)   -0.3% (  -5% -    5%) 0.756
                         AndHighLow     1629.56      (2.7%)     1626.70      (2.3%)   -0.2% (  -5% -    4%) 0.825
                    LowSloppyPhrase       54.82      (2.1%)       54.75      (1.3%)   -0.1% (  -3% -    3%) 0.810
                           Wildcard      290.56     (10.4%)      290.30     (10.1%)   -0.1% ( -18% -   22%) 0.978
               BrowseDateSSDVFacets        2.35      (7.5%)        2.35      (6.3%)   -0.1% ( -12% -   14%) 0.969
                           HighTerm     1446.58      (4.3%)     1446.18      (4.6%)   -0.0% (  -8% -    9%) 0.985
                         HighPhrase      336.00      (1.2%)      335.97      (1.6%)   -0.0% (  -2% -    2%) 0.983
                       HighSpanNear       17.08      (3.9%)       17.09      (4.2%)    0.0% (  -7% -    8%) 0.993
                        MedSpanNear       18.96      (2.7%)       18.97      (3.3%)    0.0% (  -5% -    6%) 0.966
                MedIntervalsOrdered       16.32      (2.3%)       16.34      (2.0%)    0.1% (  -4% -    4%) 0.857
                            MedTerm     1927.39      (3.3%)     1932.58      (4.8%)    0.3% (  -7% -    8%) 0.836
               HighIntervalsOrdered       14.37      (3.9%)       14.41      (4.4%)    0.3% (  -7% -    8%) 0.828
                LowIntervalsOrdered       13.96      (3.1%)       14.00      (2.1%)    0.3% (  -4% -    5%) 0.699
            AndHighMedDayTaxoFacets       57.83      (2.2%)       58.05      (2.2%)    0.4% (  -3% -    4%) 0.582
                        LowSpanNear       25.48      (2.5%)       25.60      (2.2%)    0.5% (  -4% -    5%) 0.504
           AndHighHighDayTaxoFacets       20.55      (1.9%)       20.66      (2.5%)    0.5% (  -3% -    5%) 0.445
                            Prefix3      174.40     (13.7%)      175.42     (12.2%)    0.6% ( -22% -   30%) 0.887
               MedTermDayTaxoFacets       36.25      (4.1%)       36.46      (4.0%)    0.6% (  -7% -    9%) 0.649
               BrowseDateTaxoFacets       21.85     (21.0%)       21.98     (20.5%)    0.6% ( -33% -   53%) 0.927
                       OrHighNotLow     1422.88      (5.2%)     1431.97      (3.2%)    0.6% (  -7% -    9%) 0.639
                      OrHighNotHigh      909.10      (5.1%)      915.05      (3.0%)    0.7% (  -7% -    9%) 0.617
        BrowseRandomLabelSSDVFacets        9.34      (7.1%)        9.41      (7.2%)    0.7% ( -12% -   16%) 0.764
                       OrHighNotMed      963.33      (4.5%)      969.96      (3.7%)    0.7% (  -7% -    9%) 0.598
          BrowseDayOfYearTaxoFacets       21.89     (21.2%)       22.05     (20.7%)    0.7% ( -33% -   54%) 0.913
              BrowseMonthSSDVFacets       13.44     (15.1%)       13.55     (14.4%)    0.8% ( -24% -   35%) 0.861
                      OrNotHighHigh      890.96      (4.6%)      899.54      (3.2%)    1.0% (  -6% -    9%) 0.444
                           PKLookup      170.01      (3.6%)      171.68      (4.2%)    1.0% (  -6% -    9%) 0.425
                       OrNotHighMed     1002.77      (4.5%)     1013.80      (2.9%)    1.1% (  -5% -    8%) 0.354
                       OrNotHighLow     1401.09      (2.6%)     1416.61      (2.9%)    1.1% (  -4% -    6%) 0.198
             OrHighMedDayTaxoFacets        6.95      (4.5%)        7.04      (4.8%)    1.3% (  -7% -   11%) 0.378
               HighTermTitleBDVSort       45.01     (27.2%)       45.86     (22.6%)    1.9% ( -37% -   71%) 0.810
              HighTermDayOfYearSort       99.77     (16.7%)      103.20     (20.4%)    3.4% ( -28% -   48%) 0.561
                  HighTermMonthSort       94.79     (17.3%)      102.32     (26.9%)    7.9% ( -30% -   63%) 0.268
                         TermDTSort       81.57     (26.2%)       89.10     (23.4%)    9.2% ( -31% -   79%) 0.240
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org