You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucy.apache.org by "David E. Wheeler (Commented) (JIRA)" <ji...@apache.org> on 2011/11/08 18:23:51 UTC

[lucy-issues] [jira] [Commented] (LUCY-183) Eliminate spurious "extra" query normalization

    [ https://issues.apache.org/jira/browse/LUCY-183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13146411#comment-13146411 ] 

David E. Wheeler commented on LUCY-183:
---------------------------------------

The fix seems to have broken something else. See [this test failure|http://www.cpantesters.org/cpan/report/35c78d0a-09cf-11e1-90dc-a20eabe3e1f7], especially this:

{code}

#   Failed test 'Should have results for simple search'
#   at t/base.t line 274.
#     Structures begin differing at:
#          $got->{hits}[0]{excerpt} = 'This is the <strong>pair</strong><strong> README file. Here you will find all thingds related to </strong><strong>pair</strong>, including installation information'
#     $expected->{hits}[0]{excerpt} = 'This is the <strong>pair</strong> README file. Here you will find all thingds related to <strong>pair</strong>, including installation information'
{code}
                
> Eliminate spurious "extra" query normalization
> ----------------------------------------------
>
>                 Key: LUCY-183
>                 URL: https://issues.apache.org/jira/browse/LUCY-183
>             Project: Lucy
>          Issue Type: Bug
>          Components: Search
>    Affects Versions: 0.1.0 (incubating), 0.2.0 (incubating), 0.2.1 (incubating)
>            Reporter: Marvin Humphrey
>            Assignee: Marvin Humphrey
>             Fix For: 0.2.2 (incubating), 0.3.0 (incubating)
>
>         Attachments: normalization.patch
>
>
> Query normalization is supposed to scale all scores uniformly by a simple
> multiplier, but the child nodes in complex queries are presently getting
> "extra" normalization applied to them.  This has the effect of scaling
> different subqueries by different amounts, changing the balance of the
> subqueries within a complex query, interfering with IDF weighting and subtly
> degrading relevancy.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira