You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2010/08/09 20:49:15 UTC

[jira] Created: (NUTCH-877) Allow setting of slop values for non-quote phrase queries on query-basic plugin

Allow setting of slop values for non-quote phrase queries on query-basic plugin
-------------------------------------------------------------------------------

                 Key: NUTCH-877
                 URL: https://issues.apache.org/jira/browse/NUTCH-877
             Project: Nutch
          Issue Type: Improvement
          Components: searcher
    Affects Versions: 1.2
         Environment: All
            Reporter: Dennis Kubes
            Assignee: Dennis Kubes
             Fix For: 1.2


Patch adds a configuration variable for setting slop values on phrase queries.  The default slop value, which currently can't be changed through configuration, is Integer.MAX_VALUE.  It produces something like this, which doesn't seem right to me.  If you are searching for a phrase you usually want it within a certain distance:

2.9141337E-4 = weight(content:"my phrase"~2147483647 in 1029), product of:

    * 0.07163286 = queryWeight(content:"my phrase"~2147483647), product of:
          o 9.657982 = idf(content: my=13470 phrase=534)
          o 0.0074169594 = queryNorm

This patch adds the query.phrase.slop configuration value to the nutch-default.xml file.  It has a default setting of 5.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (NUTCH-877) Allow setting of slop values for non-quote phrase queries on query-basic plugin

Posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12896889#action_12896889 ] 

Chris A. Mattmann commented on NUTCH-877:
-----------------------------------------

+1 from me too on this Dennis. Commit away! Please add this to branches/branch-1.2. Once Julien applies NUTCH-878, I'll respin a 1.2 RC.



> Allow setting of slop values for non-quote phrase queries on query-basic plugin
> -------------------------------------------------------------------------------
>
>                 Key: NUTCH-877
>                 URL: https://issues.apache.org/jira/browse/NUTCH-877
>             Project: Nutch
>          Issue Type: Improvement
>          Components: searcher
>    Affects Versions: 1.2
>         Environment: All
>            Reporter: Dennis Kubes
>            Assignee: Dennis Kubes
>             Fix For: 1.2
>
>         Attachments: NUTCH-877-1-20100809.patch
>
>
> Patch adds a configuration variable for setting slop values on phrase queries.  The default slop value, which currently can't be changed through configuration, is Integer.MAX_VALUE.  It produces something like this, which doesn't seem right to me.  If you are searching for a phrase you usually want it within a certain distance:
> 2.9141337E-4 = weight(content:"my phrase"~2147483647 in 1029), product of:
>     * 0.07163286 = queryWeight(content:"my phrase"~2147483647), product of:
>           o 9.657982 = idf(content: my=13470 phrase=534)
>           o 0.0074169594 = queryNorm
> This patch adds the query.phrase.slop configuration value to the nutch-default.xml file.  It has a default setting of 5.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (NUTCH-877) Allow setting of slop values for non-quote phrase queries on query-basic plugin

Posted by "Dennis Kubes (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dennis Kubes updated NUTCH-877:
-------------------------------

    Attachment: NUTCH-877-1-20100809.patch

Adds query.phrase.slop configuration variable to nutch-default and setting of variable in setConf method of query-basic plugin for phrase queries.

> Allow setting of slop values for non-quote phrase queries on query-basic plugin
> -------------------------------------------------------------------------------
>
>                 Key: NUTCH-877
>                 URL: https://issues.apache.org/jira/browse/NUTCH-877
>             Project: Nutch
>          Issue Type: Improvement
>          Components: searcher
>    Affects Versions: 1.2
>         Environment: All
>            Reporter: Dennis Kubes
>            Assignee: Dennis Kubes
>             Fix For: 1.2
>
>         Attachments: NUTCH-877-1-20100809.patch
>
>
> Patch adds a configuration variable for setting slop values on phrase queries.  The default slop value, which currently can't be changed through configuration, is Integer.MAX_VALUE.  It produces something like this, which doesn't seem right to me.  If you are searching for a phrase you usually want it within a certain distance:
> 2.9141337E-4 = weight(content:"my phrase"~2147483647 in 1029), product of:
>     * 0.07163286 = queryWeight(content:"my phrase"~2147483647), product of:
>           o 9.657982 = idf(content: my=13470 phrase=534)
>           o 0.0074169594 = queryNorm
> This patch adds the query.phrase.slop configuration value to the nutch-default.xml file.  It has a default setting of 5.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (NUTCH-877) Allow setting of slop values for non-quote phrase queries on query-basic plugin

Posted by "Julien Nioche (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12896831#action_12896831 ] 

Julien Nioche commented on NUTCH-877:
-------------------------------------

+1 

> Allow setting of slop values for non-quote phrase queries on query-basic plugin
> -------------------------------------------------------------------------------
>
>                 Key: NUTCH-877
>                 URL: https://issues.apache.org/jira/browse/NUTCH-877
>             Project: Nutch
>          Issue Type: Improvement
>          Components: searcher
>    Affects Versions: 1.2
>         Environment: All
>            Reporter: Dennis Kubes
>            Assignee: Dennis Kubes
>             Fix For: 1.2
>
>         Attachments: NUTCH-877-1-20100809.patch
>
>
> Patch adds a configuration variable for setting slop values on phrase queries.  The default slop value, which currently can't be changed through configuration, is Integer.MAX_VALUE.  It produces something like this, which doesn't seem right to me.  If you are searching for a phrase you usually want it within a certain distance:
> 2.9141337E-4 = weight(content:"my phrase"~2147483647 in 1029), product of:
>     * 0.07163286 = queryWeight(content:"my phrase"~2147483647), product of:
>           o 9.657982 = idf(content: my=13470 phrase=534)
>           o 0.0074169594 = queryNorm
> This patch adds the query.phrase.slop configuration value to the nutch-default.xml file.  It has a default setting of 5.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (NUTCH-877) Allow setting of slop values for non-quote phrase queries on query-basic plugin

Posted by "Julien Nioche (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Julien Nioche resolved NUTCH-877.
---------------------------------

    Resolution: Fixed

Committed revision 989733.


> Allow setting of slop values for non-quote phrase queries on query-basic plugin
> -------------------------------------------------------------------------------
>
>                 Key: NUTCH-877
>                 URL: https://issues.apache.org/jira/browse/NUTCH-877
>             Project: Nutch
>          Issue Type: Improvement
>          Components: searcher
>    Affects Versions: 1.2
>         Environment: All
>            Reporter: Dennis Kubes
>            Assignee: Dennis Kubes
>             Fix For: 1.2
>
>         Attachments: NUTCH-877-1-20100809.patch
>
>
> Patch adds a configuration variable for setting slop values on phrase queries.  The default slop value, which currently can't be changed through configuration, is Integer.MAX_VALUE.  It produces something like this, which doesn't seem right to me.  If you are searching for a phrase you usually want it within a certain distance:
> 2.9141337E-4 = weight(content:"my phrase"~2147483647 in 1029), product of:
>     * 0.07163286 = queryWeight(content:"my phrase"~2147483647), product of:
>           o 9.657982 = idf(content: my=13470 phrase=534)
>           o 0.0074169594 = queryNorm
> This patch adds the query.phrase.slop configuration value to the nutch-default.xml file.  It has a default setting of 5.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.