You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2010/08/09 20:51:16 UTC
[jira] Updated: (NUTCH-877) Allow setting of slop values for
non-quote phrase queries on query-basic plugin
[ https://issues.apache.org/jira/browse/NUTCH-877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Dennis Kubes updated NUTCH-877:
-------------------------------
Attachment: NUTCH-877-1-20100809.patch
Adds query.phrase.slop configuration variable to nutch-default and setting of variable in setConf method of query-basic plugin for phrase queries.
> Allow setting of slop values for non-quote phrase queries on query-basic plugin
> -------------------------------------------------------------------------------
>
> Key: NUTCH-877
> URL: https://issues.apache.org/jira/browse/NUTCH-877
> Project: Nutch
> Issue Type: Improvement
> Components: searcher
> Affects Versions: 1.2
> Environment: All
> Reporter: Dennis Kubes
> Assignee: Dennis Kubes
> Fix For: 1.2
>
> Attachments: NUTCH-877-1-20100809.patch
>
>
> Patch adds a configuration variable for setting slop values on phrase queries. The default slop value, which currently can't be changed through configuration, is Integer.MAX_VALUE. It produces something like this, which doesn't seem right to me. If you are searching for a phrase you usually want it within a certain distance:
> 2.9141337E-4 = weight(content:"my phrase"~2147483647 in 1029), product of:
> * 0.07163286 = queryWeight(content:"my phrase"~2147483647), product of:
> o 9.657982 = idf(content: my=13470 phrase=534)
> o 0.0074169594 = queryNorm
> This patch adds the query.phrase.slop configuration value to the nutch-default.xml file. It has a default setting of 5.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.