You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "byron miller (JIRA)" <ji...@apache.org> on 2005/04/20 23:26:27 UTC

[jira] Created: (NUTCH-48) "Did you mean" query enhancement/refignment feature request

"Did you mean"  query enhancement/refignment feature request
------------------------------------------------------------

         Key: NUTCH-48
         URL: http://issues.apache.org/jira/browse/NUTCH-48
     Project: Nutch
        Type: New Feature
  Components: web gui  
 Environment: All platforms
    Reporter: byron miller


Looking to implement a "Did you mean" feature for query result pages that return < = x amount of results to invoke a response that would recommend a fixed/related or spell checked query to try.

Note from Doug to users list:


David Spencer has worked on this some.

http://www.searchmorph.com/weblog/index.php?id=23

I think the code on his site might be more recent than what's committed
to the lucene/contrib directory.


-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Commented: (NUTCH-48) "Did you mean" query enhancement/refignment feature request

Posted by "Daniel Drozdovich (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/NUTCH-48?page=comments#action_12428269 ] 
            
Daniel Drozdovich commented on NUTCH-48:
----------------------------------------

Hi, somebody could give me the version for 0.7.2 of given patche, I still have not passed on 0.8 as it still not stable.
Regards, Daniel.

> "Did you mean"  query enhancement/refignment feature request
> ------------------------------------------------------------
>
>                 Key: NUTCH-48
>                 URL: http://issues.apache.org/jira/browse/NUTCH-48
>             Project: Nutch
>          Issue Type: New Feature
>          Components: web gui
>         Environment: All platforms
>            Reporter: byron miller
>         Assigned To: Sami Siren
>            Priority: Minor
>         Attachments: did-you-mean-combined08.patch, rss-spell.patch, spell-check.patch
>
>
> Looking to implement a "Did you mean" feature for query result pages that return < = x amount of results to invoke a response that would recommend a fixed/related or spell checked query to try.
> Note from Doug to users list:
> David Spencer has worked on this some.
> http://www.searchmorph.com/weblog/index.php?id=23
> I think the code on his site might be more recent than what's committed
> to the lucene/contrib directory.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] Updated: (NUTCH-48) "Did you mean" query enhancement/refignment feature request

Posted by "Andy Liu (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/NUTCH-48?page=all ]

Andy Liu updated NUTCH-48:
--------------------------

    Attachment: spell-check.patch

run this command:

bin/nutch org.apache.nutch.spell.NGramSpeller -i [main index] -o [output spelling index] -f content -minThreshold 500

to generate the NGrams spelling index.  minThreshold tells NGramSpeller to only include terms that have a document frequency higher than X.  Your index will contain a lot of mispelled words, so this parameter will help you exclude many of them.  You'll have to experiment to find which value works best for you.

After you generate the index, you can test using:

bin/nutch org.apache.nutch.spell.SpellCheckerBean [spelling index]

To activate spell checking, you'll have to uncomment the line including spell-check.jsp in search.jsp.  You'll also have to edit a config parameter in nutch-site to define where your spelling index is located.

There's a bunch of other levers you can tweak when generating and using your n-gram speling index.  Look at the comments in NGramSpeller (written by David Spencer) and SpellCheckerBean for more details.

> "Did you mean"  query enhancement/refignment feature request
> ------------------------------------------------------------
>
>          Key: NUTCH-48
>          URL: http://issues.apache.org/jira/browse/NUTCH-48
>      Project: Nutch
>         Type: New Feature
>   Components: web gui
>  Environment: All platforms
>     Reporter: byron miller
>     Priority: Minor
>  Attachments: spell-check.patch
>
> Looking to implement a "Did you mean" feature for query result pages that return < = x amount of results to invoke a response that would recommend a fixed/related or spell checked query to try.
> Note from Doug to users list:
> David Spencer has worked on this some.
> http://www.searchmorph.com/weblog/index.php?id=23
> I think the code on his site might be more recent than what's committed
> to the lucene/contrib directory.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Updated: (NUTCH-48) "Did you mean" query enhancement/refignment feature request

Posted by "Stefan Neufeind (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/NUTCH-48?page=all ]

Stefan Neufeind updated NUTCH-48:
---------------------------------

    Attachment: did-you-mean-combined08.patch

Here are both patches combined into one, built against 0.8-dev (namely: nightly-2006-05-20).

- The necessary API-changes in 0.8-dev are incorporated in the patch.
- Some smaller things also fixed, (e.g.:
--- missing ../ in front of link to search.jsp
--- missing </p> at end of did-you-mean-part

Small To-Do left: Maybe put text "Did you mean" into template to make it translatable to other languages. But I guess that can be done when finally merging this into the dev-tree.

Patch tested and proved to work.

> "Did you mean"  query enhancement/refignment feature request
> ------------------------------------------------------------
>
>          Key: NUTCH-48
>          URL: http://issues.apache.org/jira/browse/NUTCH-48
>      Project: Nutch
>         Type: New Feature

>   Components: web gui
>  Environment: All platforms
>     Reporter: byron miller
>     Assignee: Sami Siren
>     Priority: Minor
>  Attachments: did-you-mean-combined08.patch, rss-spell.patch, spell-check.patch
>
> Looking to implement a "Did you mean" feature for query result pages that return < = x amount of results to invoke a response that would recommend a fixed/related or spell checked query to try.
> Note from Doug to users list:
> David Spencer has worked on this some.
> http://www.searchmorph.com/weblog/index.php?id=23
> I think the code on his site might be more recent than what's committed
> to the lucene/contrib directory.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Commented: (NUTCH-48) "Did you mean" query enhancement/refignment feature request

Posted by "Stefan Neufeind (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/NUTCH-48?page=comments#action_12415970 ] 

Stefan Neufeind commented on NUTCH-48:
--------------------------------------

Could somebody please have a look? I currently lack a test-system to try that ...

> "Did you mean"  query enhancement/refignment feature request
> ------------------------------------------------------------
>
>          Key: NUTCH-48
>          URL: http://issues.apache.org/jira/browse/NUTCH-48
>      Project: Nutch
>         Type: New Feature

>   Components: web gui
>  Environment: All platforms
>     Reporter: byron miller
>     Assignee: Sami Siren
>     Priority: Minor
>  Attachments: did-you-mean-combined08.patch, rss-spell.patch, spell-check.patch
>
> Looking to implement a "Did you mean" feature for query result pages that return < = x amount of results to invoke a response that would recommend a fixed/related or spell checked query to try.
> Note from Doug to users list:
> David Spencer has worked on this some.
> http://www.searchmorph.com/weblog/index.php?id=23
> I think the code on his site might be more recent than what's committed
> to the lucene/contrib directory.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Updated: (NUTCH-48) "Did you mean" query enhancement/refignment feature request

Posted by "Aled Rhys Jones (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/NUTCH-48?page=all ]

Aled Rhys Jones updated NUTCH-48:
---------------------------------

    Attachment: rss-spell.patch

Added patch to add spelling correction to the rss feed in the following opensearch format:
<opensearch:Query role="correction" searchTerms="corrected keywords"/>

This patch must be applied after spell-check.patch.


> "Did you mean"  query enhancement/refignment feature request
> ------------------------------------------------------------
>
>          Key: NUTCH-48
>          URL: http://issues.apache.org/jira/browse/NUTCH-48
>      Project: Nutch
>         Type: New Feature
>   Components: web gui
>  Environment: All platforms
>     Reporter: byron miller
>     Assignee: Sami Siren
>     Priority: Minor
>  Attachments: rss-spell.patch, spell-check.patch
>
> Looking to implement a "Did you mean" feature for query result pages that return < = x amount of results to invoke a response that would recommend a fixed/related or spell checked query to try.
> Note from Doug to users list:
> David Spencer has worked on this some.
> http://www.searchmorph.com/weblog/index.php?id=23
> I think the code on his site might be more recent than what's committed
> to the lucene/contrib directory.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Assigned: (NUTCH-48) "Did you mean" query enhancement/refignment feature request

Posted by "Sami Siren (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/NUTCH-48?page=all ]

Sami Siren reassigned NUTCH-48:
-------------------------------

    Assign To: Sami Siren

> "Did you mean"  query enhancement/refignment feature request
> ------------------------------------------------------------
>
>          Key: NUTCH-48
>          URL: http://issues.apache.org/jira/browse/NUTCH-48
>      Project: Nutch
>         Type: New Feature
>   Components: web gui
>  Environment: All platforms
>     Reporter: byron miller
>     Assignee: Sami Siren
>     Priority: Minor
>  Attachments: spell-check.patch
>
> Looking to implement a "Did you mean" feature for query result pages that return < = x amount of results to invoke a response that would recommend a fixed/related or spell checked query to try.
> Note from Doug to users list:
> David Spencer has worked on this some.
> http://www.searchmorph.com/weblog/index.php?id=23
> I think the code on his site might be more recent than what's committed
> to the lucene/contrib directory.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Updated: (NUTCH-48) "Did you mean" query enhancement/refignment feature request

Posted by "byron miller (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/NUTCH-48?page=all ]

byron miller updated NUTCH-48:
------------------------------

    Priority: Minor  (was: Major)

> "Did you mean"  query enhancement/refignment feature request
> ------------------------------------------------------------
>
>          Key: NUTCH-48
>          URL: http://issues.apache.org/jira/browse/NUTCH-48
>      Project: Nutch
>         Type: New Feature
>   Components: web gui
>  Environment: All platforms
>     Reporter: byron miller
>     Priority: Minor

>
> Looking to implement a "Did you mean" feature for query result pages that return < = x amount of results to invoke a response that would recommend a fixed/related or spell checked query to try.
> Note from Doug to users list:
> David Spencer has worked on this some.
> http://www.searchmorph.com/weblog/index.php?id=23
> I think the code on his site might be more recent than what's committed
> to the lucene/contrib directory.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Commented: (NUTCH-48) "Did you mean" query enhancement/refignment feature request

Posted by "Sami Siren (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/NUTCH-48?page=comments#action_12415016 ] 

Sami Siren commented on NUTCH-48:
---------------------------------

stefan, I tried to apply your combined patch but it seems that the test case does not compile.

> "Did you mean"  query enhancement/refignment feature request
> ------------------------------------------------------------
>
>          Key: NUTCH-48
>          URL: http://issues.apache.org/jira/browse/NUTCH-48
>      Project: Nutch
>         Type: New Feature

>   Components: web gui
>  Environment: All platforms
>     Reporter: byron miller
>     Assignee: Sami Siren
>     Priority: Minor
>  Attachments: did-you-mean-combined08.patch, rss-spell.patch, spell-check.patch
>
> Looking to implement a "Did you mean" feature for query result pages that return < = x amount of results to invoke a response that would recommend a fixed/related or spell checked query to try.
> Note from Doug to users list:
> David Spencer has worked on this some.
> http://www.searchmorph.com/weblog/index.php?id=23
> I think the code on his site might be more recent than what's committed
> to the lucene/contrib directory.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Commented: (NUTCH-48) "Did you mean" query enhancement/refignment feature request

Posted by "Aled Rhys Jones (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/NUTCH-48?page=comments#action_12371991 ] 

Aled Rhys Jones commented on NUTCH-48:
--------------------------------------

This functionality also needs to be incorporated into the RSS results feed.

> "Did you mean"  query enhancement/refignment feature request
> ------------------------------------------------------------
>
>          Key: NUTCH-48
>          URL: http://issues.apache.org/jira/browse/NUTCH-48
>      Project: Nutch
>         Type: New Feature
>   Components: web gui
>  Environment: All platforms
>     Reporter: byron miller
>     Assignee: Sami Siren
>     Priority: Minor
>  Attachments: spell-check.patch
>
> Looking to implement a "Did you mean" feature for query result pages that return < = x amount of results to invoke a response that would recommend a fixed/related or spell checked query to try.
> Note from Doug to users list:
> David Spencer has worked on this some.
> http://www.searchmorph.com/weblog/index.php?id=23
> I think the code on his site might be more recent than what's committed
> to the lucene/contrib directory.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


[jira] Commented: (NUTCH-48) "Did you mean" query enhancement/refignment feature request

Posted by "Andy Liu (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/NUTCH-48?page=comments#action_63390 ]
     
Andy Liu commented on NUTCH-48:
-------------------------------

I have implemented a rough version of this feature using David Spencer's code.  I will submit a patch when I get the chance.

> "Did you mean"  query enhancement/refignment feature request
> ------------------------------------------------------------
>
>          Key: NUTCH-48
>          URL: http://issues.apache.org/jira/browse/NUTCH-48
>      Project: Nutch
>         Type: New Feature
>   Components: web gui
>  Environment: All platforms
>     Reporter: byron miller
>     Priority: Minor

>
> Looking to implement a "Did you mean" feature for query result pages that return < = x amount of results to invoke a response that would recommend a fixed/related or spell checked query to try.
> Note from Doug to users list:
> David Spencer has worked on this some.
> http://www.searchmorph.com/weblog/index.php?id=23
> I think the code on his site might be more recent than what's committed
> to the lucene/contrib directory.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira