You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/04/01 16:35:07 UTC
[jira] [Updated] (NUTCH-453) Move stop words to a config file
[ https://issues.apache.org/jira/browse/NUTCH-453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Markus Jelsma updated NUTCH-453:
--------------------------------
Bulk close of legacy issues:
http://www.lucidimagination.com/search/document/2738eeb014805854/clean_up_open_legacy_issues_in_jira
> Move stop words to a config file
> --------------------------------
>
> Key: NUTCH-453
> URL: https://issues.apache.org/jira/browse/NUTCH-453
> Project: Nutch
> Issue Type: Improvement
> Components: indexer, searcher
> Reporter: Steve Severance
> Priority: Minor
>
> Move the stop words from the code to a config file. This will allow the stop words to be modified without recompiling the code. The format could be the same as the regex-urlfilter where regexs are used to define the words or a plain text file of words could be used.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira