You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Luis Lopez (JIRA)" <ji...@apache.org> on 2015/03/19 23:51:38 UTC
[jira] [Created] (NUTCH-1971) The crawldb.url.filters property is
not present in any configuration file
Luis Lopez created NUTCH-1971:
---------------------------------
Summary: The crawldb.url.filters property is not present in any configuration file
Key: NUTCH-1971
URL: https://issues.apache.org/jira/browse/NUTCH-1971
Project: Nutch
Issue Type: Improvement
Components: crawldb
Affects Versions: 1.9
Reporter: Luis Lopez
In CrawlDbFilter.java there is a line for getting a boolean that sets if the filters are going to be applied or not:
public static final String URL_FILTERING = "crawldb.url.filters";
However in nutch-default.xml that property is not present. Currently the only way to set this value is using the -filter parameter from the command line.
The same applies to:
public static final String URL_NORMALIZING = "crawldb.url.normalizers";
public static final String URL_NORMALIZING_SCOPE = "crawldb.url.normalizers.scope";
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)