You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Luis Lopez (JIRA)" <ji...@apache.org> on 2015/03/19 23:51:38 UTC

[jira] [Created] (NUTCH-1971) The crawldb.url.filters property is not present in any configuration file

Luis Lopez created NUTCH-1971:
---------------------------------

             Summary: The crawldb.url.filters property is not present in any configuration file
                 Key: NUTCH-1971
                 URL: https://issues.apache.org/jira/browse/NUTCH-1971
             Project: Nutch
          Issue Type: Improvement
          Components: crawldb
    Affects Versions: 1.9
            Reporter: Luis Lopez


In CrawlDbFilter.java there is a line for getting a boolean that sets if the filters are going to be applied or not: 

  public static final String URL_FILTERING = "crawldb.url.filters";

However in nutch-default.xml that property is not present. Currently the only way to set this value is using the -filter parameter from the command line. 

The same applies to:  
public static final String URL_NORMALIZING = "crawldb.url.normalizers";
public static final String URL_NORMALIZING_SCOPE = "crawldb.url.normalizers.scope";



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)