You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Marcos Bori (JIRA)" <ji...@apache.org> on 2017/08/25 12:16:00 UTC

[jira] [Created] (NUTCH-2413) When fetching and parsing together, parameter "parse.filter.urls" is ignored

Marcos Bori created NUTCH-2413:
----------------------------------

             Summary: When fetching and parsing together, parameter "parse.filter.urls" is ignored
                 Key: NUTCH-2413
                 URL: https://issues.apache.org/jira/browse/NUTCH-2413
             Project: Nutch
          Issue Type: Bug
          Components: fetcher, parser
         Environment: Apache Nutch release 1.13.
            Reporter: Marcos Bori


In a situation when we want to:
(1) Execute the fetch and parse together ("fetcher.parse" setting to "true")
(2) Avoid applying the URL filters when executing this phase.

Condition (2) can be configured when parsing is executed as a separate process by setting "parse.filter.urls" to "false".
However, this setting ("parse.filter.urls") is ignored when we execute the fetch and parse phases together. 




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)