You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by sn...@apache.org on 2017/12/05 10:22:43 UTC

[nutch] branch master updated: NUTCH-2468 should filter out invalid URLs by default - enable plugin urlfilter-validate by default

This is an automated email from the ASF dual-hosted git repository.

snagel pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git


The following commit(s) were added to refs/heads/master by this push:
     new d8754b7  NUTCH-2468 should filter out invalid URLs by default - enable plugin urlfilter-validate by default
d8754b7 is described below

commit d8754b7f88e73949dadaa0412aedea4427207f25
Author: Sebastian Nagel <sn...@apache.org>
AuthorDate: Tue Dec 5 11:00:17 2017 +0100

    NUTCH-2468 should filter out invalid URLs by default
    - enable plugin urlfilter-validate by default
---
 conf/nutch-default.xml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/conf/nutch-default.xml b/conf/nutch-default.xml
index 745588d..8fce39c 100644
--- a/conf/nutch-default.xml
+++ b/conf/nutch-default.xml
@@ -1225,7 +1225,7 @@
 
 <property>
   <name>plugin.includes</name>
-  <value>protocol-http|urlfilter-regex|parse-(html|tika)|index-(basic|anchor)|indexer-solr|scoring-opic|urlnormalizer-(pass|regex|basic)</value>
+  <value>protocol-http|urlfilter-(regex|validator)|parse-(html|tika)|index-(basic|anchor)|indexer-solr|scoring-opic|urlnormalizer-(pass|regex|basic)</value>
   <description>Regular expression naming plugin directory names to
   include.  Any plugin not matching this expression is excluded.
   In any case you need at least include the nutch-extensionpoints plugin. By

-- 
To stop receiving notification emails like this one, please contact
['"commits@nutch.apache.org" <co...@nutch.apache.org>'].