You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by ma...@apache.org on 2019/02/22 15:49:07 UTC
[nutch] 02/03: NUTCH-2692 Subcollection to support case-insensitive
white and black lists
This is an automated email from the ASF dual-hosted git repository.
markus pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git
commit 3fa2f4a7efac598258eb01a4387b5fde43c1a813
Author: Markus Jelsma <ma...@apache.org>
AuthorDate: Fri Feb 22 16:46:42 2019 +0100
NUTCH-2692 Subcollection to support case-insensitive white and black lists
---
conf/host-protocol-mapping.txt | 11 +++++++++++
1 file changed, 11 insertions(+)
diff --git a/conf/host-protocol-mapping.txt b/conf/host-protocol-mapping.txt
new file mode 100644
index 0000000..d0a1b70
--- /dev/null
+++ b/conf/host-protocol-mapping.txt
@@ -0,0 +1,11 @@
+# This file defines a hostname to protocol plugin mapping. Each line takes a
+# host name followed by a tab, followed by the ID of the protocol plugin. You
+# can find the ID in the protocol plugin's plugin.xml file.
+#
+# <hostname>\t<plugin_id>\n
+# nutch.apache.org org.apache.nutch.protocol.httpclient.Http
+# tika.apache.org org.apache.nutch.protocol.http.Http
+#
+nutch.apache.org org.apache.nutch.protocol.httpclient.Http
+tika.apache.org org.apache.nutch.protocol.http.Http
+