You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by jo...@apache.org on 2015/03/13 05:36:04 UTC

svn commit: r1666356 - /nutch/trunk/conf/mimetype-filter.txt

Author: jorgelbg
Date: Fri Mar 13 04:36:03 2015
New Revision: 1666356

URL: http://svn.apache.org/r1666356
Log:
NUTCH-1962 Need to have mimetype-filter.txt file available by default

Added:
    nutch/trunk/conf/mimetype-filter.txt

Added: nutch/trunk/conf/mimetype-filter.txt
URL: http://svn.apache.org/viewvc/nutch/trunk/conf/mimetype-filter.txt?rev=1666356&view=auto
==============================================================================
--- nutch/trunk/conf/mimetype-filter.txt (added)
+++ nutch/trunk/conf/mimetype-filter.txt Fri Mar 13 04:36:03 2015
@@ -0,0 +1,20 @@
+# config file for mimetype-filter plugin
+
+# This plugin can be configured to work in one of two modes (similar to
+# suffix-urlfilter)
+
+# default to reject ('-'): in this mode, all documents will be rejected except
+# for those specified in this configuration file.
+
+# default to accept ('+'): in this mode, all documents will be accepted except
+# for those specified in this configuration file.
+
+# The format of this config file is one mimetype per line, with no preceding
+# whitespace. Order, in which suffixes are specified, doesn't matter. Blank
+# lines and comments (#) are allowed.
+
+# block everything
+-
+
+# allow only documents with a text/html mimetype
+text/html