You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Junqiang Zhang (JIRA)" <ji...@apache.org> on 2018/10/29 11:03:00 UTC
[jira] [Created] (NUTCH-2670)
org.apache.nutch.indexer.IndexerMapReduce does not read the value of
"indexer.delete" from nutch-site.xml
Junqiang Zhang created NUTCH-2670:
-------------------------------------
Summary: org.apache.nutch.indexer.IndexerMapReduce does not read the value of "indexer.delete" from nutch-site.xml
Key: NUTCH-2670
URL: https://issues.apache.org/jira/browse/NUTCH-2670
Project: Nutch
Issue Type: Bug
Components: indexer
Affects Versions: 1.15, 1.14
Environment: macOS Mojave and High Sierra
MacBook Pro (Retina, 13-inch, Mid 2014)
Oracle Java 1.8.0_144-b01 and previous versions
Reporter: Junqiang Zhang
Inside org.apache.nutch.indexer.IndexerMapReduce.IndexerReducer, the setup() function should read the value of "indexer.delete" from nutch-site.xml, and assign the value to the variable of "delete". See the following line of code.
(line 201) delete = conf.getBoolean(INDEXER_DELETE, false);
However, the value of "indexer.delete" set in nutch-site.xml and nutch-default.xml is not assigned to the variable, "delete". I put the following setting in one of nutch-site.xml and nutch-default.xml, or in both of them. The variable of "delete" remains false.
<property>
<name>indexer.delete</name>
<value>true</value>
<description>Whether the indexer will delete documents GONE or REDIRECTS by indexing filters
</description>
</property>
I also changed the line of code to
delete = conf.getBoolean(INDEXER_DELETE, true);
Whatever value of "indexer.delete" is set in nutch-site.xml or nutch-default.xml, the value of "delete" remains false.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)