You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Moreno Feltscher (JIRA)" <ji...@apache.org> on 2018/01/12 23:23:00 UTC
[jira] [Created] (NUTCH-2495) Use -deleteGone instead of clean job
in crawler script while indexing
Moreno Feltscher created NUTCH-2495:
---------------------------------------
Summary: Use -deleteGone instead of clean job in crawler script while indexing
Key: NUTCH-2495
URL: https://issues.apache.org/jira/browse/NUTCH-2495
Project: Nutch
Issue Type: Improvement
Reporter: Moreno Feltscher
Assignee: Moreno Feltscher
Instead of running {{bin/nutch clean}} after indexing the documents run {{bin/nutch index}} with the {{-deleteGone}} flag which instead of just deleting gone and duplicated documents also deletes redirects from the index.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)