You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Nguyen Manh Tien (JIRA)" <ji...@apache.org> on 2013/12/23 05:53:50 UTC
[jira] [Created] (NUTCH-1690) IndexClean: mark url as unindexed
after clean to not delete again
Nguyen Manh Tien created NUTCH-1690:
---------------------------------------
Summary: IndexClean: mark url as unindexed after clean to not delete again
Key: NUTCH-1690
URL: https://issues.apache.org/jira/browse/NUTCH-1690
Project: Nutch
Issue Type: Improvement
Components: indexer
Reporter: Nguyen Manh Tien
Priority: Minor
We should marked a deleted page to not delete it again and again. That can simply done by remove Index marker when we delete.
I also change to delete duplicated url in solrclean.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)