You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/05/19 22:25:43 UTC

[jira] [Commented] (NUTCH-1765) SolrClean to remove redirected URLs from Solr

    [ https://issues.apache.org/jira/browse/NUTCH-1765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14002329#comment-14002329 ] 

Julien Nioche commented on NUTCH-1765:
--------------------------------------

Iain, 

Nutch has changed a bit since 1.6 and the SOLRClean code has been deprecated in favour of the generic pluggable indexing backend mechanism. Could you please check whether this is still the case with generic clean command?

Thanks

> SolrClean to remove redirected URLs from Solr
> ---------------------------------------------
>
>                 Key: NUTCH-1765
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1765
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>    Affects Versions: 1.6
>            Reporter: Iain Lopata
>            Priority: Minor
>              Labels: Solr
>             Fix For: 1.9
>
>
> SolrClean currently only removes urls with a status of STATUS_DB_GONE from the Solr Index.  It should also remove urls with a status of  STATUS_DB_REDIR_TEMP and  STATUS_DB_REDIR_PERM.



--
This message was sent by Atlassian JIRA
(v6.2#6252)