You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2014/01/16 17:08:19 UTC

[jira] [Commented] (NUTCH-1706) IndexerMapReduce does not remove db_redir_temp etc

    [ https://issues.apache.org/jira/browse/NUTCH-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13873537#comment-13873537 ] 

Markus Jelsma commented on NUTCH-1706:
--------------------------------------

One record fetched and in CrawlDB, it is a redirect but is not going to be removed because the check is in the wrong place. This patch fixes that issue and the redirect is deleted as it should.

Please confirm.

> IndexerMapReduce does not remove db_redir_temp etc
> --------------------------------------------------
>
>                 Key: NUTCH-1706
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1706
>             Project: Nutch
>          Issue Type: Bug
>          Components: indexer
>    Affects Versions: 1.7
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Blocker
>             Fix For: 1.8
>
>         Attachments: NUTCH-1706-trunk.patch
>
>
> Code path is wrong in IndexerMapReduce, the delete code should be located after all reducer values have been gathered.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)