You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Prakhar Chaube (Jira)" <ji...@apache.org> on 2021/11/30 13:11:00 UTC

[jira] [Created] (NUTCH-2911) Add cleanup call in Fetcher.java

Prakhar Chaube created NUTCH-2911:
-------------------------------------

             Summary: Add cleanup call in Fetcher.java
                 Key: NUTCH-2911
                 URL: https://issues.apache.org/jira/browse/NUTCH-2911
             Project: Nutch
          Issue Type: Improvement
          Components: fetcher
            Reporter: Prakhar Chaube


Fetcher's inner class, FetcherRun overrides Hadoop Mapper's run().
Even though Nutch's FetcherRun doesn't need an explicit call the Mapper's cleanup() (Which is a blank function), it would increase the readability and completeness of the run Method to do so.
Ideally, every implementation of Mapper is supposed to do the following tasks:
1. Perform Setup

2.  Call map on the data set

3. Perform cleanups.

Moreover, in case a custom Fetcher is written extending Fetcher.java cleanup could get easily missed out.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)