You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/09/22 19:03:22 UTC

[jira] Created: (NUTCH-370) Generator loosed urls when run with LocalJobRunner

Generator loosed urls when run with LocalJobRunner
--------------------------------------------------

                 Key: NUTCH-370
                 URL: http://issues.apache.org/jira/browse/NUTCH-370
             Project: Nutch
          Issue Type: Bug
          Components: generator
    Affects Versions: 0.8, 0.8.1, 0.9.0
         Environment: linux
            Reporter: Sami Siren
         Assigned To: Sami Siren


When generator is run with LocalJobRunner part of generated urls get lost. This is because two map outputs are created and only one of them is processed in reduce phase.

When -numFetchers 1 is provided as command line parameters problem goes away.


-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] Closed: (NUTCH-370) Generator looses urls when run with LocalJobRunner

Posted by "Sami Siren (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/NUTCH-370?page=all ]

Sami Siren closed NUTCH-370.
----------------------------

    Resolution: Duplicate

actually this is a duplicate of #361

> Generator looses urls when run with LocalJobRunner
> --------------------------------------------------
>
>                 Key: NUTCH-370
>                 URL: http://issues.apache.org/jira/browse/NUTCH-370
>             Project: Nutch
>          Issue Type: Bug
>          Components: generator
>    Affects Versions: 0.8, 0.9.0, 0.8.1
>         Environment: linux
>            Reporter: Sami Siren
>         Assigned To: Sami Siren
>
> When generator is run with LocalJobRunner part of generated urls get lost. This is because two map outputs are created and only one of them is processed in reduce phase.
> When -numFetchers 1 is provided as command line parameters problem goes away.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] Updated: (NUTCH-370) Generator looses urls when run with LocalJobRunner

Posted by "Sami Siren (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/NUTCH-370?page=all ]

Sami Siren updated NUTCH-370:
-----------------------------

    Summary: Generator looses urls when run with LocalJobRunner  (was: Generator loosed urls when run with LocalJobRunner)

> Generator looses urls when run with LocalJobRunner
> --------------------------------------------------
>
>                 Key: NUTCH-370
>                 URL: http://issues.apache.org/jira/browse/NUTCH-370
>             Project: Nutch
>          Issue Type: Bug
>          Components: generator
>    Affects Versions: 0.8, 0.9.0, 0.8.1
>         Environment: linux
>            Reporter: Sami Siren
>         Assigned To: Sami Siren
>
> When generator is run with LocalJobRunner part of generated urls get lost. This is because two map outputs are created and only one of them is processed in reduce phase.
> When -numFetchers 1 is provided as command line parameters problem goes away.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira