You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2012/03/20 20:25:37 UTC

[jira] [Created] (NUTCH-1318) Parse time outs crash parsing fetcher

Parse time outs crash parsing fetcher
-------------------------------------

                 Key: NUTCH-1318
                 URL: https://issues.apache.org/jira/browse/NUTCH-1318
             Project: Nutch
          Issue Type: Bug
    Affects Versions: 1.4
            Reporter: Markus Jelsma
            Assignee: Markus Jelsma
            Priority: Critical
             Fix For: 1.5


Some fetch lists can never be fetched and parsed successfully because a single timing out record can cause most and eventually all subsequeny records to time out as well. Finally the mapper will hang completely and so killing the entire fetch job, loosing 99% of the records that were processed.

I'm not sure what's going on, something may be leaking somewhere.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (NUTCH-1318) Parse time outs crash parsing fetcher

Posted by "Markus Jelsma (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-1318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Markus Jelsma updated NUTCH-1318:
---------------------------------

    Fix Version/s:     (was: 1.5)
                   1.6

20120304-push-1.6
                
> Parse time outs crash parsing fetcher
> -------------------------------------
>
>                 Key: NUTCH-1318
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1318
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Critical
>             Fix For: 1.6
>
>
> Some fetch lists can never be fetched and parsed successfully because a single timing out record can cause most and eventually all subsequeny records to time out as well. Finally the mapper will hang completely and so killing the entire fetch job, loosing 99% of the records that were processed.
> I'm not sure what's going on, something may be leaking somewhere.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (NUTCH-1318) Parse time outs crash parsing fetcher

Posted by "Markus Jelsma (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-1318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Markus Jelsma resolved NUTCH-1318.
----------------------------------

    Resolution: Duplicate

Closing issue in favor of NUTCH-1387.
                
> Parse time outs crash parsing fetcher
> -------------------------------------
>
>                 Key: NUTCH-1318
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1318
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Critical
>             Fix For: 1.6
>
>
> Some fetch lists can never be fetched and parsed successfully because a single timing out record can cause most and eventually all subsequeny records to time out as well. Finally the mapper will hang completely and so killing the entire fetch job, loosing 99% of the records that were processed.
> I'm not sure what's going on, something may be leaking somewhere.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira