You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2018/01/31 23:15:00 UTC

[jira] [Comment Edited] (NUTCH-2466) Sitemap processor to follow redirects

    [ https://issues.apache.org/jira/browse/NUTCH-2466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16347762#comment-16347762 ] 

Markus Jelsma edited comment on NUTCH-2466 at 1/31/18 11:14 PM:
----------------------------------------------------------------

Another note, curious to see browser developers allow over ten redirects. I never observed any fruition to follow more than a few. Stranger even is IE's choice to jump from eleven to 120!

If anyone reading this can clarify the usefulness of following more than ten redirects? Or even 120? 

They made bad choices, or i haven't seen their views about the variety of crap on the web. Probably the latter is true.


was (Author: markus17):
Another note, curious to see browser developers allow over ten redirects. I never observed any fruition to follow more than a few. Stranger even is IE's choice to jump from eleven to 120!

If anyone reading this can clarify the usefulness of following more than ten redirects? Or even 120? 

That made bad choices, or i haven't seen their views about the variety of crap on the web. Probably the latter is true.

> Sitemap processor to follow redirects
> -------------------------------------
>
>                 Key: NUTCH-2466
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2466
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.13
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.15
>
>         Attachments: NUTCH-2466.patch, NUTCH-2466.patch, NUTCH-2466.patch
>
>
> It does follow http > https, but not the following redirect, e.g. sitemap_index.xml that some websites have.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)